<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Splunk truncating large input json data and not indexing. in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465612#M99576</link>
    <description>&lt;P&gt;Hi &lt;/P&gt;

&lt;P&gt;I have used python script to call some api's and sending the response to splunk.&lt;/P&gt;

&lt;P&gt;If the response is small, splunk taking the data and indexing it, but if the data is large, splunk is truncating the data at some point and displaying only the truncated data:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;source_type= _Json
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;props.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[sourcetype]
TRUNCATE = 1000000
KV_MODE = json
CHARSET = UTF-8
LINE_BREAKER = +[^}]+}(,[\r\n]+)
SHOULD_LINEMERGE = false
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;input.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[default]
host = F1WQGT1
time_before_close = 5
multiline_event_extra_waittime = true
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Please suggest me some better ideas.&lt;/P&gt;</description>
    <pubDate>Fri, 13 Dec 2019 06:16:55 GMT</pubDate>
    <dc:creator>swithinb</dc:creator>
    <dc:date>2019-12-13T06:16:55Z</dc:date>
    <item>
      <title>Splunk truncating large input json data and not indexing.</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465612#M99576</link>
      <description>&lt;P&gt;Hi &lt;/P&gt;

&lt;P&gt;I have used python script to call some api's and sending the response to splunk.&lt;/P&gt;

&lt;P&gt;If the response is small, splunk taking the data and indexing it, but if the data is large, splunk is truncating the data at some point and displaying only the truncated data:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;source_type= _Json
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;props.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[sourcetype]
TRUNCATE = 1000000
KV_MODE = json
CHARSET = UTF-8
LINE_BREAKER = +[^}]+}(,[\r\n]+)
SHOULD_LINEMERGE = false
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;input.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[default]
host = F1WQGT1
time_before_close = 5
multiline_event_extra_waittime = true
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Please suggest me some better ideas.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Dec 2019 06:16:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465612#M99576</guid>
      <dc:creator>swithinb</dc:creator>
      <dc:date>2019-12-13T06:16:55Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk truncating large input json data and not indexing.</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465613#M99577</link>
      <description>&lt;P&gt;If you are sure that your settings are correct (and it looks like &lt;CODE&gt;TRUNCATE&lt;/CODE&gt; is), it must be something else.  If you are doing a sourcetype override/overwrite, you must use the &lt;EM&gt;ORIGINAL&lt;/EM&gt; value, &lt;EM&gt;NOT&lt;/EM&gt; the new value.  You must deploy your settings to the first full instance(s) of Splunk that handle the events (usually either the HF tier if you use one, or else your Indexer tier) UNLESS you are using HEC's JSON endpoint (it gets pre-cooked) or INDEXED_EXTRACTIONS (configs go on the UF in that case), then restart all Splunk instances there.  When (re)evaluating, you must send in new events (old events will stay broken), then test using &lt;CODE&gt;_index_earliest=-5m&lt;/CODE&gt; to be absolutely certain that you are only examining the newly indexed events.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2020 03:22:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465613#M99577</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2020-09-30T03:22:27Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk truncating large input json data and not indexing.</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465614#M99578</link>
      <description>&lt;P&gt;It is &lt;CODE&gt;sourcetype&lt;/CODE&gt;, not &lt;CODE&gt;source_type&lt;/CODE&gt;.  It is &lt;CODE&gt;inputs.conf&lt;/CODE&gt;, not &lt;CODE&gt;input.conf&lt;/CODE&gt;.  It is &lt;CODE&gt;_json&lt;/CODE&gt;, not &lt;CODE&gt;_Json&lt;/CODE&gt;.  You should NEVER use &lt;CODE&gt;_json&lt;/CODE&gt; as your &lt;CODE&gt;sourcetype&lt;/CODE&gt;; make up your own appropriate to your data and add &lt;CODE&gt;KV_MODE = json&lt;/CODE&gt; to your &lt;CODE&gt;props.conf&lt;/CODE&gt; on your &lt;CODE&gt;Search Head&lt;/CODE&gt; to get all the "fee" JSON &lt;CODE&gt;field extractions&lt;/CODE&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Dec 2019 10:59:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-truncating-large-input-json-data-and-not-indexing/m-p/465614#M99578</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2019-12-13T10:59:48Z</dc:date>
    </item>
  </channel>
</rss>

