<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic splunk didn't ingested all json-objects in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697205#M115659</link>
    <description>&lt;P&gt;I have a json-File with with 23.904 objects in it. They are all like:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;{
  "1.Entry": "1.Data",
  ...
  "44.Entry": "44.Data"
},


... 23.902 similiar entries...

{
  "1.Entry": "1.Data",
  ...
  "44.Entry": "44.Data"
}

&lt;/LI-CODE&gt;
&lt;P&gt;But forwarding the json-file leaded to the count of 22.256 events (presents 22.256 json-objects)&lt;/P&gt;
&lt;P&gt;My props.conf&lt;/P&gt;
&lt;P&gt;[json_test]&lt;BR /&gt;DATETIME_CONFIG =&lt;BR /&gt;TIMESTAMP_FIELDS = test.sys_created_on&lt;BR /&gt;INDEXED_EXTRACTIONS = json&lt;BR /&gt;KV_MODE = none&lt;BR /&gt;LINE_BREAKER = ([\r\n]+)&lt;BR /&gt;NO_BINARY_CHECK = true&lt;BR /&gt;category = Structured&lt;BR /&gt;description = test json&lt;BR /&gt;disabled = false&lt;BR /&gt;pulldown_type = true&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The problem so is not that a single event is truncated, but the json-file is.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Sat, 24 Aug 2024 21:02:32 GMT</pubDate>
    <dc:creator>a101755</dc:creator>
    <dc:date>2024-08-24T21:02:32Z</dc:date>
    <item>
      <title>splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697205#M115659</link>
      <description>&lt;P&gt;I have a json-File with with 23.904 objects in it. They are all like:&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;{
  "1.Entry": "1.Data",
  ...
  "44.Entry": "44.Data"
},


... 23.902 similiar entries...

{
  "1.Entry": "1.Data",
  ...
  "44.Entry": "44.Data"
}

&lt;/LI-CODE&gt;
&lt;P&gt;But forwarding the json-file leaded to the count of 22.256 events (presents 22.256 json-objects)&lt;/P&gt;
&lt;P&gt;My props.conf&lt;/P&gt;
&lt;P&gt;[json_test]&lt;BR /&gt;DATETIME_CONFIG =&lt;BR /&gt;TIMESTAMP_FIELDS = test.sys_created_on&lt;BR /&gt;INDEXED_EXTRACTIONS = json&lt;BR /&gt;KV_MODE = none&lt;BR /&gt;LINE_BREAKER = ([\r\n]+)&lt;BR /&gt;NO_BINARY_CHECK = true&lt;BR /&gt;category = Structured&lt;BR /&gt;description = test json&lt;BR /&gt;disabled = false&lt;BR /&gt;pulldown_type = true&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;The problem so is not that a single event is truncated, but the json-file is.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 24 Aug 2024 21:02:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697205#M115659</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-24T21:02:32Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697208#M115661</link>
      <description>&lt;P&gt;hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/145296"&gt;@a101755&lt;/a&gt;,&lt;BR /&gt;&lt;BR /&gt;Try adding below configs in input monitors in inputs.conf.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;crcSalt = &amp;lt;SOURCE&amp;gt;
initCrcLength = 2048&lt;/LI-CODE&gt;</description>
      <pubDate>Sun, 25 Aug 2024 03:28:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697208#M115661</guid>
      <dc:creator>manjunathmeti</dc:creator>
      <dc:date>2024-08-25T03:28:34Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697225#M115664</link>
      <description>&lt;P&gt;Thank you &lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/129090"&gt;@manjunathmeti&lt;/a&gt; .&lt;/P&gt;&lt;P&gt;But it doesn't function. The result is the same as before.&lt;/P&gt;&lt;P&gt;I think your advice helps if splunk doesn't import a whole file, if it is not salted and/or the first characters in it doesn't have a difference to another file imported before.&lt;/P&gt;&lt;P&gt;Further Investigation:&lt;/P&gt;&lt;P&gt;I have exported the items from splunk (csv) and compare the original file with the export.&lt;/P&gt;&lt;P&gt;I can't see any muster,&amp;nbsp; which object is imported and which not.&amp;nbsp; A muster could be like the first 22.256 objects were importet,&lt;/P&gt;&lt;P&gt;I see, that object 66 to is not imported, 104, 108, 113, and so on not imported.&lt;/P&gt;&lt;P&gt;I think there is a limit to import json-objects. But which is it?&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2024 13:09:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697225#M115664</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-25T13:09:10Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697233#M115667</link>
      <description>&lt;P&gt;Further investigation:&lt;/P&gt;&lt;P&gt;I shortened the json-objects from 44 to 43 lines.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;{
  "1.Entry": "1.Data",
  ...
  "43.Entry": "43.Data"
},


... 48.186 similiar entries...

{
  "1.Entry": "1.Data",
  ...
  "43.Entry": "43.Data"
}&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;But forwarding the json-file leaded to the count of 45.352 events (presents 45.352 json-objects), instead of 48.188 objects.&lt;/P&gt;&lt;P&gt;That's a little bit 'loco' i think.&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2024 17:45:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697233#M115667</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-25T17:45:56Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697234#M115668</link>
      <description>&lt;P&gt;There is something not right about this. If your events are indeed formed this way (a multiline entries) and your LINE_BREAKER is set to ([\r\n]+) there is no way they are ingested as a whole.&lt;/P&gt;&lt;P&gt;Tell us more about how you are ingesting it (and if you're reading a file with a forwarder, show us the relevant inputs.conf stanza and props.conf stanza from the forwarder).&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2024 18:20:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697234#M115668</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2024-08-25T18:20:14Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697235#M115669</link>
      <description>&lt;P&gt;Thank you for your questions &lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/231884"&gt;@PickleRick&lt;/a&gt; .&lt;/P&gt;&lt;P&gt;I'm using forwarding mechanismen.&lt;/P&gt;&lt;P&gt;Here are the stanzas form the forwarder:&lt;/P&gt;&lt;P&gt;inputs.conf&lt;/P&gt;&lt;P&gt;[monitor:///daten/datasources/data/mg_test/entry2group/*.json]&lt;BR /&gt;disabled = false&lt;BR /&gt;index = mg_test&lt;BR /&gt;sourcetype = json_test&lt;BR /&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;BR /&gt;whitelist = .*\d{8}_Q\d_entry_entry2group\.v\d\.(\d\d\.){2}json$&lt;/P&gt;&lt;P&gt;[json_test]&lt;BR /&gt;DATETIME_CONFIG =&lt;BR /&gt;TIMESTAMP_FIELDS = test.sys_created_on&lt;BR /&gt;INDEXED_EXTRACTIONS = json&lt;BR /&gt;KV_MODE = none&lt;BR /&gt;LINE_BREAKER = ([\r\n]+)&lt;BR /&gt;NO_BINARY_CHECK = true&lt;BR /&gt;category = Structured&lt;BR /&gt;description = test json&lt;BR /&gt;disabled = false&lt;BR /&gt;pulldown_type = true&lt;/P&gt;&lt;P&gt;I've copied this props.conf from my first try to upload (over splunk-web).&lt;/P&gt;&lt;P&gt;Here is the stanza from ../etc/system/local/props.conf&lt;/P&gt;&lt;P&gt;[test_json]&lt;BR /&gt;DATETIME_CONFIG =&lt;BR /&gt;INDEXED_EXTRACTIONS = json&lt;BR /&gt;KV_MODE = none&lt;BR /&gt;LINE_BREAKER = ([\r\n]+)&lt;BR /&gt;TIMESTAMP_FIELDS = test.sys_created_on&lt;BR /&gt;category = Structured&lt;BR /&gt;description = JavaScript Object Notation format. For more information, visit &lt;A href="http://json.org/" target="_blank" rel="noopener"&gt;http://json.org/&lt;/A&gt;&lt;BR /&gt;disabled = false&lt;BR /&gt;pulldown_type = true&lt;/P&gt;&lt;P&gt;Another investigation shows me, you are on the right way!&lt;/P&gt;&lt;P&gt;I found following event on _internal.&lt;/P&gt;&lt;PRE&gt;08-25-2024
 19:31:28.338 +0200 ERROR JsonLineBreaker [1737739 structuredparsing] - 
JSON StreamId:1586716756715697390 had parsing error:Unexpected character
 while looking for value: ',' - 
data_source="daten/datasources/data/mg_test/entry2group/20240825_Q2_entry_entry2group.v0.03.01&lt;/PRE&gt;&lt;PRE&gt;.json]", data_host="socmg_local_fw", data_sourcetype="json_test"&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So in the next step i will isolate one event (object) which is lost if there are special sign in the data.&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2024 19:20:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697235#M115669</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-25T19:20:51Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697236#M115670</link>
      <description>&lt;P&gt;i've parsed my InputFile (json-parser) and before one of the missing event there is an error, like unexpected non-white-space sign.&lt;/P&gt;&lt;P&gt;So i think, it is not a problem of splunk!&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sun, 25 Aug 2024 20:23:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697236#M115670</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-25T20:23:33Z</dc:date>
    </item>
    <item>
      <title>Re: splunk didn't ingested all json-objects</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697363#M115683</link>
      <description>&lt;P&gt;Hello again,&lt;/P&gt;&lt;P&gt;my last entry&lt;/P&gt;&lt;P&gt;&lt;EM&gt;"i've parsed my InputFile (json-parser) and before one of the missing event there is an error, like unexpected non-white-space sign.&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;So i think, it is not a problem of splunk!&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;&lt;EM&gt;" &lt;/EM&gt;was a wrong result. I've made a mistake in my investigation.&lt;/P&gt;&lt;P&gt;So i tried the programm jq (ubuntu-linux) to validate the whole json-file.&lt;/P&gt;&lt;P&gt;Surprise - there is no failure in the json-file. I've checked the json-file in the forwarder-directory.&lt;/P&gt;&lt;P&gt;So i guess there is a sign in the data,&amp;nbsp; that splunk "misunderstand" and break the json-structure.&lt;/P&gt;</description>
      <pubDate>Mon, 26 Aug 2024 20:02:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-didn-t-ingested-all-json-objects/m-p/697363#M115683</guid>
      <dc:creator>a101755</dc:creator>
      <dc:date>2024-08-26T20:02:35Z</dc:date>
    </item>
  </channel>
</rss>

