<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Is it possible to remove unnecessary JSON wrapping before it's ingested to save license? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629365#M107961</link>
    <description>&lt;P&gt;As you have pure json event you probably could try INGEST_EVAL with json_extract?&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/9.0.3/SearchReference/JSONFunctions#json_extract.28.26lt.3Bjson.26gt.3B.2C_.26lt.3Bpaths.26gt.3B.29" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/9.0.3/SearchReference/JSONFunctions#json_extract.28.26lt.3Bjson.26gt.3B.2C_.26lt.3Bpaths.26gt.3B.29&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 02 Feb 2023 16:29:27 GMT</pubDate>
    <dc:creator>isoutamo</dc:creator>
    <dc:date>2023-02-02T16:29:27Z</dc:date>
    <item>
      <title>Is it possible to remove unnecessary JSON wrapping before it's ingested to save license?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629354#M107959</link>
      <description>&lt;P&gt;Hey there, we have a large volume (about 500-600gb) of data coming in daily but about 200gb of this is a JSON wrapper from Amazon Firehose. The data essentially looks like this:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;{
    "message": "ACTUAL_DATA_WE_WANT",
    "logGroup": "/use1/prod/eks/primary/containers",
    "logStream": "fluent-bit/cross-services/settings-7dbb9dbdb4-qjz5b/settings-api/81d3685eaaeae0effab5931590784016ce75a8171ad7e3e76152e30bd732a739",
    "timestamp": 1675349068034
}&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;As you can see,&amp;nbsp;ACTUAL_DATA_WE_WANT is what we need. This contains everything including timestamp and application information. The JSON wrapper is added by Firehose and makes up at least 250 bytes of every event.&lt;/P&gt;&lt;P&gt;Is it possible to remove all of this unnecessary data so that we can save ingestion for more useful things? I have heard that the SEDCMD can do this but it is resource intensive and we ingest almost a billion events a day.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2023 15:16:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629354#M107959</guid>
      <dc:creator>michael_sleep</dc:creator>
      <dc:date>2023-02-02T15:16:46Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to remove unnecessary JSON wrapping before it's ingested to save license?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629357#M107960</link>
      <description>&lt;P&gt;Usually, this is done with &lt;FONT face="courier new,courier"&gt;SEDCMD&lt;/FONT&gt;.&amp;nbsp; The resource use depends on the efficiency of the regex used.&amp;nbsp; Test the regex on regex101.com and evaluate the resource usage on your dev/test instances.&lt;/P&gt;&lt;P&gt;Another option is to use Cribl to remove the unwanted bytes.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2023 15:37:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629357#M107960</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-02-02T15:37:46Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to remove unnecessary JSON wrapping before it's ingested to save license?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629365#M107961</link>
      <description>&lt;P&gt;As you have pure json event you probably could try INGEST_EVAL with json_extract?&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/9.0.3/SearchReference/JSONFunctions#json_extract.28.26lt.3Bjson.26gt.3B.2C_.26lt.3Bpaths.26gt.3B.29" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/9.0.3/SearchReference/JSONFunctions#json_extract.28.26lt.3Bjson.26gt.3B.2C_.26lt.3Bpaths.26gt.3B.29&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 02 Feb 2023 16:29:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-remove-unnecessary-JSON-wrapping-before-it-s/m-p/629365#M107961</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2023-02-02T16:29:27Z</dc:date>
    </item>
  </channel>
</rss>

