<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: De-nesting JSON during indexing in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/581071#M102472</link>
    <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;When you are using TA/SA etc. it's usually best to use those same sourcetype names what has defined there. There could be additional TA/SA/Apps etc which are expecting those names. But if you want, you can choose what ever you like.&lt;/P&gt;&lt;P&gt;If you are doing ingestion time data management then it must be in the first full splunk instance in path from source to indexers. If only search time, then SC is the correct place. And if there is both then you needs it in both places. But read the TA's installation instructions which told how to install it in distributed environment! I'm not sure if you can install it by yourself int SC with Victoria experience or not? With GUI you could add those individual KOs, but I prefer to use TA as is if possible.&lt;/P&gt;&lt;P&gt;If I recall right this TA is using polling from GCP to get those events. This means that you are needing some place where it can do it. And there can be only one instance running at same time, otherwise you will get (at least partially) duplicate events as individual instances cannot know what other have already gotten. Also you have full control to separate HF instead of SC where you haven't any control to indexer layer. At least I prefer separate HF for those.&lt;/P&gt;&lt;P&gt;r. Imo&lt;/P&gt;</description>
    <pubDate>Fri, 14 Jan 2022 10:17:16 GMT</pubDate>
    <dc:creator>isoutamo</dc:creator>
    <dc:date>2022-01-14T10:17:16Z</dc:date>
    <item>
      <title>De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/561758#M100073</link>
      <description>&lt;P&gt;I have the following event from GCP pubsub:&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;TABLE&gt;&lt;TBODY&gt;&lt;TR&gt;&lt;TD&gt;&lt;DIV class="shared-eventsviewer-shared-rawfield"&gt;&lt;DIV class="json-event  wrap "&gt;&lt;DIV class="json-tree shared-jsontree"&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-1"&gt;&lt;SPAN class="key-name"&gt;attributes&lt;/SPAN&gt;:&amp;nbsp;{&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;}&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-1"&gt;&lt;SPAN class="key-name"&gt;data&lt;/SPAN&gt;:&amp;nbsp;{&amp;nbsp;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;insertId&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t string"&gt;dbp95qcbup&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;logName&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t string"&gt;organizations/xxxxxxx/logs/cloudaudit.googleapis.com%2Fdata_access&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;protoPayload&lt;/SPAN&gt;:&amp;nbsp;{&amp;nbsp;&lt;A href="https://ids.clong.network:8000/en-US/app/search/search?q=search%20index%3Dgoogle&amp;amp;sid=1627943843.7&amp;amp;display.page.search.mode=fast&amp;amp;dispatch.sample_ratio=1&amp;amp;workload_pool=&amp;amp;earliest=-24h%40h&amp;amp;latest=now#" target="_blank" rel="noopener"&gt;[+]&lt;/A&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;}&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;receiveTimestamp&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t string"&gt;2021-08-02T05:52:58.861079027Z&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;resource&lt;/SPAN&gt;:&amp;nbsp;{&amp;nbsp;&lt;A href="https://ids.clong.network:8000/en-US/app/search/search?q=search%20index%3Dgoogle&amp;amp;sid=1627943843.7&amp;amp;display.page.search.mode=fast&amp;amp;dispatch.sample_ratio=1&amp;amp;workload_pool=&amp;amp;earliest=-24h%40h&amp;amp;latest=now#" target="_blank" rel="noopener"&gt;[+]&lt;/A&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;}&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;severity&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t string"&gt;NOTICE&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-2"&gt;&lt;SPAN class="key-name"&gt;timestamp&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t string"&gt;2021-08-02T04:01:48.076823Z&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;}&lt;/SPAN&gt;&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;SPAN class="key level-1"&gt;&lt;SPAN class="key-name"&gt;publish_time&lt;/SPAN&gt;:&amp;nbsp;&lt;SPAN class="t number"&gt;1627883579.307&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/TD&gt;&lt;/TR&gt;&lt;/TBODY&gt;&lt;/TABLE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Is there any way to use a forwarder to only send the contents of data{} to Splunk? I essentially want to strip off the outer parts of the JSON attributes{}, publishtime and have the event sent as the contents of the data{} field:"&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"insertId"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"dbp95qcbup"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"logName"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"organizations/xxxxxxx/logs/cloudaudit.googleapis.com%2Fdata_access"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"protoPayload"&lt;/SPAN&gt;&lt;SPAN&gt;: {},&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"receiveTimestamp"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"2021-08-02T05:52:58.861079027Z"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"resource"&lt;/SPAN&gt;&lt;SPAN&gt;: {},&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"severity"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"NOTICE"&lt;/SPAN&gt;&lt;SPAN&gt;,&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;"timestamp"&lt;/SPAN&gt;&lt;SPAN&gt;: &lt;/SPAN&gt;&lt;SPAN&gt;"2021-08-02T04:01:48.076823Z"&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;}&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 02 Aug 2021 22:44:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/561758#M100073</guid>
      <dc:creator>clong_</dc:creator>
      <dc:date>2021-08-02T22:44:07Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/562069#M100117</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/149949"&gt;@clong_&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Can you please try this props.conf configuration&amp;nbsp;?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[YOUR_SOURCETYPE]
SHOULD_LINEMERGE=true
LINE_BREAKER=([\r\n]+)
NO_BINARY_CHECK=true
CHARSET=UTF-8
INDEXED_EXTRACTIONS=json
SEDCMD-a=s/{ \"attributes\": \{.*\},\"data\":\s//g
SEDCMD-b=s/,\"publish_time\":\s\".*//g&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;My Sample Event:&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;{ "attributes": {},"data": { "insertId": "dbp95qcbup","logName": "organizations/xxxxxxx/logs/cloudaudit.googleapis.com%2Fdata_access","protoPayload" : { },"receiveTimestamp" : "2021-08-02T05:52:58.861079027Z","resource": {},"severity": "NOTICE","timestamp": "2021-08-02T04:01:48.076823Z"},"publish_time": "1627883579.307"}&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2021-08-04 at 6.10.04 PM.png" style="width: 400px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/15425i859D3028211CDB86/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Screenshot 2021-08-04 at 6.10.04 PM.png" alt="Screenshot 2021-08-04 at 6.10.04 PM.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;KV &lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Aug 2021 12:40:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/562069#M100117</guid>
      <dc:creator>kamlesh_vaghela</dc:creator>
      <dc:date>2021-08-04T12:40:19Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580250#M102377</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/127939"&gt;@kamlesh_vaghela&lt;/a&gt;,&lt;BR /&gt;&lt;BR /&gt;I also want to de-nest logs that have been pulled from GCP. have Splunk Cloud and as far as I know, I can't edit props.conf, is it possible to achieve this using the GUI in "Field transformations"?&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 10:41:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580250#M102377</guid>
      <dc:creator>briqpayerik</dc:creator>
      <dc:date>2022-01-07T10:41:42Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580251#M102378</link>
      <description>&lt;P&gt;Isn't spath command working for you, can't think of anything else..&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 10:49:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580251#M102378</guid>
      <dc:creator>SinghK</dc:creator>
      <dc:date>2022-01-07T10:49:35Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580293#M102385</link>
      <description>&lt;P&gt;You could/should setup a HF which do that before logs have sent to SC.&lt;/P&gt;&lt;P&gt;r. Ismo&lt;/P&gt;</description>
      <pubDate>Fri, 07 Jan 2022 16:46:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580293#M102385</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2022-01-07T16:46:14Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580651#M102431</link>
      <description>&lt;P&gt;Hi!&lt;/P&gt;&lt;P&gt;I followed your advice to setup a Heavy Forwarder and installed the Splunk Add-on for Google Cloud Platform. Then, I configured it to forward logs to Splunk Cloud.&lt;BR /&gt;I configured props.conf like in the original answer (on the HF) and can verify in the GUI that the changes has applied but the logs that are forwarded to SC have not been transformed / de-nested. I don't know where the problem lies.&lt;BR /&gt;&lt;BR /&gt;Question 1: Should the source type in props.conf be [google:gcp:pubsub:message] ?&lt;BR /&gt;Question 2: Should props.conf be configured on the HF or SC?&lt;BR /&gt;Question 3: Since I realised I can make this configuration in the GUI (on SC or IDM would work?), do I even need the HF?&lt;BR /&gt;&lt;BR /&gt;HF config:&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2022-01-11 at 17.40.02.png" style="width: 915px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/17510i0F7F69A253552285/image-size/large?v=v2&amp;amp;px=999" role="button" title="Screenshot 2022-01-11 at 17.40.02.png" alt="Screenshot 2022-01-11 at 17.40.02.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Logs on SC:&lt;BR /&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Screenshot 2022-01-11 at 17.49.48.png" style="width: 400px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/17512i5C7640C64A81B26B/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Screenshot 2022-01-11 at 17.49.48.png" alt="Screenshot 2022-01-11 at 17.49.48.png" /&gt;&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jan 2022 16:56:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/580651#M102431</guid>
      <dc:creator>briqpayerik</dc:creator>
      <dc:date>2022-01-11T16:56:20Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/581069#M102471</link>
      <description>&lt;P&gt;We were able to get everything working in the end! Not sure what the exact problem was but the regex was invalid, we used regex101 with the&amp;nbsp; python flavor to fix and build our regex.&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jan 2022 10:10:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/581069#M102471</guid>
      <dc:creator>briqpayerik</dc:creator>
      <dc:date>2022-01-14T10:10:38Z</dc:date>
    </item>
    <item>
      <title>Re: De-nesting JSON during indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/581071#M102472</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;When you are using TA/SA etc. it's usually best to use those same sourcetype names what has defined there. There could be additional TA/SA/Apps etc which are expecting those names. But if you want, you can choose what ever you like.&lt;/P&gt;&lt;P&gt;If you are doing ingestion time data management then it must be in the first full splunk instance in path from source to indexers. If only search time, then SC is the correct place. And if there is both then you needs it in both places. But read the TA's installation instructions which told how to install it in distributed environment! I'm not sure if you can install it by yourself int SC with Victoria experience or not? With GUI you could add those individual KOs, but I prefer to use TA as is if possible.&lt;/P&gt;&lt;P&gt;If I recall right this TA is using polling from GCP to get those events. This means that you are needing some place where it can do it. And there can be only one instance running at same time, otherwise you will get (at least partially) duplicate events as individual instances cannot know what other have already gotten. Also you have full control to separate HF instead of SC where you haven't any control to indexer layer. At least I prefer separate HF for those.&lt;/P&gt;&lt;P&gt;r. Imo&lt;/P&gt;</description>
      <pubDate>Fri, 14 Jan 2022 10:17:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/De-nesting-JSON-during-indexing/m-p/581071#M102472</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2022-01-14T10:17:16Z</dc:date>
    </item>
  </channel>
</rss>

