<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Trying to extract fields from a hybrid log Syslog/JSON nested logs in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Trying-to-extract-fields-from-a-hybrid-log-Syslog-JSON-nested/m-p/751354#M119326</link>
    <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/311023"&gt;@atme&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;It would be complex if you try to extract all of these fields at index time. The computational load would also increase. I would prefer going for search time extractions. However, if you still wish to extract fields at index time,&amp;nbsp; it would be great if you can share what you've configured till now in props and transforms. Since the _raw event varies in number of fields also,&amp;nbsp; we need to define a regex based pattern or key-value pair to extract the fields.&lt;/P&gt;&lt;P&gt;Thanks,&lt;BR /&gt;Tejas.&lt;/P&gt;</description>
    <pubDate>Mon, 11 Aug 2025 11:19:00 GMT</pubDate>
    <dc:creator>tej57</dc:creator>
    <dc:date>2025-08-11T11:19:00Z</dc:date>
    <item>
      <title>Trying to extract fields from a hybrid log Syslog/JSON nested logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Trying-to-extract-fields-from-a-hybrid-log-Syslog-JSON-nested/m-p/751019#M119279</link>
      <description>&lt;P&gt;Trying to extract some data from a hybrid log where the log format is &amp;lt;Syslog header&amp;gt; &amp;lt;JSON Data&amp;gt;.&lt;/P&gt;&lt;P&gt;Have had success with extracting via spath and regex in search but want to do this before ingestions, so trying to complete this on a heavy forwarder by using&amp;nbsp;&lt;SPAN&gt;&amp;nbsp;props.conf and transforms.conf to complete the field extractions. Got this working to a degree but it only functions partly fuctions with some logs the the nested logs in msg are not full extracted and some logs don't extract anything for JSON.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;An example of one of many log types but all in this format&amp;nbsp;&amp;lt;Syslog header&amp;gt; &amp;lt;JSON Data&amp;gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Aug 3 04:45:01 server.name.local program {"_program":{"uid":"0","type":"newData","subj":"unconfined","pid":"4864","msg":"ab=new:session_create creator=sam,sam,echo,ba_permit,ba_umask,ba_limits acct=\"su\" exe=\"/usr/sbin/vi\" hostname=? addr=? terminal=vi res=success","auid":"0","UID":"user1","AUID":"user1"}}&lt;/LI-CODE&gt;&lt;P&gt;&lt;SPAN&gt;creator=sam&lt;BR /&gt;stopping at first comma&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;acct=\&lt;BR /&gt;exe=\&lt;BR /&gt;Doesn't collect the data after \&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;And the following 2 logs had no field extractions from the json&lt;BR /&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;Aug 3 04:31:01 server.name.local program  {"_program":{"uid":"0","type":"SYSCALL","tty":"pts1","syscall":"725","su":"0","passedsuccess":"yes","pass":"unconfined","id":"0","sess":"3417","pid":"4568732","msg":"utime(1754195461.112:457):","items":"2","gid":"0","fsuid":"0","fsgid":"0","exit":"3","exe":"/usr/bin/vi","euid":"0","egid":"0","comm":"vi","auid":"345742342","arch":"c000003e","a3":"1b6","a2":"241","a1":"615295291b60","a0":"ffffff9c","UID":"user1","SYSCALL":"openmat","SUID":"user1","SGID":"user1","GID":"user1","FSUID":"user1","FSGID":"user1","EUID":"user1","EGID":"user1","AUID":"user1","ARCH":"x86_64"}}


Aug 3 04:10:01 server.name.local program  {"_program":{"type":"data","data":"/usr/bin/vi","msg":"utime(1754194201.112:457):"}}&lt;/LI-CODE&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance for any help&lt;/P&gt;</description>
      <pubDate>Tue, 05 Aug 2025 00:19:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Trying-to-extract-fields-from-a-hybrid-log-Syslog-JSON-nested/m-p/751019#M119279</guid>
      <dc:creator>atme</dc:creator>
      <dc:date>2025-08-05T00:19:10Z</dc:date>
    </item>
    <item>
      <title>Re: Trying to extract fields from a hybrid log Syslog/JSON nested logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Trying-to-extract-fields-from-a-hybrid-log-Syslog-JSON-nested/m-p/751354#M119326</link>
      <description>&lt;P&gt;Hey&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/311023"&gt;@atme&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;It would be complex if you try to extract all of these fields at index time. The computational load would also increase. I would prefer going for search time extractions. However, if you still wish to extract fields at index time,&amp;nbsp; it would be great if you can share what you've configured till now in props and transforms. Since the _raw event varies in number of fields also,&amp;nbsp; we need to define a regex based pattern or key-value pair to extract the fields.&lt;/P&gt;&lt;P&gt;Thanks,&lt;BR /&gt;Tejas.&lt;/P&gt;</description>
      <pubDate>Mon, 11 Aug 2025 11:19:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Trying-to-extract-fields-from-a-hybrid-log-Syslog-JSON-nested/m-p/751354#M119326</guid>
      <dc:creator>tej57</dc:creator>
      <dc:date>2025-08-11T11:19:00Z</dc:date>
    </item>
  </channel>
</rss>

