<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Field Extraction for different types of data in Splunk Enterprise Security</title>
    <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489347#M8202</link>
    <description>&lt;OL&gt;
&lt;LI&gt;&lt;P&gt;Field extraction settings for structured data must be configured on the forwarder.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;If structured data has fields then those are automatically extracted. If not then &lt;EM&gt;FIELD_NAMES&lt;/EM&gt; attribute can be configured in props.conf to set field names.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;For structured data all the fields in data are extracted during index time only.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;For unstructured data it's better to extract data during search time as Splunk doc says:&lt;/P&gt;&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;####Index-time custom field extraction can degrade performance at both index time and search time. When you add to the number of fields extracted during indexing, the indexing process slows. Later, searches on the index are also slower, because the index has been enlarged by the additional fields, and a search on a larger index takes longer. You can avoid such performance issues by instead relying on search-time field extraction. &lt;/P&gt;</description>
    <pubDate>Sun, 08 Mar 2020 14:41:14 GMT</pubDate>
    <dc:creator>manjunathmeti</dc:creator>
    <dc:date>2020-03-08T14:41:14Z</dc:date>
    <item>
      <title>Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489345#M8200</link>
      <description>&lt;P&gt;Hi Splunkers,&lt;/P&gt;

&lt;P&gt;Splunk suggests to extract fields at forwarders for structured data, why? and what if i have field names in the log / no filed field names in the log?&lt;/P&gt;

&lt;P&gt;I have a confusion that whether my license usage get affected by structured field extraction at index time/ at forwarders.&lt;BR /&gt;
I understand that splunk license counts against what you index , so if i do indexed field extractions then those field value pairs will be added to _raw and cause license usage, is that correct?&lt;/P&gt;

&lt;P&gt;For unstructured data Splunk suggests us to do extraction at search time?.&lt;/P&gt;

&lt;P&gt;I'm clear with these but sometimes not,.&lt;/P&gt;

&lt;P&gt;any advises will be appreciated..&lt;/P&gt;

&lt;P&gt;Pramodh B&lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 11:29:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489345#M8200</guid>
      <dc:creator>PramodhKumar</dc:creator>
      <dc:date>2020-03-08T11:29:53Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489346#M8201</link>
      <description>&lt;P&gt;It has nothing to do with license because you are metered for that with length of &lt;CODE&gt;_raw&lt;/CODE&gt; in bytes.&lt;/P&gt;

&lt;P&gt;First, that guidance is overly-smplistic to the point of being fallacious; please post a followup comment here with the URL where you read that so that I can submit some feedback.&lt;/P&gt;

&lt;P&gt;The MAIN reason that this advice is wrong is because it will lead people to the very bad and generally &lt;EM&gt;WRONG&lt;/EM&gt; decision to use &lt;CODE&gt;Heavy Forwarders&lt;/CODE&gt; (which can do every kind of &lt;CODE&gt;index-time&lt;/CODE&gt; field extractions) instead of &lt;CODE&gt;Universal Forwarders&lt;/CODE&gt;: &lt;A href="https://www.splunk.com/en_us/blog/tips-and-tricks/universal-or-heavy-that-is-the-question.html"&gt;https://www.splunk.com/en_us/blog/tips-and-tricks/universal-or-heavy-that-is-the-question.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Another reason it is wrong is because &lt;CODE&gt;index-time&lt;/CODE&gt; field extractions consume a significant amount of disk space, often for no actual benefit (nobody is &lt;CODE&gt;tstats&lt;/CODE&gt;ing them).&lt;/P&gt;

&lt;P&gt;Also, the only sensible way to do &lt;CODE&gt;index-time&lt;/CODE&gt; field extractions on a &lt;CODE&gt;Universal Forwarder&lt;/CODE&gt; is with &lt;CODE&gt;INDEXED_EXTRACTIONS&lt;/CODE&gt; which should generally be avoided because it is "all or none".&lt;/P&gt;

&lt;P&gt;The only shred of this advice that is true is the universal distributed architecture rule that, all other considerations being equal (note my previously voiced inequalities above) as much as possible should be done at the leaves of the tree.&lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 14:26:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489346#M8201</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2020-03-08T14:26:04Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489347#M8202</link>
      <description>&lt;OL&gt;
&lt;LI&gt;&lt;P&gt;Field extraction settings for structured data must be configured on the forwarder.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;If structured data has fields then those are automatically extracted. If not then &lt;EM&gt;FIELD_NAMES&lt;/EM&gt; attribute can be configured in props.conf to set field names.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;For structured data all the fields in data are extracted during index time only.&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;For unstructured data it's better to extract data during search time as Splunk doc says:&lt;/P&gt;&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;####Index-time custom field extraction can degrade performance at both index time and search time. When you add to the number of fields extracted during indexing, the indexing process slows. Later, searches on the index are also slower, because the index has been enlarged by the additional fields, and a search on a larger index takes longer. You can avoid such performance issues by instead relying on search-time field extraction. &lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 14:41:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489347#M8202</guid>
      <dc:creator>manjunathmeti</dc:creator>
      <dc:date>2020-03-08T14:41:14Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489348#M8203</link>
      <description>&lt;P&gt;Thank you, &lt;/P&gt;

&lt;P&gt;All your suggestions are good, I really appreciate your effort.&lt;/P&gt;

&lt;P&gt;My question is what does INDEXED_EXTRACTIONS do at UF, lets say I have a csv file having 20 lines, no field names.&lt;BR /&gt;
I did INDEXED_EXTRACTIONS at UF now what exactly my forwarder sends to Indexer? That's all...&lt;/P&gt;

&lt;P&gt;My doubt is if UF forwards _raw + new field vales then while passing throgh index pipeline, does it counts all?&lt;/P&gt;

&lt;P&gt;There are 4 pipelines - Parsing-Mergine-Typing-Index , license metered at last pipeline, is that correct when data being written to disk?&lt;/P&gt;

&lt;P&gt;Can you please elaborate on precedence of props attributes vs license meter.&lt;/P&gt;

&lt;P&gt;Pramodh&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2020 04:31:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489348#M8203</guid>
      <dc:creator>PramodhKumar</dc:creator>
      <dc:date>2020-09-30T04:31:40Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489349#M8204</link>
      <description>&lt;P&gt;Thank you,&lt;/P&gt;

&lt;P&gt;I appreciate your efforts here.&lt;/P&gt;

&lt;OL&gt;
&lt;LI&gt;why splunk suggests to extarct fields at search time, is this same for structured/unstructured?&lt;/LI&gt;
&lt;LI&gt;What if I do extraction during index-time for unstructured data? license usage?&lt;/LI&gt;
&lt;LI&gt;I didn't understand, this in comparison to your 1st point.&lt;/LI&gt;
&lt;LI&gt;This is clear.&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;Pramodh&lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 15:42:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489349#M8204</guid>
      <dc:creator>PramodhKumar</dc:creator>
      <dc:date>2020-03-08T15:42:53Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489350#M8205</link>
      <description>&lt;OL&gt;
&lt;LI&gt;Refer point 4. This is same for both structured and unstructured.&lt;/LI&gt;
&lt;LI&gt;License is measured based on the amount of raw data that the indexer ingests into its indexing pipeline. Basically it is counted against _raw data.&lt;/LI&gt;
&lt;LI&gt;Actually parsing , merging and typing for structured data happens in forwarder only and indexing happens in indexer server.&lt;/LI&gt;
&lt;/OL&gt;</description>
      <pubDate>Sun, 08 Mar 2020 16:16:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489350#M8205</guid>
      <dc:creator>manjunathmeti</dc:creator>
      <dc:date>2020-03-08T16:16:51Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489351#M8206</link>
      <description>&lt;P&gt;There is no impact on license, only disk and CPU.&lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 17:14:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489351#M8206</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2020-03-08T17:14:38Z</dc:date>
    </item>
    <item>
      <title>Re: Field Extraction for different types of data</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489352#M8207</link>
      <description>&lt;P&gt;what is the splunk DB connetion?&lt;/P&gt;</description>
      <pubDate>Sun, 08 Mar 2020 17:47:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise-Security/Field-Extraction-for-different-types-of-data/m-p/489352#M8207</guid>
      <dc:creator>valishaik</dc:creator>
      <dc:date>2020-03-08T17:47:17Z</dc:date>
    </item>
  </channel>
</rss>

