<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: search-time versus index-time field extractions in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/525332#M148228</link>
    <description>&lt;P&gt;I don't. I think it's a good architectural design to be able to determine which context must implement indexed fields and which context must resort to the default search-time field extractions.&lt;/P&gt;&lt;P&gt;For example, TAs that make logs CIM compliant like TAs for network devices (e.g. Aruba or Checkpoint), they extract fields at search time and are best to remain that way because of the sheer size of network devices logs. Also, they are most likely going to be accelerated anyway in a sort of "Network Traffic" `datamodel`.&lt;/P&gt;&lt;P&gt;Another example is for specific contexts that could help your unique-to-your-organisation use cases. I used to work in a project where we had to implement index-time field extractions so we can pull them quickly in `tstats` so we can display the data quickly on a dashboard that's viewed by a high-profile boss every Monday. He gets frustrated if the dashboard loads very slowly and we don't want to upset the boss. We could've used accel-datamodel but we decided to do it at indexing phase instead.&lt;/P&gt;</description>
    <pubDate>Mon, 19 Oct 2020 11:40:59 GMT</pubDate>
    <dc:creator>morethanyell</dc:creator>
    <dc:date>2020-10-19T11:40:59Z</dc:date>
    <item>
      <title>search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33892#M7235</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I've recently noticed the recommendations the move to search-time versus index-time field extractions.  I'm trying to get an idea of exactly how much of the configuration that we've got in place doesn't follow this paradigm.  We especially have a lot of DELIMS/FIELDS-based field extractions, and I'm not clear on where we stand with these, especially since there's no obvious way to configure them in the GUI.&lt;/P&gt;

&lt;P&gt;I'm assuming when an extract says 'uses transform' as opposed to 'inline' in the GUI then it is an index-time field extraction?  &lt;STRONG&gt;Is this the case or am I oversimplifying the distinction?&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;I've looked over the documentation on search-time indexing and &lt;A href="http://www.splunk.com/base/Documentation/latest/Knowledge/Addfieldsatsearchtime" rel="nofollow"&gt;http://www.splunk.com/base/Documentation/latest/Knowledge/Addfieldsatsearchtime&lt;/A&gt; says:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
  &lt;P&gt;You can also create and maintain field
  extractions by making edits directly
  to props.conf and transforms.conf. If
  this sounds like your kind of
  thing--and it may be, especially if
  you are an old-timey Splunk user, or
  just prefer working at the
  configuration file level of things,
  you can find all the details in
  "Create and maintain search-time
  extractions through configuration
  files," in this manual.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;This being said, other documentation at &lt;A href="http://www.splunk.com/base/Splexicon:Transform" rel="nofollow"&gt;http://www.splunk.com/base/Splexicon:Transform&lt;/A&gt; says:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
  &lt;P&gt;Transforms are always involved in the
  setup of custom index-time field
  extractions.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;Can somebody please help us clear this up? Thanks!&lt;/P&gt;

&lt;P&gt;-Frank&lt;/P&gt;</description>
      <pubDate>Fri, 20 Aug 2010 01:45:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33892#M7235</guid>
      <dc:creator>fervin</dc:creator>
      <dc:date>2010-08-20T01:45:40Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33893#M7236</link>
      <description>&lt;P&gt;In general, we recommend search-time extractions rather than index-time extractions. There are relatively few cases where index-time extractions are better, and they come at the cost of brittleness of configuration and an increase in index size (which in turn makes searches slower).&lt;/P&gt;

&lt;P&gt;The distinction in the UI of "uses transform" vs. inline doesn't have anything to do with search-time vs index-time. It is referring to where the regex itself is stored: in an &lt;CODE&gt;EXTRACT-&lt;/CODE&gt; line in props.conf (for inline) as opposed to in a &lt;CODE&gt;REPORT-&lt;/CODE&gt; line that refers to a stanza in transforms.conf (for uses transform).&lt;/P&gt;

&lt;P&gt;Index time extractions are also set in props.conf and transforms.conf by means of the &lt;CODE&gt;TRANSFORM-&lt;/CODE&gt; line. Again, they should rarely be used. They are appropriate when the heuristic of search for the value of the field fails (either because the value is ubiquitous outside of cases where the field equals the value, or because the value isn't an indexed token) or when you commonly search for &lt;CODE&gt;field!=value&lt;/CODE&gt; without other terms to constrain the search.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Aug 2010 01:53:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33893#M7236</guid>
      <dc:creator>Stephen_Sorkin</dc:creator>
      <dc:date>2010-08-20T01:53:27Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33894#M7237</link>
      <description>&lt;P&gt;Thanks, that's exactly what I was hoping to hear.  Now, if we could just get an easy way to configure DELIMS/FIELDS in the UI, I'd be even happier...&lt;/P&gt;</description>
      <pubDate>Fri, 20 Aug 2010 02:31:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33894#M7237</guid>
      <dc:creator>fervin</dc:creator>
      <dc:date>2010-08-20T02:31:07Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33895#M7238</link>
      <description>&lt;P&gt;Here is a related discussion (which highlights some additional use-cases for using indexed fields)&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;&lt;A href="http://answers.splunk.com/questions/842/do-search-time-fields-have-performance-considerations/847#847" rel="nofollow"&gt;Do search-time fields have performance considerations?&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Fri, 20 Aug 2010 03:17:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33895#M7238</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2010-08-20T03:17:12Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33896#M7239</link>
      <description>&lt;P&gt;I will clarify here that DELIMS/FIELDS extraction are search-time extractions, and thus of the preferred type already.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Aug 2010 05:24:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33896#M7239</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-08-20T05:24:04Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33897#M7240</link>
      <description>&lt;P&gt;Yup, still waiting on the DELIMS/FIELDS UI thing in 2016.  And now with Splunk Cloud that's become an even bigger pain because of the lack of access to the .conf files.  ;-(&lt;/P&gt;</description>
      <pubDate>Fri, 08 Jan 2016 16:16:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33897#M7240</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2016-01-08T16:16:48Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33898#M7241</link>
      <description>&lt;P&gt;Does this "recommend" still stand nearly 8 years later?&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jul 2018 20:20:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33898#M7241</guid>
      <dc:creator>wmyersas</dc:creator>
      <dc:date>2018-07-12T20:20:40Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33899#M7242</link>
      <description>&lt;P&gt;There are 2 different &lt;CODE&gt;transform&lt;/CODE&gt; things.&lt;/P&gt;

&lt;P&gt;One is &lt;CODE&gt;transforms.conf&lt;/CODE&gt; which contains &lt;CODE&gt;transform&lt;/CODE&gt; definitions and the word &lt;CODE&gt;transform&lt;/CODE&gt; only occurs in the file name, not in the contents of the file.  That is one thing.&lt;/P&gt;

&lt;P&gt;Then there is the &lt;CODE&gt;TRANFORMS-&lt;/CODE&gt; definition inside of &lt;CODE&gt;props.conf&lt;/CODE&gt; that is part of the &lt;CODE&gt;REPORT-&lt;/CODE&gt;, &lt;CODE&gt;EXTRACT-&lt;/CODE&gt;, and &lt;CODE&gt;TRANSFORMS-&lt;/CODE&gt; triad.  The first two are &lt;CODE&gt;search-time&lt;/CODE&gt; things that are really the same thing (just that &lt;CODE&gt;REPORT-&lt;/CODE&gt; definitions will reference &lt;CODE&gt;transforms&lt;/CODE&gt; defined in &lt;CODE&gt;transforms.conf&lt;/CODE&gt; whereas &lt;CODE&gt;EXTRACT-&lt;/CODE&gt; definitions are inlined completely in &lt;CODE&gt;props.conf&lt;/CODE&gt;).  The last, &lt;CODE&gt;TRANSFORMS-&lt;/CODE&gt; is how &lt;CODE&gt;index-time&lt;/CODE&gt; extractions are configured.&lt;/P&gt;

&lt;P&gt;I agree that this is a bit confusing.&lt;/P&gt;</description>
      <pubDate>Thu, 12 Jul 2018 21:25:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33899#M7242</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2018-07-12T21:25:31Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33900#M7243</link>
      <description>&lt;P&gt;@wmyersas I think it's much more recommended now that Splunk is moving to "compute" rather than daily volume type of billing customers. Search time extractions will defo use more compute to load into RAM rather than displaying fields that have already been burned onto the disk.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jan 2020 05:17:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/33900#M7243</guid>
      <dc:creator>morethanyell</dc:creator>
      <dc:date>2020-01-09T05:17:56Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/525274#M148207</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/115107"&gt;@morethanyell&lt;/a&gt;&amp;nbsp;&amp;gt;&amp;gt;&amp;gt;&lt;SPAN&gt;Search time extractions will defo use more compute to load into RAM rather than displaying fields that have already been burned onto the disk&amp;lt;&amp;lt;&amp;lt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;so, do you suggest index-time field extraction, than the search-time field extraction?&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 19 Oct 2020 01:26:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/525274#M148207</guid>
      <dc:creator>inventsekar</dc:creator>
      <dc:date>2020-10-19T01:26:03Z</dc:date>
    </item>
    <item>
      <title>Re: search-time versus index-time field extractions</title>
      <link>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/525332#M148228</link>
      <description>&lt;P&gt;I don't. I think it's a good architectural design to be able to determine which context must implement indexed fields and which context must resort to the default search-time field extractions.&lt;/P&gt;&lt;P&gt;For example, TAs that make logs CIM compliant like TAs for network devices (e.g. Aruba or Checkpoint), they extract fields at search time and are best to remain that way because of the sheer size of network devices logs. Also, they are most likely going to be accelerated anyway in a sort of "Network Traffic" `datamodel`.&lt;/P&gt;&lt;P&gt;Another example is for specific contexts that could help your unique-to-your-organisation use cases. I used to work in a project where we had to implement index-time field extractions so we can pull them quickly in `tstats` so we can display the data quickly on a dashboard that's viewed by a high-profile boss every Monday. He gets frustrated if the dashboard loads very slowly and we don't want to upset the boss. We could've used accel-datamodel but we decided to do it at indexing phase instead.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Oct 2020 11:40:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/search-time-versus-index-time-field-extractions/m-p/525332#M148228</guid>
      <dc:creator>morethanyell</dc:creator>
      <dc:date>2020-10-19T11:40:59Z</dc:date>
    </item>
  </channel>
</rss>

