<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to filter or extract fields  before indexing time in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127138#M26159</link>
    <description>&lt;P&gt;THANK YOU!&lt;BR /&gt;
I have read some of related answers, and thought SEDCMD was just able to drop the content after a tag, nullQueue was "at event level" &lt;BR /&gt;
Than i have to dig into "sed" tool. is there a awkcmd?&lt;/P&gt;</description>
    <pubDate>Mon, 04 Nov 2013 09:07:23 GMT</pubDate>
    <dc:creator>crazyeva</dc:creator>
    <dc:date>2013-11-04T09:07:23Z</dc:date>
    <item>
      <title>How to filter or extract fields  before indexing time</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127135#M26156</link>
      <description>&lt;P&gt;I have got very large orginal data, with events strictly formatted as "f1,f2,f3,..."&lt;BR /&gt;
most of the fields are meaningless: "0,f2,0,0,0,0,f7,0,f9,..." i only want  f2,f7,f9&lt;BR /&gt;
Can I filter fields before indexing,  drop unnecessary data, avoid reach license limit?&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2013 04:32:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127135#M26156</guid>
      <dc:creator>crazyeva</dc:creator>
      <dc:date>2013-11-04T04:32:58Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter or extract fields  before indexing time</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127136#M26157</link>
      <description>&lt;P&gt;Yes, as long as you can formulate a regular expression that defines how Splunk should include or exclude data. You can either to nullQueue routing (= drop events altogether) or &lt;/P&gt;

&lt;P&gt;Docs on how to use each: nullQueue routing - &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.0/Forwarding/Routeandfilterdatad"&gt;http://docs.splunk.com/Documentation/Splunk/6.0/Forwarding/Routeandfilterdatad&lt;/A&gt;&lt;BR /&gt;
SEDCMD - &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/admin/Propsconf"&gt;http://docs.splunk.com/Documentation/Splunk/latest/admin/Propsconf&lt;/A&gt; - see specification on SEDCMD at the middle of the page.&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2013 07:43:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127136#M26157</guid>
      <dc:creator>Ayn</dc:creator>
      <dc:date>2013-11-04T07:43:38Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter or extract fields  before indexing time</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127137#M26158</link>
      <description>&lt;P&gt;source file&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;2013-11-01 11:11:11 f1 f2 f3 f4 f5 f6
2013-11-02 13:15:11 d1 d2 d3 d4 d5 d6
2013-11-02 14:23:22 e1 e2 e3 e4 e5 e6
2013-11-03 12:23:21 g1 g2 g3 g4 g5 g6
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;props.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[your_sourcetype]
TRANSFORMS-blah = keep235
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;transforms.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[keep235]
DEST_KEY = _raw
REGEX = ^(\S+)\s+(\S+)\s+(\S+)\s+(\S+)\s+(\S+)\s+(\S+)\s+(\S+)\s+(\S+)
FORMAT = $1 $2 $4 $5 $7
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Result:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;03/11/2013 12:23:21.000  2013-11-03 12:23:21 g2 g3 g5
02/11/2013 14:23:22.000  2013-11-02 14:23:22 e2 e3 e5

^Splunk parsed timestamp  ^event timestamp     ^less columns/fields
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Hope this helps,&lt;/P&gt;

&lt;P&gt;/K&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2013 08:07:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127137#M26158</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2013-11-04T08:07:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter or extract fields  before indexing time</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127138#M26159</link>
      <description>&lt;P&gt;THANK YOU!&lt;BR /&gt;
I have read some of related answers, and thought SEDCMD was just able to drop the content after a tag, nullQueue was "at event level" &lt;BR /&gt;
Than i have to dig into "sed" tool. is there a awkcmd?&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2013 09:07:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127138#M26159</guid>
      <dc:creator>crazyeva</dc:creator>
      <dc:date>2013-11-04T09:07:23Z</dc:date>
    </item>
    <item>
      <title>Re: How to filter or extract fields  before indexing time</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127139#M26160</link>
      <description>&lt;P&gt;Thank you, that's sweet!&lt;/P&gt;</description>
      <pubDate>Mon, 04 Nov 2013 09:07:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-filter-or-extract-fields-before-indexing-time/m-p/127139#M26160</guid>
      <dc:creator>crazyeva</dc:creator>
      <dc:date>2013-11-04T09:07:47Z</dc:date>
    </item>
  </channel>
</rss>

