<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Do not index file based on content in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543972#M90791</link>
    <description>&lt;P&gt;Hi Vardhan,&lt;/P&gt;&lt;P&gt;Thanks for the hint, but unfortunately this only drops the line with the Timstamp. I need to discard the entire file in the case where the file contains this 'flag' timestamp.&lt;/P&gt;&lt;P&gt;I'm thinking it may need to be scripted input, but I'm wondering if there are any other clever tricks.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;R.&lt;/P&gt;</description>
    <pubDate>Tue, 16 Mar 2021 10:36:29 GMT</pubDate>
    <dc:creator>Random_Walk</dc:creator>
    <dc:date>2021-03-16T10:36:29Z</dc:date>
    <item>
      <title>Do not index file based on content</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543968#M90788</link>
      <description>&lt;P&gt;Greetings All,&lt;/P&gt;&lt;P&gt;I'm indexing a bunch of metrics files written every 10 minutes. Just after midnight I get a file containing the same format metrics, but each value is the sum for the previous day. This totals file I want to ignore (It messes up all sorts of use cases of the metric data). The only way to reliably identify a totals file is that the third line holds a timestamp, and this will be all zero. Any other file will have a normal ISO timestamp in this point&lt;/P&gt;&lt;P&gt;REGEX = ^TimeStamp\s+:\s+0000-00-00\s00.00.00.000&lt;/P&gt;&lt;P&gt;Is there a way to block that file's ingestion based on the content of a single line?&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;R.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Mar 2021 09:45:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543968#M90788</guid>
      <dc:creator>Random_Walk</dc:creator>
      <dc:date>2021-03-16T09:45:09Z</dc:date>
    </item>
    <item>
      <title>Re: Do not index file based on content</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543969#M90789</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;you can drop the events before indexing with the help of below settings.&lt;/P&gt;&lt;P&gt;props.conf&lt;/P&gt;&lt;P&gt;TRANSFORMS-information = eventsDrop&lt;/P&gt;&lt;P&gt;Transforms.conf&lt;/P&gt;&lt;P&gt;[eventsDrop]&lt;BR /&gt;REGEX =&lt;SPAN&gt;^TimeStamp\s+:\s+0000-00-00\s00.00.00.000&lt;/SPAN&gt;&lt;BR /&gt;DEST_KEY = queue&lt;BR /&gt;FORMAT = nullQueue&lt;/P&gt;</description>
      <pubDate>Tue, 16 Mar 2021 09:53:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543969#M90789</guid>
      <dc:creator>Vardhan</dc:creator>
      <dc:date>2021-03-16T09:53:44Z</dc:date>
    </item>
    <item>
      <title>Re: Do not index file based on content</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543972#M90791</link>
      <description>&lt;P&gt;Hi Vardhan,&lt;/P&gt;&lt;P&gt;Thanks for the hint, but unfortunately this only drops the line with the Timstamp. I need to discard the entire file in the case where the file contains this 'flag' timestamp.&lt;/P&gt;&lt;P&gt;I'm thinking it may need to be scripted input, but I'm wondering if there are any other clever tricks.&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;R.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Mar 2021 10:36:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543972#M90791</guid>
      <dc:creator>Random_Walk</dc:creator>
      <dc:date>2021-03-16T10:36:29Z</dc:date>
    </item>
    <item>
      <title>Re: Do not index file based on content</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543976#M90792</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/228386"&gt;@Random_Walk&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;Then use a script to write those kinds of events in a separate file. And if your are ingesting these files through UF then use Blacklist option to ignore the files without reading.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Mar 2021 11:10:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Do-not-index-file-based-on-content/m-p/543976#M90792</guid>
      <dc:creator>Vardhan</dc:creator>
      <dc:date>2021-03-16T11:10:02Z</dc:date>
    </item>
  </channel>
</rss>

