<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Whitelisting/Blacklisting files inside tgz files in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23857#M3757</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;
Did you ever find a way to do this? &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Tue, 09 Jul 2013 06:25:02 GMT</pubDate>
    <dc:creator>gelica</dc:creator>
    <dc:date>2013-07-09T06:25:02Z</dc:date>
    <item>
      <title>Whitelisting/Blacklisting files inside tgz files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23853#M3753</link>
      <description>&lt;P&gt;I have a bunch of .tgz files that are being regularly uploaded to a directory and I'd like to only index a subset of the files inside the archive files.&lt;/P&gt;

&lt;P&gt;Example archive files:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;   tar tzvf archive.1.2.tgz 
     -rw-r--r--  0 wdh    wdh       948 Jan 10 09:24 app1.log
     -rw-r--r--  0 wdh    wdh       414 Jan 10 09:24 foo.log
     -rw-r--r--  0 wdh    wdh       770 Jan 10 09:24 splat.log

  tar tzvf archive.5.8.tgz 
     -rw-r--r--  0 wdh    wdh       148 Jan 10 09:24 app3.log
     -rw-r--r--  0 wdh    wdh       216 Jan 10 09:24 bad.log
     -rw-r--r--  0 wdh    wdh       789 Jan 10 09:24 splat.log
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;From the example above, I'd like only the "splat.log" file inside archive.*.tgz to be indexed.  It appears to me that the whitelist/blacklist settings for an inputs.conf stanza only apply to the archive file name, not to files inside the archive.  &lt;/P&gt;

&lt;P&gt;While I know I can have some external batch process run and pull the 'splat.log' files out, is there any way I can use whitelist/blacklist, or some other Splunk configuration mechanism to filter based on the internal filenames inside the archive files?&lt;/P&gt;</description>
      <pubDate>Tue, 11 Jan 2011 04:34:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23853#M3753</guid>
      <dc:creator>wdhathaway</dc:creator>
      <dc:date>2011-01-11T04:34:17Z</dc:date>
    </item>
    <item>
      <title>Re: Whitelisting/Blacklisting files inside tgz files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23854#M3754</link>
      <description>&lt;P&gt;I've just run into this issue myself and have been beating my head against the wall trying to figure it out.  It's odd that splunk supports using the name of a file inside a tgz with regex to specify the hostname, but it can't look inside the tarball for the blacklist.  Very frustrating!&lt;/P&gt;</description>
      <pubDate>Tue, 10 May 2011 23:32:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23854#M3754</guid>
      <dc:creator>jstockamp</dc:creator>
      <dc:date>2011-05-10T23:32:18Z</dc:date>
    </item>
    <item>
      <title>Re: Whitelisting/Blacklisting files inside tgz files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23855#M3755</link>
      <description>&lt;P&gt;Not quite what you're looking for, but if nothing else you could route the events to &lt;A href="http://www.splunk.com/base/Documentation/4.2.1/Deploy/Routeandfilterdatad#Discard_specific_events_and_keep_the_rest"&gt;&lt;CODE&gt;nullQueue&lt;/CODE&gt;&lt;/A&gt; to discard the events from the unwanted files at index time.&lt;/P&gt;</description>
      <pubDate>Wed, 11 May 2011 00:44:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23855#M3755</guid>
      <dc:creator>southeringtonp</dc:creator>
      <dc:date>2011-05-11T00:44:56Z</dc:date>
    </item>
    <item>
      <title>Re: Whitelisting/Blacklisting files inside tgz files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23856#M3756</link>
      <description>&lt;P&gt;Is this an issue with 4.3 as well?   Been beating my heat on this one as well.&lt;/P&gt;</description>
      <pubDate>Tue, 14 Feb 2012 23:41:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23856#M3756</guid>
      <dc:creator>robsenk</dc:creator>
      <dc:date>2012-02-14T23:41:10Z</dc:date>
    </item>
    <item>
      <title>Re: Whitelisting/Blacklisting files inside tgz files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23857#M3757</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;
Did you ever find a way to do this? &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 09 Jul 2013 06:25:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Whitelisting-Blacklisting-files-inside-tgz-files/m-p/23857#M3757</guid>
      <dc:creator>gelica</dc:creator>
      <dc:date>2013-07-09T06:25:02Z</dc:date>
    </item>
  </channel>
</rss>

