<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic load compressed files in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66776#M13405</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;as we know , before splunk eat a compressed file, splunk will decompress it first then index it. &lt;/P&gt;

&lt;P&gt;but, if we have many compressed files under the same directory (ex: ap_20110301.zip, ap_20110302.zip ...) and their original file name are the same (ex:ap.log), what will happen ? &lt;/P&gt;

&lt;P&gt;will splunk decompress all those files then index them ? or decompress and index one by one ? &lt;/P&gt;

&lt;P&gt;because their original file name are the same , if splunk decompress all of the files at first , it will overwrite existing files (actually, this is what we observed, but we want to make sure).&lt;/P&gt;

&lt;P&gt;thanks.&lt;/P&gt;</description>
    <pubDate>Thu, 24 Mar 2011 08:06:11 GMT</pubDate>
    <dc:creator>dmlee</dc:creator>
    <dc:date>2011-03-24T08:06:11Z</dc:date>
    <item>
      <title>load compressed files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66776#M13405</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;as we know , before splunk eat a compressed file, splunk will decompress it first then index it. &lt;/P&gt;

&lt;P&gt;but, if we have many compressed files under the same directory (ex: ap_20110301.zip, ap_20110302.zip ...) and their original file name are the same (ex:ap.log), what will happen ? &lt;/P&gt;

&lt;P&gt;will splunk decompress all those files then index them ? or decompress and index one by one ? &lt;/P&gt;

&lt;P&gt;because their original file name are the same , if splunk decompress all of the files at first , it will overwrite existing files (actually, this is what we observed, but we want to make sure).&lt;/P&gt;

&lt;P&gt;thanks.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2011 08:06:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66776#M13405</guid>
      <dc:creator>dmlee</dc:creator>
      <dc:date>2011-03-24T08:06:11Z</dc:date>
    </item>
    <item>
      <title>Re: load compressed files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66777#M13406</link>
      <description>&lt;P&gt;Splunk never actually decompresses the files within archives to a temporary location on disk. Instead we use a library called "libarchive" that allows us to stream through the contents of archives. These streamed contents are then indexed.&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2011 08:56:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66777#M13406</guid>
      <dc:creator>Stephen_Sorkin</dc:creator>
      <dc:date>2011-03-24T08:56:54Z</dc:date>
    </item>
    <item>
      <title>Re: load compressed files</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66778#M13407</link>
      <description>&lt;P&gt;lessons learned, thanks&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2011 13:29:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/load-compressed-files/m-p/66778#M13407</guid>
      <dc:creator>dmlee</dc:creator>
      <dc:date>2011-03-24T13:29:12Z</dc:date>
    </item>
  </channel>
</rss>

