<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Multiple gzip broken pipe errors in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190690#M37962</link>
    <description>&lt;P&gt;What version of Splunk are you running?&lt;/P&gt;</description>
    <pubDate>Wed, 30 Sep 2015 02:45:25 GMT</pubDate>
    <dc:creator>muebel</dc:creator>
    <dc:date>2015-09-30T02:45:25Z</dc:date>
    <item>
      <title>Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190685#M37957</link>
      <description>&lt;P&gt;I am seeing many errors like the below:&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;{timestamp} INFO ArchiveProcessor - handling file=/path/to/file.gz&lt;/LI&gt;
&lt;LI&gt;{timestamp} INFO ArchiveProcessor - reading path=/path/to/file.gz (seek=0 len={some number that is actually equal to the length of the file on disk})&lt;/LI&gt;
&lt;LI&gt;{timestamp} ERROR ArchiveContext - from archive='/path/to/file.gz': gzip: stdout: Broken pipe&lt;/LI&gt;
&lt;LI&gt;{timestamp} INFO ArchiveProcessor - Finished processing file '/path/to/file.gz', removing from stats&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;These files are created by IBM InfoSphere Streams, using the 'gzip' compression option of its FileSink operator. They are on an NFS mount. The odd thing is that I don't get these error from all of the files, but definitely from most. If I try to use regular gunzip to decompress the files, I get no errors or warnings even in verbose mode and they decompress just fine.&lt;/P&gt;

&lt;P&gt;What is causing all these errors?&lt;/P&gt;</description>
      <pubDate>Fri, 20 Mar 2015 12:39:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190685#M37957</guid>
      <dc:creator>MasterDuke</dc:creator>
      <dc:date>2015-03-20T12:39:12Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190686#M37958</link>
      <description>&lt;P&gt;I'm getting the same error when trying to ingest .gz files into Splunk.  Please let me know if you found a resolution.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Mar 2015 20:16:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190686#M37958</guid>
      <dc:creator>ericlarsen</dc:creator>
      <dc:date>2015-03-27T20:16:29Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190687#M37959</link>
      <description>&lt;P&gt;also dealing with this,  please let me know if you find more info&lt;/P&gt;</description>
      <pubDate>Thu, 23 Apr 2015 12:28:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190687#M37959</guid>
      <dc:creator>atorrrr</dc:creator>
      <dc:date>2015-04-23T12:28:02Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190688#M37960</link>
      <description>&lt;P&gt;I'm having the same issue.  Looking for a solution now..&lt;/P&gt;</description>
      <pubDate>Wed, 03 Jun 2015 18:38:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190688#M37960</guid>
      <dc:creator>pj_elia</dc:creator>
      <dc:date>2015-06-03T18:38:36Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190689#M37961</link>
      <description>&lt;P&gt;I am seeing these to .. any updates ? &lt;/P&gt;

&lt;P&gt;09-24-2015 05:49:35.834 -0400 INFO  ArchiveProcessor - handling file=/var/cdnlog/cdn.n1.paychexinc.com_20150924003358_112706.log.gz&lt;BR /&gt;
09-24-2015 05:49:35.834 -0400 INFO  ArchiveProcessor - reading path=/var/cdnlog/cdn.n1.paychexinc.com_20150924003358_112706.log.gz (seek=0 len=863894)&lt;BR /&gt;
09-24-2015 05:49:36.295 -0400 ERROR ArchiveContext - From archive='/var/cdnlog/cdn.n1.paychexinc.com_20150924003358_112706.log.gz':  gzip: stdout: Broken pipe&lt;BR /&gt;
09-24-2015 05:49:37.667 -0400 INFO  ArchiveProcessor - Finished processing file '/var/cdnlog/cdn.n1.paychexinc.com_20150924003358_112706.log.gz', removing from stats&lt;BR /&gt;
09-24-2015 05:49:37.667 -0400 INFO  ArchiveProcessor - handling file=/var/cdnlog/cdn.paychexinc.com_20150924020241_122706.log.gz&lt;BR /&gt;
09-24-2015 05:49:37.668 -0400 INFO  ArchiveProcessor - reading path=/var/cdnlog/cdn.paychexinc.com_20150924020241_122706.log.gz (seek=0 len=53513669)&lt;BR /&gt;
09-24-2015 05:49:37.668 -0400 WARN  TcpOutputProc - The event is missing source information. Event : &lt;BR /&gt;
09-24-2015 05:49:38.041 -0400 ERROR ArchiveContext - From archive='/var/cdnlog/cdn.paychexinc.com_20150924020241_122706.log.gz':  gzip: stdout: Broken pipe&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 07:20:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190689#M37961</guid>
      <dc:creator>wsnyder2</dc:creator>
      <dc:date>2020-09-29T07:20:15Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190690#M37962</link>
      <description>&lt;P&gt;What version of Splunk are you running?&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2015 02:45:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190690#M37962</guid>
      <dc:creator>muebel</dc:creator>
      <dc:date>2015-09-30T02:45:25Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190691#M37963</link>
      <description>&lt;P&gt;Tried several different 6.x.y versions, on 6.2.1 now I believe.&lt;/P&gt;</description>
      <pubDate>Sun, 04 Oct 2015 04:12:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190691#M37963</guid>
      <dc:creator>MasterDuke</dc:creator>
      <dc:date>2015-10-04T04:12:10Z</dc:date>
    </item>
    <item>
      <title>Re: Multiple gzip broken pipe errors</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190692#M37964</link>
      <description>&lt;P&gt;Since you ruled out gzip itself as the culprit, this looks to me like a pipeline problem, and not exactly a gzip failure - which means the trouble would be in ArchiveProcessor.&lt;/P&gt;

&lt;P&gt;In other words ArchiveProcessor may not be handling the pipeline correctly (a bug! ... perhaps). There are at least 2 other possibly related issues to be found on this site:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://answers.splunk.com/answers/57272/large-data-archives-zip-being-corrupted-on-indexing.html"&gt;http://answers.splunk.com/answers/57272/large-data-archives-zip-being-corrupted-on-indexing.html&lt;/A&gt;&lt;BR /&gt;
&lt;A href="http://answers.splunk.com/answers/132045/error-archiveprocessor-with-zip-files.html"&gt;http://answers.splunk.com/answers/132045/error-archiveprocessor-with-zip-files.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;these refer to zip files, but i wonder if it might be common cause, specifically pipeline handling in the ArchiveProcessor logic.&lt;/P&gt;

&lt;P&gt;As an aside, i'm completely unsure if your issue is related, but these are food for thought:&lt;/P&gt;

&lt;P&gt;&lt;A href="https://blog.nelhage.com/2010/02/a-very-subtle-bug/"&gt;https://blog.nelhage.com/2010/02/a-very-subtle-bug/&lt;/A&gt;&lt;BR /&gt;
&lt;A href="http://bugs.python.org/issue1652"&gt;http://bugs.python.org/issue1652&lt;/A&gt; &lt;/P&gt;</description>
      <pubDate>Sun, 04 Oct 2015 05:56:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Multiple-gzip-broken-pipe-errors/m-p/190692#M37964</guid>
      <dc:creator>sjalexander</dc:creator>
      <dc:date>2015-10-04T05:56:03Z</dc:date>
    </item>
  </channel>
</rss>

