<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Splunkd fatal error in Monitoring Splunk</title>
    <link>https://community.splunk.com/t5/Monitoring-Splunk/Splunkd-fatal-error/m-p/60336#M679</link>
    <description>&lt;P&gt;Recently I migrated the Windows Splunk server in our QA environment to Ubuntu 10.04.&lt;/P&gt;

&lt;P&gt;Things were working well for a week, today splunkd crashed. I suspect it is because the open file limit was set to a low number on the linux server. I have increased the open file limit and restarted splunkd.&lt;/P&gt;

&lt;P&gt;Looking at the logs, can anyone confirm if this theory is true? If not, any thoughts on why this happened? Thanks.&lt;/P&gt;

&lt;P&gt;Tailing &lt;STRONG&gt;Splunkd.log&lt;/STRONG&gt;, the last error messages before the crash (after a bunch of info messages) are:&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.187 +0000 ERROR JournalSlice - Cannot create new journal slice file: Too many open files, file="/opt/splunk/var/lib/splunk/defaultdb/db/hot_v1_42901/rawdata/0"&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 ERROR JournalSlice - Failed to write header for rawdata&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 INFO  HotDBManager - no hot found for event ts=1303840686, closest match=id=42900 [et,lt,span,flush,lru]=[1303676631,1303676631,14400,9223372036854775807,1313682031]  [expanded span=164055]&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 FATAL HotDBManager - hot dir with id already exists in createDir: /opt/splunk/var/lib/splunk/defaultdb/db/hot_v1_42901&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.357 +0000 WARN  EventLoop - Main Thread: about to throw a EventLoopException: error from PolledSocket write: Broken pipe&lt;/P&gt;

&lt;P&gt;Tailing &lt;STRONG&gt;/var/log/messages&lt;/STRONG&gt;:&lt;/P&gt;

&lt;P&gt;Aug 18 06:41:13 QAIFSPLUNK02 rsyslogd: [origin software="rsyslogd" swVersion="4.2.0" x-pid="790" x-info="&lt;A href="http://www.rsyslog.com%22" target="_blank"&gt;http://www.rsyslog.com"&lt;/A&gt;] rsyslogd was HUPed, type 'lightweight'.&lt;/P&gt;

&lt;P&gt;Aug 18 15:40:31 QAIFSPLUNK02 kernel: [781860.711631] __ratelimit: 3 callbacks suppressed&lt;/P&gt;

&lt;P&gt;Aug 18 15:40:31 QAIFSPLUNK02 kernel: [781860.711641] splunkd[12731]: segfault at 157a000 ip 0000000000f33280 sp 00007fb9ff7b90d0 error 4 in splunkd[400000+1017000]&lt;/P&gt;</description>
    <pubDate>Mon, 28 Sep 2020 09:48:29 GMT</pubDate>
    <dc:creator>sdevadas</dc:creator>
    <dc:date>2020-09-28T09:48:29Z</dc:date>
    <item>
      <title>Splunkd fatal error</title>
      <link>https://community.splunk.com/t5/Monitoring-Splunk/Splunkd-fatal-error/m-p/60336#M679</link>
      <description>&lt;P&gt;Recently I migrated the Windows Splunk server in our QA environment to Ubuntu 10.04.&lt;/P&gt;

&lt;P&gt;Things were working well for a week, today splunkd crashed. I suspect it is because the open file limit was set to a low number on the linux server. I have increased the open file limit and restarted splunkd.&lt;/P&gt;

&lt;P&gt;Looking at the logs, can anyone confirm if this theory is true? If not, any thoughts on why this happened? Thanks.&lt;/P&gt;

&lt;P&gt;Tailing &lt;STRONG&gt;Splunkd.log&lt;/STRONG&gt;, the last error messages before the crash (after a bunch of info messages) are:&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.187 +0000 ERROR JournalSlice - Cannot create new journal slice file: Too many open files, file="/opt/splunk/var/lib/splunk/defaultdb/db/hot_v1_42901/rawdata/0"&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 ERROR JournalSlice - Failed to write header for rawdata&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 INFO  HotDBManager - no hot found for event ts=1303840686, closest match=id=42900 [et,lt,span,flush,lru]=[1303676631,1303676631,14400,9223372036854775807,1313682031]  [expanded span=164055]&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.188 +0000 FATAL HotDBManager - hot dir with id already exists in createDir: /opt/splunk/var/lib/splunk/defaultdb/db/hot_v1_42901&lt;/P&gt;

&lt;P&gt;08-18-2011 15:40:31.357 +0000 WARN  EventLoop - Main Thread: about to throw a EventLoopException: error from PolledSocket write: Broken pipe&lt;/P&gt;

&lt;P&gt;Tailing &lt;STRONG&gt;/var/log/messages&lt;/STRONG&gt;:&lt;/P&gt;

&lt;P&gt;Aug 18 06:41:13 QAIFSPLUNK02 rsyslogd: [origin software="rsyslogd" swVersion="4.2.0" x-pid="790" x-info="&lt;A href="http://www.rsyslog.com%22" target="_blank"&gt;http://www.rsyslog.com"&lt;/A&gt;] rsyslogd was HUPed, type 'lightweight'.&lt;/P&gt;

&lt;P&gt;Aug 18 15:40:31 QAIFSPLUNK02 kernel: [781860.711631] __ratelimit: 3 callbacks suppressed&lt;/P&gt;

&lt;P&gt;Aug 18 15:40:31 QAIFSPLUNK02 kernel: [781860.711641] splunkd[12731]: segfault at 157a000 ip 0000000000f33280 sp 00007fb9ff7b90d0 error 4 in splunkd[400000+1017000]&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:48:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Monitoring-Splunk/Splunkd-fatal-error/m-p/60336#M679</guid>
      <dc:creator>sdevadas</dc:creator>
      <dc:date>2020-09-28T09:48:29Z</dc:date>
    </item>
    <item>
      <title>Re: Splunkd fatal error</title>
      <link>https://community.splunk.com/t5/Monitoring-Splunk/Splunkd-fatal-error/m-p/60337#M680</link>
      <description>&lt;P&gt;I would suggest that you restart splunk to roll this particular hotbucket to warm, it looks like something has gone haywire in your hot bucket. &lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2011 17:21:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Monitoring-Splunk/Splunkd-fatal-error/m-p/60337#M680</guid>
      <dc:creator>jbsplunk</dc:creator>
      <dc:date>2011-10-12T17:21:27Z</dc:date>
    </item>
  </channel>
</rss>

