<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to investigate crashing indexer in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426056#M74667</link>
    <description>&lt;P&gt;Our indexers are in a cluster. We have 4 indexers and they are crashing once a week, I do not how to start investigating. &lt;BR /&gt;
I tried several ways but not able to identify anything with that.&lt;/P&gt;

&lt;P&gt;1) Whenever they crash, CPU load and memory and swap will be exhausted. It only happened during the crash time most of the time they are so less utilized.&lt;/P&gt;

&lt;P&gt;free -g&lt;BR /&gt;
             total       used       free     shared    buffers     cached&lt;BR /&gt;
Mem:            11         10          0          0          0          9&lt;BR /&gt;
-/+ buffers/cache:          0         10&lt;BR /&gt;
Swap:            3          0          3&lt;/P&gt;

&lt;P&gt;2) Also, we are seeing one accelerated search the is getting completed during this time.&lt;/P&gt;

&lt;P&gt;Just let me know if any there is any info I should provide.&lt;/P&gt;

&lt;P&gt;Please help us with this.&lt;/P&gt;</description>
    <pubDate>Fri, 02 Aug 2019 19:14:32 GMT</pubDate>
    <dc:creator>shivanandbm</dc:creator>
    <dc:date>2019-08-02T19:14:32Z</dc:date>
    <item>
      <title>How to investigate crashing indexer</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426056#M74667</link>
      <description>&lt;P&gt;Our indexers are in a cluster. We have 4 indexers and they are crashing once a week, I do not how to start investigating. &lt;BR /&gt;
I tried several ways but not able to identify anything with that.&lt;/P&gt;

&lt;P&gt;1) Whenever they crash, CPU load and memory and swap will be exhausted. It only happened during the crash time most of the time they are so less utilized.&lt;/P&gt;

&lt;P&gt;free -g&lt;BR /&gt;
             total       used       free     shared    buffers     cached&lt;BR /&gt;
Mem:            11         10          0          0          0          9&lt;BR /&gt;
-/+ buffers/cache:          0         10&lt;BR /&gt;
Swap:            3          0          3&lt;/P&gt;

&lt;P&gt;2) Also, we are seeing one accelerated search the is getting completed during this time.&lt;/P&gt;

&lt;P&gt;Just let me know if any there is any info I should provide.&lt;/P&gt;

&lt;P&gt;Please help us with this.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2019 19:14:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426056#M74667</guid>
      <dc:creator>shivanandbm</dc:creator>
      <dc:date>2019-08-02T19:14:32Z</dc:date>
    </item>
    <item>
      <title>Re: How to investigate crashing indexer</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426057#M74668</link>
      <description>&lt;P&gt;There are some helpful searches in case of crash:&lt;/P&gt;

&lt;P&gt;1) When did Splunk last crash?&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal sourcetype=splunkd_crash_log | stats count by host
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;2) Show me all Splunk restarts based on loader?&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal sourcetype=splunkd loader message=*xml
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;3) Lengthy search?&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index="_audit" action="search" (id=* OR search_id=*) | eval
user=if(user=="n/a",null(),user) | stats max(total_run_time)
as total_run_time first(user) as user by search_id | stats
count perc95(total_run_time) median(total_run_time) by user
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;You might need to check &lt;CODE&gt;Monitoring Console &amp;gt;&amp;gt; Resource Usage&lt;/CODE&gt; to check memory and disk space usage within crash time period.&lt;/P&gt;</description>
      <pubDate>Fri, 02 Aug 2019 20:41:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426057#M74668</guid>
      <dc:creator>mayurr98</dc:creator>
      <dc:date>2019-08-02T20:41:59Z</dc:date>
    </item>
    <item>
      <title>Re: How to investigate crashing indexer</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426058#M74669</link>
      <description>&lt;P&gt;Thanks alot for replying. I am not getting any output in crash search and restart based on loader search.&lt;BR /&gt;
Splunk system user are the top performer during that time.&lt;BR /&gt;
Also memory and swap are completely exhausted and then our splunk process stopped in the indexer..I restarted manually.&lt;/P&gt;

&lt;P&gt;just want to know why the memory and swap are getting exhausted for short duration of time. also i see high load during that time.&lt;/P&gt;

&lt;P&gt;also i had seen all the forwarders are disconnected during that time as i got forwarder missing alerts for so many forwarders.&lt;/P&gt;

&lt;P&gt;I am not seeing anything in crashlog..&lt;/P&gt;

&lt;P&gt;Our splunk process stopped and i restarted it manually.&lt;/P&gt;

&lt;P&gt;Regards,Shivanand&lt;/P&gt;</description>
      <pubDate>Sat, 03 Aug 2019 05:00:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-investigate-crashing-indexer/m-p/426058#M74669</guid>
      <dc:creator>shivanandbm</dc:creator>
      <dc:date>2019-08-03T05:00:12Z</dc:date>
    </item>
  </channel>
</rss>

