<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize? in Knowledge Management</title>
    <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239547#M2104</link>
    <description>&lt;P&gt;How frequent do you run your summary index search? 50 M is a lot of events and would be great if you can increase the frequency to reduce the no of events to be written.&lt;/P&gt;</description>
    <pubDate>Wed, 20 Jan 2016 15:37:55 GMT</pubDate>
    <dc:creator>somesoni2</dc:creator>
    <dc:date>2016-01-20T15:37:55Z</dc:date>
    <item>
      <title>How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239546#M2103</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;I have a scheduled saved search which populates a summary index with ~50M events. As the search is triggered, I monitor the progress in the &lt;STRONG&gt;Job Inspector&lt;/STRONG&gt;. I noticed that &lt;STRONG&gt;it reaches 100% in about 30mins&lt;/STRONG&gt;. After this point, the search fails to report any more progress. I have checked the &lt;CODE&gt;search.log&lt;/CODE&gt; and the last update is about the &lt;CODE&gt;"StatsProcessor - flushed stats...."&lt;/CODE&gt; into a gzipped file in  the scheduler's job directory. I also checked this directory and in the file &lt;CODE&gt;status.csv&lt;/CODE&gt; it reports as "&lt;STRONG&gt;FINALIZING&lt;/STRONG&gt;". Waiting a couple of hours and no progress. &lt;/P&gt;

&lt;P&gt;My concern is that in the flushed results reported a count of ~9M events which is correct for based on the indexed events. However, I use &lt;CODE&gt;timechart&lt;/CODE&gt;and then &lt;CODE&gt;untable&lt;/CODE&gt; in order to fill empty buckets and this is the expansion to 50M events.&lt;BR /&gt;
Also, scheduler's log in index &lt;CODE&gt;_internal&lt;/CODE&gt; does not report any error or whatever.&lt;/P&gt;

&lt;P&gt;Is there another log/process I could check to gather more details? or any more ideas? My limits.conf are configured to handled such big searches as well.&lt;/P&gt;

&lt;P&gt;thanks, Dimoklis&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 14:13:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239546#M2103</guid>
      <dc:creator>dimoklis</dc:creator>
      <dc:date>2016-01-20T14:13:16Z</dc:date>
    </item>
    <item>
      <title>Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239547#M2104</link>
      <description>&lt;P&gt;How frequent do you run your summary index search? 50 M is a lot of events and would be great if you can increase the frequency to reduce the no of events to be written.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 15:37:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239547#M2104</guid>
      <dc:creator>somesoni2</dc:creator>
      <dc:date>2016-01-20T15:37:55Z</dc:date>
    </item>
    <item>
      <title>Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239548#M2105</link>
      <description>&lt;P&gt;Hi somesoni2 thanks for the reply. I am running it once a day in dead quiet period. Think there is a bottleneck with timechart/untable&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 16:56:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239548#M2105</guid>
      <dc:creator>dimoklis</dc:creator>
      <dc:date>2016-01-20T16:56:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239549#M2106</link>
      <description>&lt;P&gt;Ok... can you provide your search? Also, consider it running multiple times a day but processing different hours of the day to reduce the number of rows processed per run.&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 18:18:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239549#M2106</guid>
      <dc:creator>somesoni2</dc:creator>
      <dc:date>2016-01-20T18:18:32Z</dc:date>
    </item>
    <item>
      <title>Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239550#M2107</link>
      <description>&lt;P&gt;Converting this to an answer because running the search multiple times a day will likely fix the problem. I've found that when you have a search that populates a summary index, it doesn't actually write any data to the summary index until the "finalizing" stage. So, Splunk taking a long time to write 50M results is not surprising to me.&lt;/P&gt;

&lt;P&gt;I have to wonder what the use case is for doing this though, as there may be a better way to implement @dimoklis desired outcome. I usually use summary indexing to aggregate data, although I have seen it misused as a lazy way to filter the data within indexes into new indexes (instead of using event routing via props/transforms).&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jan 2016 21:08:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239550#M2107</guid>
      <dc:creator>masonmorales</dc:creator>
      <dc:date>2016-01-20T21:08:31Z</dc:date>
    </item>
    <item>
      <title>Re: How to troubleshoot why a scheduled saved search that populates a summary index does not finalize?</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239551#M2108</link>
      <description>&lt;P&gt;Did some work around on the way i aggregate the result with stats first. thanks @somesoni2. Marked as answer as it consists a general &lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;best practice&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Fri, 22 Jan 2016 08:07:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-to-troubleshoot-why-a-scheduled-saved-search-that-populates/m-p/239551#M2108</guid>
      <dc:creator>dimoklis</dc:creator>
      <dc:date>2016-01-22T08:07:03Z</dc:date>
    </item>
  </channel>
</rss>

