<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Some events missing from data model - only if accelerated in Knowledge Management</title>
    <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391382#M3363</link>
    <description>&lt;P&gt;Forgot to mention, our fix was to pipe that single event to "delete" and rebuild dm acceleration. &lt;/P&gt;</description>
    <pubDate>Tue, 04 Feb 2020 23:07:57 GMT</pubDate>
    <dc:creator>codebuilder</dc:creator>
    <dc:date>2020-02-04T23:07:57Z</dc:date>
    <item>
      <title>Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391368#M3349</link>
      <description>&lt;H2&gt;The problem&lt;/H2&gt;

&lt;P&gt;Sometimes specific events are missing from an accelerated data model. These events can be found with a regular SPL search. &lt;BR /&gt;
When searching the data model, the events are only returned when the data model is &lt;EM&gt;not&lt;/EM&gt; accelerated. Once the acceleration is enabled, the events do not show up any more. Of course, we checked that the DMA's status is "100% completed". Rebuilding the accelerated data model does not help.&lt;/P&gt;

&lt;P&gt;What is really strange: it happens that &lt;STRONG&gt;events are disappearing from the accelerated data model&lt;/STRONG&gt; that showed up just fine mere minutes earlier. I noticed that today when I refreshed a dashboard and some charts suddenly were empty (which led me to investigate this again and post this question).&lt;/P&gt;

&lt;H2&gt;An example search&lt;/H2&gt;

&lt;P&gt;Here is an example of a search that is not returning all expected events (we have noticed this issue with different datasets in the past, too):&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;| pivot uberAgent Process_NetworkTargetPerformance
count(Process_NetworkTargetPerformance) as "Event count"
splitrow AppName
filter host is "Client11"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;We have seen this issue on various versions of Splunk Enterprise 7.1.x and 7.2.x (including 7.2.1).&lt;/P&gt;

&lt;H2&gt;Screenshots&lt;/H2&gt;

&lt;P&gt;&lt;STRONG&gt;Screenshots&lt;/STRONG&gt; of the above search, first with acceleration disabled (9 results), then with acceleration enabled (6 results):&lt;/P&gt;

&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="alt text"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/6115i4389B23B9474DDE3/image-size/large?v=v2&amp;amp;px=999" role="button" title="alt text" alt="alt text" /&gt;&lt;/span&gt;&lt;BR /&gt;
&lt;span class="lia-inline-image-display-wrapper" image-alt="alt text"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/6116iCC7F52AA404F1F2C/image-size/large?v=v2&amp;amp;px=999" role="button" title="alt text" alt="alt text" /&gt;&lt;/span&gt;&lt;/P&gt;

&lt;H2&gt;Scheduler.log&lt;/H2&gt;

&lt;P&gt;I ran a search to query &lt;CODE&gt;scheduler.log&lt;/CODE&gt; over a time range of 8 hours, which covered the time of the issue:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal source=*scheduler.log 
| stats count by log_level
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Result:&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;DEBUG: 22537&lt;/LI&gt;
&lt;LI&gt;ERROR: 3&lt;/LI&gt;
&lt;LI&gt;INFO: 5811&lt;/LI&gt;
&lt;/UL&gt;

&lt;H2&gt;Update 2020-02-04: back in Splunk 8.x&lt;/H2&gt;

&lt;P&gt;Dominik's answer fixed the problem for Splunk versions 7.x. However, it is back in Splunk 8 (we tested 8.0 and 8.0.1).&lt;/P&gt;</description>
      <pubDate>Mon, 12 Nov 2018 18:33:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391368#M3349</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-11-12T18:33:59Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391369#M3350</link>
      <description>&lt;P&gt;Did you get your lack of scheduler logs fixed?&lt;/P&gt;</description>
      <pubDate>Mon, 12 Nov 2018 23:03:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391369#M3350</guid>
      <dc:creator>martin_mueller</dc:creator>
      <dc:date>2018-11-12T23:03:02Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391370#M3351</link>
      <description>&lt;P&gt;Hi @martin_mueller, I am not sure I understand your question. However, I just re-checked (several hours after my initial post) and the missing events "magically" reappeared. I do not consider that a solution, because we have seen this before, and I cannot have searches sometimes showing incomplete results.&lt;BR /&gt;
Even more interesting, the acceleration was turned off in these past hours. When I just turned it back on, it immediately showed a status of "100% completed" and it immediately returned the events that had been missing.&lt;/P&gt;</description>
      <pubDate>Mon, 12 Nov 2018 23:20:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391370#M3351</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-11-12T23:20:49Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391371#M3352</link>
      <description>&lt;P&gt;Dominik's environment had an almost-empty scheduler.log, indicating something really fishy going on with the scheduler.&lt;/P&gt;</description>
      <pubDate>Mon, 12 Nov 2018 23:26:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391371#M3352</guid>
      <dc:creator>martin_mueller</dc:creator>
      <dc:date>2018-11-12T23:26:17Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391372#M3353</link>
      <description>&lt;P&gt;According to this machine's &lt;CODE&gt;scheduler.log&lt;/CODE&gt; Splunk has been busy (in a good way). Lots of DEBUG and INFO messages, nearly no errors. I updated the question with details.&lt;/P&gt;</description>
      <pubDate>Tue, 13 Nov 2018 00:19:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391372#M3353</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-11-13T00:19:36Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391373#M3354</link>
      <description>&lt;P&gt;Dominik here. The empty &lt;EM&gt;scheduler.log&lt;/EM&gt; was the result of a misconfiguration in &lt;EM&gt;$SPLUNK_HOME\etc\log.cfg&lt;/EM&gt;. It is fixed now, the &lt;EM&gt;scheduler.log&lt;/EM&gt; fills up again, but the problem that Helge has is still there.&lt;/P&gt;

&lt;P&gt;I also gave the scheduler more resources by setting &lt;EM&gt;max_searches_perc&lt;/EM&gt; and &lt;EM&gt;auto_summary_perc&lt;/EM&gt; from 50 to 80 in the &lt;EM&gt;limits.conf&lt;/EM&gt;.&lt;/P&gt;

&lt;P&gt;Splunk version: 7.2.1&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 21:58:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391373#M3354</guid>
      <dc:creator>dominikatvastli</dc:creator>
      <dc:date>2020-09-29T21:58:35Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391374#M3355</link>
      <description>&lt;P&gt;Good to hear that you at least got the log fixed. Now that you have scheduler logs, what's your skip ratio in the MC?&lt;/P&gt;</description>
      <pubDate>Tue, 13 Nov 2018 09:48:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391374#M3355</guid>
      <dc:creator>martin_mueller</dc:creator>
      <dc:date>2018-11-13T09:48:03Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391375#M3356</link>
      <description>&lt;P&gt;For most hours, the skip ratio is between 0% and 0.5%.&lt;/P&gt;</description>
      <pubDate>Tue, 13 Nov 2018 12:00:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391375#M3356</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-11-13T12:00:02Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391376#M3357</link>
      <description>&lt;P&gt;We did some further testing and it seems that the problem only occurs when Splunk is running on Windows. Splunk on Ubuntu is fine.&lt;/P&gt;</description>
      <pubDate>Mon, 10 Dec 2018 14:49:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391376#M3357</guid>
      <dc:creator>dominikatvastli</dc:creator>
      <dc:date>2018-12-10T14:49:35Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391377#M3358</link>
      <description>&lt;P&gt;Splunk support told us it is a known bug which will be fixed in the future. They came up with the following workaround which fixes the problem for us:&lt;/P&gt;

&lt;P&gt;&lt;EM&gt;Please edit/create the file $SPLUNK_HOME/etc/system/local/limits.conf .&lt;BR /&gt;
Add the following stanza&lt;/EM&gt;&lt;/P&gt;

&lt;P&gt;&lt;EM&gt;[directives] &lt;BR /&gt;
read_summary = disabled&lt;/EM&gt;&lt;/P&gt;

&lt;P&gt;&lt;EM&gt;Restart the instance for changes to take effect.&lt;BR /&gt;
In a distributed env it should be done on the SH.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 17 May 2019 12:42:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391377#M3358</guid>
      <dc:creator>dominikatvastli</dc:creator>
      <dc:date>2019-05-17T12:42:11Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391378#M3359</link>
      <description>&lt;P&gt;Which Splunk version &amp;amp; OS?&lt;/P&gt;</description>
      <pubDate>Sat, 18 May 2019 06:42:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391378#M3359</guid>
      <dc:creator>gjanders</dc:creator>
      <dc:date>2019-05-18T06:42:06Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391379#M3360</link>
      <description>&lt;P&gt;Splunk 7.x on Windows. Our Ubuntu Splunk servers are not affected.&lt;/P&gt;</description>
      <pubDate>Mon, 20 May 2019 05:53:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391379#M3360</guid>
      <dc:creator>dominikatvastli</dc:creator>
      <dc:date>2019-05-20T05:53:00Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391380#M3361</link>
      <description>&lt;P&gt;This fixes the problem for Splunk 7.x. However, the issue is back in 8.x and the &lt;CODE&gt;read_summary&lt;/CODE&gt; setting does not help.&lt;/P&gt;</description>
      <pubDate>Tue, 04 Feb 2020 22:28:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391380#M3361</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2020-02-04T22:28:19Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391381#M3362</link>
      <description>&lt;P&gt;We just recently chased this very issue, though our root cause may be different than yours, not certain.&lt;BR /&gt;
The issue manifested itself in some of our dashboards, which are powered by accelerated datamodels. Many began to run very slow, while others (different datamodels) were fine.&lt;/P&gt;

&lt;P&gt;During RCA, we found that the event count for a given datamodel was different, depending on acceleration. Easy to check by toggling summariesonly=t / f. We also found that there were gaps in the acceleration, though the UI was showing 100% accelerated.&lt;/P&gt;

&lt;P&gt;In any case, check your internal logs for PCRE_ERROR_MATCHLIMIT entries:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal source=*splunkd.log* PCRE_ERROR_MATCHLIMIT | cluster showcount=t | table cluster_count _raw | sort -cluster_count
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;What we found, was that in certain periods of the accelerated date/time range, the forwarder had encountered corrupted events. Though the data was still indexed, one or more datamodel acceleration jobs/searches would fail due to a regex exceeding the  PCRE_ERROR_MATCHLIMIT (default is 100k).  Later jobs would pick up at a later time and succeed, thus causing the gap(s) in our acceleration.&lt;/P&gt;

&lt;P&gt;So, when a user would open a dashboard and pick a date/time range that included the "gap", the query would drop back to searching non-accelerated data, causing the slowness.&lt;/P&gt;

&lt;P&gt;In our case, a single corrupted event was the cause. It was full of hex/binary data. We narrowed this down using timechart and a span of 1 second. Note that the "problem" event was just prior to the gap in acceleration, not within the gap itself.&lt;/P&gt;

&lt;P&gt;We are on Splunk 7.2.6, no other errors were thrown, and the datamodel acceleration showed 100% complete.&lt;/P&gt;

&lt;P&gt;Hope this helps.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2020 04:02:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391381#M3362</guid>
      <dc:creator>codebuilder</dc:creator>
      <dc:date>2020-09-30T04:02:18Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391382#M3363</link>
      <description>&lt;P&gt;Forgot to mention, our fix was to pipe that single event to "delete" and rebuild dm acceleration. &lt;/P&gt;</description>
      <pubDate>Tue, 04 Feb 2020 23:07:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391382#M3363</guid>
      <dc:creator>codebuilder</dc:creator>
      <dc:date>2020-02-04T23:07:57Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391383#M3364</link>
      <description>&lt;P&gt;That is an interesting issue you have described, @codebuilder, it seems to be different from what we are seeing, though.&lt;/P&gt;</description>
      <pubDate>Wed, 05 Feb 2020 12:47:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391383#M3364</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2020-02-05T12:47:42Z</dc:date>
    </item>
    <item>
      <title>Re: Some events missing from data model - only if accelerated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391384#M3365</link>
      <description>&lt;P&gt;For finding the corrupt event, you said you "narrowed this down using timechart and a span of 1 second." Could you elaborate? I'm seeing something similar, and I'm not sure I have a corrupt event or not... but not sure how to reveal it.&lt;/P&gt;</description>
      <pubDate>Tue, 17 Mar 2020 17:57:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/Some-events-missing-from-data-model-only-if-accelerated/m-p/391384#M3365</guid>
      <dc:creator>wryanthomas</dc:creator>
      <dc:date>2020-03-17T17:57:06Z</dc:date>
    </item>
  </channel>
</rss>

