<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Accelerated data model 100% complete even though most populating searches are skipped in Other Usage</title>
    <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328083#M1476</link>
    <description>&lt;P&gt;thanks @helge&lt;/P&gt;</description>
    <pubDate>Sat, 02 Jun 2018 04:21:23 GMT</pubDate>
    <dc:creator>Esky73</dc:creator>
    <dc:date>2018-06-02T04:21:23Z</dc:date>
    <item>
      <title>Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328068#M1461</link>
      <description>&lt;P&gt;Our app uses an accelerated data model for all searches, which works really well.&lt;/P&gt;

&lt;P&gt;I recently stumbled about a discrepancy which I cannot explain. The Data Models UI always shows the acceleration status as 100% completed, and the field &lt;EM&gt;Updated&lt;/EM&gt; is always within a few seconds of the current time. That is good, of course, however when looking at how often populating searches are actually run, things seem to be different. In &lt;EM&gt;scheduler.log&lt;/EM&gt; nearly all searches of type &lt;EM&gt;datamodel_acceleration&lt;/EM&gt; have a status of &lt;EM&gt;skipped&lt;/EM&gt;.&lt;/P&gt;

&lt;P&gt;When I visualize how often populating searchs for a specific data model object are run successfully, I get only one successful search every 20 minutes:&lt;/P&gt;

&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper" image-alt="alt text"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/3009i32226E71889FF2DB/image-size/large?v=v2&amp;amp;px=999" role="button" title="alt text" alt="alt text" /&gt;&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;My question is:&lt;/STRONG&gt; how can the data model be 100% complete (and dashboards always show current data) when populating searches only run once every 20 minutes?&lt;/P&gt;

&lt;P&gt;I have observed this on Splunk Enterprise 6.5.3 on Windows (simple single-instance Splunk environment).&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Update 2017-06-15:&lt;/STRONG&gt; I tested Splunk Enterprise 6.6.0 on Linux and Splunk Enterprise 6.6.1 on Windows. The issue occurs on those versions &amp;amp; platforms, too.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2017 00:17:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328068#M1461</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-06-06T00:17:32Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328069#M1462</link>
      <description>&lt;P&gt;I assume that you either have scheduled real-time searches OR are using ITSI (which does so under the hood).  There is a bug in all versions of Splunk that "correctly" but misleadingly says that it is skipping the real-time search because a real-time search will never stop.  If it did crash for some reason, it would restart on the next cycle and NOT generate this misleading log and then every cycle after that complain.  This is a known problem and there is a &lt;CODE&gt;jira&lt;/CODE&gt; on it and it should be fixed soon.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2017 05:12:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328069#M1462</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-06-06T05:12:17Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328070#M1463</link>
      <description>&lt;P&gt;ITSI is not installed on that machine, and there are no realtime searches either. Our app does have scheduled historic searches, plus the accelerated data model.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2017 13:05:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328070#M1463</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-06-06T13:05:13Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328071#M1464</link>
      <description>&lt;P&gt;I would be more than happy to submit log files to support if that helps.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2017 13:06:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328071#M1464</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-06-06T13:06:08Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328072#M1465</link>
      <description>&lt;P&gt;Noticing the exact same behavior, with the exact same app (uberAgent).  The App developer has indicated that this is normal behavior and not to worry about it. For me, this is running on a staging environment and is under powered, but it almost seems like skipped data model accelerated searches are continually rescheduled every minute (or less) until they are successful.  Because each of the 28 data model objects is eventually successful within the 5 minute schedule that is defined for the App, but for every successful search, there can be anywhere from 0 to 30 skipped searches.&lt;/P&gt;

&lt;P&gt;Does anyone know if this is normal data model acceleration behavior?  And if so, what kind of impact would this have on other scheduled and adhoc searches?&lt;/P&gt;

&lt;P&gt;Trying to figure out if it is acceptable to move this into production - or if there is a problem with the App or our Splunk configuration.&lt;/P&gt;</description>
      <pubDate>Tue, 06 Jun 2017 18:14:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328072#M1465</guid>
      <dc:creator>itwebmaintenanc</dc:creator>
      <dc:date>2017-06-06T18:14:57Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328073#M1466</link>
      <description>&lt;P&gt;I noticed the following fixed issues in the release notes for Splunk 6.6.3 - maybe they help with this?&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;SPL-142801, SPL-142771: Only one root event search in a DM gets accelerated&lt;/LI&gt;
&lt;LI&gt;SPL-141887, SPL-141823: SearchParser Errors for Datamodel Acceleration prevents other scheduled searches from being executed&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Wed, 23 Aug 2017 11:27:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328073#M1466</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-08-23T11:27:13Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328074#M1467</link>
      <description>&lt;P&gt;is it possible that there is only new data every 20 mins? I recall noticing that dm accelerations that I had running and which had no data, showed as skipped, which triggered me to think something was wrong, but after investigating I found it was only DMAs that had no events. &lt;/P&gt;</description>
      <pubDate>Wed, 23 Aug 2017 12:32:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328074#M1467</guid>
      <dc:creator>mattymo</dc:creator>
      <dc:date>2017-08-23T12:32:30Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328075#M1468</link>
      <description>&lt;P&gt;Not really. Every endpoint sends at least a few dozen events twice a minute, and uberAgent is typically deployed to hundreds/thousands of endpoints.&lt;/P&gt;</description>
      <pubDate>Wed, 23 Aug 2017 12:34:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328075#M1468</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-08-23T12:34:45Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328076#M1469</link>
      <description>&lt;P&gt;interesting. I also vaguely remember searches showing up as skipped because they were already running....I'd have to take a look at the app itself...&lt;/P&gt;</description>
      <pubDate>Wed, 23 Aug 2017 12:53:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328076#M1469</guid>
      <dc:creator>mattymo</dc:creator>
      <dc:date>2017-08-23T12:53:11Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328077#M1470</link>
      <description>&lt;P&gt;In case you want to take a look, uberAgent is available here: &lt;A href="https://uberagent.com/download/"&gt;https://uberagent.com/download/&lt;/A&gt;&lt;BR /&gt;
If you have any questions please email &lt;A href="mailto:support@uberagent.com"&gt;support@uberagent.com&lt;/A&gt;. Thanks!&lt;/P&gt;</description>
      <pubDate>Wed, 23 Aug 2017 13:58:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328077#M1470</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-08-23T13:58:56Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328078#M1471</link>
      <description>&lt;P&gt;Ok.  We spent some time debugging this at #conf2017. We observed that all of the "base event" searches within the data model required separate &lt;EM&gt;single&lt;/EM&gt; search processes (acceleration jobs) to accelerate &lt;EM&gt;that piece&lt;/EM&gt; of the model. Put another way, if there were 42 separate "root" searches within the model, attempting to accelerate the model would want to run 42 separate search jobs. However, due to the number of cores on my laptop (8), my resulting limit for the number of acceleration jobs was 3. Tracing this in the _audit log showed that three jobs started (and completed quickly) to attempt to accelerate the model. However, that meant that the models "turn" at accelerating was done for this scheduled slot, and the remaining 39 searches were "skipped" by the scheduler. On the next iteration, a different set of (3) searches from the root objects were chosen, so it feels like the models would eventually get a chance to accelerate. &lt;/P&gt;

&lt;P&gt;Finally, due to the "mixed mode" nature of &lt;CODE&gt;|pivot&lt;/CODE&gt; and &lt;CODE&gt;| tstats&lt;/CODE&gt; with their default arguments, any events that were in buckets not yet summarized would be searched ad-hoc, resulting in a "complete" result set, while not being fully accelerated, per se.&lt;/P&gt;

&lt;P&gt;The suggestion is to break the model into separate (possibly related) root searches, so that any given acceleration run can accelerate all of the child searches therein.&lt;/P&gt;</description>
      <pubDate>Wed, 27 Sep 2017 02:51:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328078#M1471</guid>
      <dc:creator>sowings</dc:creator>
      <dc:date>2017-09-27T02:51:53Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328079#M1472</link>
      <description>&lt;P&gt;Thanks for your time, Sanford. After our discussion I noticed that I still do not understand the following: why is the data model showing a status of "100% completed" when it clearly is not?&lt;/P&gt;</description>
      <pubDate>Wed, 27 Sep 2017 15:22:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328079#M1472</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2017-09-27T15:22:13Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328080#M1473</link>
      <description>&lt;P&gt;Hi @helge &lt;/P&gt;

&lt;P&gt;Was this ever resolved in later versions of splunk do you know ? &lt;/P&gt;

&lt;P&gt;We are using uberagent (with ITSI) on splunk 6.6.3 and seeing this skipped search 'feature' is there any updates to these discussions other than here ?&lt;/P&gt;

&lt;P&gt;Sample:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;Report Name App User    Cron Schedule   Schedule Interval (sec) Average Runtime (sec)   Interval Load Factor    Total Executions    Skipped Executions  Skip Ratio  Deferred Executions Average Execution Latency sec)
_ACCELERATE_DM_uberAgent_uberAgent.Citrix_Applications_ACCELERATE_  uberAgent   nobody          15  12744   12732   99.91 % 0   0
_ACCELERATE_DM_uberAgent_uberAgent.Logon_All_ACCELERATE_    uberAgent   nobody          15  12565   12553   99.90 % 0   0
_ACCELERATE_DM_uberAgent_uberAgent.Citrix_Databases_ACCELERATE_ uberAgent   nobody          5   12489   12477   99.90 % 0   0
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 01 Jun 2018 06:38:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328080#M1473</guid>
      <dc:creator>Esky73</dc:creator>
      <dc:date>2018-06-01T06:38:54Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328081#M1474</link>
      <description>&lt;P&gt;To fix the issue of the high number of skipped data model acceleration searches Splunk added auto-skewing in Splunk 7.1. Auto-skewing needs to be enabled per data model. Once enabled, Splunk distributes the searches across the available time range instead of trying to run them all at the same time.&lt;/P&gt;

&lt;P&gt;To enable auto-skewing add the following to your &lt;CODE&gt;datamodels.conf&lt;/CODE&gt;:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;acceleration.allow_skew = 100%
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;More information on the &lt;A href="https://uberagent.com/blog/uberagent-5-0-1-splunk-7-1-data-model-acceleration-auto-skewing/"&gt;uberAgent blog&lt;/A&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 01 Jun 2018 15:55:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328081#M1474</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-06-01T15:55:57Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328082#M1475</link>
      <description>&lt;P&gt;@Esky73 Please see the accepted answer which I just added to this question.&lt;/P&gt;</description>
      <pubDate>Fri, 01 Jun 2018 15:57:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328082#M1475</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2018-06-01T15:57:54Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328083#M1476</link>
      <description>&lt;P&gt;thanks @helge&lt;/P&gt;</description>
      <pubDate>Sat, 02 Jun 2018 04:21:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/328083#M1476</guid>
      <dc:creator>Esky73</dc:creator>
      <dc:date>2018-06-02T04:21:23Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/581574#M1477</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/120361"&gt;@helge&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;&lt;P&gt;I am facing the similar issue. Apart from adding this configuration in .conf, is there a way I can make this allow_skew changes from UI?&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Tue, 18 Jan 2022 23:34:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/581574#M1477</guid>
      <dc:creator>bsanjeeva</dc:creator>
      <dc:date>2022-01-18T23:34:46Z</dc:date>
    </item>
    <item>
      <title>Re: Accelerated data model 100% complete even though most populating searches are skipped</title>
      <link>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/581644#M1478</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/242111"&gt;@bsanjeeva&lt;/a&gt;I'm afraid I don't know if/how this can be configured via Splunk's UI.&lt;/P&gt;</description>
      <pubDate>Wed, 19 Jan 2022 14:11:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Accelerated-data-model-100-complete-even-though-most-populating/m-p/581644#M1478</guid>
      <dc:creator>helge</dc:creator>
      <dc:date>2022-01-19T14:11:09Z</dc:date>
    </item>
  </channel>
</rss>

