<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Event sampling and Count - accuracy? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Event-sampling-and-Count-accuracy/m-p/614874#M213682</link>
    <description>&lt;P&gt;I have a need for approximate statistics/metrics and am currently using Event Sampling, which drastically speeds up the queries. For queries that calculate averages this works great, but I also have a need to do counts. If you set the Event Sampling to for example 1:100, then Splunk seems to look at every 100 Events, which is also reflected in how many Events that are matched when doing 1:100 vs 1:1.&lt;/P&gt;&lt;P&gt;Example count with and without sampling:&lt;/P&gt;&lt;P&gt;1:100 = 26311&lt;BR /&gt;1:1 = 2623658&lt;BR /&gt;1:100 scaled up to 1:1 = 2631100&lt;BR /&gt;Diff = 7442, which is 0.3%&lt;/P&gt;&lt;P&gt;The Time Period was a previous hour (not the latest hour) as not to have incoming Events affect the Count.&lt;/P&gt;&lt;P&gt;0.3% difference is perfectly ok for my purpose.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I thinking of this correctly, or is there any risk of much bigger differences in Count (after upscaling the count)?&lt;/P&gt;</description>
    <pubDate>Wed, 28 Sep 2022 06:59:06 GMT</pubDate>
    <dc:creator>dmoberg</dc:creator>
    <dc:date>2022-09-28T06:59:06Z</dc:date>
    <item>
      <title>Event sampling and Count - accuracy?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Event-sampling-and-Count-accuracy/m-p/614874#M213682</link>
      <description>&lt;P&gt;I have a need for approximate statistics/metrics and am currently using Event Sampling, which drastically speeds up the queries. For queries that calculate averages this works great, but I also have a need to do counts. If you set the Event Sampling to for example 1:100, then Splunk seems to look at every 100 Events, which is also reflected in how many Events that are matched when doing 1:100 vs 1:1.&lt;/P&gt;&lt;P&gt;Example count with and without sampling:&lt;/P&gt;&lt;P&gt;1:100 = 26311&lt;BR /&gt;1:1 = 2623658&lt;BR /&gt;1:100 scaled up to 1:1 = 2631100&lt;BR /&gt;Diff = 7442, which is 0.3%&lt;/P&gt;&lt;P&gt;The Time Period was a previous hour (not the latest hour) as not to have incoming Events affect the Count.&lt;/P&gt;&lt;P&gt;0.3% difference is perfectly ok for my purpose.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Am I thinking of this correctly, or is there any risk of much bigger differences in Count (after upscaling the count)?&lt;/P&gt;</description>
      <pubDate>Wed, 28 Sep 2022 06:59:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Event-sampling-and-Count-accuracy/m-p/614874#M213682</guid>
      <dc:creator>dmoberg</dc:creator>
      <dc:date>2022-09-28T06:59:06Z</dc:date>
    </item>
    <item>
      <title>Re: Event sampling and Count - accuracy?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Event-sampling-and-Count-accuracy/m-p/615235#M213808</link>
      <description>&lt;P&gt;I think you're correct, at least with regard to event counts.&lt;/P&gt;</description>
      <pubDate>Thu, 29 Sep 2022 15:23:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Event-sampling-and-Count-accuracy/m-p/615235#M213808</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2022-09-29T15:23:36Z</dc:date>
    </item>
  </channel>
</rss>

