<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Incorrect results when using  PERC with TSTATS? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169031#M48257</link>
    <description>&lt;P&gt;Thanks to Brian M. at Splunk for pointing me to this answers post&lt;BR /&gt;
&lt;A href="http://answers.splunk.com/answers/44336/percentile-implementation.html"&gt;http://answers.splunk.com/answers/44336/percentile-implementation.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;After reading the answers link it seems like my raw data search, which is using perc99, is trying to process a percentile for more than 1000 distinct values, which will then use an algorithm to approximate the final percentile values.&lt;/P&gt;

&lt;P&gt;The accelerated data model that was created for the same dataset, when ran over the same time frame, will not approximate the final percentile values and give me an exact percentile, even though I am still using perc99.  &lt;/P&gt;

&lt;P&gt;This explains when I was  comparing results they were just slightly off from each other&lt;/P&gt;

&lt;P&gt;I was able to confirm this by using exactperc99 in my raw search, and then used perc99 in my data model search.  The result set came out to be identical!&lt;/P&gt;

&lt;P&gt;Being new to data models I wanted to verify 100% I was getting the same results as the raw search, so it was a bit concerning to see the differences in the results.  However I can understand there are HUGE differences between data models and raw data, and the limitations for doing calculations between both can be very different.  It would be helpful to know at what point data models will use this approximation approach to percentiles.&lt;/P&gt;</description>
    <pubDate>Thu, 16 Oct 2014 23:20:32 GMT</pubDate>
    <dc:creator>cramasta</dc:creator>
    <dc:date>2014-10-16T23:20:32Z</dc:date>
    <item>
      <title>Incorrect results when using  PERC with TSTATS?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169029#M48255</link>
      <description>&lt;P&gt;Has anyone had any luck using PERC with TSTATS on a tsidx file created from data model?&lt;/P&gt;

&lt;P&gt;here is my tstats search&lt;/P&gt;

&lt;P&gt;| tstats PERC90("PerformanceMetricBaseSearch.duration")  AS count from datamodel="PerformanceMetrics" where (nodename="PerformanceMetricBaseSearch")  groupby "PerformanceMetricBaseSearch.ownerClass" "_time"   span=1m  | eval "ownerClass"='PerformanceMetricBaseSearch.ownerClass' | timechart span=1m perc90(count) by ownerClass limit=100&lt;/P&gt;

&lt;P&gt;here is the equivalent regular search &lt;/P&gt;

&lt;P&gt;index=perf PerformanceMetric | timechart span=1m PERC90(duration) by ownerClass limit=100&lt;/P&gt;

&lt;P&gt;When i compare the timecharts in a line chart they look almost the same however Im finding the values returned from tstats are always a bit higher.&lt;/P&gt;

&lt;P&gt;If i run the same two searches and change perc90 to avg I get the exact same result set.&lt;/P&gt;

&lt;P&gt;im on 6.0.3&lt;/P&gt;</description>
      <pubDate>Thu, 16 Oct 2014 16:41:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169029#M48255</guid>
      <dc:creator>cramasta</dc:creator>
      <dc:date>2014-10-16T16:41:40Z</dc:date>
    </item>
    <item>
      <title>Re: Incorrect results when using  PERC with TSTATS?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169030#M48256</link>
      <description>&lt;P&gt;This is not just limited to timechart. Stats is doing the same thing&lt;/P&gt;</description>
      <pubDate>Thu, 16 Oct 2014 17:11:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169030#M48256</guid>
      <dc:creator>cramasta</dc:creator>
      <dc:date>2014-10-16T17:11:19Z</dc:date>
    </item>
    <item>
      <title>Re: Incorrect results when using  PERC with TSTATS?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169031#M48257</link>
      <description>&lt;P&gt;Thanks to Brian M. at Splunk for pointing me to this answers post&lt;BR /&gt;
&lt;A href="http://answers.splunk.com/answers/44336/percentile-implementation.html"&gt;http://answers.splunk.com/answers/44336/percentile-implementation.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;After reading the answers link it seems like my raw data search, which is using perc99, is trying to process a percentile for more than 1000 distinct values, which will then use an algorithm to approximate the final percentile values.&lt;/P&gt;

&lt;P&gt;The accelerated data model that was created for the same dataset, when ran over the same time frame, will not approximate the final percentile values and give me an exact percentile, even though I am still using perc99.  &lt;/P&gt;

&lt;P&gt;This explains when I was  comparing results they were just slightly off from each other&lt;/P&gt;

&lt;P&gt;I was able to confirm this by using exactperc99 in my raw search, and then used perc99 in my data model search.  The result set came out to be identical!&lt;/P&gt;

&lt;P&gt;Being new to data models I wanted to verify 100% I was getting the same results as the raw search, so it was a bit concerning to see the differences in the results.  However I can understand there are HUGE differences between data models and raw data, and the limitations for doing calculations between both can be very different.  It would be helpful to know at what point data models will use this approximation approach to percentiles.&lt;/P&gt;</description>
      <pubDate>Thu, 16 Oct 2014 23:20:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169031#M48257</guid>
      <dc:creator>cramasta</dc:creator>
      <dc:date>2014-10-16T23:20:32Z</dc:date>
    </item>
    <item>
      <title>Re: Incorrect results when using  PERC with TSTATS?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169032#M48258</link>
      <description>&lt;P&gt;Something I am finding now is that when using PERC for tstats, i start to see all the perc results being populated, but as soon as the search gets half way through all the perc fields disappear.  Could this be a issue when the dataset gets too large?&lt;/P&gt;

&lt;P&gt;If i use the exactperc function the results remain. Seems like there might be a issue with the perc function working with TSTATS.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Oct 2014 21:40:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Incorrect-results-when-using-PERC-with-TSTATS/m-p/169032#M48258</guid>
      <dc:creator>cramasta</dc:creator>
      <dc:date>2014-10-17T21:40:58Z</dc:date>
    </item>
  </channel>
</rss>

