<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Summary index for non aggregated data: How to read only delta data each time? in Other Usage</title>
    <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653378#M596</link>
    <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;I'm working with a large amount of data.&lt;BR /&gt;I wrote a main report that extracts all events (let's call them events A,B,C,D) from the last 30 days and do some manipulations for fields.&lt;BR /&gt;And then i wrote 5 reports that filter the main saved report by events type and get only the relevant fields for each event:&lt;BR /&gt;For example- the report for event A contain all fields relevant for event A,&amp;nbsp;&lt;BR /&gt;report for event B contains all fields relevant for event B and etc.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;My dashboard contains 5 tabs, one for each event (tab 1 for report A, tab 2 for report B,..), and triggers the relevant saved search report (reports A/B/C,..)&lt;BR /&gt;&lt;BR /&gt;Problems- all the reports run very slow&lt;BR /&gt;&lt;BR /&gt;My questions:&lt;BR /&gt;1. How to read only delta data each time? i mean, how to not read 30 days each time at once, if the query was already run today and i execute it one more time it should read only new data and use the history data that have already read in the previous run.&lt;BR /&gt;&lt;BR /&gt;2. i read a bit about summary index. my reports extract all fields and not aggregate data. how to create my 6 reports (main+5 others) with summary index? As i said, - i use table command and not functions like top,count,.. in my query (my reports just extract relevant fields with some naming manipulations)&lt;BR /&gt;&lt;BR /&gt;* in case that you would recommend to use summary index i will appreciate if you could provide me example code, because i have 6 reports and not sure how work with summary index&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;thanks,&lt;BR /&gt;Maayan&lt;/P&gt;</description>
    <pubDate>Mon, 07 Aug 2023 18:56:06 GMT</pubDate>
    <dc:creator>maayan</dc:creator>
    <dc:date>2023-08-07T18:56:06Z</dc:date>
    <item>
      <title>Summary index for non aggregated data: How to read only delta data each time?</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653378#M596</link>
      <description>&lt;P&gt;Hi,&lt;BR /&gt;&lt;BR /&gt;I'm working with a large amount of data.&lt;BR /&gt;I wrote a main report that extracts all events (let's call them events A,B,C,D) from the last 30 days and do some manipulations for fields.&lt;BR /&gt;And then i wrote 5 reports that filter the main saved report by events type and get only the relevant fields for each event:&lt;BR /&gt;For example- the report for event A contain all fields relevant for event A,&amp;nbsp;&lt;BR /&gt;report for event B contains all fields relevant for event B and etc.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;My dashboard contains 5 tabs, one for each event (tab 1 for report A, tab 2 for report B,..), and triggers the relevant saved search report (reports A/B/C,..)&lt;BR /&gt;&lt;BR /&gt;Problems- all the reports run very slow&lt;BR /&gt;&lt;BR /&gt;My questions:&lt;BR /&gt;1. How to read only delta data each time? i mean, how to not read 30 days each time at once, if the query was already run today and i execute it one more time it should read only new data and use the history data that have already read in the previous run.&lt;BR /&gt;&lt;BR /&gt;2. i read a bit about summary index. my reports extract all fields and not aggregate data. how to create my 6 reports (main+5 others) with summary index? As i said, - i use table command and not functions like top,count,.. in my query (my reports just extract relevant fields with some naming manipulations)&lt;BR /&gt;&lt;BR /&gt;* in case that you would recommend to use summary index i will appreciate if you could provide me example code, because i have 6 reports and not sure how work with summary index&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;thanks,&lt;BR /&gt;Maayan&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2023 18:56:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653378#M596</guid>
      <dc:creator>maayan</dc:creator>
      <dc:date>2023-08-07T18:56:06Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653382#M597</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/256320"&gt;@maayan&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;yes the solutions could be summary indexes or data models.&lt;/P&gt;&lt;P&gt;In bothe cases, you have to schedule a search, e.g. for&amp;nbsp; report 1:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=index1
| table _time field 1 field2, field3
| collect index=summary1&lt;/LI-CODE&gt;&lt;P&gt;the frequency dependa on your requirements.&lt;/P&gt;&lt;P&gt;then you can run a search on this index, e.g. calculate sum of field 2 for each field1:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=summary1
| stats sum(field2) AS field2_sum BY field1&lt;/LI-CODE&gt;&lt;P&gt;Ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2023 09:17:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653382#M597</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2023-08-07T09:17:53Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653393#M598</link>
      <description>&lt;P&gt;ok, I will try,thanks!&lt;BR /&gt;&lt;BR /&gt;And regarding my first question, is it something that I can do in Splunk? (read delta data)&lt;BR /&gt;&lt;BR /&gt;And which method do you recommend to use in my case? data model or summary index?&lt;BR /&gt;&lt;BR /&gt;thanks&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2023 10:48:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653393#M598</guid>
      <dc:creator>maayan</dc:creator>
      <dc:date>2023-08-07T10:48:59Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653394#M599</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/256320"&gt;@maayan&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;yes, you can calculate delta, global and partial sum, etc...&lt;/P&gt;&lt;P&gt;the main job is building the scheduled search to extract the requested data.&lt;/P&gt;&lt;P&gt;in my opinion, I'd use summary index, scheduling the population search with the frequency you need (e.g. every month or every night.&lt;/P&gt;&lt;P&gt;Ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
      <pubDate>Mon, 07 Aug 2023 11:03:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653394#M599</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2023-08-07T11:03:42Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653550#M600</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;Gcusello!&lt;BR /&gt;Can you explain more about: "&lt;SPAN&gt;you can calculate delta, global and partial sum, etc..."&amp;nbsp; ?&lt;BR /&gt;I didn't find documentation and also asked in other communities and nobody knows.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 08 Aug 2023 09:00:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/653550#M600</guid>
      <dc:creator>maayan</dc:creator>
      <dc:date>2023-08-08T09:00:56Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/654177#M601</link>
      <description>&lt;P&gt;Hi, i tried to implement the summary index as you suggested but i had a problem to extract the original fields from the main query. i read that i might use stats and stats. i posted a new post. maybe you can help. thanks&lt;/P&gt;</description>
      <pubDate>Sun, 13 Aug 2023 09:36:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/654177#M601</guid>
      <dc:creator>maayan</dc:creator>
      <dc:date>2023-08-13T09:36:28Z</dc:date>
    </item>
    <item>
      <title>Re: Summary index for non aggregated data</title>
      <link>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/654198#M602</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/256320"&gt;@maayan&lt;/a&gt;,&lt;/P&gt;&lt;P&gt;it all depends on the data you have (that I don't know), so e.g. if field1 is the hostname and field2 is the CPU utilization, you save with the scheduled search the CPU utilization min, max and avg day by day.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=index1
| stats 
   min(CPU) AS min_CPU 
   max(CPU) AS max_CPU 
   avg(CPU) AS avg_CPU 
   BY host
| collect index=summary1&lt;/LI-CODE&gt;&lt;P&gt;then you can calculate (using the normal commands as stats or timechart) the max, the avg and the min in a month&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=summary1
| stats 
   min(min_CPU) AS min_CPU 
   max(max_CPU) AS max_CPU 
   avg(avg_CPU) AS avg_CPU 
   BY host&lt;/LI-CODE&gt;&lt;P&gt;As I said, it depends on the data that you added to you summary index.&lt;/P&gt;&lt;P&gt;Ciao.&lt;/P&gt;&lt;P&gt;Giuseppe&lt;/P&gt;</description>
      <pubDate>Sun, 13 Aug 2023 14:25:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Other-Usage/Summary-index-for-non-aggregated-data-How-to-read-only-delta/m-p/654198#M602</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2023-08-13T14:25:28Z</dc:date>
    </item>
  </channel>
</rss>

