<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Load difference Indexed real-time search versus  real-time search in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359843#M93194</link>
    <description>&lt;P&gt;Indexed real-time reads the disk instead of watching the data flow between indexing steps in memory. When a search is running though, it consumes a memory core, regardless of whether it is a real-time search or an indexed real-time search or a historical search. You can set different limits on the number of concurrent searches by type (in limits.conf) - but using indexed real-time does not inherently increase the number of possible parallel searches.&lt;/P&gt;

&lt;P&gt;While running, an indexed real-time search performs like a historical (normal, not real-time) search at any point in time, in terms of memory use and cpu workload. The difference is that an indexed real-time search never &lt;EM&gt;finishes&lt;/EM&gt; reading the data, as it is always waiting for more data to arrive on disk. However, because it uses the normal OS I/O mechanisms, an indexed realtime search can take advantage of disk caching, concurrent reads, etc.,  making it more efficient than a real-time search.&lt;/P&gt;

&lt;P&gt;A real-time search monitors memory and tends to slow down the indexing of incoming data, which can lead to other problems.&lt;/P&gt;

&lt;P&gt;However, these differences occur on the indexers - where the data is being read and indexed - and not really on the search heads (AFAIK). So you might think, "this isn't going to affect my search head at all" - but it does. All searches essentially compete for time on the indexers, so when a less efficient search runs (like any real-time search), it can affect the throughput of all searches running on the search head. &lt;/P&gt;

&lt;P&gt;AFAIK, there are no hard numbers, as the cost/impact will depend on the workload mix (indexing vs. searching) on the indexers as well as the workload (number, size, and types of searches) running on the search head. I think the effects will be more obvious with a higher system load; for example, systems with unused cores might show little difference.&lt;/P&gt;

&lt;P&gt;But even though there are no hard numbers, I think it means a lot when Splunk says "All real-time searches in Splunk Enterprise Security use the indexed real-time setting to improve indexing performance" in the Enterprise Security Installation and Upgrade Manual under &lt;A href="http://docs.splunk.com/Documentation/ES/5.0.0/Install/DeploymentPlanning"&gt;Deployment Planning&lt;/A&gt;. But notice it says "indexing performance" not "search performance." I think your main performance improvement happens by not bogging down the indexers, which in turn will make the searches run faster on the search head.&lt;/P&gt;

&lt;P&gt;Hope this helps! You might want to benchmark this a bit in your own environment. &lt;/P&gt;

&lt;P&gt;I know this answer is not definitive and I would love to hear more from others...&lt;/P&gt;</description>
    <pubDate>Thu, 22 Mar 2018 00:01:51 GMT</pubDate>
    <dc:creator>lguinn2</dc:creator>
    <dc:date>2018-03-22T00:01:51Z</dc:date>
    <item>
      <title>Load difference Indexed real-time search versus  real-time search</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359842#M93193</link>
      <description>&lt;P&gt;Has anyone real world experience on the difference in the load on a search head if a real time search is executed as Indexed real-time search.&lt;BR /&gt;
Of course the number of possible parallel searchess should increase, but how much load do this searches generate.&lt;/P&gt;</description>
      <pubDate>Tue, 20 Mar 2018 07:48:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359842#M93193</guid>
      <dc:creator>FritzWittwer_ol</dc:creator>
      <dc:date>2018-03-20T07:48:45Z</dc:date>
    </item>
    <item>
      <title>Re: Load difference Indexed real-time search versus  real-time search</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359843#M93194</link>
      <description>&lt;P&gt;Indexed real-time reads the disk instead of watching the data flow between indexing steps in memory. When a search is running though, it consumes a memory core, regardless of whether it is a real-time search or an indexed real-time search or a historical search. You can set different limits on the number of concurrent searches by type (in limits.conf) - but using indexed real-time does not inherently increase the number of possible parallel searches.&lt;/P&gt;

&lt;P&gt;While running, an indexed real-time search performs like a historical (normal, not real-time) search at any point in time, in terms of memory use and cpu workload. The difference is that an indexed real-time search never &lt;EM&gt;finishes&lt;/EM&gt; reading the data, as it is always waiting for more data to arrive on disk. However, because it uses the normal OS I/O mechanisms, an indexed realtime search can take advantage of disk caching, concurrent reads, etc.,  making it more efficient than a real-time search.&lt;/P&gt;

&lt;P&gt;A real-time search monitors memory and tends to slow down the indexing of incoming data, which can lead to other problems.&lt;/P&gt;

&lt;P&gt;However, these differences occur on the indexers - where the data is being read and indexed - and not really on the search heads (AFAIK). So you might think, "this isn't going to affect my search head at all" - but it does. All searches essentially compete for time on the indexers, so when a less efficient search runs (like any real-time search), it can affect the throughput of all searches running on the search head. &lt;/P&gt;

&lt;P&gt;AFAIK, there are no hard numbers, as the cost/impact will depend on the workload mix (indexing vs. searching) on the indexers as well as the workload (number, size, and types of searches) running on the search head. I think the effects will be more obvious with a higher system load; for example, systems with unused cores might show little difference.&lt;/P&gt;

&lt;P&gt;But even though there are no hard numbers, I think it means a lot when Splunk says "All real-time searches in Splunk Enterprise Security use the indexed real-time setting to improve indexing performance" in the Enterprise Security Installation and Upgrade Manual under &lt;A href="http://docs.splunk.com/Documentation/ES/5.0.0/Install/DeploymentPlanning"&gt;Deployment Planning&lt;/A&gt;. But notice it says "indexing performance" not "search performance." I think your main performance improvement happens by not bogging down the indexers, which in turn will make the searches run faster on the search head.&lt;/P&gt;

&lt;P&gt;Hope this helps! You might want to benchmark this a bit in your own environment. &lt;/P&gt;

&lt;P&gt;I know this answer is not definitive and I would love to hear more from others...&lt;/P&gt;</description>
      <pubDate>Thu, 22 Mar 2018 00:01:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359843#M93194</guid>
      <dc:creator>lguinn2</dc:creator>
      <dc:date>2018-03-22T00:01:51Z</dc:date>
    </item>
    <item>
      <title>Re: Load difference Indexed real-time search versus  real-time search</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359844#M93195</link>
      <description>&lt;P&gt;closer and closer, as I am searching to understand the difference between real-time search and continuous on tsidx(using tstats on summariesonly=true, so basically on already indexed and accelersted data)&lt;BR /&gt;
the terms themselves zi find very counterintuitive ...continuous and rela-time,but that is another topic.&lt;BR /&gt;
back to some examples as they always help:&lt;BR /&gt;
one rule looking for same event and if it happens x times in an interval of y minutes to have an alert;let’s call this a fail event so I need x fails in y minutes and no succes event inside this 5 minutes and between the fails.&lt;BR /&gt;
let’s say I want to put as less pressure on system as possible and I look on acceler data model ;I want to be sure I loose no event from reading so then what should I choose? &lt;BR /&gt;
1.a real-time search which let’s say looks back 1hour and runs every minute and I can set inside search count by _time span=5min&lt;BR /&gt;
2.a continous search which looks back five minutes and runs every minute with the count by _time span=5min ;&lt;BR /&gt;
advantage for 1, as I am looking as this is not critical app and I want to offer other searches more space and time ,is that if it can not run it has 59 possible fails/canceled runs and it still can see the events I described above&lt;BR /&gt;
if I apply continuous I understand the run will not be ever csnceled so it will fight with other searches untill the moment when whole system can have delayed searches.&lt;/P&gt;

&lt;P&gt;Very confusing over all, looking fwd for an answer&lt;BR /&gt;
thank you&lt;/P&gt;</description>
      <pubDate>Sat, 23 Feb 2019 17:28:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Load-difference-Indexed-real-time-search-versus-real-time-search/m-p/359844#M93195</guid>
      <dc:creator>printul77700</dc:creator>
      <dc:date>2019-02-23T17:28:21Z</dc:date>
    </item>
  </channel>
</rss>

