<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: indexes.conf recommendation for large volume of data per day in 1index in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529573#M89211</link>
    <description>&lt;P&gt;See&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/8.1.0/Capacity/Summaryofperformancerecommendations" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/8.1.0/Capacity/Summaryofperformancerecommendations&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Mon, 16 Nov 2020 16:39:50 GMT</pubDate>
    <dc:creator>richgalloway</dc:creator>
    <dc:date>2020-11-16T16:39:50Z</dc:date>
    <item>
      <title>indexes.conf recommendation for large volume of data per day in 1index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529393#M89185</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;I am looking for any recommendation when&amp;nbsp; dealing with such scenario. On one instance or one indexer, 300-400GB of data per day in a single index. Is there any recommended configuration for such index?&lt;/P&gt;&lt;P&gt;So far, I came with few changes:&lt;/P&gt;&lt;P&gt;-&amp;nbsp; increase&amp;nbsp;&lt;SPAN&gt;maxTotalDataSizeMB to go beyond&amp;nbsp;the 500GB default and meet my requirement in term of retention.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;- Enable &lt;/SPAN&gt;&lt;FONT face="inherit"&gt;maxDataSize&amp;nbsp;=&amp;nbsp;auto_high_volume (host &lt;/FONT&gt;bucket&lt;FONT face="inherit"&gt;&amp;nbsp;to 10GB)&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT face="inherit"&gt;What I am &lt;/FONT&gt;considering&lt;FONT face="inherit"&gt;&amp;nbsp;is to increase the &lt;/FONT&gt;number&lt;FONT face="inherit"&gt;&amp;nbsp;of hot bucket, cause with 6 Hot &lt;/FONT&gt;bucket&lt;FONT face="inherit"&gt;&amp;nbsp;by default, it is only 60GB of data which is not even 24 Hours of data. Should I increase it ? or should I only increase the number of warm bucket? or both?&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;Are warm buckets also 10GB each? If my disk capacity and performance allows it, can I keep only warm bucket for my max retention (30 days) and dont use Cold?&lt;/P&gt;&lt;P&gt;Any advice or feedbacks with this type of scenario?&lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;P&gt;/Fabien&lt;/P&gt;</description>
      <pubDate>Sat, 14 Nov 2020 02:33:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529393#M89185</guid>
      <dc:creator>fgu</dc:creator>
      <dc:date>2020-11-14T02:33:19Z</dc:date>
    </item>
    <item>
      <title>Re: indexes.conf recommendation for large volume of data per day in 1index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529422#M89187</link>
      <description>&lt;P&gt;400GB to a single indexer is too much, especially if you want to search, too.&amp;nbsp; Splunk recommends 100GB per indexer, although you may be able to get away with 200GB / indexer.&amp;nbsp; Using multiple indexers will improve indexing and search performance.&lt;/P&gt;&lt;P&gt;Why put all that data into a single index?&amp;nbsp; Is it all from the same source/sourcetype or all related somehow?&amp;nbsp; If not, split it into separate indexes.&amp;nbsp; Having all your data in a single large index may seem convenient, but searching through so much data will be slower than searching smaller indexes.&lt;/P&gt;&lt;P&gt;Warm buckets are the exact same size as they were when they were hot.&amp;nbsp; The change from hot to warm is just a rename.&lt;/P&gt;&lt;P&gt;Yes, you can keep only warm buckets, if you choose.&amp;nbsp; There's no harm in using cold buckets, however, if the storage media has the same performance as for warm buckets.&amp;nbsp; .&lt;/P&gt;</description>
      <pubDate>Sat, 14 Nov 2020 19:01:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529422#M89187</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-11-14T19:01:30Z</dc:date>
    </item>
    <item>
      <title>Re: indexes.conf recommendation for large volume of data per day in 1index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529446#M89191</link>
      <description>I agree with &lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/213957"&gt;@richgalloway&lt;/a&gt;, 400GB for individual indexers per daily is too much. With ES 100GB is absolutely max GB/day/indexer, with pure splunk enterprise that could be 150GB but not more. Of course it depends how much you are defining inputs and how much splunk needs to guess with events.&lt;BR /&gt;&lt;BR /&gt;I also prefer separate indexes not only for access and retention, but also for search profile define which data should put together and which needs separation.&lt;BR /&gt;&lt;BR /&gt;This &lt;A href="https://community.splunk.com/t5/Getting-Data-In/What-is-the-disadvantage-of-having-a-lot-of-small-buckets-and/m-p/258474#M49643" target="_blank"&gt;https://community.splunk.com/t5/Getting-Data-In/What-is-the-disadvantage-of-having-a-lot-of-small-buckets-and/m-p/258474#M49643&lt;/A&gt; probably gives you more thinking?&lt;BR /&gt;&lt;BR /&gt;r. Ismo</description>
      <pubDate>Sun, 15 Nov 2020 10:24:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529446#M89191</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2020-11-15T10:24:52Z</dc:date>
    </item>
    <item>
      <title>Re: indexes.conf recommendation for large volume of data per day in 1index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529566#M89209</link>
      <description>&lt;P&gt;Thank you guys for taking the time to share your insight! that is helpful.&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;The data is coming from 1 source. This is why it is in a single index. It is using also the same type of storage (local SSD), that is why I was considering keeping as many warm bucket as I can.&lt;/P&gt;&lt;P&gt;About the volume of data per indexer, I am much (much!) higher that what you recommend.&amp;nbsp; I thought it was Ok cause of the spec of the server I am using: (48c/96T cpu @2 .3 Ghz, 128GB of RAM, 5TB local SSD in RAID10). Would you recommend to&amp;nbsp; keep index volume low even with this type of configuration? Is there a rule of thumb I can follow to find out what the right volume of data per day based on HW specs?&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;thanks&lt;/P&gt;&lt;P&gt;/Fabien&lt;/P&gt;</description>
      <pubDate>Mon, 16 Nov 2020 15:59:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529566#M89209</guid>
      <dc:creator>fgu</dc:creator>
      <dc:date>2020-11-16T15:59:31Z</dc:date>
    </item>
    <item>
      <title>Re: indexes.conf recommendation for large volume of data per day in 1index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529573#M89211</link>
      <description>&lt;P&gt;See&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/8.1.0/Capacity/Summaryofperformancerecommendations" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/8.1.0/Capacity/Summaryofperformancerecommendations&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 16 Nov 2020 16:39:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/indexes-conf-recommendation-for-large-volume-of-data-per-day-in/m-p/529573#M89211</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-11-16T16:39:50Z</dc:date>
    </item>
  </channel>
</rss>

