<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Splunk data retention in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743711#M118136</link>
    <description>&lt;P&gt;One small correction. With smartstore there is no separate warm/cold storage. A bucket is getting uploaded to remote storage and is being cached locally if needed but it doesn't go through warm-&amp;gt;cold lifecycle.&lt;/P&gt;&lt;P&gt;It's also worth noting that with some use cases (especially when you often work with searches covering a significant portion of your remote storage which turns out to be way over your local storage) you might get a significant performance hit because you're effectively not caching anything locally.&lt;/P&gt;</description>
    <pubDate>Mon, 07 Apr 2025 18:59:02 GMT</pubDate>
    <dc:creator>PickleRick</dc:creator>
    <dc:date>2025-04-07T18:59:02Z</dc:date>
    <item>
      <title>Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743704#M118133</link>
      <description>&lt;P&gt;I was newly aligned into a project and didn't have proper KT from the left ones. I have queries regarding my current architecture and configurations and I am not well versed with advanced admin concepts. Please help me in these queries:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;We have 6 indexers (hosted on AWS cloud as EC2 but not Splunk cloud) with 6.9TB disk storage and 1.5GB/day license. Is this ok? I am checking for retention period but nowhere set with frozentimeperiodinsecs or maxTotalDataSizeMB in local. But in default it is there... I am also looking whether archival location is set or not.&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;indexes.conf in Cluster Manager:&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;[new_index]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;homePath&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;volume:primary/$_index_name/db&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;coldPath&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;volume:primary/$_index_name/colddb&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;thawedPath&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;$SPLUNK_DB/$_index_name/thaweddb&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;volumes indexes.conf:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;[volume:primary]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;path&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;$SPLUNK_DB&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;#maxVolumeDataSizeMB&amp;nbsp;=&amp;nbsp;6000000&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;there is one more app which is pushing to indexers with indexes.conf: (not at all aware of this)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;[default]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remotePath&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;volume:aws_s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;_vol/$_index_name&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;maxDataSize&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;750&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;[volume:aws_s3_vol]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;storageType&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;remote&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;path&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;://conn-splunk-prod-smartstore/&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remote.s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;auth_region&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;eu-west-&lt;/SPAN&gt;&lt;SPAN&gt;1&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remote.s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;bucket_name&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;conn-splunk-prod-smartstore&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remote.s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;encryption&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;sse-kms&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remote.s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;.kms.&lt;/SPAN&gt;&lt;SPAN&gt;key_id&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;= XXXX&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;remote.s&lt;/SPAN&gt;&lt;SPAN&gt;3&lt;/SPAN&gt;&lt;SPAN&gt;.&lt;/SPAN&gt;&lt;SPAN&gt;supports_versioning&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;false&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;and I don't see&amp;nbsp;&lt;SPAN&gt;&lt;SPAN&gt;coldToFrozenDir and&amp;nbsp;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;SPAN&gt;coldToFrozenScript is also not mentioned anywhere.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;Now are we storing archival data in S3 bucket now? but there mentioned&amp;nbsp;&lt;SPAN&gt;maxDataSize&lt;/SPAN&gt;&lt;SPAN&gt;&amp;nbsp;which is related to hot to warm. So apart from hot bucket data, rest all data is storing in s3 bucket now?&amp;nbsp;&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;So how will Splunk take the data from S3 bucket to search and make queries?&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 07 Apr 2025 17:31:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743704#M118133</guid>
      <dc:creator>Karthikeya</dc:creator>
      <dc:date>2025-04-07T17:31:57Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743709#M118134</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;You are using Splunk SmartStore, which offloads warm and cold buckets to remote object storage (S3). Hot buckets remain on local indexer storage until they roll to warm, then get uploaded to S3.&lt;/P&gt;&lt;P&gt;Your remotePath and [volume:aws_s3_vol] config confirms SmartStore is enabled, meaning:&lt;/P&gt;&lt;UL&gt;&lt;UL&gt;&lt;LI&gt;Hot Data and cached warm/cold data resides on indexers&lt;/LI&gt;&lt;LI&gt;Warm and cold buckets are stored in S3&lt;/LI&gt;&lt;LI&gt;There is no need for coldToFrozenDir or coldToFrozenScript unless you want to archive frozen data elsewhere. This allows for archiving data which is passed the&amp;nbsp;frozenTimePeriodInSecs&amp;nbsp;to be moved elsewhere.&lt;/LI&gt;&lt;/UL&gt;&lt;/UL&gt;&lt;P&gt;Retention is controlled by frozenTimePeriodInSecs (age-based) or maxTotalDataSizeMB (size-based). If you don’t override these in local/, defaults apply (usually 6 years retention).&lt;/P&gt;&lt;P&gt;You can run the following command on one of your indexers to confirm the settings which have been applied:&amp;nbsp;&lt;/P&gt;&lt;PRE&gt;/opt/splunk/bin/splunk btool indexes list --debug | grep -A 10 new_index&lt;/PRE&gt;&lt;P&gt;Splunk automatically retrieves data from S3 to local cache when searches require it. This is transparent to users but may add latency for cold data which is not already in the cache. When the cache reaches capacity it will "evict" buckets based on the &lt;A href="https://docs.splunk.com/Documentation/Splunk/9.4.1/Indexer/ConfigureSmartStorecachemanager#:~:text=per%2Dindex%20basis.-,Set%20the%20cache%20eviction%20policy,-The%20eviction_policy%20setting" target="_self"&gt;eviction policy&lt;/A&gt;&amp;nbsp;which by default is the least-recently used bucket.&lt;/P&gt;&lt;P&gt;Some useful Docs relating to SmartStore and index configuration which might be useful:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;&lt;A href="https://docs.splunk.com/Documentation/Splunk/latest/Indexer/AboutSmartStore" target="_blank" rel="noopener"&gt;SmartStore Overview&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://docs.splunk.com/Documentation/Splunk/latest/Admin/Indexesconf" target="_blank" rel="noopener"&gt;Indexes.conf retention settings&lt;/A&gt;&lt;/LI&gt;&lt;LI&gt;&lt;A href="https://docs.splunk.com/Documentation/Splunk/latest/Indexer/HowSplunkstoresindexes" target="_blank" rel="noopener"&gt;Data lifecycle and bucket types&lt;/A&gt;&lt;/LI&gt;&lt;/UL&gt;&lt;DIV&gt;&lt;P&gt;&lt;span class="lia-unicode-emoji" title=":glowing_star:"&gt;🌟&lt;/span&gt; &lt;STRONG&gt;Did this answer help you?&lt;/STRONG&gt; If so, please consider:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Adding karma to show it was useful&lt;/LI&gt;&lt;LI&gt;Marking it as the solution if it resolved your issue&lt;/LI&gt;&lt;LI&gt;Commenting if you need any clarification&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Your feedback encourages the volunteers in this community to continue contributing&lt;/P&gt;&lt;/DIV&gt;</description>
      <pubDate>Mon, 07 Apr 2025 18:45:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743709#M118134</guid>
      <dc:creator>livehybrid</dc:creator>
      <dc:date>2025-04-07T18:45:31Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743710#M118135</link>
      <description>&lt;P&gt;Thanks for this... So my understanding is my index size which is of 500GB by default will never fill at all because once it reaches to 750 MB (Maxdatasize) it will roll over to warm bucket which is in S3 bucket? Am I correct?&lt;/P&gt;</description>
      <pubDate>Mon, 07 Apr 2025 18:57:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743710#M118135</guid>
      <dc:creator>Karthikeya</dc:creator>
      <dc:date>2025-04-07T18:57:20Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743711#M118136</link>
      <description>&lt;P&gt;One small correction. With smartstore there is no separate warm/cold storage. A bucket is getting uploaded to remote storage and is being cached locally if needed but it doesn't go through warm-&amp;gt;cold lifecycle.&lt;/P&gt;&lt;P&gt;It's also worth noting that with some use cases (especially when you often work with searches covering a significant portion of your remote storage which turns out to be way over your local storage) you might get a significant performance hit because you're effectively not caching anything locally.&lt;/P&gt;</description>
      <pubDate>Mon, 07 Apr 2025 18:59:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743711#M118136</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2025-04-07T18:59:02Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743712#M118137</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/231884"&gt;@PickleRick&lt;/a&gt;&amp;nbsp;And will data be deleted in S3 if it reaches to any limit? I mean we didn't set frozentimeperiodinsecs so by default it is 6 years so the older data stays for 6 years in S3?&lt;/P&gt;</description>
      <pubDate>Mon, 07 Apr 2025 19:08:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743712#M118137</guid>
      <dc:creator>Karthikeya</dc:creator>
      <dc:date>2025-04-07T19:08:08Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data retention</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743714#M118139</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/273888"&gt;@Karthikeya&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;For reference, the following docs page is useful for SmartStore retention settings:&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/9.4.1/Indexer/SmartStoredataretention" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/9.4.1/Indexer/SmartStoredataretention&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;maxDataSize = Bucket Size in MB, not the total size of the index&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Data will be "frozen" when either&amp;nbsp;&lt;SPAN&gt;maxGlobalDataSizeMB or&amp;nbsp;frozenTimePeriodInSecs is met (whichever is first!) - so it is not safe to assume the data will be retained for 6 years if the&amp;nbsp;maxGlobalDataSizeMB setting is not large enough to hold 6 years of data.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;To clarify my previous post as&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/231884"&gt;@PickleRick&lt;/a&gt;&amp;nbsp;mentioned -&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/9.4.1/Indexer/SmartStoreindexing#:~:text=cold%20buckets%20in%20SmartStore%20indexes%20are%20functionally%20equivalent%20to%20warm%20buckets" target="_self"&gt;cold buckets in SmartStore indexes are functionally equivalent to warm buckets&lt;/A&gt;&amp;nbsp; - They are essentially the same and cold buckets &lt;A href="https://docs.splunk.com/Documentation/Splunk/9.4.1/Indexer/SmartStoreindexing#:~:text=Cold%20buckets%20can%2C%20in%20fact%2C%20exist%20in%20a%20SmartStore%20index%2C%20but%20only%20under%20limited%20circumstances" target="_self"&gt;only exist in circumstances&lt;/A&gt;&amp;nbsp;and in any case, the storage on S3 is the same.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Let me know if you have any further questions or need clarity on any of these points &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-unicode-emoji" title=":glowing_star:"&gt;🌟&lt;/span&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Did this answer help you?&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;If so, please consider:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Adding karma to show it was useful&lt;/LI&gt;&lt;LI&gt;Marking it as the solution if it resolved your issue&lt;/LI&gt;&lt;LI&gt;Commenting if you need any clarification&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Your feedback encourages the volunteers in this community to continue contributing&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 07 Apr 2025 19:50:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-retention/m-p/743714#M118139</guid>
      <dc:creator>livehybrid</dc:creator>
      <dc:date>2025-04-07T19:50:27Z</dc:date>
    </item>
  </channel>
</rss>

