<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: MIsmatch between df and Splunk size of volume in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50729#M9719</link>
    <description>&lt;P&gt;Hi &lt;BR /&gt;
"This can be customized but by default they get 100GB per index and a limit of 1TB in total in the volume called _splunk_summaries"&lt;/P&gt;

&lt;P&gt;where above statement is written in docs and can be customized it?&lt;/P&gt;</description>
    <pubDate>Wed, 30 Sep 2020 03:16:53 GMT</pubDate>
    <dc:creator>nagendra1111</dc:creator>
    <dc:date>2020-09-30T03:16:53Z</dc:date>
    <item>
      <title>MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50724#M9714</link>
      <description>&lt;P&gt;We use a volume configuration for our storage, and the amount of disk being used is measured differently by Splunk and df:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;05-12-2012 14:34:21.961 -0400 INFO  VolumeManager - The size of volume 'local' exceeds the limit, will have to acquiesce it (size=901197562019, max_size=900228710400, path='/opt/splunk/var/lib/splunk')
05-12-2012 14:34:21.962 -0400 INFO  VolumeManager - Getting a list of candidate buckets for moving (chilling or freezing)
05-12-2012 14:34:21.992 -0400 INFO  VolumeManager - Will move bucket with latest=1205445442, path='/opt/splunk/var/lib/splunk/sharepoint/db/db_1205445442_1197673042_20'
05-12-2012 14:34:21.993 -0400 INFO  VolumeManager - Bucket moved successfully (current size=901197556937, max=900228710400)
05-12-2012 14:34:21.993 -0400 INFO  VolumeManager - Will move bucket with latest=1281595719, path='/opt/splunk/var/lib/splunk/sharepoint/db/db_1281595719_1281595719_21'
05-12-2012 14:34:21.993 -0400 INFO  VolumeManager - Bucket moved successfully (current size=901197553132, max=900228710400)
05-12-2012 14:34:21.993 -0400 INFO  VolumeManager - Will move bucket with latest=1294374586, path='/opt/splunk/var/lib/splunk/proxy/db/db_1294374586_1294203869_3'
05-12-2012 14:34:21.994 -0400 INFO  VolumeManager - Bucket moved successfully (current size=894885008853, max=900228710400)
05-12-2012 14:34:21.994 -0400 INFO  VolumeManager - Acquiescing volume 'local' completed.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;at the same time, df --block-size=1 tells me:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;Filesystem           1B-blocks      Used Available Use% Mounted on
/dev/mapper/db_dg-opt
                     1005947170816 931457794048 23390171136  98% /opt/splunk/var/lib/splunk
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;or df -m&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;Filesystem           1M-blocks      Used Available Use% Mounted on
/dev/mapper/db_dg-opt
                        959346    888655     21960  98% /opt/splunk/var/lib/splunk
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;My volume configuration in indexes.conf is&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[volume:local]
path = /opt/splunk/var/lib/splunk
maxVolumeDataSizeMB = 858525
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;The "max_size" in the logs matches correctly to my maxVolumeDataSizeMB (using 1024 MB per KB). But what Splunk is measuring as "size" (901197562019) does not match with what df tells me (931457794048). Splunk's measurement is 3% smaller than what df tells me.&lt;/P&gt;

&lt;P&gt;Why the mismatch, and how to I measure/plan adequately to maximize disk utilization without leaving too little free space? And how much free space is necessary, since this volume is used only for hot/warm bucket storage and nothing else?&lt;/P&gt;</description>
      <pubDate>Sat, 12 May 2012 18:57:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50724#M9714</guid>
      <dc:creator>supersleepwalke</dc:creator>
      <dc:date>2012-05-12T18:57:04Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50725#M9715</link>
      <description>&lt;P&gt;The short answer is that Splunk only accounts for volumes using the volume configuration.&lt;/P&gt;

&lt;P&gt;To explain further, there are two things that could be on your disk that are taking up space that Splunk is not accounting for:&lt;/P&gt;

&lt;P&gt;1) Files that are not current indexes.  This could include any file that is on the filesystem or possibly old indexes that were removed from the config and not cleaned up&lt;/P&gt;

&lt;P&gt;2) Indexes that use somethign other than volume:local to define where they live.  So, for example, it might use $SPLUNK_DB/mydb as the location.  While this might point to the same filesystem, it won't be counted against your maxVolumeDataSizeMB quota.&lt;/P&gt;

&lt;P&gt;Make sure to check all indexes.conf on your system for indexes of the second kind.  Some apps will define their own indexes and, in my experience, most will use $SPLUNK_DB.  Just change the definition by copying the indexes.conf to the local dir and using the correct volume setting.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Oct 2012 18:35:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50725#M9715</guid>
      <dc:creator>christopher_hod</dc:creator>
      <dc:date>2012-10-09T18:35:16Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50726#M9716</link>
      <description>&lt;P&gt;In my case, it was the Splunk built-in indexes that were causing the most pain (specifically audit and summarydb). Thank you the tip to use 'splunk cmd btool indexes list' to see how all the various config files got merged. That made it much easier to see which indexes were and weren't using my volumes configuration.&lt;/P&gt;</description>
      <pubDate>Tue, 09 Oct 2012 18:39:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50726#M9716</guid>
      <dc:creator>supersleepwalke</dc:creator>
      <dc:date>2012-10-09T18:39:17Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50727#M9717</link>
      <description>&lt;P&gt;I still dont' get why the total volume size doesn't match what DF tells me, in this case linux on a dedicated volume.&lt;/P&gt;</description>
      <pubDate>Sat, 13 Aug 2016 16:35:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50727#M9717</guid>
      <dc:creator>mendesjo</dc:creator>
      <dc:date>2016-08-13T16:35:23Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50728#M9718</link>
      <description>&lt;P&gt;Furthermore, keep in mind that accelerated data models by default will store data in the default index location as per:&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Knowledge/Acceleratedatamodels"&gt;Accelerate Data Models&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;This can be customized but in version 8.0 the splunk_summaries volume does not have a set maximum as per the &lt;A href="https://docs.splunk.com/Documentation/Splunk/latest/Admin/Indexesconf"&gt;indexes.conf.spec&lt;/A&gt; "maxVolumeDataSizeMB = , * Optional"&lt;BR /&gt;
On my test server on 8.0.0 it was not specified at all.&lt;/P&gt;

&lt;P&gt;This will not count towards your custom-made volume unless you customise the tstatsHomePath for each index with an accelerated data model...&lt;/P&gt;</description>
      <pubDate>Sun, 14 Aug 2016 06:18:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50728#M9718</guid>
      <dc:creator>gjanders</dc:creator>
      <dc:date>2016-08-14T06:18:00Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50729#M9719</link>
      <description>&lt;P&gt;Hi &lt;BR /&gt;
"This can be customized but by default they get 100GB per index and a limit of 1TB in total in the volume called _splunk_summaries"&lt;/P&gt;

&lt;P&gt;where above statement is written in docs and can be customized it?&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2020 03:16:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50729#M9719</guid>
      <dc:creator>nagendra1111</dc:creator>
      <dc:date>2020-09-30T03:16:53Z</dc:date>
    </item>
    <item>
      <title>Re: MIsmatch between df and Splunk size of volume</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50730#M9720</link>
      <description>&lt;P&gt;FYI this post is from over 3 years ago!&lt;/P&gt;

&lt;P&gt;&lt;A href="https://docs.splunk.com/Documentation/Splunk/latest/Admin/Indexesconf" target="_blank"&gt;Splunk indexes.conf.spec&lt;/A&gt;&lt;BR /&gt;
* Default: volume:_splunk_summaries/$_index_name/datamodel_summary,&lt;BR /&gt;
  where "$_index_name" is runtime-expanded to the name of the index&lt;/P&gt;

&lt;P&gt;The max volume size I checked in 8.0.0 appears to be unlimited, I will remove the 100GB per index comment, and I've updated my above post.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Sep 2020 03:17:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/MIsmatch-between-df-and-Splunk-size-of-volume/m-p/50730#M9720</guid>
      <dc:creator>gjanders</dc:creator>
      <dc:date>2020-09-30T03:17:01Z</dc:date>
    </item>
  </channel>
</rss>

