<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Creating large, multi-terabyte indexes in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80794#M97113</link>
    <description>&lt;P&gt;Splunk automatically creates buckets as needed. You don't need to do anything about buckets for a 2TB index; this is not considered a particularly large index in Splunk. (There are customers who add much more than 2TB &lt;EM&gt;every day&lt;/EM&gt;.) &lt;/P&gt;

&lt;P&gt;However, you do need to change the maximum size of your index, as the default maximum size is 500,000MB (or .5TB) You can change this setting in the configuation file indexes.conf (maxTotalDataSizeMB) or you can do it via the user interface in the Splunk Manager.&lt;/P&gt;

&lt;P&gt;Contrary to wdhathaway's post - a Splunk index is not implemented as a monolithic file; it is in fact a number of files. But I don't think that you will have a significant fsck problem anyway.&lt;/P&gt;

&lt;P&gt;FInally, for more about indexes and sizing take a look at&lt;BR /&gt;&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Admin/Aboutmanagingindexes"&gt;Managing Indexes&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/Admin/Setupmultipleindexes#Create_and_edit_indexes"&gt;Create and edit indexes&lt;/A&gt;  &lt;/P&gt;</description>
    <pubDate>Fri, 09 Mar 2012 02:20:21 GMT</pubDate>
    <dc:creator>lguinn2</dc:creator>
    <dc:date>2012-03-09T02:20:21Z</dc:date>
    <item>
      <title>Creating large, multi-terabyte indexes</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80792#M97111</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I have a few indexes that I want to expand to be multiple terabytes. Are there general guidelines about this?  Should I increase the number of buckets, and if so what's considered 'just right' for a 2TB (or more) index?&lt;/P&gt;

&lt;P&gt;What can I expect if I need to run an fsck?  Will large indexes make running this out of the question?&lt;/P&gt;

&lt;P&gt;Thanks,&lt;BR /&gt;
Will&lt;/P&gt;</description>
      <pubDate>Tue, 28 Feb 2012 18:38:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80792#M97111</guid>
      <dc:creator>williamsweat</dc:creator>
      <dc:date>2012-02-28T18:38:35Z</dc:date>
    </item>
    <item>
      <title>Re: Creating large, multi-terabyte indexes</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80793#M97112</link>
      <description>&lt;P&gt;I'm not sure on the bucket size part of your question, but as far as your fsck question goes, &lt;BR /&gt;
In general, fsck times are linear with number of inodes, so for a file system filled with a smaller number of large files (like Splunk indexes), it should be much faster to fsck than a file system filled with with a huge amount of small files.&lt;/P&gt;</description>
      <pubDate>Tue, 28 Feb 2012 19:23:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80793#M97112</guid>
      <dc:creator>wdhathaway</dc:creator>
      <dc:date>2012-02-28T19:23:55Z</dc:date>
    </item>
    <item>
      <title>Re: Creating large, multi-terabyte indexes</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80794#M97113</link>
      <description>&lt;P&gt;Splunk automatically creates buckets as needed. You don't need to do anything about buckets for a 2TB index; this is not considered a particularly large index in Splunk. (There are customers who add much more than 2TB &lt;EM&gt;every day&lt;/EM&gt;.) &lt;/P&gt;

&lt;P&gt;However, you do need to change the maximum size of your index, as the default maximum size is 500,000MB (or .5TB) You can change this setting in the configuation file indexes.conf (maxTotalDataSizeMB) or you can do it via the user interface in the Splunk Manager.&lt;/P&gt;

&lt;P&gt;Contrary to wdhathaway's post - a Splunk index is not implemented as a monolithic file; it is in fact a number of files. But I don't think that you will have a significant fsck problem anyway.&lt;/P&gt;

&lt;P&gt;FInally, for more about indexes and sizing take a look at&lt;BR /&gt;&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Admin/Aboutmanagingindexes"&gt;Managing Indexes&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/Admin/Setupmultipleindexes#Create_and_edit_indexes"&gt;Create and edit indexes&lt;/A&gt;  &lt;/P&gt;</description>
      <pubDate>Fri, 09 Mar 2012 02:20:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Creating-large-multi-terabyte-indexes/m-p/80794#M97113</guid>
      <dc:creator>lguinn2</dc:creator>
      <dc:date>2012-03-09T02:20:21Z</dc:date>
    </item>
  </channel>
</rss>

