<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Fishbucket Growing Issue in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/668107#M111990</link>
    <description>&lt;P&gt;1.&amp;nbsp; Not exactly.&amp;nbsp; Here's what limits.conf.spec says about the fishbucket size limit:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;file_tracking_db_threshold_mb = &amp;lt;integer&amp;gt;
* The size, in megabytes, at which point the file tracking
  database, otherwise known as the "fishbucket" or "btree", rolls over
  to a new file.
* The rollover process is as follows:
  * After the fishbucket reaches 'file_tracking_db_threshold_mb' megabytes
    in size, a new database file is created.
  * From this point forward, the processor writes new entries to the
    new database.
  * Initially, the processor attempts to read entries from the new database,
    but upon failure, falls back to the old database.
  * Successful reads from the old database are written to the new database.&lt;/LI-CODE&gt;&lt;P&gt;Notice the old database file stays around even when a new database file is created.&amp;nbsp; That implies the file_tracking_db_threshold_mb value is at least doubled.&amp;nbsp; When the database is saved, it's doubled again for each file (new and old) so 4x.&lt;/P&gt;&lt;P&gt;2. I see what you mean, although this is true for any TA, not just nmon.&amp;nbsp; The more input files you have, the more that must be tracked in the fishbucket.&lt;/P&gt;</description>
    <pubDate>Thu, 09 Nov 2023 15:53:42 GMT</pubDate>
    <dc:creator>richgalloway</dc:creator>
    <dc:date>2023-11-09T15:53:42Z</dc:date>
    <item>
      <title>Fishbucket Growing Issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667646#M111955</link>
      <description>&lt;P&gt;Hi, I have an issue here with the &lt;STRONG&gt;fishbucket&lt;/STRONG&gt; of the Universal Forwarder. I have tried to look for quite a lot of documentation, but it seems that there is too little documentation, and there are also few topics on it.&lt;/P&gt;&lt;P&gt;The problem I am facing is that the &lt;STRONG&gt;fishbucket&lt;/STRONG&gt; is taking up a large amount of space, about &lt;STRONG&gt;2GB&lt;/STRONG&gt; on the hard drive, while the limit configuration in limits.conf is:&lt;/P&gt;&lt;P&gt;&lt;EM&gt;&amp;nbsp;file_tracking_db_threshold_mb = 500&lt;/EM&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In some other topics, I read that the &lt;STRONG&gt;fishbucket&lt;/STRONG&gt; can be up to &lt;STRONG&gt;2 or 3 times larger&lt;/STRONG&gt; than the configured limit. And this happens because of its backup mechanism with file &lt;STRONG&gt;save and snapshot.tmp&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;However, &lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;is there a limit to the size of the fishbucket? Will it continue to expand over time without limit, or only expand to a certain limit?&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;FONT color="#FF0000"&gt;&lt;STRONG&gt;PS:&amp;nbsp; i have nmon TA install on my server.&lt;/STRONG&gt;&lt;/FONT&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;Please, provide me with Splunk documentation on this part.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;Thank you.&lt;/P&gt;&lt;DIV class=""&gt;&lt;IMG border="0" /&gt;&lt;/DIV&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Nov 2023 07:40:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667646#M111955</guid>
      <dc:creator>hieuba6868</dc:creator>
      <dc:date>2023-11-07T07:40:21Z</dc:date>
    </item>
    <item>
      <title>Re: Fishbucket Growing Issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667704#M111958</link>
      <description>&lt;P&gt;You are correct about the size being affected by backup and save files.&amp;nbsp; The limit.conf setting is for the base file only so the total could be 4x that value.&lt;/P&gt;&lt;P&gt;I don't see how the nmon TA has any bearing on this.&lt;/P&gt;</description>
      <pubDate>Tue, 07 Nov 2023 15:18:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667704#M111958</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-11-07T15:18:55Z</dc:date>
    </item>
    <item>
      <title>Re: Fishbucket Growing Issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667803#M111966</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/213957"&gt;@richgalloway&lt;/a&gt;,&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;&lt;P&gt;Do you have any documentation that validates the possibility of the fishbucket's size being up to four times larger than the limit specified in the limits.conf file? Any official resources or explanations that could clarify why the fishbucket index might exceed the configured threshold by such a significant margin would be extremely helpful.&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;Concerning TA-nmon: I've noticed that it monitors the server by generating new CSV files every minute, and it deletes the older ones. I suspect that this process could incrementally increase the size of the fishbucket, as it continuously logs the CRCs of newly created log files without removing the CRCs of the old, deleted logs. This situation seems to be evidenced by the _internal log errors related to checksum faild when the log files no longer exist.&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;</description>
      <pubDate>Wed, 08 Nov 2023 06:53:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/667803#M111966</guid>
      <dc:creator>hieuba6868</dc:creator>
      <dc:date>2023-11-08T06:53:32Z</dc:date>
    </item>
    <item>
      <title>Re: Fishbucket Growing Issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/668107#M111990</link>
      <description>&lt;P&gt;1.&amp;nbsp; Not exactly.&amp;nbsp; Here's what limits.conf.spec says about the fishbucket size limit:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;file_tracking_db_threshold_mb = &amp;lt;integer&amp;gt;
* The size, in megabytes, at which point the file tracking
  database, otherwise known as the "fishbucket" or "btree", rolls over
  to a new file.
* The rollover process is as follows:
  * After the fishbucket reaches 'file_tracking_db_threshold_mb' megabytes
    in size, a new database file is created.
  * From this point forward, the processor writes new entries to the
    new database.
  * Initially, the processor attempts to read entries from the new database,
    but upon failure, falls back to the old database.
  * Successful reads from the old database are written to the new database.&lt;/LI-CODE&gt;&lt;P&gt;Notice the old database file stays around even when a new database file is created.&amp;nbsp; That implies the file_tracking_db_threshold_mb value is at least doubled.&amp;nbsp; When the database is saved, it's doubled again for each file (new and old) so 4x.&lt;/P&gt;&lt;P&gt;2. I see what you mean, although this is true for any TA, not just nmon.&amp;nbsp; The more input files you have, the more that must be tracked in the fishbucket.&lt;/P&gt;</description>
      <pubDate>Thu, 09 Nov 2023 15:53:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/668107#M111990</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-11-09T15:53:42Z</dc:date>
    </item>
    <item>
      <title>Re: Fishbucket Growing Issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/669528#M112256</link>
      <description>&lt;P&gt;So, after asking for Splunk Support, they said:&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;1. The possible reasons and conditions under which the fishbucket could exceed the configured threshold of 500MB.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;It is because of the amount of data ingestion you are doing per day &amp;amp; the fishbucket can be up to 2 or 3 times larger than the configured limit. And this happens because of its backup mechanism with file save and snapshot.tmp&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;2. If there are any log files or diagnostic tools within Splunk that can help us track and understand the growth of the fishbucket index.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;If you have the nmon app installed we found that it was contributing to the fishbucket's rapid growth.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;3.The absolute maximum size that the fishbucket can reach within the Splunk system.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;There is no strict maximum size for the splunk fish bucket. It is the size influenced by factors like the volume of data being ingested, the frequency of indexing &amp;amp; the specific configuration of the your splunk environment.&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;4.Any factors that could contribute to the fishbucket exceeding the expected maximum by such a substantial margin.&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;It can only grow with the time, volume of the data, frequency of indexing.&lt;/P&gt;</description>
      <pubDate>Thu, 23 Nov 2023 02:27:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Fishbucket-Growing-Issue/m-p/669528#M112256</guid>
      <dc:creator>hieuba6868</dc:creator>
      <dc:date>2023-11-23T02:27:14Z</dc:date>
    </item>
  </channel>
</rss>

