<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to reindex data from a forwarder in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93320#M88036</link>
    <description>&lt;P&gt;Agreed. Also, you may need to check with the creator of that particular add-on. Once you create the new question thread, link to it here and we can jump over there.&lt;/P&gt;</description>
    <pubDate>Wed, 22 Nov 2017 14:08:25 GMT</pubDate>
    <dc:creator>sloshburch</dc:creator>
    <dc:date>2017-11-22T14:08:25Z</dc:date>
    <item>
      <title>How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93310#M88026</link>
      <description>&lt;P&gt;I have a Storm project and I want to clean all and reindex only the last days, and some specific files.&lt;BR /&gt;
I have Splunk Universal forwarders monitoring my files for now.&lt;/P&gt;

&lt;P&gt;I suppose that this is similar for a  Splunk Enterprise, when we clear an index and Storm when we manually empty a project.&lt;/P&gt;</description>
      <pubDate>Fri, 18 Jan 2013 23:39:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93310#M88026</guid>
      <dc:creator>mataharry</dc:creator>
      <dc:date>2013-01-18T23:39:28Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93311#M88027</link>
      <description>&lt;P&gt;First of all even before reindexing, to configure Splunk to index only recent data, you can use the 2 techniques:&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;&lt;P&gt;for file monitoring, add the parameter &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt; in inputs.conf&lt;BR /&gt;
It will look at the &lt;STRONG&gt;modtime of the files&lt;/STRONG&gt;, example : &lt;CODE&gt;ignoreOlderThan=7d&lt;/CODE&gt; will index only files touched during the last 7 days. On linux you can couple this with the &lt;CODE&gt;touch&lt;/CODE&gt; command to change the modtime of a file and trigger the indexing.&lt;BR /&gt;
see &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/admin/Inputsconf" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/latest/admin/Inputsconf&lt;/A&gt;&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;for WinEventLogs, you can setup the parameter &lt;CODE&gt;current_only=1&lt;/CODE&gt; in inputs.conf to exclude the historical logs, and starts only now.&lt;/P&gt;

&lt;HR /&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Now that you have setup your inputs to avoid blastering your instance, you can focus on &lt;STRONG&gt;How to force a splunk instance to reindex a file&lt;/STRONG&gt; that has already be indexed.&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;the radical method is to &lt;STRONG&gt;clean the fishbucket index&lt;/STRONG&gt; . That will remove the memory of every files, But it will reindex all.&lt;/LI&gt;
&lt;LI&gt;on an indexer &lt;CODE&gt;splunk clean eventdata -index _fishbucket&lt;/CODE&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;on a forwarder by removing the folder $SPLUNK_HOME/var/lib/splunk/fishbucket&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;or selectivelly &lt;STRONG&gt;forgot a single file&lt;/STRONG&gt; from the fishbucket&lt;/P&gt;

&lt;P&gt;splunk cmd btprobe -d $SPLUNK_HOME/var/lib/splunk/fishbucket/splunk_private_db --file $FILE --reset&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;manually reindex each file with the &lt;STRONG&gt;oneshot&lt;/STRONG&gt; option,&lt;BR /&gt;
you also can edit the log file and add a comment on the first line that will force the file to detected as a new file.&lt;/P&gt;

&lt;P&gt;./splunk add oneshot "/path/to/my/file.log" -sourcetype mysourcetype&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;&lt;STRONG&gt;modify the first line of the files to reindex&lt;/STRONG&gt;, by default splunk checks the first 256 chars of a file to differentiate them. If you had a simple comment on the first line it wil reindex it &lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;change the &lt;STRONG&gt;crcSalt&lt;/STRONG&gt;,  create a new input for a new folder, add all the correct sourcetypes, etc... &lt;BR /&gt;
using a static string  that will force a one time reindexing.&lt;/P&gt;

&lt;P&gt;crcSalt= REINDEXMEPLEASE&lt;/P&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;or add the option &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;crcSalt= &amp;lt;SOURCE&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;then move or copy the files to be reindex to the folder, they will be detected as new (because the path will be considered in the crc calculation). (ps the source field will be different of course.)&lt;/P&gt;

&lt;P&gt;see &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/admin/Inputsconf" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/latest/admin/Inputsconf&lt;/A&gt;&lt;/P&gt;

&lt;HR /&gt;

&lt;P&gt;Remark : before reindexing you may want to remove the existing data in splunk to avoid duplicates.&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;You can empty the index if nothing has to be preserved.&lt;/LI&gt;
&lt;LI&gt;Or use the &lt;CODE&gt;|delete&lt;/CODE&gt; command to selectively hide some events.
see &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Delete" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Delete&lt;/A&gt; &lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Remark : if you are monitoring windows logs (wineventlog) or are using modular inputs, the counters are not in the fishbucket.&lt;BR /&gt;
you need to clear the checkpoints files in &lt;CODE&gt;$SPLUNK_HOME/var/lib/splunk/modinputs/&lt;/CODE&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 13:08:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93311#M88027</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2020-09-28T13:08:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93312#M88028</link>
      <description>&lt;P&gt;This method doesn't work with the Splunk 6 Forwarder but I found if you remove all directories in C:\Program Files\SplunkUniversalForwarder\var\lib\splunk, this will force Splunk to reindex all the Window's logs. You have to remove all of them.&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jun 2015 20:57:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93312#M88028</guid>
      <dc:creator>coleman07</dc:creator>
      <dc:date>2015-06-30T20:57:26Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93313#M88029</link>
      <description>&lt;P&gt;FYI the Splunk service must be stopped before removing files from the fishbucket&lt;/P&gt;</description>
      <pubDate>Tue, 07 Jul 2015 12:44:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93313#M88029</guid>
      <dc:creator>bnorthway</dc:creator>
      <dc:date>2015-07-07T12:44:06Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93314#M88030</link>
      <description>&lt;P&gt;Hi I am using Splunk 6.4, just use:&lt;BR /&gt;
splunk clean eventdata -index _fishbucket&lt;/P&gt;

&lt;P&gt;'-index' is no longer required.&lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2016 01:41:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93314#M88030</guid>
      <dc:creator>season88481</dc:creator>
      <dc:date>2016-04-21T01:41:20Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93315#M88031</link>
      <description>&lt;P&gt;&lt;CODE&gt;$SPLUNK_HOME/bin/splunk clean all&lt;/CODE&gt; still works on 6.4.2 &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2016 02:31:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93315#M88031</guid>
      <dc:creator>MuS</dc:creator>
      <dc:date>2016-08-23T02:31:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93316#M88032</link>
      <description>&lt;P&gt;To re-index forwarded events for &lt;STRONG&gt;Windows Event logs&lt;/STRONG&gt;, I had to remove the &lt;CODE&gt;modinputs&lt;/CODE&gt; file that bookmarks the last &lt;STRONG&gt;RecordId&lt;/STRONG&gt; sent to splunk indexer....I also deleted the fishbuckets as suggested by @yannK&lt;/P&gt;

&lt;P&gt;&lt;STRONG&gt;Source Path&lt;/STRONG&gt;: &lt;CODE&gt;SplunkUniversalForwarder\var\lib\splunk\modinputs\WinEventLog&lt;/CODE&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;&amp;lt;BookmarkList&amp;gt;
  &amp;lt;Bookmark Channel='application' RecordId='48426' IsCurrent='true'/&amp;gt;
&amp;lt;/BookmarkList&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Removing the application file and restarting the forwarder will force a reindex for windows event logs. &lt;/P&gt;</description>
      <pubDate>Fri, 09 Sep 2016 00:27:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93316#M88032</guid>
      <dc:creator>slierninja</dc:creator>
      <dc:date>2016-09-09T00:27:43Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93317#M88033</link>
      <description>&lt;P&gt;FYI. Now it's &lt;CODE&gt;_thefishbucket&lt;/CODE&gt;, not &lt;CODE&gt;_fishbucket&lt;/CODE&gt;. At least in 6.6.2...&lt;/P&gt;</description>
      <pubDate>Fri, 28 Jul 2017 19:20:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93317#M88033</guid>
      <dc:creator>sloshburch</dc:creator>
      <dc:date>2017-07-28T19:20:47Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93318#M88034</link>
      <description>&lt;P&gt;Will any of these methods work for re-indexing the data from an API? Many of the resources I've found only mention log files when speaking of re-indexing. My data input is an API. I am able to clean the index for this API, but want to ensure I can re-index all the data.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Nov 2017 19:57:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93318#M88034</guid>
      <dc:creator>jgreen12</dc:creator>
      <dc:date>2017-11-21T19:57:54Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93319#M88035</link>
      <description>&lt;P&gt;jgreen12 please open a new question as this question is answered (and very old)&lt;/P&gt;

&lt;P&gt;It may or may not relate to modular inputs and there may be a checkpoint file keeping track of the data it has obtained but a new question would make more sense here rather than guessing...&lt;/P&gt;</description>
      <pubDate>Tue, 21 Nov 2017 22:16:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93319#M88035</guid>
      <dc:creator>gjanders</dc:creator>
      <dc:date>2017-11-21T22:16:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93320#M88036</link>
      <description>&lt;P&gt;Agreed. Also, you may need to check with the creator of that particular add-on. Once you create the new question thread, link to it here and we can jump over there.&lt;/P&gt;</description>
      <pubDate>Wed, 22 Nov 2017 14:08:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93320#M88036</guid>
      <dc:creator>sloshburch</dc:creator>
      <dc:date>2017-11-22T14:08:25Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93321#M88037</link>
      <description>&lt;P&gt;You can also trigger the forwarder to re-ingest files by adding/modifying the crcsalt parameter within inputs.conf.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;crcSalt = &amp;lt;string&amp;gt;
* Use this setting to force the input to consume files that have matching CRCs
  (cyclic redundancy checks).
    * By default, the input only performs CRC checks against the first 256
      bytes of a file. This behavior prevents the input from indexing the same
      file twice, even though you might have renamed it, as with rolling log
      files, for example. Because the CRC is based on only the first
      few lines of the file, it is possible for legitimately different files
      to have matching CRCs, particularly if they have identical headers.
* If set, &amp;lt;string&amp;gt; is added to the CRC.
* If set to the literal string "&amp;lt;SOURCE&amp;gt;" (including the angle brackets), the
  full directory path to the source file is added to the CRC. This ensures
  that each file being monitored has a unique CRC. When 'crcSalt' is invoked,
  it is usually set to &amp;lt;SOURCE&amp;gt;.
* Be cautious about using this setting with rolling log files; it could lead
  to the log file being re-indexed after it has rolled.
* In many situations, 'initCrcLength' can be used to achieve the same goals.
* Default: empty string.
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 07 Feb 2020 16:18:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/93321#M88037</guid>
      <dc:creator>codebuilder</dc:creator>
      <dc:date>2020-02-07T16:18:44Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/623629#M107295</link>
      <description>&lt;P&gt;In 8.2.6, the btprobe command is missing:&lt;/P&gt;&lt;PRE&gt;$ ./bin/splunk help commands | grep cmd&lt;BR /&gt;&amp;nbsp;&amp;nbsp;&amp;nbsp; cmd [btool|exporttool|importtool|locktest|locktool|parsetest|pcregextest|signtool|walklex]&lt;/PRE&gt;&lt;P&gt;&lt;EM&gt;&amp;nbsp;EDIT: nevermind, yes it is. It's just not listed in the help menu.&lt;/EM&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 07 Dec 2022 18:30:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/623629#M107295</guid>
      <dc:creator>lyeager</dc:creator>
      <dc:date>2022-12-07T18:30:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/685204#M114327</link>
      <description>&lt;P&gt;Would it work to replace the Universal Forwarder with a new one?&lt;/P&gt;</description>
      <pubDate>Tue, 23 Apr 2024 20:17:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/685204#M114327</guid>
      <dc:creator>gabmsmith</dc:creator>
      <dc:date>2024-04-23T20:17:36Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/685341#M114342</link>
      <description>&lt;P&gt;This didn't work...&lt;/P&gt;</description>
      <pubDate>Wed, 24 Apr 2024 14:55:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/685341#M114342</guid>
      <dc:creator>gabmsmith</dc:creator>
      <dc:date>2024-04-24T14:55:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/700716#M116019</link>
      <description>&lt;P&gt;This list has aged quite a bit, is it still accurate?&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/1357"&gt;@yannK&lt;/a&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 01 Oct 2024 20:54:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/700716#M116019</guid>
      <dc:creator>R15</dc:creator>
      <dc:date>2024-10-01T20:54:53Z</dc:date>
    </item>
    <item>
      <title>Re: How to reindex data from a forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/700720#M116021</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/252191"&gt;@R15&lt;/a&gt;&amp;nbsp;&amp;nbsp;For monitoring Stanzas, it's still pretty much the same.&lt;BR /&gt;However, many new type of inputs exists too (modular, scripted, HEC etc...), who do not rely on the fishbucket.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Oct 2024 21:18:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-reindex-data-from-a-forwarder/m-p/700720#M116021</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2024-10-01T21:18:54Z</dc:date>
    </item>
  </channel>
</rss>

