<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Directory / Source in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129122#M26499</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;First: I have a directory with five sources. Sometimes a source (always the same) is not loaded into Splunk even if the file has been modified, I dont know why yet the file date is really changed)? Its 5 CSV files.&lt;BR /&gt;
What I want to do, instead of modifying the file, my script will delete the file and create a new one (instead of changing its content) ... I'm looking for solutions.&lt;/P&gt;

&lt;P&gt;Finally, there is there way to monitor whether the five sources were picked up by splunk (If only 4 sources, alert me).  I am looking for an query that would make this, do you have an idea?&lt;/P&gt;

&lt;P&gt;Thank you&lt;BR /&gt;
Best regards&lt;/P&gt;</description>
    <pubDate>Fri, 06 Feb 2015 14:33:05 GMT</pubDate>
    <dc:creator>rene847</dc:creator>
    <dc:date>2015-02-06T14:33:05Z</dc:date>
    <item>
      <title>Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129122#M26499</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;First: I have a directory with five sources. Sometimes a source (always the same) is not loaded into Splunk even if the file has been modified, I dont know why yet the file date is really changed)? Its 5 CSV files.&lt;BR /&gt;
What I want to do, instead of modifying the file, my script will delete the file and create a new one (instead of changing its content) ... I'm looking for solutions.&lt;/P&gt;

&lt;P&gt;Finally, there is there way to monitor whether the five sources were picked up by splunk (If only 4 sources, alert me).  I am looking for an query that would make this, do you have an idea?&lt;/P&gt;

&lt;P&gt;Thank you&lt;BR /&gt;
Best regards&lt;/P&gt;</description>
      <pubDate>Fri, 06 Feb 2015 14:33:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129122#M26499</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2015-02-06T14:33:05Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129123#M26500</link>
      <description>&lt;P&gt;Splunk looks at the beginning of a file (the first 5 or so lines) to see if it's already been seen, look at the documentation for crcSalt in props.conf.&lt;/P&gt;

&lt;P&gt;I had near-duplicate files that were not being indexed until I set in props.conf crcSalt=&amp;lt;SOURCE&amp;gt;.  I found out about this by reviewing "index=_internal sourcetype=splunkd" and the filename that was missed&lt;/P&gt;

&lt;P&gt;With this setting, Splunk used the file name as part of the file identification process, and my nearly-identical files (with different paths) were happily imported.&lt;/P&gt;

&lt;P&gt;As for monitoring, I would suggest that you could search the splunkd.log and look for the filenames.  I can't tell you the exact syntax, but you may be able to group the relevant lines into a transaction, and count how many there are.  If there are less than 5, splunk has missed a file.&lt;/P&gt;</description>
      <pubDate>Fri, 06 Feb 2015 15:07:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129123#M26500</guid>
      <dc:creator>jpvlsmv</dc:creator>
      <dc:date>2015-02-06T15:07:53Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129124#M26501</link>
      <description>&lt;P&gt;This is not because "crc".&lt;BR /&gt;
I takes the contents, I insert it in another file, I put the same name file in the directory and Hop, Splunk takes.&lt;BR /&gt;
It's really weird because, as I say, with another file, Splunk takes the file and when I query, I found data !!!!&lt;/P&gt;

&lt;P&gt;Sorry, I have 6 files in the folder (not 5):&lt;BR /&gt;
DcServerX_&lt;EM&gt;eventlogs_application.CSV&lt;BR /&gt;
DcServerX&lt;/EM&gt;&lt;EM&gt;eventlogs_DS.csv&lt;BR /&gt;
DcServerX&lt;/EM&gt;&lt;EM&gt;eventlogs_FRS.csv&lt;BR /&gt;
DcerverX&lt;/EM&gt;&lt;EM&gt;eventlogs_security.csv&lt;BR /&gt;
DcServerX&lt;/EM&gt;&lt;EM&gt;eventlogs_system.csv&lt;BR /&gt;
And the file DcServerX&lt;/EM&gt;_eventlogs_DNS.csv&lt;BR /&gt;
This is the file that causes me problems sometimes and I do not know why.&lt;/P&gt;

&lt;P&gt;My Inputs.conf is (fonctional for 5 files??):&lt;BR /&gt;
[monitor://D:SplunkDataInputTI-WINEVENTS-RNI.csv]&lt;BR /&gt;
index = TI-WINEVENTS-RNI&lt;BR /&gt;
host_segment = 3&lt;/P&gt;

&lt;P&gt;What do you think for solution?&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
Here's a search (Newest to oldest) :&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
04/02/15 04:17:39,455&lt;BR /&gt;&lt;BR /&gt;
02-04-2015 04:17:39.455 +0000 INFO  WatchedFile - File too small to check seekcrc, probably truncated.  Will re-read entire file='D:/SplunkDataInput/TI-WINEVENTS-RNI/BLABLABLA_eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:\Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
03/02/15 04:15:58,401&lt;BR /&gt;&lt;BR /&gt;
02-03-2015 04:15:58.401 +0000 INFO  WatchedFile - Will begin reading at offset=0 for file='D:\SplunkDataInput\TI-WINEVENTS-RNI\BLABLABLA_&lt;EM&gt;eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:/Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
03/02/15 04:15:58,401&lt;BR /&gt;&lt;BR /&gt;
02-03-2015 04:15:58.401 +0000 INFO  WatchedFile - Checksum for seekptr didn't match, will re-read entire file='D:/SplunkDataInput/TI-WINEVENTS-RNI/BLABLABLA&lt;/EM&gt;&lt;EM&gt;eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:/Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
02/02/15 04:15:18,748&lt;BR /&gt;&lt;BR /&gt;
02-02-2015 04:15:18.748 +0000 INFO  WatchedFile - Will begin reading at offset=0 for file='D:/SplunkDataInput/TI-WINEVENTS-RNI/BLABLABLA&lt;/EM&gt;_eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:/Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 18:50:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129124#M26501</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2020-09-28T18:50:57Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129125#M26502</link>
      <description>&lt;P&gt;Real time, I put a new file (with the same content), here the log:&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
06/02/15 19:45:51,454&lt;BR /&gt;&lt;BR /&gt;
02-06-2015 19:45:51.454 +0000 INFO  WatchedFile - File too small to check seekcrc, probably truncated.  Will re-read entire file='D:/SplunkDataInput/TI-WINEVENTS-RNI/BLABLABLA_&lt;EM&gt;eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:/Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;BR /&gt;
.&lt;BR /&gt;
.&lt;BR /&gt;
I put the old file and :&lt;BR /&gt;
.&lt;BR /&gt;
06/02/15 19:46:07,569&lt;BR /&gt;&lt;BR /&gt;
02-06-2015 19:46:07.569 +0000 INFO  WatchedFile - Will begin reading at offset=3717 for file='D:/SplunkDataInput/TI-WINEVENTS-RNI/BLABLABLA&lt;/EM&gt;_eventlogs_DNS.csv'.&lt;BR /&gt;
host = TOTO source = C:/Program Files/SplunkUniversalForwarder/var/log/splunk/splunkd.log sourcetype = splunkd&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 18:51:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129125#M26502</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2020-09-28T18:51:00Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129126#M26503</link>
      <description>&lt;P&gt;The "new file" with the same content is being reported as too small to index... perhaps splunk is trying to read the file before it has finished copying into place?&lt;/P&gt;

&lt;P&gt;Otherwise, I'm out of ideas &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;  Sorry.&lt;/P&gt;</description>
      <pubDate>Mon, 09 Feb 2015 14:41:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129126#M26503</guid>
      <dc:creator>jpvlsmv</dc:creator>
      <dc:date>2015-02-09T14:41:35Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129127#M26504</link>
      <description>&lt;P&gt;Thank for your support, I appreciate !&lt;BR /&gt;
Best Regards&lt;/P&gt;</description>
      <pubDate>Mon, 09 Feb 2015 15:50:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129127#M26504</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2015-02-09T15:50:50Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129128#M26505</link>
      <description>&lt;P&gt;There he has someone who has an idea of my problem?&lt;/P&gt;</description>
      <pubDate>Mon, 09 Feb 2015 17:38:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129128#M26505</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2015-02-09T17:38:59Z</dc:date>
    </item>
    <item>
      <title>Re: Directory / Source</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129129#M26506</link>
      <description>&lt;P&gt;Finally,&lt;BR /&gt;
We destroyed the directory, create a new, new query and now functional&lt;BR /&gt;
weird&lt;/P&gt;</description>
      <pubDate>Wed, 17 Jun 2015 18:51:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Directory-Source/m-p/129129#M26506</guid>
      <dc:creator>rene847</dc:creator>
      <dc:date>2015-06-17T18:51:03Z</dc:date>
    </item>
  </channel>
</rss>

