<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Issues with csv Splunk File Monitoring in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747798#M118808</link>
    <description>&lt;P&gt;Hi Splunkers, a colleague team si facing some issues related to .csv file collection. Let me share&amp;nbsp; the required context.&lt;/P&gt;&lt;P&gt;We have a .csv file that is sent to a sftp server. The sending is 1 per day: this means that every day, the file is write once and never modified. In addiction to this, even if the file is a csv one, it has a .log extension.&lt;/P&gt;&lt;P&gt;On this server, the Splunk UF is installed and configured to read this daily file.&lt;/P&gt;&lt;P&gt;What currently happen is the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;The file is read many time: multiple occurrence of error message like:&amp;nbsp;&lt;P&gt;&lt;STRONG&gt;INFO&amp;nbsp; WatchedFile [23227 tailreader0] - File too small to check seekcrc, probably truncated.&amp;nbsp; Will re-read entire file=&amp;lt;file name here&amp;gt;&lt;/STRONG&gt; can be got from internal logs&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The csv header is viewed like an event. This means that, for example, the file contains 1000 events, performing a search in assigned index we have 1000 + x&amp;nbsp; events; each of this x events does not contains real events, but the csv header file. So, we see the header as an event/logs.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;SPAN&gt;For the first problem, I suggested to my team to use the&amp;nbsp;&lt;STRONG&gt;initCrcLength&lt;/STRONG&gt; parameter, properly set.&lt;BR /&gt;For the second one, I shared them to ensure that following parameter are set:&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;INDEXED_EXTRACTIONS = csv
HEADER_FIELD_LINE_NUMBER = 1
CHECK_FOR_HEADER = true
 &lt;/LI-CODE&gt;&lt;P&gt;In addition to this, I suggested them to avoid the default line breaker; in the inputs.conf file is set the following one:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt; LINE_BREAKER = ([\r\n]+)&lt;/LI-CODE&gt;&lt;P&gt;That could be the root cause/one of the cause of header extraction as events.&lt;/P&gt;&lt;P&gt;I don't know if those changes has fixed the events (they are still performing required restarts), but I would ask you if any other possible fix should be applied.&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
    <pubDate>Wed, 11 Jun 2025 09:26:37 GMT</pubDate>
    <dc:creator>SplunkExplorer</dc:creator>
    <dc:date>2025-06-11T09:26:37Z</dc:date>
    <item>
      <title>Issues with csv Splunk File Monitoring</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747798#M118808</link>
      <description>&lt;P&gt;Hi Splunkers, a colleague team si facing some issues related to .csv file collection. Let me share&amp;nbsp; the required context.&lt;/P&gt;&lt;P&gt;We have a .csv file that is sent to a sftp server. The sending is 1 per day: this means that every day, the file is write once and never modified. In addiction to this, even if the file is a csv one, it has a .log extension.&lt;/P&gt;&lt;P&gt;On this server, the Splunk UF is installed and configured to read this daily file.&lt;/P&gt;&lt;P&gt;What currently happen is the following:&lt;/P&gt;&lt;OL&gt;&lt;LI&gt;The file is read many time: multiple occurrence of error message like:&amp;nbsp;&lt;P&gt;&lt;STRONG&gt;INFO&amp;nbsp; WatchedFile [23227 tailreader0] - File too small to check seekcrc, probably truncated.&amp;nbsp; Will re-read entire file=&amp;lt;file name here&amp;gt;&lt;/STRONG&gt; can be got from internal logs&lt;/P&gt;&lt;/LI&gt;&lt;LI&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;The csv header is viewed like an event. This means that, for example, the file contains 1000 events, performing a search in assigned index we have 1000 + x&amp;nbsp; events; each of this x events does not contains real events, but the csv header file. So, we see the header as an event/logs.&lt;/SPAN&gt;&lt;/P&gt;&lt;/LI&gt;&lt;/OL&gt;&lt;P&gt;&lt;SPAN&gt;For the first problem, I suggested to my team to use the&amp;nbsp;&lt;STRONG&gt;initCrcLength&lt;/STRONG&gt; parameter, properly set.&lt;BR /&gt;For the second one, I shared them to ensure that following parameter are set:&lt;/SPAN&gt;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;INDEXED_EXTRACTIONS = csv
HEADER_FIELD_LINE_NUMBER = 1
CHECK_FOR_HEADER = true
 &lt;/LI-CODE&gt;&lt;P&gt;In addition to this, I suggested them to avoid the default line breaker; in the inputs.conf file is set the following one:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt; LINE_BREAKER = ([\r\n]+)&lt;/LI-CODE&gt;&lt;P&gt;That could be the root cause/one of the cause of header extraction as events.&lt;/P&gt;&lt;P&gt;I don't know if those changes has fixed the events (they are still performing required restarts), but I would ask you if any other possible fix should be applied.&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Wed, 11 Jun 2025 09:26:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747798#M118808</guid>
      <dc:creator>SplunkExplorer</dc:creator>
      <dc:date>2025-06-11T09:26:37Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with csv Splunk File Monitoring</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747803#M118809</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/249714"&gt;@SplunkExplorer&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I think the message about re-reading the file shouldnt be an issue in your case.&lt;/P&gt;&lt;P&gt;You mentioned setting LINE_BREAKER in inputs.conf, however this should be in props.conf - having said that - I think the default should be sufficient for your CSV file.&lt;/P&gt;&lt;P&gt;If you set&amp;nbsp;HEADER_FIELD_LINE_NUMBER=0 (default) do you get the same results?&lt;/P&gt;&lt;P&gt;What does the first line with the headers look like, is it a typical comma (,) separated list of headers? No quotes, spaces,tabs etc etc? If so the default&amp;nbsp;FIELD_DELIMITER should suffice but want to check.&lt;/P&gt;&lt;P&gt;I'm not 100% sure I follow what you mean about the headers, do you mean that for each event you also see the header printed?&lt;/P&gt;&lt;P&gt;&lt;span class="lia-unicode-emoji" title=":glowing_star:"&gt;🌟&lt;/span&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;&lt;STRONG&gt;Did this answer help you?&lt;/STRONG&gt;&lt;SPAN&gt;&amp;nbsp;&lt;/SPAN&gt;If so, please consider:&lt;/P&gt;&lt;UL&gt;&lt;LI&gt;Adding karma to show it was useful&lt;/LI&gt;&lt;LI&gt;Marking it as the solution if it resolved your issue&lt;/LI&gt;&lt;LI&gt;Commenting if you need any clarification&lt;/LI&gt;&lt;/UL&gt;&lt;P&gt;Your feedback encourages the volunteers in this community to continue contributing&lt;/P&gt;</description>
      <pubDate>Wed, 11 Jun 2025 10:36:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747803#M118809</guid>
      <dc:creator>livehybrid</dc:creator>
      <dc:date>2025-06-11T10:36:40Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with csv Splunk File Monitoring</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747828#M118810</link>
      <description>Can you show current inputs.conf and props.conf stanzas for this CSV file?&lt;BR /&gt;And example (modified) from 1st 2 lines (header + real masked events) from that file?</description>
      <pubDate>Wed, 11 Jun 2025 15:50:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747828#M118810</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2025-06-11T15:50:51Z</dc:date>
    </item>
    <item>
      <title>Re: Issues with csv Splunk File Monitoring</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747938#M118836</link>
      <description>&lt;P&gt;Hi!&amp;nbsp;&lt;/P&gt;&lt;P&gt;here below what you have requested:&amp;nbsp;&lt;/P&gt;&lt;P&gt;props.conf&lt;BR /&gt;[GDPR_ZUORA]&lt;BR /&gt;SHOULD_LINEMERGE=false&lt;BR /&gt;#LINE_BREAKER=([\r\n]+)&lt;BR /&gt;NO_BINARY_CHECK=true&lt;BR /&gt;CHARSET=UTF-8&lt;BR /&gt;INDEXED_EXTRACTIONS=csv&lt;BR /&gt;KV_MODE=none&lt;BR /&gt;category=Structured&lt;BR /&gt;description=Comma-separated value format. Set header and other settings in "Delimited Settings"&lt;BR /&gt;pulldown_type=true&lt;BR /&gt;HEADER_FIELD_LINE_NUMBER = 1&lt;BR /&gt;CHECK_FOR_HEADER = true&lt;BR /&gt;#SHOULD_LINEMERGE = false&lt;BR /&gt;#FIELD_DELIMITER = ,&lt;BR /&gt;#FIELD_NAMES = date,hostname,app,action,ObjectName,user,operation,value_before,value_after,op_target,description&lt;/P&gt;&lt;P&gt;inputs.conf&lt;BR /&gt;[monitor:///sftp/Zuora/LOG-Zuora-*.log]&lt;BR /&gt;disabled = false&lt;BR /&gt;index = sftp_compliance&lt;BR /&gt;sourcetype = GDPR_ZUORA&lt;BR /&gt;source = GDPR_ZUORA&lt;BR /&gt;initCrcLength = 256&lt;/P&gt;&lt;P&gt;First 2 lines of the file monitored:&lt;BR /&gt;DataOra,ServerSorgente,Applicazione,TipoAzione,TipologiaOperazione,ServerDestinazione,UserID,UserName,OldValue,NewValue,Note&lt;BR /&gt;2025-06-05T23:22:01.157Z,,Zuora,Tenant Property,UPDATED,,3,ScheduledJobUser,2025-06-04T22:07:09.005473Z,2025-06-05T22:21:30.642092Z,BIN_DATA_UPDATE_FROM&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Jun 2025 13:19:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Issues-with-csv-Splunk-File-Monitoring/m-p/747938#M118836</guid>
      <dc:creator>marsantamaria</dc:creator>
      <dc:date>2025-06-13T13:19:36Z</dc:date>
    </item>
  </channel>
</rss>

