<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Ingest entire XML file in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/530215#M89268</link>
    <description>&lt;P&gt;Splunk tries its best to avoid re-indexing entire files that are ingesting via a &lt;FONT face="courier new,courier"&gt;monitor&lt;/FONT&gt; stanza.&amp;nbsp; I'm not aware of any setting to override that behavior.&lt;/P&gt;&lt;P&gt;Consider using &lt;FONT face="courier new,courier"&gt;batch&lt;/FONT&gt; input, instead.&amp;nbsp; Splunk will read the entire file, but will delete it afterward.&amp;nbsp; That means your application must be prepared to re-create the file.&amp;nbsp; It also runs the risk of a race condition between Splunk and your app.&amp;nbsp; Can the application be configured to write a new file instead of overwriting existing files?&lt;/P&gt;</description>
    <pubDate>Fri, 20 Nov 2020 14:30:24 GMT</pubDate>
    <dc:creator>richgalloway</dc:creator>
    <dc:date>2020-11-20T14:30:24Z</dc:date>
    <item>
      <title>Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529280#M89174</link>
      <description>&lt;P&gt;Hello everybody,&lt;/P&gt;&lt;P&gt;we are monitoring via Universal Forwarder several directories with a large XML file in there (around 1000 lines). These files changes every few seconds, and the change also involves the timestamp which is written in the first 256 bytes of the file.&lt;/P&gt;&lt;P&gt;I need to ingest these files entirely at every change but, instead, Splunk ingest me these files only one time every some hours or even days. Do you have any suggestion on how can I fix this?&lt;/P&gt;&lt;P&gt;Here's the props.conf in my heavy forwarders (we have a distributed environment):&lt;BR /&gt;[xml_atm]&lt;BR /&gt;TRANSFORMS-routing=xmlatm-route&lt;BR /&gt;SHOULD_LINEMERGE=true&lt;BR /&gt;LINE_BREAKER=(?:restart)([\r\n]+)&lt;BR /&gt;CHARSET=ISO-8859-1&lt;BR /&gt;CHECK_METHOD = modtime&lt;BR /&gt;MAX_EVENTS=4000&lt;BR /&gt;TRUNCATE=0&lt;BR /&gt;disabled=false&lt;BR /&gt;TIME_PREFIX=restart-flag="&lt;BR /&gt;REPORT-xmlext=xml-extr&lt;BR /&gt;&lt;BR /&gt;While inputs.conf in UF is this:&lt;BR /&gt;&lt;SPAN&gt;[monitor://D:\ABC\Monitor\Monitor\Inputs\*\*.xml]&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;disabled = 0&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;host_segment = 5&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;index = my_index&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;sourcetype = xml_atm&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;Thanks in advance.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Nov 2020 14:18:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529280#M89174</guid>
      <dc:creator>nicofantinato</dc:creator>
      <dc:date>2020-11-13T14:18:39Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529300#M89176</link>
      <description>&lt;P&gt;Please share the relevant inputs.conf stanza from the UFs.&lt;/P&gt;</description>
      <pubDate>Fri, 13 Nov 2020 13:46:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529300#M89176</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-11-13T13:46:48Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529301#M89177</link>
      <description>&lt;P&gt;Here is the inputs.conf on the UF&lt;BR /&gt;[monitor://D:\ABC\Monitor\Monitor\Inputs\*\*.xml]&lt;BR /&gt;disabled = 0&lt;BR /&gt;host_segment = 5&lt;BR /&gt;index = my_index&lt;BR /&gt;sourcetype = xml_atm&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 13 Nov 2020 13:55:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/529301#M89177</guid>
      <dc:creator>nicofantinato</dc:creator>
      <dc:date>2020-11-13T13:55:12Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/530191#M89266</link>
      <description>&lt;P&gt;Hi, do you have any suggestion? I'm still unable to ingest entire XML files every time their modification time changes.&lt;/P&gt;</description>
      <pubDate>Fri, 20 Nov 2020 11:32:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/530191#M89266</guid>
      <dc:creator>nicofantinato</dc:creator>
      <dc:date>2020-11-20T11:32:10Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/530215#M89268</link>
      <description>&lt;P&gt;Splunk tries its best to avoid re-indexing entire files that are ingesting via a &lt;FONT face="courier new,courier"&gt;monitor&lt;/FONT&gt; stanza.&amp;nbsp; I'm not aware of any setting to override that behavior.&lt;/P&gt;&lt;P&gt;Consider using &lt;FONT face="courier new,courier"&gt;batch&lt;/FONT&gt; input, instead.&amp;nbsp; Splunk will read the entire file, but will delete it afterward.&amp;nbsp; That means your application must be prepared to re-create the file.&amp;nbsp; It also runs the risk of a race condition between Splunk and your app.&amp;nbsp; Can the application be configured to write a new file instead of overwriting existing files?&lt;/P&gt;</description>
      <pubDate>Fri, 20 Nov 2020 14:30:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/530215#M89268</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-11-20T14:30:24Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/532502#M89517</link>
      <description>&lt;P&gt;Hi, turned out we also needed to add directive&amp;nbsp;crcSalt = &amp;lt;SOURCE&amp;gt; in inputs.conf on UFs. Adding this all worked as expected.&lt;/P&gt;&lt;P&gt;inputs.conf became simply:&lt;/P&gt;&lt;P&gt;[&lt;SPAN&gt;monitor://D:\ABC\Monitor\Monitor\Inputs\*\*.xml&lt;/SPAN&gt;]&lt;BR /&gt;disabled = 0&lt;BR /&gt;host_segment = 5&lt;BR /&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;BR /&gt;index = &lt;SPAN&gt;my_index&lt;/SPAN&gt;&lt;BR /&gt;sourcetype = &lt;SPAN&gt;xml_atm&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 09 Dec 2020 16:54:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/532502#M89517</guid>
      <dc:creator>nicofantinato</dc:creator>
      <dc:date>2020-12-09T16:54:21Z</dc:date>
    </item>
    <item>
      <title>Re: Ingest entire XML file</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/553097#M91720</link>
      <description>&lt;P&gt;I have the exact same config as you, the only difference is that I want "source" also, if I define a source value then host names goes back to default servername, in this case host/host_regex/host_segment, nothing works if "source" is defined.&lt;/P&gt;&lt;P&gt;Do you have any suggestion what I can try? I am also configuring on UF.&lt;/P&gt;&lt;P&gt;Sample:&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;SPAN&gt;monitor://D:\ABC\Monitor\Monitor\Inputs\*\*.xml&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;disabled = 0&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;host_segment = 5&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;index =&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;my_index&lt;/SPAN&gt;&lt;BR /&gt;&lt;SPAN&gt;sourcetype =&amp;nbsp;&lt;/SPAN&gt;&lt;SPAN&gt;xml_atm&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;source = B_wks&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 26 May 2021 09:18:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingest-entire-XML-file/m-p/553097#M91720</guid>
      <dc:creator>ff9231</dc:creator>
      <dc:date>2021-05-26T09:18:13Z</dc:date>
    </item>
  </channel>
</rss>

