<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Configurations that Filters Existing Orders from AS/400 File in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175082#M35155</link>
    <description>&lt;P&gt;This &lt;EM&gt;might&lt;/EM&gt; be possible if you &lt;/P&gt;

&lt;P&gt;(1) Keep the same file name. In other words, overwrite the old file with the new file each hour.&lt;/P&gt;

&lt;P&gt;(2) Make sure that the beginning of the file (up to the point of the new data) has not changed.&lt;/P&gt;

&lt;P&gt;But if Splunk figures out that this is a different file, it will index it from the beginning, causing the duplication that you are trying to avoid.&lt;/P&gt;

&lt;P&gt;There is no way for Splunk to compare inbound data with existing data before indexing. However, it is possible to "dedup" data being retreived during a search - although you have to do it explicitly with the &lt;CODE&gt;uniq&lt;/CODE&gt; command.&lt;/P&gt;

&lt;P&gt;I would test it out.&lt;/P&gt;</description>
    <pubDate>Thu, 22 May 2014 20:05:03 GMT</pubDate>
    <dc:creator>lguinn2</dc:creator>
    <dc:date>2014-05-22T20:05:03Z</dc:date>
    <item>
      <title>Configurations that Filters Existing Orders from AS/400 File</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175081#M35154</link>
      <description>&lt;P&gt;I'm looking to come up with some configurations that filter out existing orders from files I (currently) manually copy to a local directory where Splunk then picks them up and indexes the order info for that hour.&lt;/P&gt;

&lt;P&gt;Each file that comes out from the AS/400 has a total of all orders with various information on that order(CustomerNumber, PONumber, Date, Time, etc.) up to that particular hour.&lt;/P&gt;

&lt;P&gt;Basically every time Splunk Picks up one of those files, I want it so that Splunk only indexes the NEW orders, rather than indexing the same order data from the previous hours.  Otherwise, there will be a large amount of duplicate data being indexed.&lt;/P&gt;

&lt;P&gt;Is there a way I can do this?  Let me know if any information is needed to dig in to this further.&lt;/P&gt;

&lt;P&gt;Thanks in advance!&lt;/P&gt;</description>
      <pubDate>Thu, 22 May 2014 12:29:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175081#M35154</guid>
      <dc:creator>_gkollias</dc:creator>
      <dc:date>2014-05-22T12:29:35Z</dc:date>
    </item>
    <item>
      <title>Re: Configurations that Filters Existing Orders from AS/400 File</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175082#M35155</link>
      <description>&lt;P&gt;This &lt;EM&gt;might&lt;/EM&gt; be possible if you &lt;/P&gt;

&lt;P&gt;(1) Keep the same file name. In other words, overwrite the old file with the new file each hour.&lt;/P&gt;

&lt;P&gt;(2) Make sure that the beginning of the file (up to the point of the new data) has not changed.&lt;/P&gt;

&lt;P&gt;But if Splunk figures out that this is a different file, it will index it from the beginning, causing the duplication that you are trying to avoid.&lt;/P&gt;

&lt;P&gt;There is no way for Splunk to compare inbound data with existing data before indexing. However, it is possible to "dedup" data being retreived during a search - although you have to do it explicitly with the &lt;CODE&gt;uniq&lt;/CODE&gt; command.&lt;/P&gt;

&lt;P&gt;I would test it out.&lt;/P&gt;</description>
      <pubDate>Thu, 22 May 2014 20:05:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175082#M35155</guid>
      <dc:creator>lguinn2</dc:creator>
      <dc:date>2014-05-22T20:05:03Z</dc:date>
    </item>
    <item>
      <title>Re: Configurations that Filters Existing Orders from AS/400 File</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175083#M35156</link>
      <description>&lt;P&gt;Awesome - thanks for your suggestions.  I will some testing around it a bit more and come back with some feedback&lt;/P&gt;</description>
      <pubDate>Fri, 23 May 2014 18:02:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Configurations-that-Filters-Existing-Orders-from-AS-400-File/m-p/175083#M35156</guid>
      <dc:creator>_gkollias</dc:creator>
      <dc:date>2014-05-23T18:02:51Z</dc:date>
    </item>
  </channel>
</rss>

