<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to exclude duplicate Data while onboaring the data in below scenerio in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548728#M91209</link>
    <description>&lt;P&gt;I have a python script with runs daily and saves output in csv file&amp;nbsp;&lt;/P&gt;&lt;P&gt;for example: if i run that script&amp;nbsp; today it will get the data from april 1st to today date(04/21/2021) and if i run tomorrow it will get the data from april 1st to tomorrow date (04/22/2021) and with different file name every time we run&amp;nbsp;&lt;/P&gt;&lt;P&gt;i want to onboard this data into splunk with out duplicate data&amp;nbsp;&lt;/P&gt;&lt;P&gt;how can we do that?&amp;nbsp;&lt;/P&gt;&lt;P&gt;we have a field name called start_time&amp;nbsp; &amp;nbsp;this field we are taking as time field&amp;nbsp;&lt;/P&gt;&lt;P&gt;for example: start_time field value =&amp;nbsp;04/21/2021 10.30&lt;/P&gt;&lt;P&gt;example: start_time field value =&amp;nbsp;04/22/2021 10.30&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 21 Apr 2021 06:57:23 GMT</pubDate>
    <dc:creator>vikram1583</dc:creator>
    <dc:date>2021-04-21T06:57:23Z</dc:date>
    <item>
      <title>How to exclude duplicate Data while onboaring the data in below scenerio</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548728#M91209</link>
      <description>&lt;P&gt;I have a python script with runs daily and saves output in csv file&amp;nbsp;&lt;/P&gt;&lt;P&gt;for example: if i run that script&amp;nbsp; today it will get the data from april 1st to today date(04/21/2021) and if i run tomorrow it will get the data from april 1st to tomorrow date (04/22/2021) and with different file name every time we run&amp;nbsp;&lt;/P&gt;&lt;P&gt;i want to onboard this data into splunk with out duplicate data&amp;nbsp;&lt;/P&gt;&lt;P&gt;how can we do that?&amp;nbsp;&lt;/P&gt;&lt;P&gt;we have a field name called start_time&amp;nbsp; &amp;nbsp;this field we are taking as time field&amp;nbsp;&lt;/P&gt;&lt;P&gt;for example: start_time field value =&amp;nbsp;04/21/2021 10.30&lt;/P&gt;&lt;P&gt;example: start_time field value =&amp;nbsp;04/22/2021 10.30&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks in advance&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 06:57:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548728#M91209</guid>
      <dc:creator>vikram1583</dc:creator>
      <dc:date>2021-04-21T06:57:23Z</dc:date>
    </item>
    <item>
      <title>Re: How to exclude duplicate Data while onboaring the data in below scenerio</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548730#M91210</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/164079"&gt;@vikram1583&lt;/a&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;How the data looks like in both files they change every time script runs?&amp;nbsp;&lt;/P&gt;&lt;P&gt;Instead index both files and remove duplicates using Splunk commands like - dedup, dc etc... depends on your use case.&lt;/P&gt;&lt;P&gt;----------------------------------------------&lt;/P&gt;&lt;P&gt;An upvote would be appreciated if it helps!&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 07:12:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548730#M91210</guid>
      <dc:creator>venkatasri</dc:creator>
      <dc:date>2021-04-21T07:12:30Z</dc:date>
    </item>
    <item>
      <title>Re: How to exclude duplicate Data while onboaring the data in below scenerio</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548815#M91219</link>
      <description>&lt;P&gt;Hi&amp;nbsp;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/163730"&gt;@venkatasri&lt;/a&gt;&amp;nbsp; thanks for your response.&amp;nbsp; its not about only 2 files i will run that script every day if i inject those files everyday license usage will increase so i just want to inject new data&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 15:14:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548815#M91219</guid>
      <dc:creator>vikram1583</dc:creator>
      <dc:date>2021-04-21T15:14:03Z</dc:date>
    </item>
    <item>
      <title>Re: How to exclude duplicate Data while onboaring the data in below scenerio</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548816#M91220</link>
      <description>&lt;P&gt;data will be same for previous dates it just adds new data for current date&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 21 Apr 2021 15:16:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548816#M91220</guid>
      <dc:creator>vikram1583</dc:creator>
      <dc:date>2021-04-21T15:16:33Z</dc:date>
    </item>
    <item>
      <title>Re: How to exclude duplicate Data while onboaring the data in below scenerio</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548872#M91233</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;Then Splunk avoids re-indexing duplicate data which is built-in, have you configured the monitors then share inputs.conf and sample data files.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 22 Apr 2021 00:00:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-exclude-duplicate-Data-while-onboaring-the-data-in-below/m-p/548872#M91233</guid>
      <dc:creator>venkatasri</dc:creator>
      <dc:date>2021-04-22T00:00:01Z</dc:date>
    </item>
  </channel>
</rss>

