<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Modify _indextime to avoid duplication at indexation time in Splunk Dev</title>
    <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396503#M6727</link>
    <description>&lt;P&gt;No problem, thanks for trying &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;In fact I have very few duplicate events, but I can't avoid them more by playing with the data source.&lt;/P&gt;</description>
    <pubDate>Fri, 11 May 2018 15:09:07 GMT</pubDate>
    <dc:creator>Clovisa</dc:creator>
    <dc:date>2018-05-11T15:09:07Z</dc:date>
    <item>
      <title>Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396499#M6723</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I noticed that if I send two times the exact same event, _time included, they are not merged. While investigating, I discoverd the _indextime field that could explain why they are considered as two different events.&lt;/P&gt;

&lt;P&gt;Is it possible to set the _indextime with the value of _time ? Will it "merge" my identical events ? And if not, is there a way to do it at indexation time ?&lt;/P&gt;

&lt;P&gt;Thanks !&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 14:34:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396499#M6723</guid>
      <dc:creator>Clovisa</dc:creator>
      <dc:date>2018-05-11T14:34:13Z</dc:date>
    </item>
    <item>
      <title>Re: Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396500#M6724</link>
      <description>&lt;P&gt;I don't think there is any way to influence the _indextime field. That is simply Splunk's internal recording of when it indexed the event. But even if you could: Splunk doesn't merge events that are identical.&lt;/P&gt;

&lt;P&gt;If you want to somehow filter part of the incoming events, provide some more details of how you are ingesting them (what input method etc.) and what the events look like.&lt;/P&gt;

&lt;P&gt;What you can do of course is apply the &lt;CODE&gt;dedup&lt;/CODE&gt; command at search time, to remove duplicates from your search results. It's typically more efficient to dedup based on a certain field, but you can also do a &lt;CODE&gt;dedup _raw&lt;/CODE&gt; to dedup based on the entire original event (note: _raw does not include metadata fields like _indextime, it is just the original raw event itself).&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 14:49:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396500#M6724</guid>
      <dc:creator>FrankVl</dc:creator>
      <dc:date>2018-05-11T14:49:14Z</dc:date>
    </item>
    <item>
      <title>Re: Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396501#M6725</link>
      <description>&lt;P&gt;Thanks for your help. I receive the events via an HTTP entrypoint, and it is json. An simplified equivalent of the requests that are sent is :&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;curl -k "splunk:8088/services/collector" \
    -H "Authorization: Splunk 1c0afd4d-d802-gg2c-9fc2-0f428217adf7" \
    -d '{"event": {"Owner": "Toto", "Title": "Hello", "Date":"2018-02-02 11:45:23"}, "sourcetype": "sales", "time":"2018-02-02 11:45:23"}'
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I will use &lt;CODE&gt;dedup&lt;/CODE&gt; if it is my last option, but I feel like it will be redundant to write this for every request&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 14:53:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396501#M6725</guid>
      <dc:creator>Clovisa</dc:creator>
      <dc:date>2018-05-11T14:53:59Z</dc:date>
    </item>
    <item>
      <title>Re: Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396502#M6726</link>
      <description>&lt;P&gt;I don't see an obvious way to filter duplicates for such events during input/parsing phase, but perhaps someone else will come by who has a smart idea for that.&lt;/P&gt;

&lt;P&gt;Can't you tune the data source somehow to reduce sending the same event multiple times?&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 15:04:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396502#M6726</guid>
      <dc:creator>FrankVl</dc:creator>
      <dc:date>2018-05-11T15:04:15Z</dc:date>
    </item>
    <item>
      <title>Re: Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396503#M6727</link>
      <description>&lt;P&gt;No problem, thanks for trying &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;In fact I have very few duplicate events, but I can't avoid them more by playing with the data source.&lt;/P&gt;</description>
      <pubDate>Fri, 11 May 2018 15:09:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396503#M6727</guid>
      <dc:creator>Clovisa</dc:creator>
      <dc:date>2018-05-11T15:09:07Z</dc:date>
    </item>
    <item>
      <title>Re: Modify _indextime to avoid duplication at indexation time</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396504#M6728</link>
      <description>&lt;P&gt;I don't see a way besides dedup either. This should be fixed at the source/input, if possible. &lt;/P&gt;</description>
      <pubDate>Sat, 12 May 2018 04:50:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Modify-indextime-to-avoid-duplication-at-indexation-time/m-p/396504#M6728</guid>
      <dc:creator>xpac</dc:creator>
      <dc:date>2018-05-12T04:50:17Z</dc:date>
    </item>
  </channel>
</rss>

