<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Index access from scripted or modular input in Splunk Dev</title>
    <link>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544371#M9760</link>
    <description>&lt;P&gt;Thank you.&lt;/P&gt;&lt;P&gt;Weighing up my options for deduplication of events at index time. Doesn't seem like the correct approach if it's bad practice to access an index from a&amp;nbsp; scripted/modular input.&lt;/P&gt;&lt;P&gt;I know that there's a dedup command at search time, however ideally I wouldn't have duplicate events in the index. The source of these events, which im not in control of, may produce duplicates.&lt;/P&gt;</description>
    <pubDate>Thu, 18 Mar 2021 16:57:06 GMT</pubDate>
    <dc:creator>chrisdev</dc:creator>
    <dc:date>2021-03-18T16:57:06Z</dc:date>
    <item>
      <title>Index access from scripted or modular input</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544350#M9758</link>
      <description>&lt;P&gt;Is it possible to access the index from a scripted or modular input? And is the standard way of doing this via the SDK features as shown in examples such as search.py?&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 15:26:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544350#M9758</guid>
      <dc:creator>chrisdev</dc:creator>
      <dc:date>2021-03-18T15:26:40Z</dc:date>
    </item>
    <item>
      <title>Re: Index access from scripted or modular input</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544369#M9759</link>
      <description>&lt;P&gt;Scripted and modular inputs are input-only.&amp;nbsp; Technically, they don't access indexes at all, but produce output which Splunk then writes to an index.&lt;/P&gt;&lt;P&gt;What problem are you trying to solve?&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 16:49:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544369#M9759</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2021-03-18T16:49:09Z</dc:date>
    </item>
    <item>
      <title>Re: Index access from scripted or modular input</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544371#M9760</link>
      <description>&lt;P&gt;Thank you.&lt;/P&gt;&lt;P&gt;Weighing up my options for deduplication of events at index time. Doesn't seem like the correct approach if it's bad practice to access an index from a&amp;nbsp; scripted/modular input.&lt;/P&gt;&lt;P&gt;I know that there's a dedup command at search time, however ideally I wouldn't have duplicate events in the index. The source of these events, which im not in control of, may produce duplicates.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 16:57:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544371#M9760</guid>
      <dc:creator>chrisdev</dc:creator>
      <dc:date>2021-03-18T16:57:06Z</dc:date>
    </item>
    <item>
      <title>Re: Index access from scripted or modular input</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544382#M9761</link>
      <description>&lt;P&gt;I suggest the modular input cache events internally rather than try to fetch them from an index.&amp;nbsp; If a new event is in the cache then it's a duplicate and can be discarded; otherwise, index it and add it to the cache.&amp;nbsp; Yes, the cache will limit your look-back for duplicates, but will perform vastly better than scanning an index for every incoming event.&amp;nbsp; You'll still need to handle duplicates at search time, but there will be far fewer of them.&lt;/P&gt;</description>
      <pubDate>Thu, 18 Mar 2021 17:56:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/Index-access-from-scripted-or-modular-input/m-p/544382#M9761</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2021-03-18T17:56:05Z</dc:date>
    </item>
  </channel>
</rss>

