Splunk Dev

Index access from scripted or modular input

chrisdev
Explorer

Is it possible to access the index from a scripted or modular input? And is the standard way of doing this via the SDK features as shown in examples such as search.py?

Labels (3)
0 Karma

richgalloway
SplunkTrust
SplunkTrust

Scripted and modular inputs are input-only.  Technically, they don't access indexes at all, but produce output which Splunk then writes to an index.

What problem are you trying to solve?

---
If this reply helps you, Karma would be appreciated.

chrisdev
Explorer

Thank you.

Weighing up my options for deduplication of events at index time. Doesn't seem like the correct approach if it's bad practice to access an index from a  scripted/modular input.

I know that there's a dedup command at search time, however ideally I wouldn't have duplicate events in the index. The source of these events, which im not in control of, may produce duplicates.

0 Karma

richgalloway
SplunkTrust
SplunkTrust

I suggest the modular input cache events internally rather than try to fetch them from an index.  If a new event is in the cache then it's a duplicate and can be discarded; otherwise, index it and add it to the cache.  Yes, the cache will limit your look-back for duplicates, but will perform vastly better than scanning an index for every incoming event.  You'll still need to handle duplicates at search time, but there will be far fewer of them.

---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Automatic Discovery Part 1: What is Automatic Discovery in Splunk Observability Cloud ...

If you’ve ever deployed a new database cluster, spun up a caching layer, or added a load balancer, you know it ...

Real-Time Fraud Detection: How Splunk Dashboards Protect Financial Institutions

Financial fraud isn't slowing down. If anything, it's getting more sophisticated. Account takeovers, credit ...

Splunk + ThousandEyes: Correlate frontend, app, and network data to troubleshoot ...

 Are you tired of troubleshooting delays caused by siloed frontend, application, and network data? We've got a ...