Getting Data In

Does the data flow through those same queues at the indexer?

dokaas_2
Communicator

If an HF is used for a intermediate / aggregation tier and the data is parsed,  what does the ingestion pipeline look like when it hits the indexer.  That is, if the HF does parsing, aggregation, typing, but not indexing, does the data flow through those same queues at the indexer? Or is the data injected directly in the the indexing queue?

Tags (2)
0 Karma
1 Solution

PickleRick
SplunkTrust
SplunkTrust

Of course the point of reference are the invaluable Masa diagrams

https://community.splunk.com/t5/Getting-Data-In/Diagrams-of-how-indexing-works-in-the-Splunk-platfor...

HF is just an indexer which doesn't do local indexing in the IndexerPipeline. So it processes the events up to the indexing queue but then sends the parsed (not cooked! the data is now fully parsed!) to the indexer.

It's not shown on the diagram (because only standalone indexer and UF->idx cases are covered) but as the indexer receives it via S2S tcp input (or http-embedded s2s input) and sees that it has parsed data (the forwarder tells the indexer at the beginning of the connection what kind of data it's sending), it bypasses all local processing and sends the parsed event stream straight to local indexing queue.

View solution in original post

PickleRick
SplunkTrust
SplunkTrust

Of course the point of reference are the invaluable Masa diagrams

https://community.splunk.com/t5/Getting-Data-In/Diagrams-of-how-indexing-works-in-the-Splunk-platfor...

HF is just an indexer which doesn't do local indexing in the IndexerPipeline. So it processes the events up to the indexing queue but then sends the parsed (not cooked! the data is now fully parsed!) to the indexer.

It's not shown on the diagram (because only standalone indexer and UF->idx cases are covered) but as the indexer receives it via S2S tcp input (or http-embedded s2s input) and sees that it has parsed data (the forwarder tells the indexer at the beginning of the connection what kind of data it's sending), it bypasses all local processing and sends the parsed event stream straight to local indexing queue.

gcusello
SplunkTrust
SplunkTrust

Hi @dokaas_2,

if data is passing through HFs they are coocked, so the pipeline is always the same but on Indexers Data arrive already coocked so they don't load Indexers.

Ciao.

Giuseppe

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...