Getting Data In

Not all events indexed from file

woolfalr
New Member

I’m new to Splunk, and running with the trial licence exactly as it comes out of the box (no fancy config etc.). i.e. Single instance combining search and indexing, on Windows 7.

Using the downloaded sample data, I see that by inputting the same data file repeatedly, I get a different number of events indexed each time.

So: First I input a file which resulted in 21065 events indexed. (For info this was the apache3.splunk.com/access_combined.log from the sample data).

I input the same file again, using a different host value, and got 26776 events on the summary. Further loads (each with different host value) gave 26693 , 27888 and 24210 events.

Each input (except one - see CLUE below) seems to be associated with one or more “ERROR HTTPServer - InputStreamConduit write timed out after 5 seconds.” in Splunkd.log .

Can anyone help explain this?

CLUE 1: There were no errors in Splunkd.log at the time of the input which resulted in 27888 events.

CLUE 2: From inspecting the input file myself I’d say it contains 27705 events.

CLUE 3: I'm getting similar problem with a different file (apache2) of about the same size. But not with the apache1 sample file which is smaller (9199 events each time consistently)

Thanks for any help!
Rick

0 Karma

woolfalr
New Member

Further analysis after reading a couple of other posts - I exported the raw events (from Splunk) and compared to the original file. It shows we really do have events missing.

From each input of the file, I get one or more blocks of contiguous missing events; around 1200 events each time.

No obvious pattern to this (except fairly consistent size of the missing blocks). Buffering problem? (but as I said, this is straight out of the box, no configuration tweaks)

This seems to tie in with a couple of other threads on here but nobody has any answers yet...

(ps Please ignore Clue 2 earlier – just confusion on my part. The total events in my example is 27888)

0 Karma

Takajian
Builder

I have seen similar issue. I will share my case, but not sure if this help. I saw "linecount" field in left pane of search result, then there were some value "2" or "3". It means splunk accidentally indexed event with wrong break. Could you confirm if this is your case?

0 Karma

woolfalr
New Member

Nice try, but no ; they are all linecount=1 . Keep the suggestions coming 🙂

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...

Modernize your Splunk Apps – Introducing Python 3.13 in Splunk

We are excited to announce that the upcoming releases of Splunk Enterprise 10.2.x and Splunk Cloud Platform ...

Step into “Hunt the Insider: An Splunk ES Premier Mystery” to catch a cybercriminal ...

After a whole week of being on call, you fell asleep on your keyboard, and you hit a sequence of buttons that ...