Splunk Search

Extracting an alphanumeric expression from a line

shilpi
New Member

I have a logger like below and I need to extract the alphanumeric word from this line-

"My employeeID E1233244345 is valid"

The alphanumeric word will always start with E (eg E1233244345) followed by 10 digits.

I need to extract this alphanumeric word from this line.
Please help

Tags (1)
0 Karma

kristian_kolb
Ultra Champion

You can do it from the search query with rex like so;

your base search | rex "My\semployeeID\s(?<EmpID>\w+)\sis\svalid\." | ...

This is a fairly explicit statement, and in order to extract the employeeID (as EmpID), it has to be inside the sentence as specified above. However, the capture specification as such is only defined as 'more than one character'.

An alternative - if the employeeID turns up in other sentences - is to make the specification stricter, and the anchoring looser. I.e. it could follow either a space or a colon, must start with a capital E, followed by 10 digits exactly, then space.

your base search | rex "[:\s](?<EmpID>E\d{10})\s" 

Hope this helps,

/K

shilpi
New Member

Thanks for this answer..It worked:)

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

[Puzzles] Solve, Learn, Repeat: Character substitutions with Regular Expressions

This challenge was first posted on Slack #puzzles channelFor BORE at .conf23, we had a puzzle question which ...

Splunk Community Badges!

  Hey everyone! Ready to earn some serious bragging rights in the community? Along with our existing badges ...

[Puzzles] Solve, Learn, Repeat: Matching cron expressions

This puzzle (first published here) is based on matching timestamps to cron expressions.All the timestamps ...