Splunk Search

Which is the best way to extract fields: field extractor, rex and eval commands, or props and transforms?

vkakani60
Path Finder

Out of three ways to extract the fields,
1. BY using rex or eval command in search
2. By using field extractor option
3. By adding entries to the prop.conf and transforms.conf

Currently, I am using 1st method, by using rex commands such as

index="PriorityLogs" sourcetype="WebPay" | rex field=_raw "ProcessId: (?<ProcessId>.*)" |  rex field=_raw "EventId: (?<EventId>.*)" 

which extracts the desired ProcessID with 99.9% of the events.

Coming to the field extractor, my logs don't have a proper structure which leads to poor field extractions when i use the Splunk field extractor.

Which one is the best method?

0 Karma
1 Solution

vkakani60
Path Finder

After going through couple of splunk answers and documents. Search time field extractions are good compared to the index-time field extractions (editing conf files).

https://answers.splunk.com/answers/5817/search-time-versus-index-time-field-extractions.html
https://answers.splunk.com/answers/57247/index-time-field-extraction.html

View solution in original post

0 Karma

vkakani60
Path Finder

After going through couple of splunk answers and documents. Search time field extractions are good compared to the index-time field extractions (editing conf files).

https://answers.splunk.com/answers/5817/search-time-versus-index-time-field-extractions.html
https://answers.splunk.com/answers/57247/index-time-field-extraction.html

0 Karma

jeremiahc4
Builder

I've found ".*" to be a very generous grab, usually netting me everything up to the next line return (\r) or newline (\n) character.
You might have better luck limiting that in some fashion. Something like ".+" (slightly less generous grab) or "[\w\d]*" (doesn't grab spaces and various punctuation) might get you decent results.

0 Karma

somesoni2
Revered Legend

Technically, option 2 and option 3 are same (both are 'saved search time field extraction in configuration files'). The option 2 IFX provides an interactive method to do field extraction and can be used if you're not proficient with regular expression.

The option 1 and 3 is for advanced users who are good at regular expression. I'm guessing that efficiency of regex (99.9% ) is your issue here, so if you can provide sample data for which your inline extraction is working and for which it's not, splunkers here may be able to help you with that.

0 Karma
Get Updates on the Splunk Community!

Take Your Breath Away with Splunk Risk-Based Alerting (RBA)

WATCH NOW!The Splunk Guide to Risk-Based Alerting is here to empower your SOC like never before. Join Haylee ...

SignalFlow: What? Why? How?

What is SignalFlow? Splunk Observability Cloud’s analytics engine, SignalFlow, opens up a world of in-depth ...

Federated Search for Amazon S3 | Key Use Cases to Streamline Compliance Workflows

Modern business operations are supported by data compliance. As regulations evolve, organizations must ...