Splunk Search

Which is the best way to extract fields: field extractor, rex and eval commands, or props and transforms?

vkakani60
Path Finder

Out of three ways to extract the fields,
1. BY using rex or eval command in search
2. By using field extractor option
3. By adding entries to the prop.conf and transforms.conf

Currently, I am using 1st method, by using rex commands such as

index="PriorityLogs" sourcetype="WebPay" | rex field=_raw "ProcessId: (?<ProcessId>.*)" |  rex field=_raw "EventId: (?<EventId>.*)" 

which extracts the desired ProcessID with 99.9% of the events.

Coming to the field extractor, my logs don't have a proper structure which leads to poor field extractions when i use the Splunk field extractor.

Which one is the best method?

0 Karma
1 Solution

vkakani60
Path Finder

After going through couple of splunk answers and documents. Search time field extractions are good compared to the index-time field extractions (editing conf files).

https://answers.splunk.com/answers/5817/search-time-versus-index-time-field-extractions.html
https://answers.splunk.com/answers/57247/index-time-field-extraction.html

View solution in original post

0 Karma

vkakani60
Path Finder

After going through couple of splunk answers and documents. Search time field extractions are good compared to the index-time field extractions (editing conf files).

https://answers.splunk.com/answers/5817/search-time-versus-index-time-field-extractions.html
https://answers.splunk.com/answers/57247/index-time-field-extraction.html

0 Karma

jeremiahc4
Builder

I've found ".*" to be a very generous grab, usually netting me everything up to the next line return (\r) or newline (\n) character.
You might have better luck limiting that in some fashion. Something like ".+" (slightly less generous grab) or "[\w\d]*" (doesn't grab spaces and various punctuation) might get you decent results.

0 Karma

somesoni2
Revered Legend

Technically, option 2 and option 3 are same (both are 'saved search time field extraction in configuration files'). The option 2 IFX provides an interactive method to do field extraction and can be used if you're not proficient with regular expression.

The option 1 and 3 is for advanced users who are good at regular expression. I'm guessing that efficiency of regex (99.9% ) is your issue here, so if you can provide sample data for which your inline extraction is working and for which it's not, splunkers here may be able to help you with that.

0 Karma
Get Updates on the Splunk Community!

Customer Experience | Splunk 2024: New Onboarding Resources

In 2023, we were routinely reminded that the digital world is ever-evolving and susceptible to new ...

Celebrate CX Day with Splunk: Take our interactive quiz, join our LinkedIn Live ...

Today and every day, Splunk celebrates the importance of customer experience throughout our product, ...

How to Get Started with Splunk Data Management Pipeline Builders (Edge Processor & ...

If you want to gain full control over your growing data volumes, check out Splunk’s Data Management pipeline ...