Splunk Search

Why won't my regular expression extract fields when the _raw field is greater than 56kb?

bitfhacker
New Member

Hi,

I'm trying to extract two fields with this regular expression:

Transaction\sID=\"(?P<Transaction_ID>\w*)\".*OperationCode=\"(?P<Transaction_OperationCode>\w*)\"

and it works in almost all records but it seems that when the _raw field is greater than 56kb, the regular expression doesn't work.

Any clue?

0 Karma

DalJeanis
Legend

Probably reaching catastrophic backtracking because of the greedy .* before "OperationCode=". That is going to take everything in the entire 56K until it hits the end, then back up and look for the very last place it finds "OperationCode=" before the end. Sometimes you want that behavior, but not in this case.

Instead, you want it to be lazy, and stop slurping its soup the first time it encounters "OperationCode=".

Change that middle snippet to this... we're just adding a question mark to make the .* before "OperationCode=" become "lazy" instead of "greedy".

\".*?OperationCode=\"
0 Karma

asimagu
Builder

are you able to provide a sample for the not working ones?

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Thanks for the Memories! Splunk University, .conf25, and our Community

Thank you to everyone in the Splunk Community who joined us for .conf25, which kicked off with our iconic ...

Data Persistence in the OpenTelemetry Collector

This blog post is part of an ongoing series on OpenTelemetry. What happens if the OpenTelemetry collector ...

Introducing Splunk 10.0: Smarter, Faster, and More Powerful Than Ever

Now On Demand Whether you're managing complex deployments or looking to future-proof your data ...