Splunk Search

Why won't my regular expression extract fields when the _raw field is greater than 56kb?

New Member

Hi,

I'm trying to extract two fields with this regular expression:

Transaction\sID=\"(?P<Transaction_ID>\w*)\".*OperationCode=\"(?P<Transaction_OperationCode>\w*)\"

and it works in almost all records but it seems that when the _raw field is greater than 56kb, the regular expression doesn't work.

Any clue?

0 Karma

SplunkTrust
SplunkTrust

Probably reaching catastrophic backtracking because of the greedy .* before "OperationCode=". That is going to take everything in the entire 56K until it hits the end, then back up and look for the very last place it finds "OperationCode=" before the end. Sometimes you want that behavior, but not in this case.

Instead, you want it to be lazy, and stop slurping its soup the first time it encounters "OperationCode=".

Change that middle snippet to this... we're just adding a question mark to make the .* before "OperationCode=" become "lazy" instead of "greedy".

\".*?OperationCode=\"
0 Karma

Builder

are you able to provide a sample for the not working ones?

0 Karma
State of Splunk Careers

Access the Splunk Careers Report to see real data that shows how Splunk mastery increases your value and job satisfaction.

Find out what your skills are worth!