Splunk Search

Regex Help: Parse CSV with whatever it has got rather than failing on entire line

koshyk
Super Champion

Hi
We have a regex/requirement to extract col1,col2,col3,col4 everytime. But the data may not contain col3 onwards everytime.
How to write regex , so it will be forgiving and extract what it has got, rather than failing for the entire line?

(?<col1>[^\"]*?)\",\"(?<col2>[^\"]*?)\",\"(?<col3>[^\"]*?)\",\"(?<col4>[^\"]*?)\"

below is dataset

"r1col1","r1col2"
"r1col1","r1col2","r1col3"
"r3col1","r3col2","r3col3","r3col4"
"r4col1","r4col2","r4col3","r4col4","r4col5","r4col6","r4col7"

in above regex, it is failing for Line1 and Line2, but rather prefer to give atleast col1 and col2 if it doesn't find others.

https://regex101.com/r/Bkle5V/1

0 Karma
1 Solution

elliotproebstel
Champion

How about this:
(?<col1>[^\"]*?)\",(\"(?<col2>[^\"]*?)\",)?(\"(?<col3>[^\"]*?)\",)?(\"(?<col4>[^\"]*?)\")?

This makes col2, col3, and col4 optional by wrapping them in parenthesis and appending a question mark, to indicate that the field may occur 0 or 1 times - effectively making them optional.

https://regex101.com/r/Bkle5V/2

View solution in original post

elliotproebstel
Champion

How about this:
(?<col1>[^\"]*?)\",(\"(?<col2>[^\"]*?)\",)?(\"(?<col3>[^\"]*?)\",)?(\"(?<col4>[^\"]*?)\")?

This makes col2, col3, and col4 optional by wrapping them in parenthesis and appending a question mark, to indicate that the field may occur 0 or 1 times - effectively making them optional.

https://regex101.com/r/Bkle5V/2

koshyk
Super Champion

cheers. it works

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Observability Simplified: Combining User Experience, Application Performance & ...

Tech Talk Observability Simplified: Combining User Experience, Application Performance & Network ...

Event Series May & June: From Network Visibility to Service Intelligence

Unifying the Network: Moving from Alert Noise to Service Intelligence with Splunk ITSI In today’s hybrid ...

Global Splunk User Group Events: May + June 2026

Your Splunk Community Awaits: Discover Upcoming User Group Events Worldwide    Staying ahead in the fast-paced ...