Splunk Search

rex

Jagaspu
Engager

Hi i need extract the below file name from extracted output 

 

MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270

 

file name :- AR-100-01_20230702_PAY.zip

 

i need extract the above file name using rex command

Labels (1)
0 Karma

emzed
Path Finder

Or something like this

| makeresults 
| eval msg="MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270" 
| rex field=msg "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"



0 Karma

Jagaspu
Engager

Hi emzed , sorry for your command i have not received an output , Attached screen shot for reference.

 

Jagaspu_0-1688453893464.png

 

 

 

0 Karma

emzed
Path Finder

I tested it on artificial data and I used a field "msg" in rex command. I thing you have data in the field "_raw". 

 

You should use

| rex field=_raw "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

or 

| rex "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

note: _raw field is default field for rex command

 

 

0 Karma

yuanliu
SplunkTrust
SplunkTrust

Something like

| rex "READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Jagaspu
Engager

Hi yuanliu , Thanks below provided rex command has worked and can i get any topic on provided command, Just for knowledge gain.

Tags (1)
0 Karma

yuanliu
SplunkTrust
SplunkTrust

"READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Rex is about compromises.  I have to make a few assumptions based on the illustrated sample data.

  1. "READ:" is perhaps a keyword and doesn't change from event to event.
  2. "*MDTM" is perhaps a classifier that may take different forms but that does not contain space. (\S)
  3. The path before file name is absolute, and can vary in depth. (See below.)
  4. File name contains no space. ([^s])  By convention, file name also does not include a path separator. (Combined with no space, that's [^\s/])
  5. After file name, there is either a space or end of the line.

The expression contains two different repetition tokens.  + means repeat at least once, up to any number of times.  * means repeat zero to unlimited times.  Parentheses in standard regex is just grouping.  So, (/[^/]+)* matches /abc, /abc/def, /abc/def/ghi; but (/[^/]+)* zero-length string, so (/[^/]+)*/ also matches /.

Hope this helps.

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Index This | What travels the world but is also stuck in place?

April 2026 Edition  Hayyy Splunk Education Enthusiasts and the Eternally Curious!   We’re back with this ...

Discover New Use Cases: Unlock Greater Value from Your Existing Splunk Data

Realizing the full potential of your Splunk investment requires more than just understanding current usage; it ...

Continue Your Journey: Join Session 2 of the Data Management and Federation Bootcamp ...

As data volumes continue to grow and environments become more distributed, managing and optimizing data ...