Splunk Search

rex

Jagaspu
Engager

Hi i need extract the below file name from extracted output 

 

MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270

 

file name :- AR-100-01_20230702_PAY.zip

 

i need extract the above file name using rex command

Labels (1)
0 Karma

emzed
Path Finder

Or something like this

| makeresults 
| eval msg="MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270" 
| rex field=msg "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"



0 Karma

Jagaspu
Engager

Hi emzed , sorry for your command i have not received an output , Attached screen shot for reference.

 

Jagaspu_0-1688453893464.png

 

 

 

0 Karma

emzed
Path Finder

I tested it on artificial data and I used a field "msg" in rex command. I thing you have data in the field "_raw". 

 

You should use

| rex field=_raw "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

or 

| rex "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

note: _raw field is default field for rex command

 

 

0 Karma

yuanliu
SplunkTrust
SplunkTrust

Something like

| rex "READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Jagaspu
Engager

Hi yuanliu , Thanks below provided rex command has worked and can i get any topic on provided command, Just for knowledge gain.

Tags (1)
0 Karma

yuanliu
SplunkTrust
SplunkTrust

"READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Rex is about compromises.  I have to make a few assumptions based on the illustrated sample data.

  1. "READ:" is perhaps a keyword and doesn't change from event to event.
  2. "*MDTM" is perhaps a classifier that may take different forms but that does not contain space. (\S)
  3. The path before file name is absolute, and can vary in depth. (See below.)
  4. File name contains no space. ([^s])  By convention, file name also does not include a path separator. (Combined with no space, that's [^\s/])
  5. After file name, there is either a space or end of the line.

The expression contains two different repetition tokens.  + means repeat at least once, up to any number of times.  * means repeat zero to unlimited times.  Parentheses in standard regex is just grouping.  So, (/[^/]+)* matches /abc, /abc/def, /abc/def/ghi; but (/[^/]+)* zero-length string, so (/[^/]+)*/ also matches /.

Hope this helps.

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

[Puzzles] Solve, Learn, Repeat: Matching cron expressions

This puzzle (first published here) is based on matching timestamps to cron expressions.All the timestamps ...

Design, Compete, Win: Submit Your Best Splunk Dashboards for a .conf26 Pass

Hello Splunkers,  We’re excited to kick off a Splunk Dashboard contest! We know that dashboards are a primary ...

May 2026 Splunk Expert Sessions: Security & Observability

Level Up Your Operations: May 2026 Splunk Expert Sessions Whether you are refining your security posture or ...