Splunk Search

rex

Jagaspu
Engager

Hi i need extract the below file name from extracted output 

 

MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270

 

file name :- AR-100-01_20230702_PAY.zip

 

i need extract the above file name using rex command

Labels (1)
0 Karma

emzed
Path Finder

Or something like this

| makeresults 
| eval msg="MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270" 
| rex field=msg "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"



0 Karma

Jagaspu
Engager

Hi emzed , sorry for your command i have not received an output , Attached screen shot for reference.

 

Jagaspu_0-1688453893464.png

 

 

 

0 Karma

emzed
Path Finder

I tested it on artificial data and I used a field "msg" in rex command. I thing you have data in the field "_raw". 

 

You should use

| rex field=_raw "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

or 

| rex "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

note: _raw field is default field for rex command

 

 

0 Karma

yuanliu
SplunkTrust
SplunkTrust

Something like

| rex "READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Jagaspu
Engager

Hi yuanliu , Thanks below provided rex command has worked and can i get any topic on provided command, Just for knowledge gain.

Tags (1)
0 Karma

yuanliu
SplunkTrust
SplunkTrust

"READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Rex is about compromises.  I have to make a few assumptions based on the illustrated sample data.

  1. "READ:" is perhaps a keyword and doesn't change from event to event.
  2. "*MDTM" is perhaps a classifier that may take different forms but that does not contain space. (\S)
  3. The path before file name is absolute, and can vary in depth. (See below.)
  4. File name contains no space. ([^s])  By convention, file name also does not include a path separator. (Combined with no space, that's [^\s/])
  5. After file name, there is either a space or end of the line.

The expression contains two different repetition tokens.  + means repeat at least once, up to any number of times.  * means repeat zero to unlimited times.  Parentheses in standard regex is just grouping.  So, (/[^/]+)* matches /abc, /abc/def, /abc/def/ghi; but (/[^/]+)* zero-length string, so (/[^/]+)*/ also matches /.

Hope this helps.

0 Karma
Get Updates on the Splunk Community!

Developer Spotlight with Brett Adams

In our third Spotlight feature, we're excited to shine a light on Brett—a Splunk consultant, innovative ...

Index This | What can you do to make 55,555 equal 500?

April 2025 Edition Hayyy Splunk Education Enthusiasts and the Eternally Curious!  We’re back with this ...

Say goodbye to manually analyzing phishing and malware threats with Splunk Attack ...

In today’s evolving threat landscape, we understand you’re constantly bombarded with phishing and malware ...