Splunk Search

rex

Jagaspu
Engager

Hi i need extract the below file name from extracted output 

 

MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270

 

file name :- AR-100-01_20230702_PAY.zip

 

i need extract the above file name using rex command

Labels (1)
0 Karma

emzed
Path Finder

Or something like this

| makeresults 
| eval msg="MDTM|07/02/2023 23:58:59.007|[SFTP:3460819_0:eftpos:10.18.168.158] READ: *MDTM /eftpos/prod/AR-100-01_20230702_PAY.zip 16883063270" 
| rex field=msg "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"



0 Karma

Jagaspu
Engager

Hi emzed , sorry for your command i have not received an output , Attached screen shot for reference.

 

Jagaspu_0-1688453893464.png

 

 

 

0 Karma

emzed
Path Finder

I tested it on artificial data and I used a field "msg" in rex command. I thing you have data in the field "_raw". 

 

You should use

| rex field=_raw "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

or 

| rex "\w+:\s+\S+\s+(\/[^\/]+)*\/(?<filename>[^\s\/]+)"

note: _raw field is default field for rex command

 

 

0 Karma

yuanliu
SplunkTrust
SplunkTrust

Something like

| rex "READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Jagaspu
Engager

Hi yuanliu , Thanks below provided rex command has worked and can i get any topic on provided command, Just for knowledge gain.

Tags (1)
0 Karma

yuanliu
SplunkTrust
SplunkTrust

"READ: \S+ (/[^/]+)*/(?<filename>[^\s/]+)

Rex is about compromises.  I have to make a few assumptions based on the illustrated sample data.

  1. "READ:" is perhaps a keyword and doesn't change from event to event.
  2. "*MDTM" is perhaps a classifier that may take different forms but that does not contain space. (\S)
  3. The path before file name is absolute, and can vary in depth. (See below.)
  4. File name contains no space. ([^s])  By convention, file name also does not include a path separator. (Combined with no space, that's [^\s/])
  5. After file name, there is either a space or end of the line.

The expression contains two different repetition tokens.  + means repeat at least once, up to any number of times.  * means repeat zero to unlimited times.  Parentheses in standard regex is just grouping.  So, (/[^/]+)* matches /abc, /abc/def, /abc/def/ghi; but (/[^/]+)* zero-length string, so (/[^/]+)*/ also matches /.

Hope this helps.

0 Karma
Get Updates on the Splunk Community!

Accelerating Observability as Code with the Splunk AI Assistant

We’ve seen in previous posts what Observability as Code (OaC) is and how it’s now essential for managing ...

Integrating Splunk Search API and Quarto to Create Reproducible Investigation ...

 Splunk is More Than Just the Web Console For Digital Forensics and Incident Response (DFIR) practitioners, ...

Congratulations to the 2025-2026 SplunkTrust!

Hello, Splunk Community! We are beyond thrilled to announce our newest group of SplunkTrust members!  The ...