Splunk Search

xml field extraction

carmackd
Communicator

could someone please explain what stanza configuration i should include in the props.conf file to extract the size attribute from <email> into the field emailSize.

<stream TimeScanned="1271078550">
<command class="MAIL FROM" data="&lt;me@gmail.com&gt; SIZE=42446" />
<email size="42521">
    <header Name="attachmentname" Value="driver-install.pdf" />
    </email>

thanks,

Tags (1)
1 Solution

Lowell
Super Champion

This should do the trick:

EXTRACT-emailsize = <email\s+size=["'](?<emailSize>\d)["'][> ]

Of course, if size isn't always the first attribute in the email element, then the regex can start to get ugly.

If you are asking how you can automatically extract key value pairs using a combination of the element name and attribute name (like emailSize, headerName, headerValue, commandClass, commandData, ... and so on), then it's not something splunk does out of the box...

You may also find the xmlkv command helpful. The the search source=*.xml | xmlkv

View solution in original post

sideview
SplunkTrust
SplunkTrust

If the regexes just get too crazy one other option is to leave props.conf alone and just use the xpath command at search time:

<your search> | xpath "//email/@size" outfield=emailSize | search emailSize > 40000
0 Karma

Lowell
Super Champion

This should do the trick:

EXTRACT-emailsize = <email\s+size=["'](?<emailSize>\d)["'][> ]

Of course, if size isn't always the first attribute in the email element, then the regex can start to get ugly.

If you are asking how you can automatically extract key value pairs using a combination of the element name and attribute name (like emailSize, headerName, headerValue, commandClass, commandData, ... and so on), then it's not something splunk does out of the box...

You may also find the xmlkv command helpful. The the search source=*.xml | xmlkv

Get Updates on the Splunk Community!

[Puzzles] Solve, Learn, Repeat: Dynamic formatting from XML events

This challenge was first posted on Slack #puzzles channelFor a previous puzzle, I needed a set of fixed-length ...

Enter the Agentic Era with Splunk AI Assistant for SPL 1.4

  &#x1f680; Your data just got a serious AI upgrade — are you ready? Say hello to the Agentic Era with the ...

Stronger Security with Federated Search for S3, GCP SQL & Australian Threat ...

Splunk Lantern is a Splunk customer success center that provides advice from Splunk experts on valuable data ...