Splunk Search

xml field extraction

carmackd
Communicator

could someone please explain what stanza configuration i should include in the props.conf file to extract the size attribute from <email> into the field emailSize.

<stream TimeScanned="1271078550">
<command class="MAIL FROM" data="&lt;me@gmail.com&gt; SIZE=42446" />
<email size="42521">
    <header Name="attachmentname" Value="driver-install.pdf" />
    </email>

thanks,

Tags (1)
1 Solution

Lowell
Super Champion

This should do the trick:

EXTRACT-emailsize = <email\s+size=["'](?<emailSize>\d)["'][> ]

Of course, if size isn't always the first attribute in the email element, then the regex can start to get ugly.

If you are asking how you can automatically extract key value pairs using a combination of the element name and attribute name (like emailSize, headerName, headerValue, commandClass, commandData, ... and so on), then it's not something splunk does out of the box...

You may also find the xmlkv command helpful. The the search source=*.xml | xmlkv

View solution in original post

sideview
SplunkTrust
SplunkTrust

If the regexes just get too crazy one other option is to leave props.conf alone and just use the xpath command at search time:

<your search> | xpath "//email/@size" outfield=emailSize | search emailSize > 40000
0 Karma

Lowell
Super Champion

This should do the trick:

EXTRACT-emailsize = <email\s+size=["'](?<emailSize>\d)["'][> ]

Of course, if size isn't always the first attribute in the email element, then the regex can start to get ugly.

If you are asking how you can automatically extract key value pairs using a combination of the element name and attribute name (like emailSize, headerName, headerValue, commandClass, commandData, ... and so on), then it's not something splunk does out of the box...

You may also find the xmlkv command helpful. The the search source=*.xml | xmlkv

Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Modernize your Splunk Apps – Introducing Python 3.13 in Splunk

We are excited to announce that the upcoming releases of Splunk Enterprise 10.2.x and Splunk Cloud Platform ...

Step into “Hunt the Insider: An Splunk ES Premier Mystery” to catch a cybercriminal ...

After a whole week of being on call, you fell asleep on your keyboard, and you hit a sequence of buttons that ...

SplunkTrust Application Period is Officially OPEN!

It's that time, folks! The application/nomination period for the 2026-2027 SplunkTrust is officially open. If ...