<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic XML metadata field extraction in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/XML-metadata-field-extraction/m-p/348107#M94883</link>
    <description>&lt;P&gt;Hi Splunkers,&lt;/P&gt;

&lt;P&gt;I am working on field extraction for XML events. I have added regex in transforms.conf for extraction. XML looks like this,&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;
&amp;lt;transaction version="6.00" ID="1234" agentRole="sourceAgent" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="TransferLog.xsd" xmlns=""&amp;gt;
    &amp;lt;field1 time="2018-03-30T19:53:34.472Z"&amp;gt;abc&amp;lt;/field1&amp;gt;
    &amp;lt;field2 agent="abc" QMgr="abc" agentType="abc"&amp;gt;
        &amp;lt;systemInfo architecture="abc" name="abc" version="3.10.0-514.21.2.el7.x86_64"/&amp;gt;
    &amp;lt;/field1&amp;gt;
    &amp;lt;field3 agent="abc" QMgr="abc"/&amp;gt;
    &amp;lt;originator&amp;gt;
        &amp;lt;field4&amp;gt;abc&amp;lt;/field4&amp;gt;
        &amp;lt;field5&amp;gt;abc&amp;lt;/field5&amp;gt;
        &amp;lt;field6&amp;gt;abc&amp;lt;/field6&amp;gt;
    &amp;lt;/originator&amp;gt;
    &amp;lt;transferSet startTime="2018-03-30T19:53:34.473Z" total="1" bytesSent="0"&amp;gt;
        &amp;lt;metaDataSet&amp;gt;
            &amp;lt;metaData key="field7"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="field8"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="field9"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field10"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field11"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field12"&amp;gt;abc&amp;lt;/metaData&amp;gt;           
        &amp;lt;/metaDataSet&amp;gt;
    &amp;lt;/transferSet&amp;gt;
&amp;lt;/transaction&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;props.conf&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[mft]
category = Custom
pulldown_type = 1
NO_BINARY_CHECK = true
disabled = false
REPORT-xmlkv = xmlkv-alternative
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;transforms.conf&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; [xmlkv-alternative]
 REGEX = &amp;lt;([^\s\&amp;gt;]*)[^\&amp;gt;]*\&amp;gt;([^&amp;lt;]*)\&amp;lt;\/\1\&amp;gt;
 FORMAT = $1::$2
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Using the above regex I am able extract tags named field1, field2, field3 field4, field5 and field6 in the XML. However remaining fields inside  tag ie field7 to field12 in XML cannot be extracted. Exception here is i need not extract/ ignore all fields in this form com.ibm.wmqfte.field10. Please help me with this extraction.&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;</description>
    <pubDate>Wed, 18 Apr 2018 10:41:08 GMT</pubDate>
    <dc:creator>jsanjeb</dc:creator>
    <dc:date>2018-04-18T10:41:08Z</dc:date>
    <item>
      <title>XML metadata field extraction</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/XML-metadata-field-extraction/m-p/348107#M94883</link>
      <description>&lt;P&gt;Hi Splunkers,&lt;/P&gt;

&lt;P&gt;I am working on field extraction for XML events. I have added regex in transforms.conf for extraction. XML looks like this,&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;&amp;lt;?xml version="1.0" encoding="UTF-8"?&amp;gt;
&amp;lt;transaction version="6.00" ID="1234" agentRole="sourceAgent" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="TransferLog.xsd" xmlns=""&amp;gt;
    &amp;lt;field1 time="2018-03-30T19:53:34.472Z"&amp;gt;abc&amp;lt;/field1&amp;gt;
    &amp;lt;field2 agent="abc" QMgr="abc" agentType="abc"&amp;gt;
        &amp;lt;systemInfo architecture="abc" name="abc" version="3.10.0-514.21.2.el7.x86_64"/&amp;gt;
    &amp;lt;/field1&amp;gt;
    &amp;lt;field3 agent="abc" QMgr="abc"/&amp;gt;
    &amp;lt;originator&amp;gt;
        &amp;lt;field4&amp;gt;abc&amp;lt;/field4&amp;gt;
        &amp;lt;field5&amp;gt;abc&amp;lt;/field5&amp;gt;
        &amp;lt;field6&amp;gt;abc&amp;lt;/field6&amp;gt;
    &amp;lt;/originator&amp;gt;
    &amp;lt;transferSet startTime="2018-03-30T19:53:34.473Z" total="1" bytesSent="0"&amp;gt;
        &amp;lt;metaDataSet&amp;gt;
            &amp;lt;metaData key="field7"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="field8"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="field9"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field10"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field11"&amp;gt;abc&amp;lt;/metaData&amp;gt;
            &amp;lt;metaData key="com.ibm.wmqfte.field12"&amp;gt;abc&amp;lt;/metaData&amp;gt;           
        &amp;lt;/metaDataSet&amp;gt;
    &amp;lt;/transferSet&amp;gt;
&amp;lt;/transaction&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;props.conf&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[mft]
category = Custom
pulldown_type = 1
NO_BINARY_CHECK = true
disabled = false
REPORT-xmlkv = xmlkv-alternative
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;&lt;STRONG&gt;transforms.conf&lt;/STRONG&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; [xmlkv-alternative]
 REGEX = &amp;lt;([^\s\&amp;gt;]*)[^\&amp;gt;]*\&amp;gt;([^&amp;lt;]*)\&amp;lt;\/\1\&amp;gt;
 FORMAT = $1::$2
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Using the above regex I am able extract tags named field1, field2, field3 field4, field5 and field6 in the XML. However remaining fields inside  tag ie field7 to field12 in XML cannot be extracted. Exception here is i need not extract/ ignore all fields in this form com.ibm.wmqfte.field10. Please help me with this extraction.&lt;/P&gt;

&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 18 Apr 2018 10:41:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/XML-metadata-field-extraction/m-p/348107#M94883</guid>
      <dc:creator>jsanjeb</dc:creator>
      <dc:date>2018-04-18T10:41:08Z</dc:date>
    </item>
    <item>
      <title>Re: XML metadata field extraction</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/XML-metadata-field-extraction/m-p/529209#M94884</link>
      <description>&lt;P&gt;I am curios about this as well, i have a similar problem and trying to extract the field name from metadata&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Nov 2020 20:03:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/XML-metadata-field-extraction/m-p/529209#M94884</guid>
      <dc:creator>785978</dc:creator>
      <dc:date>2020-11-12T20:03:00Z</dc:date>
    </item>
  </channel>
</rss>

