Getting Data In

How to configure SEDCMD in props.conf to delete XML event content at index-time?

himynamesdave
Contributor

Hi all -

I have content in XML events I'm indexing that I don't want:

<?xml version="1.0" encoding="UTF-8"?>

So I decided to use sedcmd in my props.conf like follows:

SEDCMD-header = <\?xml version\=\"1\.0\" encoding\=\"UTF\-8\"\?>

Alas, it does not work. I feel it may be the regex that is failing, but I cannot seem to get it to click.

Can anyone help?

Tags (3)
1 Solution

somesoni2
Revered Legend

Try something like this in your props.conf on Indexer/Heavy Forwarder

[Yoursourcetype]
...Other configurations...
SEDCMD-aremoveheader = s/^\<\?xml[^\>]*\>\n*//g

View solution in original post

woodcock
Esteemed Legend

Try this:

 SEDCMD-header = s/<?xml version="1.0" encoding="UTF-8"?>//

You need to have a full sed command, in this case it says to switch your string with nothing.

0 Karma

somesoni2
Revered Legend

Try something like this in your props.conf on Indexer/Heavy Forwarder

[Yoursourcetype]
...Other configurations...
SEDCMD-aremoveheader = s/^\<\?xml[^\>]*\>\n*//g

landen99
Motivator

You want to make sure to capture the last ">" too. Maybe even the line characters afterwards.

SEDCMD-aremoveheader = s/^\<\?xml[^\>]*\>\n*//g

somesoni2
Revered Legend

Good catch. Just updated the my answer to reflect that.

0 Karma
Get Updates on the Splunk Community!

Upcoming Webinar: Unmasking Insider Threats with Slunk Enterprise Security’s UEBA

Join us on Wed, Dec 10. at 10AM PST / 1PM EST for a live webinar and demo with Splunk experts! Discover how ...

.conf25 technical session recap of Observability for Gen AI: Monitoring LLM ...

If you’re unfamiliar, .conf is Splunk’s premier event where the Splunk community, customers, partners, and ...

A Season of Skills: New Splunk Courses to Light Up Your Learning Journey

There’s something special about this time of year—maybe it’s the glow of the holidays, maybe it’s the ...