Hi,
I would like to remove every occurrence of a specific pattern from my _raw events.
Specifically in this case I am looking for deleting these html tags: <b>, </b>, <br>
Example, I have this raw event:
<b>This<\b> is an <b>example<\b><br>of raw<br>event
And I would like to transform it like this:
This is an exampleof rawevent
I tried to create this transforms.conf:
NOTE: This setting is only valid for index-time field extractions. This setting is ignored if DEST_KEY is _raw.
And I must set DEST_KEY = _raw
Can you help me?
Thank you in advance.
Hi @tommasoscarpa1,
if you remove the XML tags, how can you recognize fields?
maybe you could use INDEXED_EXTRACTIONS = XML in your sourcetype definition having all the field extracted.
Ciao.
Giuseppe
Hi Giuseppe,
I am not talking about XML tags, but HTML tags. HTML tags are used to format the text and do not give any information about fields. Text between <b> and </b> will be formatted in bold and <br> is a line break.
I would like to remove these unnecessary characters from my inputs.
Ciao!
Tommaso