Splunk Search

Field Extraction from html tags

Nadhiyaa
Path Finder

  • align="left"> Accepted

    <td align="right" class="mailViewRowReadEven"> 64399
    

    <td align="right" class="mailViewRowReadEven">
    

    56.32%

I want to extract the field Accepted=64399

Tags (1)
0 Karma

niketn
Legend

@Nadhiyaa, you will need to post the <html> code on Splunk Answers question using the code button i.e. with 101010 or Ctrl+K as the shortcut key after you have selected all html code, to ensure that it does not escape.

Based on bare minimal html required as per your question you can try the following rex command. Please try the following run anywhere search to test the same.

| makeresults
| eval _raw=" Accepted  
<td align=\"right\" class=\"mailViewRowReadEven\"> 64399
    </td>"
| rex "\sAccepted\s+\<td align=\"right\" class=\"mailViewRowReadEven\"\>\s(?<Accepted>\d+)\s+\<\/td\>"

Above regular expression uses \d+ to get Accepted numbers as per the sample data provided.
While I have not tested this, but check out the html2text app from Splunkbase as well.

____________________________________________
| makeresults | eval message= "Happy Splunking!!!"
0 Karma

Nadhiyaa
Path Finder

@niketnilay

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

Community Content Calendar, September edition

Welcome to another insightful post from our Community Content Calendar! We're thrilled to continue bringing ...

Splunkbase Unveils New App Listing Management Public Preview

Splunkbase Unveils New App Listing Management Public PreviewWe're thrilled to announce the public preview of ...

Leveraging Automated Threat Analysis Across the Splunk Ecosystem

Are you leveraging automation to its fullest potential in your threat detection strategy?Our upcoming Security ...