Splunk Search

Field Extraction from html tags

Nadhiyaa
Path Finder

  • align="left"> Accepted

    <td align="right" class="mailViewRowReadEven"> 64399
    

    <td align="right" class="mailViewRowReadEven">
    

    56.32%

I want to extract the field Accepted=64399

Tags (1)
0 Karma

niketn
Legend

@Nadhiyaa, you will need to post the <html> code on Splunk Answers question using the code button i.e. with 101010 or Ctrl+K as the shortcut key after you have selected all html code, to ensure that it does not escape.

Based on bare minimal html required as per your question you can try the following rex command. Please try the following run anywhere search to test the same.

| makeresults
| eval _raw=" Accepted  
<td align=\"right\" class=\"mailViewRowReadEven\"> 64399
    </td>"
| rex "\sAccepted\s+\<td align=\"right\" class=\"mailViewRowReadEven\"\>\s(?<Accepted>\d+)\s+\<\/td\>"

Above regular expression uses \d+ to get Accepted numbers as per the sample data provided.
While I have not tested this, but check out the html2text app from Splunkbase as well.

____________________________________________
| makeresults | eval message= "Happy Splunking!!!"
0 Karma

Nadhiyaa
Path Finder

@niketnilay

0 Karma
Get Updates on the Splunk Community!

Splunk Mobile: Your Brand-New Home Screen

Meet Your New Mobile Hub  Hello Splunk Community!  Staying connected to your data—no matter where you are—is ...

Introducing Value Insights (Beta): Understand the Business Impact your organization ...

Real progress on your strategic priorities starts with knowing the business outcomes your teams are delivering ...

Enterprise Security (ES) Essentials 8.3 is Now GA — Smarter Detections, Faster ...

As of today, Enterprise Security (ES) Essentials 8.3 is now generally available, helping SOC teams simplify ...