Archive

Field Extraction from html tags

Nadhiyaa
Path Finder

  • align="left"> Accepted

    <td align="right" class="mailViewRowReadEven"> 64399
    

    <td align="right" class="mailViewRowReadEven">
    

    56.32%

I want to extract the field Accepted=64399

Tags (1)
0 Karma

niketnilay
Legend

@Nadhiyaa, you will need to post the <html> code on Splunk Answers question using the code button i.e. with 101010 or Ctrl+K as the shortcut key after you have selected all html code, to ensure that it does not escape.

Based on bare minimal html required as per your question you can try the following rex command. Please try the following run anywhere search to test the same.

| makeresults
| eval _raw=" Accepted  
<td align=\"right\" class=\"mailViewRowReadEven\"> 64399
    </td>"
| rex "\sAccepted\s+\<td align=\"right\" class=\"mailViewRowReadEven\"\>\s(?<Accepted>\d+)\s+\<\/td\>"

Above regular expression uses \d+ to get Accepted numbers as per the sample data provided.
While I have not tested this, but check out the html2text app from Splunkbase as well.

____________________________________________
| makeresults | eval message= "Happy Splunking!!!"
0 Karma

Nadhiyaa
Path Finder

@niketnilay

0 Karma
Take the 2021 Splunk Career Survey

Help us learn about how Splunk has
impacted your career by taking the 2021 Splunk Career Survey.

Earn $50 in Amazon cash!