Archive

Field Extraction from html tags

Nadhiyaa
Path Finder

  • align="left"> Accepted

    <td align="right" class="mailViewRowReadEven"> 64399
    

    <td align="right" class="mailViewRowReadEven">
    

    56.32%

I want to extract the field Accepted=64399

Tags (1)
0 Karma

niketnilay
Legend

@Nadhiyaa, you will need to post the <html> code on Splunk Answers question using the code button i.e. with 101010 or Ctrl+K as the shortcut key after you have selected all html code, to ensure that it does not escape.

Based on bare minimal html required as per your question you can try the following rex command. Please try the following run anywhere search to test the same.

| makeresults
| eval _raw=" Accepted  
<td align=\"right\" class=\"mailViewRowReadEven\"> 64399
    </td>"
| rex "\sAccepted\s+\<td align=\"right\" class=\"mailViewRowReadEven\"\>\s(?<Accepted>\d+)\s+\<\/td\>"

Above regular expression uses \d+ to get Accepted numbers as per the sample data provided.
While I have not tested this, but check out the html2text app from Splunkbase as well.

____________________________________________
| makeresults | eval message= "Happy Splunking!!!"
0 Karma

Nadhiyaa
Path Finder

@niketnilay

0 Karma
.conf21 CFS Extended through 5/20!

Don't miss your chance
to share your Splunk
wisdom in-person or
virtually at .conf21!

Call for Speakers has
been extended through
Thursday, 5/20!