Getting Data In

Line Breaker help

richnavis
Contributor

Hi Guys,
I'm trying to ingest an entire html file as a single event everytime it gets written. The html file ALWAYS starts with ANGLEBRACKET p ANGLEBRACKET and always ends with ANGLEBRACKET p ANGLEBRACKET. Any suggestions about how to setup the line breaking?

0 Karma
1 Solution

FrankVl
Ultra Champion

Based on comments, I think this should work:

SHOULD_LINEMERGE = false
LINE_BREAKER = ([\r\n]*)\<p\>

View solution in original post

FrankVl
Ultra Champion

Based on comments, I think this should work:

SHOULD_LINEMERGE = false
LINE_BREAKER = ([\r\n]*)\<p\>

xpac
SplunkTrust
SplunkTrust

Just to make sure - it begins with <p> and ends with </p>, right?

0 Karma

richnavis
Contributor

Thanks FrankVI for the response. Correct that it doesn't have html tags around it, however it can function perfectly fine without those tags. There are only "P" tags at the beginning and end. Question about your solution. This file gets over-written 4 times a day. would setting MAX_Events and Should_LineMERGE be sufficient to keep each file as a single event, but also create a new event each time the file gets over-written?

0 Karma

FrankVl
Ultra Champion

That's a good question, never tried that.

But if there are indeed no further <p> elements inside the file, then you could just use that as a linebreaker.

0 Karma

FrankVl
Ultra Champion

Which is odd, because that means it isn't a valid HTML file, as that should at least have <html> tags around it all, right?

Also important to know before being able to answer this: are there any further <p> tags in the middle of the file?

But since you don't want Splunk to break anything, wouldn't it be sufficient to just increase the MAX_EVENTS setting to larger than the expected number of lines and leave SHOULD_LINEMERGE to its default true value?

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

Community Content Calendar, September edition

Welcome to another insightful post from our Community Content Calendar! We're thrilled to continue bringing ...

Splunkbase Unveils New App Listing Management Public Preview

Splunkbase Unveils New App Listing Management Public PreviewWe're thrilled to announce the public preview of ...

Leveraging Automated Threat Analysis Across the Splunk Ecosystem

Are you leveraging automation to its fullest potential in your threat detection strategy?Our upcoming Security ...