<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Why is LINE_BREAKER not always separating? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282911#M54077</link>
    <description>&lt;P&gt;Give this atry&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[yoursourcetype]
LINE_BREAKER = ([\r\n]+)\d+-\d+-\d+T\d+:\d+:\d+\.\d+\s+\|
SHOULD_LINEMERGE=true
TIME_PREFIX = ^
TIME_FORMAT = %Y-%m-%dT%H:%M:%S.%N
MAX_TIMESTAMP_LOOKAHEAD = 23
&lt;/CODE&gt;&lt;/PRE&gt;</description>
    <pubDate>Mon, 04 Apr 2016 16:50:32 GMT</pubDate>
    <dc:creator>somesoni2</dc:creator>
    <dc:date>2016-04-04T16:50:32Z</dc:date>
    <item>
      <title>Why is LINE_BREAKER not always separating?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282910#M54076</link>
      <description>&lt;P&gt;I have a log that starts each event by a new line starting with a timestamp followed by a space and pipe, like the following:&lt;/P&gt;

&lt;P&gt;2016-04-01T02:55:24.030 |&lt;/P&gt;

&lt;P&gt;I have tried setting up props.conf with a new sourcetype, first trying BREAK_ONLY_BEFORE, then LINE_BREAKER. They both seem to work most of the time, but I am still finding chunks of several events lumped into one. I inspected the clumping events in the logs manually, expecting missing line end characters or some other red flag, but haven't found any real reason why the line break isn't working. &lt;/P&gt;

&lt;P&gt;BREAK_ONLY_BEFORE:&lt;BR /&gt;
[newsource]&lt;BR /&gt;
BREAK_ONLY_BEFORE=([0-9]{4}-[0-9]{2}-[0-9]{2}T[0-9]{2}:[0-9]{2}:[0-9]{2}.[0-9]*\s+|)&lt;BR /&gt;
SHOULD_LINEMERGE=true&lt;BR /&gt;
MAX_EVENTS=1000000&lt;BR /&gt;
TRUNCATE=0&lt;BR /&gt;
(Note: I have also tried this without the regex wrapped as a group)&lt;/P&gt;

&lt;P&gt;LINE_BREAKER:&lt;BR /&gt;
[newsource]&lt;BR /&gt;
LINE_BREAKER=([\r\n]+)([0-9]{4}-[0-9]{2}-[0-9]{2}T[0-9]{2}:[0-9]{2}:[0-9]{2}.[0-9]*\s+|)&lt;BR /&gt;
SHOULD_LINEMERGE=false&lt;BR /&gt;
TRUNCATE=0&lt;BR /&gt;
(I have also tried this without the timestamp portion group-wrapped)&lt;/P&gt;

&lt;P&gt;Any ideas why I'm getting these random clusters of events which should be separated?&lt;BR /&gt;
Thanks!&lt;/P&gt;

&lt;P&gt;(Using Splunk universal forwarder 6.2.3)&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 09:21:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282910#M54076</guid>
      <dc:creator>meburbo</dc:creator>
      <dc:date>2020-09-29T09:21:19Z</dc:date>
    </item>
    <item>
      <title>Re: Why is LINE_BREAKER not always separating?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282911#M54077</link>
      <description>&lt;P&gt;Give this atry&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[yoursourcetype]
LINE_BREAKER = ([\r\n]+)\d+-\d+-\d+T\d+:\d+:\d+\.\d+\s+\|
SHOULD_LINEMERGE=true
TIME_PREFIX = ^
TIME_FORMAT = %Y-%m-%dT%H:%M:%S.%N
MAX_TIMESTAMP_LOOKAHEAD = 23
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 04 Apr 2016 16:50:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282911#M54077</guid>
      <dc:creator>somesoni2</dc:creator>
      <dc:date>2016-04-04T16:50:32Z</dc:date>
    </item>
    <item>
      <title>Re: Why is LINE_BREAKER not always separating?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282912#M54078</link>
      <description>&lt;P&gt;That seems to have the same effect&lt;/P&gt;</description>
      <pubDate>Mon, 04 Apr 2016 18:31:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282912#M54078</guid>
      <dc:creator>meburbo</dc:creator>
      <dc:date>2016-04-04T18:31:27Z</dc:date>
    </item>
    <item>
      <title>Re: Why is LINE_BREAKER not always separating?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282913#M54079</link>
      <description>&lt;P&gt;Turns out both were correct. I assumed that queried events in the UI meant that indexing was complete. Looks like the events were not fully processed when I was checking on them. Apologies for the time waste.&lt;/P&gt;</description>
      <pubDate>Mon, 04 Apr 2016 19:57:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-LINE-BREAKER-not-always-separating/m-p/282913#M54079</guid>
      <dc:creator>meburbo</dc:creator>
      <dc:date>2016-04-04T19:57:40Z</dc:date>
    </item>
  </channel>
</rss>

