<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88495#M18347</link>
    <description>&lt;P&gt;I'm not 100% sure, but here's something to try:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; BREAK_ONLY_BEFORE = ^#\s+\w+\s+\d+\s+
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;There really isn't any value in trying to match the entire line.  And on top of that, trying to use &lt;CODE&gt;$&lt;/CODE&gt; could be your problem.  In regex terms, depending on the mode, &lt;CODE&gt;$&lt;/CODE&gt; can mean end-of-line or end-of-string, and I'm really not sure if this regex is applied line-by-line or not.  I just know that I've gotten into trouble before with assumptions about &lt;CODE&gt;^&lt;/CODE&gt; and &lt;CODE&gt;$&lt;/CODE&gt;, and that could be having an effect here too.  In this case, you certainly don't need the extra "&lt;CODE&gt;.*$&lt;/CODE&gt;" on the end, so I'd drop it in any case.&lt;/P&gt;

&lt;P&gt;If that doesn't work, try also replacing the &lt;CODE&gt;^&lt;/CODE&gt; (for the same reason).&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; BREAK_ONLY_BEFORE = [\r\n]+#\s+\w+\s+\d+\s+
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;BTW, is there any reason not to put the word "modify" in there instead of &lt;CODE&gt;\w+&lt;/CODE&gt;?&lt;/P&gt;

&lt;P&gt;Also, I assume you realize that only newly indexed events will be effected by this index-time change.&lt;/P&gt;

&lt;P&gt;It wouldn't hurt to add a &lt;CODE&gt;TIME_PREFIX&lt;/CODE&gt; and &lt;CODE&gt;TIME_FORMAT&lt;/CODE&gt; in there too, since there seem to be multiple unix epoch time sprinkled in your example.&lt;/P&gt;</description>
    <pubDate>Fri, 07 Oct 2011 18:08:42 GMT</pubDate>
    <dc:creator>Lowell</dc:creator>
    <dc:date>2011-10-07T18:08:42Z</dc:date>
    <item>
      <title>Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88494#M18346</link>
      <description>&lt;P&gt;So here's a new one. I have an input (OpenLDAP Audit Logs). Each event (from #modify to #end modify) is generated at the same time, but on multiple lines, which splunk indexes. In props.conf, I set BREAK_ONLY_BEFORE_DATE = False. However when I do the searches, the events are not being parsed correctly. Below is an example of an event, and then the event split. It is acting as if BREAK_ONLY_BEFORE_DATE = True! &lt;/P&gt;

&lt;P&gt;The results from a btool debug &lt;A href="http://pastebin.com/xLBtX0Yd" target="_blank"&gt;can be found&lt;/A&gt;. I used &lt;STRONG&gt;splunk cmd btool --debug props list&lt;/STRONG&gt;. Did I miss something?&lt;/P&gt;

&lt;P&gt;props.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[ldap:open:audit]
SHOULD_LINEMERGE = True
BREAK_ONLY_BEFORE = ^#\s+\w+\s+\d+\s+.*$
BREAK_ONLY_BEFORE_DATE = False
REPORT-MultiValueAudit = loa-MultiValueAudit
REPORT-AuditUser = loa-audituser
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;transforms.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[loa-MultiValueAudit]
DELIMS="\n", ":"
MV_ADD=true

[loa-audituser]
REGEX = ^# modify (\d+) (dc=[\w,=]+) (\w+=([\w\s]*),([,=\w]*))
FORMAT = modifiedTimestamp::$1 suffix::$2 ModifiedBy::$4 ModifierBase::$5 fullModifierName::$3
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;====== EVENT TO PARSE (sourcetype=ldap:open:audit) =========&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;# modify 1317923089 dc=ycp,dc=edu cn=Manager,dc=contoso,dc=com  conn=-1
dn: uid=USER,ou=People,dc=contoso,dc=com
changetype: modify
delete: pwdFailureTime
-
replace: entryCSN
entryCSN: 20111006174450.024605Z#000000#004#000000
-
replace: modifiersName
modifiersName: cn=Manager,dc=contoso,dc=com
-
replace: modifyTimestamp
modifyTimestamp: 20111006174450Z
-
# end modify 1317923089
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;==== End Event ====&lt;/P&gt;

&lt;P&gt;==== Results of Event Parse ====&lt;/P&gt;

&lt;P&gt;===event1===&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;# modify 1317923089 dc=ycp,dc=edu cn=Manager,dc=contoso,dc=com  conn=-1
dn: uid=USER,ou=People,dc=contoso,dc=com
changetype: modify
delete: pwdFailureTime
-
replace: entryCSN
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;===end event1===&lt;/P&gt;

&lt;P&gt;===event2===&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;entryCSN: 20111006174450.024605Z#000000#004#000000
-
replace: modifiersName
modifiersName: cn=Manager,dc=contoso,dc=com
-
replace: modifyTimestamp
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;===end event2===&lt;/P&gt;

&lt;P&gt;===event3===&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;modifyTimestamp: 20111006174450Z
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;===end event3===&lt;/P&gt;

&lt;P&gt;===event4===&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;# end modify 1317923089
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;===end event4===&lt;/P&gt;

&lt;P&gt;==== End Results of Event Parse ====&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:57:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88494#M18346</guid>
      <dc:creator>alacercogitatus</dc:creator>
      <dc:date>2020-09-28T09:57:50Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88495#M18347</link>
      <description>&lt;P&gt;I'm not 100% sure, but here's something to try:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; BREAK_ONLY_BEFORE = ^#\s+\w+\s+\d+\s+
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;There really isn't any value in trying to match the entire line.  And on top of that, trying to use &lt;CODE&gt;$&lt;/CODE&gt; could be your problem.  In regex terms, depending on the mode, &lt;CODE&gt;$&lt;/CODE&gt; can mean end-of-line or end-of-string, and I'm really not sure if this regex is applied line-by-line or not.  I just know that I've gotten into trouble before with assumptions about &lt;CODE&gt;^&lt;/CODE&gt; and &lt;CODE&gt;$&lt;/CODE&gt;, and that could be having an effect here too.  In this case, you certainly don't need the extra "&lt;CODE&gt;.*$&lt;/CODE&gt;" on the end, so I'd drop it in any case.&lt;/P&gt;

&lt;P&gt;If that doesn't work, try also replacing the &lt;CODE&gt;^&lt;/CODE&gt; (for the same reason).&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; BREAK_ONLY_BEFORE = [\r\n]+#\s+\w+\s+\d+\s+
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;BTW, is there any reason not to put the word "modify" in there instead of &lt;CODE&gt;\w+&lt;/CODE&gt;?&lt;/P&gt;

&lt;P&gt;Also, I assume you realize that only newly indexed events will be effected by this index-time change.&lt;/P&gt;

&lt;P&gt;It wouldn't hurt to add a &lt;CODE&gt;TIME_PREFIX&lt;/CODE&gt; and &lt;CODE&gt;TIME_FORMAT&lt;/CODE&gt; in there too, since there seem to be multiple unix epoch time sprinkled in your example.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Oct 2011 18:08:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88495#M18347</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2011-10-07T18:08:42Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88496#M18348</link>
      <description>&lt;P&gt;Thanks for the reply, but still not working.  Here's what I used. I assumed that %+ was a unix timestamp. &lt;/P&gt;

&lt;P&gt;[ldap:open:audit]&lt;BR /&gt;
SHOULD_LINEMERGE = True&lt;BR /&gt;
BREAK_ONLY_BEFORE = [\r\n]+#\s+modify\s+\d+\s+&lt;BR /&gt;
BREAK_ONLY_BEFORE_DATE = False&lt;BR /&gt;
TIME_PREFIX = #\s+modify\s+&lt;BR /&gt;
TIME_FORMAT = %+&lt;BR /&gt;
REPORT-MultiValueAudit = loa-MultiValueAudit&lt;BR /&gt;
REPORT-AuditUser = loa-audituser&lt;BR /&gt;
priority = 100&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:57:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88496#M18348</guid>
      <dc:creator>alacercogitatus</dc:creator>
      <dc:date>2020-09-28T09:57:53Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88497#M18349</link>
      <description>&lt;P&gt;I think your &lt;CODE&gt;TIME_FORMAT&lt;/CODE&gt; should be "&lt;CODE&gt;%s&lt;/CODE&gt;" and not "&lt;CODE&gt;%+&lt;/CODE&gt;"; I've never heard of "&lt;CODE&gt;%+&lt;/CODE&gt;" before, but it's possible I'm missing something.  Of course, that doesn't solve your breaking issue.&lt;/P&gt;</description>
      <pubDate>Fri, 07 Oct 2011 19:42:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88497#M18349</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2011-10-07T19:42:30Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88498#M18350</link>
      <description>&lt;P&gt;I switched it to &lt;CODE&gt;%s&lt;/CODE&gt;. I found it &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Data/Configuretimestamprecognition#Enhanced_strptime.28.29_support" target="_blank"&gt;here&lt;/A&gt;. But it's still acting as if BREAK_ONLY_BEFORE_DATE = True. So odd, especially since it used to work......&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:57:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88498#M18350</guid>
      <dc:creator>alacercogitatus</dc:creator>
      <dc:date>2020-09-28T09:57:56Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88499#M18351</link>
      <description>&lt;P&gt;Hey alacercogitatus, Give this stanza a try: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[ldap:open:audit] 
TIME_PREFIX = ^#\s+modify\s
TIME_FORMAT= %s
MAX_TIMESTAMP_LOOKAHEAD = 11
LINE_BREAKER = ([\r\n]+)(?=^#\s+modify\s\d{10}\s+)
SHOULD_LINEMERGE = false
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 10 Oct 2011 21:42:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88499#M18351</guid>
      <dc:creator>_d_</dc:creator>
      <dc:date>2011-10-10T21:42:48Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88500#M18352</link>
      <description>&lt;P&gt;nope, it didn't work. It seems no matter what I do, it splits at the date.&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2011 12:57:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88500#M18352</guid>
      <dc:creator>alacercogitatus</dc:creator>
      <dc:date>2011-10-11T12:57:57Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88501#M18353</link>
      <description>&lt;P&gt;BTW, &lt;CODE&gt;%+&lt;/CODE&gt; gives a timestamp like:  &lt;CODE&gt;Sat Oct  8 06:41:52 EDT 2011&lt;/CODE&gt;, so &lt;CODE&gt;%s&lt;/CODE&gt; is certainly what you want here.  (Just a side note, I was curious about this.  This certainly doesn't solve your problem, unfortunately.)&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2011 15:20:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88501#M18353</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2011-10-11T15:20:00Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88502#M18354</link>
      <description>&lt;P&gt;I updated the formatting in this answer and perhaps tweaked regex as well.  There seemed to be some extra "&lt;CODE&gt;\&lt;/CODE&gt;s" in there which sometimes happens with formatting on this site.  I'd double check what you have and maybe try this again.  This approach should take all event-breaking logic out of the picture by forcing splunk to use custom line-breaking logic instead of event-combining logic which is where you were having issues.  (This should also be faster.  The downside is that it's more rigid and could cause issues if your log file varies much.)&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2011 15:28:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88502#M18354</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2011-10-11T15:28:43Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88503#M18355</link>
      <description>&lt;P&gt;Is any of the stuff on this page helpful?   &lt;A href="http://splunk-base.splunk.com/answers/4075/whats-the-best-way-to-track-down-propsconf-problems"&gt;http://splunk-base.splunk.com/answers/4075/whats-the-best-way-to-track-down-propsconf-problems&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2011 15:46:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88503#M18355</guid>
      <dc:creator>Lowell</dc:creator>
      <dc:date>2011-10-11T15:46:00Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88504#M18356</link>
      <description>&lt;P&gt;I believe I found the issue. The logs were being splunked to indexers that weren't doing the index time configurations from the main indexer. Only the search-time extractions were being performed. So, I added the configs from the main indexer to the secondary indexers and it seems to be working. Thanks for all the recommendations!&lt;/P&gt;</description>
      <pubDate>Tue, 11 Oct 2011 17:22:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88504#M18356</guid>
      <dc:creator>alacercogitatus</dc:creator>
      <dc:date>2011-10-11T17:22:59Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88505#M18357</link>
      <description>&lt;P&gt;@alacercogitatus Did you create app or modified the local\props.conf on the indexer for the solution?&lt;/P&gt;</description>
      <pubDate>Fri, 24 Apr 2020 15:31:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88505#M18357</guid>
      <dc:creator>youngsuh</dc:creator>
      <dc:date>2020-04-24T15:31:59Z</dc:date>
    </item>
    <item>
      <title>Re: Linebreaking Failure - BREAK_ONLY_BEFORE_DATE</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88506#M18358</link>
      <description>&lt;P&gt;&lt;CODE&gt;LINE_BREAKER=([\r\n]+)time:&lt;/CODE&gt;  worked for me.&lt;/P&gt;</description>
      <pubDate>Tue, 05 May 2020 20:16:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Linebreaking-Failure-BREAK-ONLY-BEFORE-DATE/m-p/88506#M18358</guid>
      <dc:creator>youngsuh</dc:creator>
      <dc:date>2020-05-05T20:16:07Z</dc:date>
    </item>
  </channel>
</rss>

