<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: multiline events using line merge weird splitting issue in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30016#M5209</link>
    <description>&lt;P&gt;If you are using a buffered logger where there may be delays, this is likely the problem. Splunk will flush the file with a _doneKey after a certain time:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://www.splunk.com/base/Documentation/latest/Admin/Inputsconf" rel="nofollow"&gt;http://www.splunk.com/base/Documentation/latest/Admin/Inputsconf&lt;/A&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;time_before_close = &amp;lt;integer&amp;gt;
* Modtime delta required before Splunk can close a file on EOF.
* Tells the system not to close files that have been updated in past &amp;lt;integer&amp;gt; seconds.
* Defaults to 3.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;By the way, the below should be sufficient:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[my_log]
BREAK_ONLY_BEFORE = \-\-\-\sbegin
MAX_EVENTS = 10000
TRUNCATE = 100000
&lt;/CODE&gt;&lt;/PRE&gt;</description>
    <pubDate>Sat, 22 Jan 2011 04:38:42 GMT</pubDate>
    <dc:creator>araitz</dc:creator>
    <dc:date>2011-01-22T04:38:42Z</dc:date>
    <item>
      <title>multiline events using line merge weird splitting issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30014#M5207</link>
      <description>&lt;P&gt;Hello, I have a big log file that is set to be sourcetype=my_log and it basically looks like this:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;--- begin_request ---
blah blah
blah blah
--- end_request ---
--- begin_request ---
blah blah
--- end_request ---
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;and so on. With props.conf configuration below events are correctly split most of the time but sometimes they are just split somewhere in the middle. This happens to small and large events. &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[my_log]
BREAK_ONLY_BEFORE_DATE = False
SHOULD_LINEMERGE = true
BREAK_ONLY_BEFORE = --- begin_request ---
MUST_NOT_BREAK_AFTER = --- begin_request ---
MUST_BREAK_AFTER = --- end_request ---
MAX_EVENTS = 10000
TRUNCATE = 100000
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;App that produces my_log uses a buffered logger so there are random and periodic delays of how log data is flushed into the log file and it is definitely possible that a single event will not be flushed to disk as a whole. That might be the cause for the strange event splits  but I'm not sure.&lt;/P&gt;

&lt;P&gt;Any ideas?&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jan 2011 07:52:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30014#M5207</guid>
      <dc:creator>cppforlife</dc:creator>
      <dc:date>2011-01-21T07:52:45Z</dc:date>
    </item>
    <item>
      <title>Re: multiline events using line merge weird splitting issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30015#M5208</link>
      <description>&lt;P&gt;No solution to the breaking problem, but have you considered using transactions to achieve the same thing? Have Splunk index the events line by line but group them together as transactions using startswith="--- begin_request ---" and endswith="--- end_request ---".&lt;/P&gt;</description>
      <pubDate>Fri, 21 Jan 2011 19:24:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30015#M5208</guid>
      <dc:creator>Ayn</dc:creator>
      <dc:date>2011-01-21T19:24:49Z</dc:date>
    </item>
    <item>
      <title>Re: multiline events using line merge weird splitting issue</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30016#M5209</link>
      <description>&lt;P&gt;If you are using a buffered logger where there may be delays, this is likely the problem. Splunk will flush the file with a _doneKey after a certain time:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://www.splunk.com/base/Documentation/latest/Admin/Inputsconf" rel="nofollow"&gt;http://www.splunk.com/base/Documentation/latest/Admin/Inputsconf&lt;/A&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;time_before_close = &amp;lt;integer&amp;gt;
* Modtime delta required before Splunk can close a file on EOF.
* Tells the system not to close files that have been updated in past &amp;lt;integer&amp;gt; seconds.
* Defaults to 3.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;By the way, the below should be sufficient:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[my_log]
BREAK_ONLY_BEFORE = \-\-\-\sbegin
MAX_EVENTS = 10000
TRUNCATE = 100000
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Sat, 22 Jan 2011 04:38:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/multiline-events-using-line-merge-weird-splitting-issue/m-p/30016#M5209</guid>
      <dc:creator>araitz</dc:creator>
      <dc:date>2011-01-22T04:38:42Z</dc:date>
    </item>
  </channel>
</rss>

