<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Why is Splunk Truncating Multi-line Events? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145660#M29690</link>
    <description>&lt;P&gt;Here is my props.conf entry for log4j:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[log4j] 
TIME_FORMAT = %Y-%m-%d %H:%M:%S 
TIME_PREFIX = ^ 
MAX_TIMESTAMP_LOOKAHEAD = 25
BREAK_ONLY_BEFORE=^\d{4}-\d{1,2}-\d{1,2} \d{1,2}:\d{1,2}:\d{1,2},\d{3} 
NO_BINARY_CHECK = 1 
pulldown_type = 
true maxDist = 75
&lt;/CODE&gt;&lt;/PRE&gt;</description>
    <pubDate>Thu, 17 Jul 2014 14:51:42 GMT</pubDate>
    <dc:creator>sjnorman</dc:creator>
    <dc:date>2014-07-17T14:51:42Z</dc:date>
    <item>
      <title>Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145652#M29682</link>
      <description>&lt;P&gt;I'm indexing some Java application log files that use the log4j framework to output log messages. The log files are intermixed with CXF logging interceptor statements that log inbound/outbound SOAP messages that have the following format:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;2014-07-16 10:25:13,812 INFO  WebContainer : 16 - Inbound Message
---------------------------- 
ID: 15231
Response-Code: 200
Encoding: UTF-8
Content-Type: text/xml;charset=UTF-8
Headers: {Content-Length=[5612], content-type=[text/xml;charset=UTF-8], Date=[Wed, 16 Jul 2014 15:25:13 GMT], Server=[Jetty(7.1.6.v20100715)]}
Payload: &amp;lt;soap:Envelope xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/"&amp;gt;&amp;lt;soap:Header&amp;gt;&amp;lt;/soap:Header&amp;gt;&amp;lt;soap:Body&amp;gt;&amp;lt;MyXmlMessage&amp;gt;&amp;lt;/MyXmlMessage&amp;gt;&amp;lt;/soap:Body&amp;gt;&amp;lt;/soap:Envelope&amp;gt;
----------------------------
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I'd like to log these statements as single, multi-line events but Splunk seems to be randomly truncating the events after the following line: "Content-Type: text/xml;charset=UTF-8"&lt;/P&gt;

&lt;P&gt;i.e. some events include the full context (including the payload), whereas others only include up to the content-type.&lt;/P&gt;

&lt;P&gt;Here's what my props.conf looks like:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[default]
CHARSET = UTF-8
LINE_BREAKER_LOOKBEHIND = 100
TRUNCATE = 0
DATETIME_CONFIG = /etc/datetime.xml
ANNOTATE_PUNCT = True
HEADER_MODE =
MAX_DAYS_HENCE=2
MAX_DAYS_AGO=2000
MAX_DIFF_SECS_AGO=3600
MAX_DIFF_SECS_HENCE=604800
MAX_TIMESTAMP_LOOKAHEAD = 128
SHOULD_LINEMERGE = True
BREAK_ONLY_BEFORE = 
BREAK_ONLY_BEFORE_DATE = True
MAX_EVENTS = 20000 
MUST_BREAK_AFTER = 
MUST_NOT_BREAK_AFTER = 
MUST_NOT_BREAK_BEFORE = 


[log4j]
TIME_FORMAT = %Y-%m-%d %H:%M:%S
TIME_PREFIX = ^
MAX_TIMESTAMP_LOOKAHEAD = 30
#BREAK_ONLY_BEFORE = \d\d?:\d\d:\d\d
BREAK_ONLY_BEFORE=^\d{4}-\d{1,2}-\d{1,2} \d{1,2}:\d{1,2}:\d{1,2},\d{3}
NO_BINARY_CHECK = true
pulldown_type = true 
maxDist = 75
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Can anyone explain why Splunk would be truncating the events prematurely?&lt;/P&gt;</description>
      <pubDate>Wed, 16 Jul 2014 15:46:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145652#M29682</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-16T15:46:33Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145653#M29683</link>
      <description>&lt;P&gt;The props seems correct, especially the BREAK_ONLY_BEFORE.&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;Try  to add BREAK_ONLY_BEFORE_DATE = false&lt;/LI&gt;
&lt;LI&gt;and make sure that the props.conf is deployed on the &lt;EM&gt;indexers&lt;/EM&gt; and &lt;EM&gt;heavy forwarders&lt;/EM&gt; (if any), because they are the instances parsing the events.&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Mon, 28 Sep 2020 17:05:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145653#M29683</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2020-09-28T17:05:47Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145654#M29684</link>
      <description>&lt;P&gt;Thanks for the suggestion, but it didn't seem to have any effect -- the behaviour is still the same.&lt;/P&gt;

&lt;P&gt;FYI, yes I've made the changes to props.conf on my universal forwarders and re-started them afterwards.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Jul 2014 17:36:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145654#M29684</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-16T17:36:08Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145655#M29685</link>
      <description>&lt;P&gt;The parsing is no happening at the universal/lightweight forwarder level, so it should not make a difference.&lt;/P&gt;</description>
      <pubDate>Wed, 16 Jul 2014 17:54:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145655#M29685</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2014-07-16T17:54:22Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145656#M29686</link>
      <description>&lt;P&gt;I think the timestamp in the payload line in combination with some of your other options is tripping it up. &lt;/P&gt;

&lt;P&gt;I made a small sample file and got proper breaking with something as simple as this for the sourcetype:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;# chanfoli's settings
MAX_TIMESTAMP_LOOKAHEAD=25
NO_BINARY_CHECK=1
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 16 Jul 2014 18:13:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145656#M29686</guid>
      <dc:creator>chanfoli</dc:creator>
      <dc:date>2014-07-16T18:13:30Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145657#M29687</link>
      <description>&lt;P&gt;I applied the changes and still suffer from the same problem...somewhere between 25% and 50% of the events for the CXF log statements are being cut off after the "Content-Type: text/xml; charset=UTF-8" line. I really don't know what's tripping it up there.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jul 2014 14:25:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145657#M29687</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-17T14:25:28Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145658#M29688</link>
      <description>&lt;P&gt;FYI, I've checked the log files manually and there are no special characters that would be tripping up Splunk -- all lines end with a line feed character.&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jul 2014 14:35:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145658#M29688</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-17T14:35:08Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145659#M29689</link>
      <description>&lt;P&gt;FYI, I manually checked the log files and the lines all end with line feeds...&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jul 2014 14:50:41 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145659#M29689</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-17T14:50:41Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145660#M29690</link>
      <description>&lt;P&gt;Here is my props.conf entry for log4j:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[log4j] 
TIME_FORMAT = %Y-%m-%d %H:%M:%S 
TIME_PREFIX = ^ 
MAX_TIMESTAMP_LOOKAHEAD = 25
BREAK_ONLY_BEFORE=^\d{4}-\d{1,2}-\d{1,2} \d{1,2}:\d{1,2}:\d{1,2},\d{3} 
NO_BINARY_CHECK = 1 
pulldown_type = 
true maxDist = 75
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 17 Jul 2014 14:51:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145660#M29690</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-17T14:51:42Z</dc:date>
    </item>
    <item>
      <title>Re: Why is Splunk Truncating Multi-line Events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145661#M29691</link>
      <description>&lt;P&gt;Doh! I was applying the configuration to the forwarders. I applied the update to the &lt;EM&gt;indexer&lt;/EM&gt; and it seems to be working now, thanks!&lt;/P&gt;</description>
      <pubDate>Thu, 17 Jul 2014 15:04:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Why-is-Splunk-Truncating-Multi-line-Events/m-p/145661#M29691</guid>
      <dc:creator>sjnorman</dc:creator>
      <dc:date>2014-07-17T15:04:43Z</dc:date>
    </item>
  </channel>
</rss>

