<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Record Event Break in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122317#M25331</link>
    <description>&lt;P&gt;This seemed to work.&lt;/P&gt;

&lt;H1&gt;splunk clean eventdata -index &amp;lt;index&amp;gt;&lt;/H1&gt;

&lt;P&gt;Entered the following in prop.cond&lt;/P&gt;

&lt;P&gt;&amp;lt;source::/auditserverDAT/audit*&amp;gt;&lt;BR /&gt;
BREAK_ONLY_BEFORE=&amp;lt;record version&lt;/P&gt;</description>
    <pubDate>Mon, 28 Sep 2020 20:02:51 GMT</pubDate>
    <dc:creator>rjsteele</dc:creator>
    <dc:date>2020-09-28T20:02:51Z</dc:date>
    <item>
      <title>Record Event Break</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122314#M25328</link>
      <description>&lt;P&gt;It does not look like Splunk is breaking my XML correctly. I’d like to break it before each ‘&amp;lt;record version=’ tag in the XML. I am trying to set the props.conf with the following entry, but it does not seem to affect the way Splunk breaks the already indexed data, even after I stop and start Splunk. Can anyone assist?&lt;/P&gt;

&lt;P&gt;&amp;lt;source::/auditserverDAT/audit*&amp;gt;&lt;BR /&gt;
LINE_BREAKER=&amp;lt;record version&lt;/P&gt;

&lt;P&gt;Here is a sample XML file with a few records:&lt;/P&gt;

&lt;P&gt;&amp;lt;?xml version='1.0' encoding='UTF-8' ?&amp;gt;&lt;BR /&gt;
&amp;lt;?xml-stylesheet type='text/xsl' href='file:///usr/share/lib/xml/style/adt_record.xsl.1' ?&amp;gt;&lt;BR /&gt;
&amp;lt;!DOCTYPE audit PUBLIC '-//Sun Microsystems, Inc.//DTD Audit V1//EN' 'file:///usr/share/lib/xml/dtd/adt_record.dtd.1'&amp;gt;&lt;BR /&gt;
&amp;lt;audit&amp;gt;&lt;BR /&gt;
&amp;lt;file iso8601="2007-08-21 10:35:46.111 +08:00"&amp;gt;/var/audit/20070821023545.20070821023546.chihung&amp;lt;/file&amp;gt;&lt;BR /&gt;
&amp;lt;record version="2" event="system booted" modifier="na" iso8601="2007-08-21 10:34:49.704 +08:00"&amp;gt;&lt;BR /&gt;
&amp;lt;text&amp;gt;booting kernel&amp;lt;/text&amp;gt;&lt;BR /&gt;
&amp;lt;/record&amp;gt;&lt;BR /&gt;
&amp;lt;record version="2" event="stat(2)" host="chihung" iso8601="2007-08-21 10:39:22.700 +08:00"&amp;gt;&lt;BR /&gt;
&amp;lt;path&amp;gt;/usr/lib/pt_chmod&amp;lt;/path&amp;gt;&lt;BR /&gt;
&amp;lt;attribute mode="104511" uid="root" gid="bin" fsid="85" nodeid="623" device="0"/&amp;gt;&lt;BR /&gt;
&amp;lt;subject audit-uid="chihung" uid="root" gid="staff" ruid="chihung" rgid="staff" pid="704" sid="2477344358" tid="756 65558 ftpl_2_207"/&amp;gt;&lt;BR /&gt;
&amp;lt;return errval="success" retval="0"/&amp;gt;&lt;BR /&gt;
&amp;lt;/record&amp;gt;&lt;BR /&gt;
&amp;lt;record version="2" event="stat(2)" modifier="fe" host="chihung" iso8601="2007-08-21 10:39:22.727 +08:00"&amp;gt;&lt;BR /&gt;
&amp;lt;path&amp;gt;/platform/SUNW,UltraSPARC-IIi-cEngine/lib&amp;lt;/path&amp;gt;&lt;BR /&gt;
&amp;lt;subject audit-uid="chihung" uid="root" gid="staff" ruid="chihung" rgid="staff" pid="704" sid="2477344358" tid="756 65558 ftpl_2_207"/&amp;gt;&lt;BR /&gt;
&amp;lt;return errval="failure: No such file or directory&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 20:02:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122314#M25328</guid>
      <dc:creator>rjsteele</dc:creator>
      <dc:date>2020-09-28T20:02:48Z</dc:date>
    </item>
    <item>
      <title>Re: Record Event Break</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122315#M25329</link>
      <description>&lt;P&gt;Breaking a stream of data into lines, and aggregating that data back into events happens only once, at index time. If you have events already indexed by Splunk, these will not be affected by changes to line breaking or aggregation settings. You could hide or remove existing data and &lt;A href="http://answers.splunk.com/answers/72562/how-to-reindex-data-from-a-forwarder.html" target="_blank"&gt;reindex&lt;/A&gt; them to apply new settings.&lt;/P&gt;

&lt;P&gt;Now regarding your &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.2.3/Admin/propsconf" target="_blank"&gt;props.conf&lt;/A&gt;, if you're wanting to use LINE_BREAKER to split events, you must include a capturing group which is where the division of the line actually occurs, and you should also set SHOULD_LINEMERGE = false so that Splunk doesn't attempt to merge these "lines" back into larger events. You could also consider  instead using BREAK_ONLY_BEFORE and other attributes to determine event boundaries. &lt;/P&gt;

&lt;P&gt;An excellent reference on at what part of indexing and parsing each attribute is referenced can be found &lt;A href="http://wiki.splunk.com/Community:HowIndexingWorks" target="_blank"&gt;on the splunk wiki&lt;/A&gt;. The splunk docs contain a &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.2.3/Data/Overviewofeventprocessing" target="_blank"&gt;bunch of pages on event processing&lt;/A&gt; as well. There is also a guide on &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.2.3/Indexer/RemovedatafromSplunk" target="_blank"&gt;how to remove already indexed data&lt;/A&gt; from Splunk with options ranging from &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.2.3/SearchReference/delete" target="_blank"&gt;hiding individual events&lt;/A&gt; to destroying entire indexes. &lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 20:04:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122315#M25329</guid>
      <dc:creator>acharlieh</dc:creator>
      <dc:date>2020-09-28T20:04:31Z</dc:date>
    </item>
    <item>
      <title>Re: Record Event Break</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122316#M25330</link>
      <description>&lt;P&gt;Thanks acharlieh. I will look into the references. I do not want to split event, sorry for the confusion. I simply want Splunk to use '&lt;/P&gt;</description>
      <pubDate>Mon, 25 May 2015 18:21:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122316#M25330</guid>
      <dc:creator>rjsteele</dc:creator>
      <dc:date>2015-05-25T18:21:59Z</dc:date>
    </item>
    <item>
      <title>Re: Record Event Break</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122317#M25331</link>
      <description>&lt;P&gt;This seemed to work.&lt;/P&gt;

&lt;H1&gt;splunk clean eventdata -index &amp;lt;index&amp;gt;&lt;/H1&gt;

&lt;P&gt;Entered the following in prop.cond&lt;/P&gt;

&lt;P&gt;&amp;lt;source::/auditserverDAT/audit*&amp;gt;&lt;BR /&gt;
BREAK_ONLY_BEFORE=&amp;lt;record version&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 20:02:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Record-Event-Break/m-p/122317#M25331</guid>
      <dc:creator>rjsteele</dc:creator>
      <dc:date>2020-09-28T20:02:51Z</dc:date>
    </item>
  </channel>
</rss>

