<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What is the effect of annotate_punct on indexing time? in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459839#M20254</link>
    <description>&lt;P&gt;Our sales engineer said -&lt;/P&gt;

&lt;P&gt;PUNCT is exactly like it sounds; it’s an index-time field containing an ordered list of punctuations in an event. This is extremely useful for finding “patterns” of events; like a windows event where the service name and IP address would change but the event &lt;EM&gt;structure&lt;/EM&gt; would remain the same.&lt;/P&gt;

&lt;P&gt;It’s used in the background by Splunk sometimes. Very useful for eventtype, tagging, etc.&lt;/P&gt;

&lt;P&gt;ANNOTATE_PUNCT in particular is a toggling switch for this setting. It’s on by default, but if you have;&lt;BR /&gt;
1.  Extremely long events&lt;BR /&gt;
2.  Extremely frequent events&lt;BR /&gt;
3.  Events all of the same PUNCT pattern&lt;BR /&gt;
4.  Events of all different PUNCT patterns&lt;/P&gt;

&lt;P&gt;Than turning it off will reduce indexer CPU load on the parsing queue in the indexing pipeline.&lt;/P&gt;</description>
    <pubDate>Wed, 07 Nov 2018 14:25:37 GMT</pubDate>
    <dc:creator>ddrillic</dc:creator>
    <dc:date>2018-11-07T14:25:37Z</dc:date>
    <item>
      <title>What is the effect of annotate_punct on indexing time?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459838#M20253</link>
      <description>&lt;P&gt;The architecting Splunk 7.1 Enterprise Deployments class empathizes that setting &lt;CODE&gt;annotate_punct = false&lt;/CODE&gt; in &lt;CODE&gt;props.conf&lt;/CODE&gt; at indexer-level can improve significantly the indexing time.&lt;/P&gt;

&lt;P&gt;I wonder why setting it like this can improve indexing time and in which cases we should keep the punctuations field. &lt;/P&gt;</description>
      <pubDate>Tue, 06 Nov 2018 23:47:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459838#M20253</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2018-11-06T23:47:39Z</dc:date>
    </item>
    <item>
      <title>Re: What is the effect of annotate_punct on indexing time?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459839#M20254</link>
      <description>&lt;P&gt;Our sales engineer said -&lt;/P&gt;

&lt;P&gt;PUNCT is exactly like it sounds; it’s an index-time field containing an ordered list of punctuations in an event. This is extremely useful for finding “patterns” of events; like a windows event where the service name and IP address would change but the event &lt;EM&gt;structure&lt;/EM&gt; would remain the same.&lt;/P&gt;

&lt;P&gt;It’s used in the background by Splunk sometimes. Very useful for eventtype, tagging, etc.&lt;/P&gt;

&lt;P&gt;ANNOTATE_PUNCT in particular is a toggling switch for this setting. It’s on by default, but if you have;&lt;BR /&gt;
1.  Extremely long events&lt;BR /&gt;
2.  Extremely frequent events&lt;BR /&gt;
3.  Events all of the same PUNCT pattern&lt;BR /&gt;
4.  Events of all different PUNCT patterns&lt;/P&gt;

&lt;P&gt;Than turning it off will reduce indexer CPU load on the parsing queue in the indexing pipeline.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Nov 2018 14:25:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459839#M20254</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2018-11-07T14:25:37Z</dc:date>
    </item>
    <item>
      <title>Re: What is the effect of annotate_punct on indexing time?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459840#M20255</link>
      <description>&lt;P&gt;It seems to me that most log files would fall under the 3 category - &lt;EM&gt;Events all of the same PUNCT pattern&lt;/EM&gt;.&lt;/P&gt;</description>
      <pubDate>Wed, 07 Nov 2018 14:41:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-is-the-effect-of-annotate-punct-on-indexing-time/m-p/459840#M20255</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2018-11-07T14:41:51Z</dc:date>
    </item>
  </channel>
</rss>

