<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Describe the pattern matching syntax used for 'punct'? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101621#M182884</link>
    <description>&lt;P&gt;punct is a field just like any other.  The content is punct is the same as the event stripping all letters and number, and replacing whitespace with the underscore.  Leaving just the PUNCTuation.&lt;/P&gt;

&lt;P&gt;punct is useful for finding similar messages that have varying some varying text in them.  Process ids, host names, and times could be different in the content of several events, but the actual message that you care about could be the same.  In may cases like this the punct will be the same across those events.&lt;/P&gt;</description>
    <pubDate>Tue, 27 Mar 2012 01:30:34 GMT</pubDate>
    <dc:creator>sam</dc:creator>
    <dc:date>2012-03-27T01:30:34Z</dc:date>
    <item>
      <title>Describe the pattern matching syntax used for 'punct'?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101620#M182883</link>
      <description>&lt;P&gt;I am trying to determine how I can use 'punct' to match certain patterns and set eventtypes for my data.&lt;/P&gt;

&lt;P&gt;I see &lt;CODE&gt;punct&lt;/CODE&gt; described in the documentation at &lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/User/UseDefaultAndInternalFields#punct"&gt;UseDefaultAndInternalFields&lt;/A&gt; and &lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/User/ClassifyAndGroupSimilarEvents#Identify_similar_events_with_punct"&gt;ClassifyAndGroupSimilarEvents&lt;/A&gt;, but I don't see any description on how to read the syntax.&lt;/P&gt;

&lt;P&gt;What does something like &lt;CODE&gt;punct="&amp;lt;&amp;gt;__::_..._[]:_=_=___=\"=,=,=,=,=,=\""&lt;/CODE&gt; mean? It's obviously some sort of pattern matching behavior like regular expressions or globbing, but I don't see this defined anywhere.&lt;/P&gt;

&lt;P&gt;Does punct support wildcards?&lt;/P&gt;

&lt;P&gt;Is there an easy way to experiment with different punct patterns and see if they correctly match my data? I need a way to quickly compare one pattern vs. another, so I can determine if a particular punct is too narrow or too broad.&lt;/P&gt;

&lt;P&gt;On any given search, Splunk will suggest over 50 different puncts, which are very difficult to compare. My logs are all sent via syslog, and follow the standard formats defined in RFC 5424 (RFC 3164).&lt;/P&gt;</description>
      <pubDate>Mon, 26 Mar 2012 23:57:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101620#M182883</guid>
      <dc:creator>stefanlasiewski</dc:creator>
      <dc:date>2012-03-26T23:57:55Z</dc:date>
    </item>
    <item>
      <title>Re: Describe the pattern matching syntax used for 'punct'?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101621#M182884</link>
      <description>&lt;P&gt;punct is a field just like any other.  The content is punct is the same as the event stripping all letters and number, and replacing whitespace with the underscore.  Leaving just the PUNCTuation.&lt;/P&gt;

&lt;P&gt;punct is useful for finding similar messages that have varying some varying text in them.  Process ids, host names, and times could be different in the content of several events, but the actual message that you care about could be the same.  In may cases like this the punct will be the same across those events.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Mar 2012 01:30:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101621#M182884</guid>
      <dc:creator>sam</dc:creator>
      <dc:date>2012-03-27T01:30:34Z</dc:date>
    </item>
    <item>
      <title>Re: Describe the pattern matching syntax used for 'punct'?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101622#M182885</link>
      <description>&lt;P&gt;Deep in the Knowledge Manager Manual, I found more details about &lt;CODE&gt;punct&lt;/CODE&gt;. In the section "&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/Knowledge/Abouteventtypes#Use_the_punct_field_to_search_on_similar_events"&gt;Use the punct field to search on similar events &lt;/A&gt;" says:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;The punct field stores the first 30 punctuation characters in the first line of the event. This field is useful for finding similar events quickly.&lt;/P&gt;

&lt;P&gt;When you use punct, keep in mind:&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;Quotes and backslashes are escaped.&lt;/LI&gt;
&lt;LI&gt;Spaces are replaced with an underscore (_).&lt;/LI&gt;
&lt;LI&gt;Tabs are replaced with a "t".&lt;/LI&gt;
&lt;LI&gt;Dashes that follow alphanumeric characters are ignored.&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;Interesting punctuation characters are:&lt;/P&gt;

&lt;P&gt;",;-#$%&amp;amp;+./:=?@\'|*\n\r\"(){}&amp;lt;&amp;gt;[]^!" &lt;/P&gt;&lt;/LI&gt;
&lt;/UL&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;In addition, wildcards &lt;EM&gt;are&lt;/EM&gt; supported, according to "&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.1/User/ClassifyAndGroupSimilarEvents#Identify_similar_events_with_punct"&gt;Identify similar events with punct&lt;/A&gt;". However, this description is vague. &lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;You may want to consider wildcarding the punctuation to match insignificant variations (for example, "punct=::[]&lt;EM&gt;/&lt;/EM&gt;"). &lt;/P&gt;
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Tue, 27 Mar 2012 04:16:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101622#M182885</guid>
      <dc:creator>stefanlasiewski</dc:creator>
      <dc:date>2012-03-27T04:16:14Z</dc:date>
    </item>
    <item>
      <title>Re: Describe the pattern matching syntax used for 'punct'?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101623#M182886</link>
      <description>&lt;P&gt;I cannot edit this answer anymore, but I thought I would provide an update. The 6.2.0 manual has improved this documentation somewhat. See &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Knowledge/Abouteventtypes#Use_the_punct_field_to_search_on_similar_events"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Knowledge/Abouteventtypes#Use_the_punct_field_to_search_on_similar_events&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 16 Dec 2014 19:10:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/101623#M182886</guid>
      <dc:creator>stefanlasiewski</dc:creator>
      <dc:date>2014-12-16T19:10:22Z</dc:date>
    </item>
    <item>
      <title>Re: Describe the pattern matching syntax used for 'punct'?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/693781#M236015</link>
      <description>&lt;P&gt;punct can be super helpful for trying to diagnose issues by looking at the kinds of events in your logs. I use this pretty regularly on Windows machines when there is an issue going on.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=wineventlog
| stats latest(_raw) as _raw count by source, punct
| sort - count&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 18 Jul 2024 18:31:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Describe-the-pattern-matching-syntax-used-for-punct/m-p/693781#M236015</guid>
      <dc:creator>fredclown</dc:creator>
      <dc:date>2024-07-18T18:31:40Z</dc:date>
    </item>
  </channel>
</rss>

