<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Is the Enterprise Security 'ECSU - Remote Desktop Network Bruteforce - Rule' correlation search incorrect? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Is-the-Enterprise-Security-ECSU-Remote-Desktop-Network/m-p/468781#M195720</link>
    <description>&lt;P&gt;Hi Splunkers,&lt;/P&gt;

&lt;P&gt;I was wading through some of the Enterprise Security correlation searches and I noticed that the &lt;EM&gt;Remote Desktop Network Bruteforce&lt;/EM&gt; search (defined in &lt;EM&gt;$SPLUNK_HOME/etc/apps/DA-ESS-ContentUpdate/default/savedsearches.conf&lt;/EM&gt;) appears to be attempting to identify an anomalous count of RDP network connections by getting a &lt;EM&gt;count&lt;/EM&gt; from &lt;EM&gt;tstats&lt;/EM&gt;, then checking if 'count&amp;gt;(stdev*2)'.&lt;/P&gt;

&lt;P&gt;Now I've never been particularly good at statistics, but I thought that a common method for detecting outliers was to check for values that were more than 2 (or 3) standard deviations &lt;STRONG&gt;from the mean&lt;/STRONG&gt;, rather than more than 2 (or 3) standard deviations from zero?&lt;/P&gt;

&lt;P&gt;Most of the other outlier detection searches that I've seen do 'avg + (2 * stdev)' type constructs (like the &lt;EM&gt;ESCU - SMB Traffic Spike - Rule&lt;/EM&gt; correlation search in that same file for instance), so I tried to find some statistics background information and found &lt;EM&gt;How to Use Statistics to Identify Outliers in Data&lt;/EM&gt;&lt;A href="https://machinelearningmastery.com/how-to-use-statistics-to-identify-outliers-in-data/"&gt;1&lt;/A&gt;, which mentions the 'Standard Deviation Method'. That goes on to say that the data can be 'normalised' so that the mean is zero, which I believe would explain the expression in the correlation search not taking &lt;EM&gt;avg&lt;/EM&gt; in to account, but I can't see anything in that search query to 'normalise' the data (not that I'd know what that looked like, but it is just getting a straight &lt;EM&gt;count&lt;/EM&gt; from &lt;EM&gt;tstats&lt;/EM&gt; so I'm assuming it isn't normalised?).&lt;/P&gt;

&lt;P&gt;Also, to further back up my theory, the &lt;EM&gt;description&lt;/EM&gt; field for the correlation search in the above-mentioned &lt;EM&gt;savedsearches.conf&lt;/EM&gt; file states:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;This search looks for RDP application network traffic and filters any source/destination pair generating more than twice the standard deviation &lt;STRONG&gt;of the average traffic&lt;/STRONG&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;So, thinking that this may actually be a bug, I checked for a later version of the &lt;EM&gt;ES Content Updates&lt;/EM&gt; app (I'm running v1.0.38) and found v1.0.41. Downloading and checking that shows the same potential problem in v1.0.41 too.&lt;/P&gt;

&lt;P&gt;The following UNIX command will show any search string mentioning 'stdev' along with the stanza name (for the search name), for comparison -- some take the &lt;EM&gt;avg&lt;/EM&gt; in to account and some don't:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;grep "[|[=|][^=|]*stdev" "$SPLUNK_HOME/etc/apps/DA-ESS-ContentUpdate/default/savedsearches.conf" |grep -B 1 "stdev"&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;This isn't really a problem as such, because I can just redo the correlation search and add the calculated &lt;EM&gt;avg&lt;/EM&gt; field. I'm just after some sort of confirmation as to whether or not the existing search string is correct, as it produces more notable events without using &lt;EM&gt;avg&lt;/EM&gt; than with.&lt;/P&gt;

&lt;P&gt;Thanks,&lt;BR /&gt;
Karl&lt;/P&gt;</description>
    <pubDate>Thu, 29 Aug 2019 05:17:03 GMT</pubDate>
    <dc:creator>grashupfer</dc:creator>
    <dc:date>2019-08-29T05:17:03Z</dc:date>
    <item>
      <title>Is the Enterprise Security 'ECSU - Remote Desktop Network Bruteforce - Rule' correlation search incorrect?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Is-the-Enterprise-Security-ECSU-Remote-Desktop-Network/m-p/468781#M195720</link>
      <description>&lt;P&gt;Hi Splunkers,&lt;/P&gt;

&lt;P&gt;I was wading through some of the Enterprise Security correlation searches and I noticed that the &lt;EM&gt;Remote Desktop Network Bruteforce&lt;/EM&gt; search (defined in &lt;EM&gt;$SPLUNK_HOME/etc/apps/DA-ESS-ContentUpdate/default/savedsearches.conf&lt;/EM&gt;) appears to be attempting to identify an anomalous count of RDP network connections by getting a &lt;EM&gt;count&lt;/EM&gt; from &lt;EM&gt;tstats&lt;/EM&gt;, then checking if 'count&amp;gt;(stdev*2)'.&lt;/P&gt;

&lt;P&gt;Now I've never been particularly good at statistics, but I thought that a common method for detecting outliers was to check for values that were more than 2 (or 3) standard deviations &lt;STRONG&gt;from the mean&lt;/STRONG&gt;, rather than more than 2 (or 3) standard deviations from zero?&lt;/P&gt;

&lt;P&gt;Most of the other outlier detection searches that I've seen do 'avg + (2 * stdev)' type constructs (like the &lt;EM&gt;ESCU - SMB Traffic Spike - Rule&lt;/EM&gt; correlation search in that same file for instance), so I tried to find some statistics background information and found &lt;EM&gt;How to Use Statistics to Identify Outliers in Data&lt;/EM&gt;&lt;A href="https://machinelearningmastery.com/how-to-use-statistics-to-identify-outliers-in-data/"&gt;1&lt;/A&gt;, which mentions the 'Standard Deviation Method'. That goes on to say that the data can be 'normalised' so that the mean is zero, which I believe would explain the expression in the correlation search not taking &lt;EM&gt;avg&lt;/EM&gt; in to account, but I can't see anything in that search query to 'normalise' the data (not that I'd know what that looked like, but it is just getting a straight &lt;EM&gt;count&lt;/EM&gt; from &lt;EM&gt;tstats&lt;/EM&gt; so I'm assuming it isn't normalised?).&lt;/P&gt;

&lt;P&gt;Also, to further back up my theory, the &lt;EM&gt;description&lt;/EM&gt; field for the correlation search in the above-mentioned &lt;EM&gt;savedsearches.conf&lt;/EM&gt; file states:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;This search looks for RDP application network traffic and filters any source/destination pair generating more than twice the standard deviation &lt;STRONG&gt;of the average traffic&lt;/STRONG&gt;&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;So, thinking that this may actually be a bug, I checked for a later version of the &lt;EM&gt;ES Content Updates&lt;/EM&gt; app (I'm running v1.0.38) and found v1.0.41. Downloading and checking that shows the same potential problem in v1.0.41 too.&lt;/P&gt;

&lt;P&gt;The following UNIX command will show any search string mentioning 'stdev' along with the stanza name (for the search name), for comparison -- some take the &lt;EM&gt;avg&lt;/EM&gt; in to account and some don't:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;grep "[|[=|][^=|]*stdev" "$SPLUNK_HOME/etc/apps/DA-ESS-ContentUpdate/default/savedsearches.conf" |grep -B 1 "stdev"&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;This isn't really a problem as such, because I can just redo the correlation search and add the calculated &lt;EM&gt;avg&lt;/EM&gt; field. I'm just after some sort of confirmation as to whether or not the existing search string is correct, as it produces more notable events without using &lt;EM&gt;avg&lt;/EM&gt; than with.&lt;/P&gt;

&lt;P&gt;Thanks,&lt;BR /&gt;
Karl&lt;/P&gt;</description>
      <pubDate>Thu, 29 Aug 2019 05:17:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Is-the-Enterprise-Security-ECSU-Remote-Desktop-Network/m-p/468781#M195720</guid>
      <dc:creator>grashupfer</dc:creator>
      <dc:date>2019-08-29T05:17:03Z</dc:date>
    </item>
  </channel>
</rss>

