<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic strip sensitive data before indexing in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/strip-sensitive-data-before-indexing/m-p/313219#M58709</link>
    <description>&lt;P&gt;Hi all, &lt;/P&gt;

&lt;P&gt;We run several tools in our environment for network inspection and the logging it provides logs things like NTLM creds, HTTP Basic Auth etc. We'd like to strip this data off before indexing, or at least 'sanitize' it so we dont index usernames/passwords. Can anyone point me in the right direction to solve this?&lt;/P&gt;

&lt;P&gt;Example data: &lt;/P&gt;

&lt;P&gt;{&lt;BR /&gt;
   "timestamp":"2018-04-04T09:00:08.085563-0600",&lt;BR /&gt;
   "flow_id":151014950299099,&lt;BR /&gt;
   "in_iface":"asdfasdf",&lt;BR /&gt;
   "event_type":"alert",&lt;BR /&gt;
   "vlan":10,&lt;BR /&gt;
   "src_ip":"x.x.x.x",&lt;BR /&gt;
   "src_port":60130,&lt;BR /&gt;
   "dest_ip":"166.70.63.169",&lt;BR /&gt;
   "dest_port":443,&lt;BR /&gt;
   "proto":"TCP",&lt;BR /&gt;
   "tx_id":0,&lt;BR /&gt;
   "alert":{&lt;BR /&gt;
      "action":"allowed",&lt;BR /&gt;
      "gid":1,&lt;BR /&gt;
      "signature_id":2013928,&lt;BR /&gt;
      "rev":4,&lt;BR /&gt;
      "signature":"ET POLICY HTTP traffic on port 443 (PROPFIND)",&lt;BR /&gt;
      "category":"Potentially Bad Traffic",&lt;BR /&gt;
      "severity":2&lt;BR /&gt;
   },&lt;BR /&gt;
   "http":{&lt;BR /&gt;
      "hostname":"&lt;A href="http://www.somesite.org" target="_blank"&gt;www.somesite.org&lt;/A&gt;",&lt;BR /&gt;
      "url":"\/things\/remote.php\/webdav\/",&lt;BR /&gt;
      "http_user_agent":"Mozilla\/5.0 (Linux) mirall\/2.3.3",&lt;BR /&gt;
      "http_content_type":"application\/xml",&lt;BR /&gt;
      "http_method":"PROPFIND",&lt;BR /&gt;
      "protocol":"HTTP\/1.1",&lt;BR /&gt;
      "status":207,&lt;BR /&gt;
      "length":382&lt;BR /&gt;
   },&lt;BR /&gt;
   "payload_printable":"PROPFIND \/owncloud\/remote.php\/webdav\/ HTTP\/1.1\r\n&lt;BR /&gt;
Depth: 0\r\n&lt;BR /&gt;
Authorization: Basic REDACTEDBASE64PASSWORDHERE==\r\n&lt;BR /&gt;
User-Agent: Mozilla\/5.0 (Linux) mirall\/2.3.3\r\n&lt;BR /&gt;
Accept: &lt;EM&gt;\/&lt;/EM&gt;\r\n&lt;BR /&gt;
Content-Type: text\/xml; charset=utf-8\r\nCookie: oc_sessionPassphrase=redacted \r\nContent-Length: 105\r\n&lt;BR /&gt;
Connection: Keep-Alive\r\n&lt;BR /&gt;
Accept-Encoding: gzip, deflate\r\nAccept-Language: en-US,*\r\n&lt;BR /&gt;
Host: &lt;A href="http://www.somesite.org%5Cr%5Cn%5Cr%5Cn%5Cn%5Cn" target="_blank"&gt;www.somesite.org\r\n\r\n\n\n&lt;/A&gt; \n \n &amp;lt;\/d:prop&amp;gt;\n&amp;lt;\/d:propfind&amp;gt;\n",&lt;BR /&gt;
   "stream":1&lt;BR /&gt;
}&lt;/P&gt;</description>
    <pubDate>Tue, 29 Sep 2020 18:52:17 GMT</pubDate>
    <dc:creator>zhatsispgx</dc:creator>
    <dc:date>2020-09-29T18:52:17Z</dc:date>
    <item>
      <title>strip sensitive data before indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/strip-sensitive-data-before-indexing/m-p/313219#M58709</link>
      <description>&lt;P&gt;Hi all, &lt;/P&gt;

&lt;P&gt;We run several tools in our environment for network inspection and the logging it provides logs things like NTLM creds, HTTP Basic Auth etc. We'd like to strip this data off before indexing, or at least 'sanitize' it so we dont index usernames/passwords. Can anyone point me in the right direction to solve this?&lt;/P&gt;

&lt;P&gt;Example data: &lt;/P&gt;

&lt;P&gt;{&lt;BR /&gt;
   "timestamp":"2018-04-04T09:00:08.085563-0600",&lt;BR /&gt;
   "flow_id":151014950299099,&lt;BR /&gt;
   "in_iface":"asdfasdf",&lt;BR /&gt;
   "event_type":"alert",&lt;BR /&gt;
   "vlan":10,&lt;BR /&gt;
   "src_ip":"x.x.x.x",&lt;BR /&gt;
   "src_port":60130,&lt;BR /&gt;
   "dest_ip":"166.70.63.169",&lt;BR /&gt;
   "dest_port":443,&lt;BR /&gt;
   "proto":"TCP",&lt;BR /&gt;
   "tx_id":0,&lt;BR /&gt;
   "alert":{&lt;BR /&gt;
      "action":"allowed",&lt;BR /&gt;
      "gid":1,&lt;BR /&gt;
      "signature_id":2013928,&lt;BR /&gt;
      "rev":4,&lt;BR /&gt;
      "signature":"ET POLICY HTTP traffic on port 443 (PROPFIND)",&lt;BR /&gt;
      "category":"Potentially Bad Traffic",&lt;BR /&gt;
      "severity":2&lt;BR /&gt;
   },&lt;BR /&gt;
   "http":{&lt;BR /&gt;
      "hostname":"&lt;A href="http://www.somesite.org" target="_blank"&gt;www.somesite.org&lt;/A&gt;",&lt;BR /&gt;
      "url":"\/things\/remote.php\/webdav\/",&lt;BR /&gt;
      "http_user_agent":"Mozilla\/5.0 (Linux) mirall\/2.3.3",&lt;BR /&gt;
      "http_content_type":"application\/xml",&lt;BR /&gt;
      "http_method":"PROPFIND",&lt;BR /&gt;
      "protocol":"HTTP\/1.1",&lt;BR /&gt;
      "status":207,&lt;BR /&gt;
      "length":382&lt;BR /&gt;
   },&lt;BR /&gt;
   "payload_printable":"PROPFIND \/owncloud\/remote.php\/webdav\/ HTTP\/1.1\r\n&lt;BR /&gt;
Depth: 0\r\n&lt;BR /&gt;
Authorization: Basic REDACTEDBASE64PASSWORDHERE==\r\n&lt;BR /&gt;
User-Agent: Mozilla\/5.0 (Linux) mirall\/2.3.3\r\n&lt;BR /&gt;
Accept: &lt;EM&gt;\/&lt;/EM&gt;\r\n&lt;BR /&gt;
Content-Type: text\/xml; charset=utf-8\r\nCookie: oc_sessionPassphrase=redacted \r\nContent-Length: 105\r\n&lt;BR /&gt;
Connection: Keep-Alive\r\n&lt;BR /&gt;
Accept-Encoding: gzip, deflate\r\nAccept-Language: en-US,*\r\n&lt;BR /&gt;
Host: &lt;A href="http://www.somesite.org%5Cr%5Cn%5Cr%5Cn%5Cn%5Cn" target="_blank"&gt;www.somesite.org\r\n\r\n\n\n&lt;/A&gt; \n \n &amp;lt;\/d:prop&amp;gt;\n&amp;lt;\/d:propfind&amp;gt;\n",&lt;BR /&gt;
   "stream":1&lt;BR /&gt;
}&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 18:52:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/strip-sensitive-data-before-indexing/m-p/313219#M58709</guid>
      <dc:creator>zhatsispgx</dc:creator>
      <dc:date>2020-09-29T18:52:17Z</dc:date>
    </item>
    <item>
      <title>Re: strip sensitive data before indexing</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/strip-sensitive-data-before-indexing/m-p/313220#M58710</link>
      <description>&lt;P&gt;See &lt;A href="http://docs.splunk.com/Documentation/Splunk/7.0.3/Data/Anonymizedata"&gt;http://docs.splunk.com/Documentation/Splunk/7.0.3/Data/Anonymizedata&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 04 Apr 2018 16:29:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/strip-sensitive-data-before-indexing/m-p/313220#M58710</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2018-04-04T16:29:07Z</dc:date>
    </item>
  </channel>
</rss>

