<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Anonymous based on scripting in Reporting</title>
    <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60503#M1317</link>
    <description>&lt;P&gt;Thanks that is what I guessed. I've oppened an enhancement request for this.&lt;/P&gt;</description>
    <pubDate>Tue, 31 Jan 2012 14:36:11 GMT</pubDate>
    <dc:creator>ruisantos</dc:creator>
    <dc:date>2012-01-31T14:36:11Z</dc:date>
    <item>
      <title>Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60499#M1313</link>
      <description>&lt;P&gt;Is there a way to anonymize data based on a script/function. I want to anonymize data but would like to have an hash that I can use to perform valid reports on it. &lt;/P&gt;

&lt;P&gt;To further extend one what I would like to have.&lt;/P&gt;

&lt;P&gt;Currently splunk allows me to anonymize data like this: &lt;STRONG&gt;eg. replace 123456789 with XXXXXX789.&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;What I would like is something more like: &lt;STRONG&gt;eg. replace 123456789 with the result of function md5(123456789)=jf430fj490fj4&lt;/STRONG&gt;&lt;/P&gt;

&lt;P&gt;This would guarantee anonymity and uniqueness for reporting.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jan 2012 15:22:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60499#M1313</guid>
      <dc:creator>ruisantos</dc:creator>
      <dc:date>2012-01-30T15:22:21Z</dc:date>
    </item>
    <item>
      <title>Re: Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60500#M1314</link>
      <description>&lt;P&gt;Yes you can: &lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Data/Anonymizedatausingconfigurationfiles"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Data/Anonymizedatausingconfigurationfiles&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jan 2012 20:41:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60500#M1314</guid>
      <dc:creator>Kate_Lawrence-G</dc:creator>
      <dc:date>2012-01-30T20:41:44Z</dc:date>
    </item>
    <item>
      <title>Re: Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60501#M1315</link>
      <description>&lt;P&gt;I saw that document. But that document performs a general replacement of characters.&lt;/P&gt;

&lt;P&gt;eg. replace 123456789 with XXXXXX789.&lt;/P&gt;

&lt;P&gt;What I would like is something more like.&lt;/P&gt;

&lt;P&gt;eg. replace md5(123456789) with jf430fj490fj4&lt;/P&gt;

&lt;P&gt;This would guarantee anonymity and uniqueness for reporting.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jan 2012 09:46:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60501#M1315</guid>
      <dc:creator>ruisantos</dc:creator>
      <dc:date>2012-01-31T09:46:45Z</dc:date>
    </item>
    <item>
      <title>Re: Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60502#M1316</link>
      <description>&lt;P&gt;Hmm...I don't think that is something you can do natively in Splunk. The anonymize data function is limited to replacement/character substitution through either SED or REGEX.&lt;/P&gt;

&lt;P&gt;The closest 3rd party app I see uploaded is: &lt;BR /&gt;
&lt;A href="http://splunk-base.splunk.com/apps/22403/adds-support-for-anonymizing-log-files-at-index-time"&gt;http://splunk-base.splunk.com/apps/22403/adds-support-for-anonymizing-log-files-at-index-time&lt;/A&gt; , but I think that it's probably just character substations based on common fields found in data. &lt;/P&gt;

&lt;P&gt;It sounds like you actually want to randomize the data with a hash or some kind of seed so that its completely unique.&lt;/P&gt;

&lt;P&gt;I &lt;EM&gt;think&lt;/EM&gt; the best bet for this would be a custom python command that accepts the raw data does a specific function and then spits out a new field based on logic external to Splunk. &lt;/P&gt;

&lt;P&gt;Here is the link to the the Splunk doc on this:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3/SearchReference/WriteaPythonsearchcommand#Examples"&gt;http://docs.splunk.com/Documentation/Splunk/4.3/SearchReference/WriteaPythonsearchcommand#Examples&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jan 2012 14:32:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60502#M1316</guid>
      <dc:creator>Kate_Lawrence-G</dc:creator>
      <dc:date>2012-01-31T14:32:37Z</dc:date>
    </item>
    <item>
      <title>Re: Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60503#M1317</link>
      <description>&lt;P&gt;Thanks that is what I guessed. I've oppened an enhancement request for this.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jan 2012 14:36:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60503#M1317</guid>
      <dc:creator>ruisantos</dc:creator>
      <dc:date>2012-01-31T14:36:11Z</dc:date>
    </item>
    <item>
      <title>Re: Anonymous based on scripting</title>
      <link>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60504#M1318</link>
      <description>&lt;P&gt;I disagree with that it would "guarantee" anonymity. Uniqueness, perhaps (as long as you don't manage to create a hash collision), but anonymity? It's just a matter of finding the correct string that produces the given MD5 sum. The masking approach taken by default in Splunk, on the other hand, alters the string in a way that guarantees that the original data cannot be recreated.&lt;/P&gt;</description>
      <pubDate>Tue, 31 Jan 2012 14:40:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Reporting/Anonymous-based-on-scripting/m-p/60504#M1318</guid>
      <dc:creator>Ayn</dc:creator>
      <dc:date>2012-01-31T14:40:53Z</dc:date>
    </item>
  </channel>
</rss>

