<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Is it possible to pseudonymize incoming data in Splunk? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236393#M45950</link>
    <description>&lt;P&gt;i think, the "Anonymize data" splunk document produces the exact output..&lt;BR /&gt;
&lt;A href="https://docs.splunk.com/Documentation/Splunk/7.1.2/Data/Anonymizedata"&gt;https://docs.splunk.com/Documentation/Splunk/7.1.2/Data/Anonymizedata&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;For example, if you have a log file called accounts.log that contains Social Security and credit card numbers:&lt;/P&gt;

&lt;P&gt;...&lt;CODE&gt;&lt;BR /&gt;
ss=123456789, cc=1234-5678-9012-3456&lt;BR /&gt;
ss=123456790, cc=2234-5678-9012-3457&lt;BR /&gt;
ss=123456791, cc=3234-5678-9012-3458&lt;BR /&gt;
ss=123456792, cc=4234-5678-9012-3459&lt;BR /&gt;
...&lt;/CODE&gt;&lt;BR /&gt;
And you want to mask the fields, so that they appear like this:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;...
ss=XXXXX6789, cc=XXXX-XXXX-XXXX-3456
ss=XXXXX6790, cc=XXXX-XXXX-XXXX-3457
ss=XXXXX6791, cc=XXXX-XXXX-XXXX-3458
ss=XXXXX6792, cc=XXXX-XXXX-XXXX-3459
... 
&lt;/CODE&gt;&lt;/PRE&gt;</description>
    <pubDate>Tue, 21 Aug 2018 09:09:26 GMT</pubDate>
    <dc:creator>inventsekar</dc:creator>
    <dc:date>2018-08-21T09:09:26Z</dc:date>
    <item>
      <title>Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236389#M45946</link>
      <description>&lt;P&gt;Hi forum,&lt;/P&gt;

&lt;P&gt;I would like to know if and how it is possible to pseudonymise incoming data in Splunk. I know that I can anonymize data by applying a regex for an incoming sourcetype. &lt;/P&gt;

&lt;P&gt;This procedure is removing information from the data. I would need something like applying a hash function to a certain type of data at parsing/index time.&lt;/P&gt;

&lt;P&gt;Thanks for your help in advance,&lt;/P&gt;

&lt;P&gt;Andreas&lt;/P&gt;</description>
      <pubDate>Tue, 23 Aug 2016 19:48:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236389#M45946</guid>
      <dc:creator>schose</dc:creator>
      <dc:date>2016-08-23T19:48:05Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236390#M45947</link>
      <description>&lt;P&gt;This sounds like a job for SEDCMD in props.conf. I don't have an exact answer for you, but here are some breadcrumbs. &lt;/P&gt;

&lt;P&gt;&lt;A href="https://answers.splunk.com/answers/210096/how-to-configure-sedcmd-in-propsconf.html"&gt;https://answers.splunk.com/answers/210096/how-to-configure-sedcmd-in-propsconf.html&lt;/A&gt;&lt;BR /&gt;
&lt;A href="https://answers.splunk.com/answers/323853/masking-ip-in-propsconf-using-sedcmd.html"&gt;https://answers.splunk.com/answers/323853/masking-ip-in-propsconf-using-sedcmd.html&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 24 Aug 2016 16:10:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236390#M45947</guid>
      <dc:creator>JDukeSplunk</dc:creator>
      <dc:date>2016-08-24T16:10:02Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236391#M45948</link>
      <description>&lt;P&gt;No such a built-in feature in Splunk as of now. I recommend to file an enhancement request .&lt;BR /&gt;&lt;BR /&gt;
It is good to provide good use case when you file an enhancement request. &lt;/P&gt;</description>
      <pubDate>Thu, 01 Sep 2016 01:00:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236391#M45948</guid>
      <dc:creator>Masa</dc:creator>
      <dc:date>2016-09-01T01:00:08Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236392#M45949</link>
      <description>&lt;P&gt;Hi there,&lt;/P&gt;

&lt;P&gt;I face the same issue/requirement. A good use case is nowadays when we use Splunk on sensitive incoming data that needs pseudonymisation, in order to be compliant with the European General Data Protection Regulation (GDPR).&lt;/P&gt;

&lt;P&gt;Regards,&lt;/P&gt;</description>
      <pubDate>Tue, 21 Aug 2018 08:52:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236392#M45949</guid>
      <dc:creator>fbourel</dc:creator>
      <dc:date>2018-08-21T08:52:16Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236393#M45950</link>
      <description>&lt;P&gt;i think, the "Anonymize data" splunk document produces the exact output..&lt;BR /&gt;
&lt;A href="https://docs.splunk.com/Documentation/Splunk/7.1.2/Data/Anonymizedata"&gt;https://docs.splunk.com/Documentation/Splunk/7.1.2/Data/Anonymizedata&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;For example, if you have a log file called accounts.log that contains Social Security and credit card numbers:&lt;/P&gt;

&lt;P&gt;...&lt;CODE&gt;&lt;BR /&gt;
ss=123456789, cc=1234-5678-9012-3456&lt;BR /&gt;
ss=123456790, cc=2234-5678-9012-3457&lt;BR /&gt;
ss=123456791, cc=3234-5678-9012-3458&lt;BR /&gt;
ss=123456792, cc=4234-5678-9012-3459&lt;BR /&gt;
...&lt;/CODE&gt;&lt;BR /&gt;
And you want to mask the fields, so that they appear like this:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;...
ss=XXXXX6789, cc=XXXX-XXXX-XXXX-3456
ss=XXXXX6790, cc=XXXX-XXXX-XXXX-3457
ss=XXXXX6791, cc=XXXX-XXXX-XXXX-3458
ss=XXXXX6792, cc=XXXX-XXXX-XXXX-3459
... 
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 21 Aug 2018 09:09:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236393#M45950</guid>
      <dc:creator>inventsekar</dc:creator>
      <dc:date>2018-08-21T09:09:26Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236394#M45951</link>
      <description>&lt;P&gt;The need here is to pseudonymize and not anonymise which is different. Therefore the need is to be able to trace someone uniquely regardless of who he is namely. Anonymisation will lose traceability between events by replacing valuable information with "just" XXXX characters.&lt;/P&gt;

&lt;P&gt;Regards,&lt;/P&gt;</description>
      <pubDate>Tue, 21 Aug 2018 09:32:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236394#M45951</guid>
      <dc:creator>fbourel</dc:creator>
      <dc:date>2018-08-21T09:32:45Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236395#M45952</link>
      <description>&lt;P&gt;how you can pseudonymize?!?! i mean, you want to pseudonymize only one string (only one ip address or SSN number, etc) or multiple strings?!?! i think you need to create "tokens" manually and using this token, do anonymize manually.. &lt;/P&gt;

&lt;P&gt;For other readers, this will help others to understand pseudonymization VS anonymization - &lt;BR /&gt;
&lt;A href="https://www.protegrity.com/pseudonymization-vs-anonymization-help-gdpr/"&gt;https://www.protegrity.com/pseudonymization-vs-anonymization-help-gdpr/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 21 Aug 2018 10:00:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236395#M45952</guid>
      <dc:creator>inventsekar</dc:creator>
      <dc:date>2018-08-21T10:00:52Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236396#M45953</link>
      <description>&lt;P&gt;Thanks for the link and its clarity.&lt;/P&gt;

&lt;P&gt;Pseudonymisation in Splunk is not built-in, so one must rely on external programs to pseudonymise incoming raw data (one or several strings). I have found a Splunk app related to that issue: &lt;A href="https://splunkbase.splunk.com/app/282/"&gt;https://splunkbase.splunk.com/app/282/&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;I have also found a talk at the Splunk Conf 2017 clearly addressing the problem and the possible solutions :&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;link: &lt;A href="http://conf.splunk.com/sessions/2017-sessions.html#search=obfuscation"&gt;http://conf.splunk.com/sessions/2017-sessions.html#search=obfuscation&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;pdf: &lt;A href="https://conf.splunk.com/files/2017/slides/data-obfuscation-and-field-protection-in-splunk.pdf"&gt;https://conf.splunk.com/files/2017/slides/data-obfuscation-and-field-protection-in-splunk.pdf&lt;/A&gt;&lt;/LI&gt;
&lt;/UL&gt;

&lt;P&gt;Personally, I have the possibility to pseudonymize the input data before any Splunk indexation, so maybe I'll head that way for now.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Aug 2018 12:00:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236396#M45953</guid>
      <dc:creator>fbourel</dc:creator>
      <dc:date>2018-08-21T12:00:49Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236397#M45954</link>
      <description>&lt;P&gt;Use INGEST_EVAL and cryptographic functions to create a hash at index time.&lt;/P&gt;</description>
      <pubDate>Mon, 06 May 2019 04:54:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/236397#M45954</guid>
      <dc:creator>jpass</dc:creator>
      <dc:date>2019-05-06T04:54:24Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/512993#M86989</link>
      <description>&lt;P&gt;Is it still the same in 2020,&amp;nbsp; has the capability been enabled in Splunk for pseudonymization ?&amp;nbsp;&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 07 Aug 2020 12:46:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/512993#M86989</guid>
      <dc:creator>ankitsync</dc:creator>
      <dc:date>2020-08-07T12:46:29Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/513018#M86993</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;you should add the idea here&amp;nbsp;&lt;A href="https://ideas.splunk.com/ideas" target="_blank"&gt;https://ideas.splunk.com/ideas&lt;/A&gt;&lt;/P&gt;&lt;P&gt;r. Ismo&lt;/P&gt;</description>
      <pubDate>Fri, 07 Aug 2020 14:46:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/513018#M86993</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2020-08-07T14:46:14Z</dc:date>
    </item>
    <item>
      <title>Re: Is it possible to pseudonymize incoming data in Splunk?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/513115#M87008</link>
      <description>&lt;P&gt;&lt;A href="https://community.splunk.com/t5/Getting-Data-In/Anonymize-data-from-JSON-File/m-p/502197/highlight/true#M85571" target="_blank"&gt;https://community.splunk.com/t5/Getting-Data-In/Anonymize-data-from-JSON-File/m-p/502197/highlight/true#M85571&lt;/A&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;INGEST_EVAL can work for it.&lt;/P&gt;</description>
      <pubDate>Sat, 08 Aug 2020 00:15:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Is-it-possible-to-pseudonymize-incoming-data-in-Splunk/m-p/513115#M87008</guid>
      <dc:creator>to4kawa</dc:creator>
      <dc:date>2020-08-08T00:15:03Z</dc:date>
    </item>
  </channel>
</rss>

