<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic UTF-8 characters - how to remove them automatically in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668852#M112112</link>
    <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am looking for a solution to remove UTF-8 character encoding from the logs&lt;/P&gt;
&lt;P&gt;I have a regular expression that works in the search field, but I would like to find an automated solution for Splunk Cloud.&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;| rex mode=sed "s/\x1B\[[0-9;]*[mK]//g"&lt;/LI-CODE&gt;
&lt;P&gt;Sample log line:&lt;/P&gt;
&lt;P&gt;2023-11-15 11:47:21,605 backend_2023.2.8: &amp;#27;[36mINFO &amp;#27;[0;39m [-dispatcher-7] &amp;#27;[36mvip.service.northbound.MrpServiceakkaAddress=akka://backend, akkaUid=2193530468036521242&amp;#27;[0;39m &amp;#27;[39mMRP Service is alive and active.&amp;#27;[0;39m&lt;/P&gt;
&lt;P&gt;Any idea?&lt;/P&gt;
&lt;P&gt;Thanks for help.&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Thu, 16 Nov 2023 13:06:20 GMT</pubDate>
    <dc:creator>dabroma5</dc:creator>
    <dc:date>2023-11-16T13:06:20Z</dc:date>
    <item>
      <title>UTF-8 characters - how to remove them automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668852#M112112</link>
      <description>&lt;P&gt;Hi,&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I am looking for a solution to remove UTF-8 character encoding from the logs&lt;/P&gt;
&lt;P&gt;I have a regular expression that works in the search field, but I would like to find an automated solution for Splunk Cloud.&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;| rex mode=sed "s/\x1B\[[0-9;]*[mK]//g"&lt;/LI-CODE&gt;
&lt;P&gt;Sample log line:&lt;/P&gt;
&lt;P&gt;2023-11-15 11:47:21,605 backend_2023.2.8: &amp;#27;[36mINFO &amp;#27;[0;39m [-dispatcher-7] &amp;#27;[36mvip.service.northbound.MrpServiceakkaAddress=akka://backend, akkaUid=2193530468036521242&amp;#27;[0;39m &amp;#27;[39mMRP Service is alive and active.&amp;#27;[0;39m&lt;/P&gt;
&lt;P&gt;Any idea?&lt;/P&gt;
&lt;P&gt;Thanks for help.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 13:06:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668852#M112112</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-16T13:06:20Z</dc:date>
    </item>
    <item>
      <title>[Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668735#M112170</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;
&lt;P&gt;I have a log file which contains&amp;nbsp;UTF-8 characters&lt;/P&gt;
&lt;P&gt;"&lt;SPAN&gt;[&lt;/SPAN&gt;&lt;SPAN class=""&gt;1&lt;/SPAN&gt;&lt;SPAN&gt;;&lt;/SPAN&gt;&lt;SPAN class=""&gt;33mWARN&lt;/SPAN&gt;&lt;SPAN&gt; &amp;#27;[&lt;/SPAN&gt;&lt;SPAN class=""&gt;0&lt;/SPAN&gt;&lt;SPAN&gt;;&lt;/SPAN&gt;&lt;SPAN class=""&gt;39m&lt;/SPAN&gt;&lt;SPAN&gt; [&lt;/SPAN&gt;&lt;SPAN class=""&gt;-dispatcher-6&lt;/SPAN&gt;&lt;SPAN&gt;] &amp;#27;[&lt;/SPAN&gt;&lt;SPAN class=""&gt;&lt;SPAN class=""&gt;36m" and so on.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;Below regex works perfectly, but how to automate this solution&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;| rex mode=sed "s/\x1B\[[0-9;]*[mK]//g"&lt;/LI-CODE&gt;
&lt;P&gt;Thanks for your help.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Nov 2023 15:07:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668735#M112170</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-15T15:07:25Z</dc:date>
    </item>
    <item>
      <title>Re: UTF-8 characters - how to remove them automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668873#M112118</link>
      <description>&lt;P&gt;1. UTF-8 includes normal ASCII range. I don't think that's what you meant by "remove UTF-8 characters". UTF-8 is just an encoding.&lt;/P&gt;&lt;P&gt;2. What you're presenting are so called ANSI escape sequences.&lt;/P&gt;&lt;P&gt;3. Are you sure they are literarily in your logs or do you have them rendered and filtered already?&lt;/P&gt;&lt;P&gt;4. Ugh. Where are you getting those events from? It seems like capturing some terminal input instead of sending events as such. (BTW, you could try setting some dumb terminal type before starting your process so the service doesn't produce such ugly codes).&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 13:30:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668873#M112118</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2023-11-16T13:30:02Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668894#M112171</link>
      <description>&lt;P&gt;Use that regex in &lt;FONT face="courier new,courier"&gt;SEDCMD&lt;/FONT&gt; in props.conf.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[mysourcetype]
SEDCMD-no_UTF-8 = s/\x1B\[[0-9;]*[mK]//g&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 16 Nov 2023 15:54:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668894#M112171</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-11-16T15:54:27Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668970#M112172</link>
      <description>&lt;P&gt;Unfortunately, this is not an option for Splunk Cloud&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 08:47:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668970#M112172</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-17T08:47:07Z</dc:date>
    </item>
    <item>
      <title>Re: UTF-8 characters - how to remove them automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668972#M112138</link>
      <description>&lt;P&gt;This is what it looks like straight from the log file:&lt;/P&gt;&lt;P&gt;2023-11-15 11:47:21,605 backend_2023.2.8: &amp;#27;[36mINFO &amp;#27;[0;39m [-dispatcher-7] &amp;#27;[36mvip.service.northbound.MrpService.serverakkaAddress=akka://backend, akkaUid=2193530468036521242&amp;#27;[0;39m &amp;#27;[39mServer is alive - num conns = 0&amp;#27;[0;39m&lt;/P&gt;&lt;P&gt;of course it looks better from the terminal&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="dabroma5_0-1700211280882.png" style="width: 999px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/28152i5228806CC7E9D74F/image-size/large?v=v2&amp;amp;px=999" role="button" title="dabroma5_0-1700211280882.png" alt="dabroma5_0-1700211280882.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 08:57:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668972#M112138</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-17T08:57:24Z</dc:date>
    </item>
    <item>
      <title>Re: UTF-8 characters - how to remove them automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668977#M112140</link>
      <description>&lt;P&gt;To be honest, I'm not fully sure at which step of the pipeline (if any) those non-printable characters are escaped. I'll have to verify it.&lt;/P&gt;&lt;P&gt;But still - it would be best if you could make the source generate logs without the formating codes - they don't belong there. It's a presentation layer, those codes shouldn't be in the log entries.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 09:57:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/668977#M112140</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2023-11-17T09:57:31Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669009#M112173</link>
      <description>&lt;P&gt;Splunk Cloud fully supports &lt;FONT face="courier new,courier"&gt;SEDCMD&lt;/FONT&gt;.&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 13:23:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669009#M112173</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-11-17T13:23:47Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669018#M112174</link>
      <description>&lt;P&gt;which one should I move to /opt/splunkforwarder/etc/system/local , and edit:&lt;/P&gt;&lt;P&gt;/opt/splunkforwarder/etc/system/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/etc/apps/search/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/etc/apps/splunk_internal_metrics/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/etc/apps/learned/local/props.conf&lt;BR /&gt;/opt/splunkforwarder/etc/apps/SplunkUniversalForwarder/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/var/run/splunk/confsnapshot/baseline_local/apps/learned/local/props.conf&lt;BR /&gt;/opt/splunkforwarder/var/run/splunk/confsnapshot/baseline_default/system/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/var/run/splunk/confsnapshot/baseline_default/apps/search/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/var/run/splunk/confsnapshot/baseline_default/apps/splunk_internal_metrics/default/props.conf&lt;BR /&gt;/opt/splunkforwarder/var/run/splunk/confsnapshot/baseline_default/apps/SplunkUniversalForwarder/default/props.conf&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 14:03:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669018#M112174</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-17T14:03:49Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669029#M112175</link>
      <description>&lt;P&gt;None of those.&amp;nbsp; The &lt;FONT face="courier new,courier"&gt;SEDCMD&lt;/FONT&gt; setting must be on the indexer(s) and/or heavy forwarders.&amp;nbsp; It should go in the stanza where the sourcetype it goes with resides (if the file is in a default stanza then put the setting in the associated local directory).&lt;/P&gt;</description>
      <pubDate>Fri, 17 Nov 2023 15:39:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669029#M112175</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2023-11-17T15:39:59Z</dc:date>
    </item>
    <item>
      <title>Re: [Splunk Cloud] UTF-8 characters in a log file - how to remove it automatically</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669153#M112193</link>
      <description>&lt;P&gt;I can't make it work.&lt;/P&gt;&lt;P&gt;I found some explanation here:&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.splunk.com/t5/Getting-Data-In/How-to-replace-characters-in-logs-using-SEDCMD-in-props-conf-in/m-p/392306" target="_blank"&gt;https://community.splunk.com/t5/Getting-Data-In/How-to-replace-characters-in-logs-using-SEDCMD-in-props-conf-in/m-p/392306&lt;/A&gt;&lt;/P&gt;&lt;P&gt;but they said the change should be made in HF props.conf&lt;/P&gt;&lt;P&gt;I need to make it work on UF for Splunk Cloud&lt;/P&gt;</description>
      <pubDate>Mon, 20 Nov 2023 08:31:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/UTF-8-characters-how-to-remove-them-automatically/m-p/669153#M112193</guid>
      <dc:creator>dabroma5</dc:creator>
      <dc:date>2023-11-20T08:31:07Z</dc:date>
    </item>
  </channel>
</rss>

