Splunk Search

How does splunk handle UTF8 and non UTF8 in the same event?

BobM
Builder

My client has a conversion program that takes ISO8859 text from round the world and converts it to UTF-8. Another one does the opposite. It is possible to get an event something like

01/02/12 01:23:45 converted "ISO-8859-8 characters" to "UTF-8 equivalent" 

Where "ISO-8859-8 characters" could be Chinese or any foreign characters. How would splunk handle this mixed character set data?
And before you say try it, I don't have any example data to test yet.

Tags (2)

gkanapathy
Splunk Employee
Splunk Employee

I don't know, but probably you're going to get one or both strings mis-decoded. Splunk can read different character sets, but I suspect it is going to try to determine one character set for each source file.

0 Karma
Get Updates on the Splunk Community!

Fun with Regular Expression - multiples of nine

Fun with Regular Expression - multiples of nineThis challenge was first posted on Slack #regex channel ...

[Live Demo] Watch SOC transformation in action with the reimagined Splunk Enterprise ...

Overwhelmed SOC? Splunk ES Has Your Back Tool sprawl, alert fatigue, and endless context switching are making ...

What’s New & Next in Splunk SOAR

Security teams today are dealing with more alerts, more tools, and more pressure than ever.  Join us on ...