Splunk Search

How does splunk handle UTF8 and non UTF8 in the same event?

BobM
Builder

My client has a conversion program that takes ISO8859 text from round the world and converts it to UTF-8. Another one does the opposite. It is possible to get an event something like

01/02/12 01:23:45 converted "ISO-8859-8 characters" to "UTF-8 equivalent" 

Where "ISO-8859-8 characters" could be Chinese or any foreign characters. How would splunk handle this mixed character set data?
And before you say try it, I don't have any example data to test yet.

Tags (2)

gkanapathy
Splunk Employee
Splunk Employee

I don't know, but probably you're going to get one or both strings mis-decoded. Splunk can read different character sets, but I suspect it is going to try to determine one character set for each source file.

0 Karma
Get Updates on the Splunk Community!

Building Reliable Asset and Identity Frameworks in Splunk ES

 Accurate asset and identity resolution is the backbone of security operations. Without it, alerts are ...

Cloud Monitoring Console - Unlocking Greater Visibility in SVC Usage Reporting

For Splunk Cloud customers, understanding and optimizing Splunk Virtual Compute (SVC) usage and resource ...

Automatic Discovery Part 3: Practical Use Cases

If you’ve enabled Automatic Discovery in your install of the Splunk Distribution of the OpenTelemetry ...