Splunk Search

How does splunk handle UTF8 and non UTF8 in the same event?

BobM
Builder

My client has a conversion program that takes ISO8859 text from round the world and converts it to UTF-8. Another one does the opposite. It is possible to get an event something like

01/02/12 01:23:45 converted "ISO-8859-8 characters" to "UTF-8 equivalent" 

Where "ISO-8859-8 characters" could be Chinese or any foreign characters. How would splunk handle this mixed character set data?
And before you say try it, I don't have any example data to test yet.

Tags (2)

gkanapathy
Splunk Employee
Splunk Employee

I don't know, but probably you're going to get one or both strings mis-decoded. Splunk can read different character sets, but I suspect it is going to try to determine one character set for each source file.

0 Karma
Get Updates on the Splunk Community!

Upcoming Webinar: Unmasking Insider Threats with Slunk Enterprise Security’s UEBA

Join us on Wed, Dec 10. at 10AM PST / 1PM EST for a live webinar and demo with Splunk experts! Discover how ...

.conf25 technical session recap of Observability for Gen AI: Monitoring LLM ...

If you’re unfamiliar, .conf is Splunk’s premier event where the Splunk community, customers, partners, and ...

A Season of Skills: New Splunk Courses to Light Up Your Learning Journey

There’s something special about this time of year—maybe it’s the glow of the holidays, maybe it’s the ...