Getting Data In

Splunk throws error message when data input tar.gz file contains Simplified Chinese characters(GB2312): Input is not proper UTF-8, indicate encoding!

rsimmons
Splunk Employee
Splunk Employee

Splunk throws error message when data input tar.gz file contains Simplified Chinese characters(GB2312): Input is not proper UTF-8, indicate encoding!

Tags (1)
1 Solution

zliu
Splunk Employee
Splunk Employee

This is a known issue. Bug#SPL-38488.

Workaround: manually extract the CSV files from the tar.gz file and put them in the same data input file path. Splunk will recognize all the CSV files with Chinese file names and all events will be read into Splunk correctly.

View solution in original post

zliu
Splunk Employee
Splunk Employee

This is a known issue. Bug#SPL-38488.

Workaround: manually extract the CSV files from the tar.gz file and put them in the same data input file path. Splunk will recognize all the CSV files with Chinese file names and all events will be read into Splunk correctly.

Get Updates on the Splunk Community!

Data Management Digest – December 2025

Welcome to the December edition of Data Management Digest! As we continue our journey of data innovation, the ...

Index This | What is broken 80% of the time by February?

December 2025 Edition   Hayyy Splunk Education Enthusiasts and the Eternally Curious!    We’re back with this ...

Unlock Faster Time-to-Value on Edge and Ingest Processor with New SPL2 Pipeline ...

Hello Splunk Community,   We're thrilled to share an exciting update that will help you manage your data more ...