Getting Data In

Splunk throws error message when data input tar.gz file contains Simplified Chinese characters(GB2312): Input is not proper UTF-8, indicate encoding!

rsimmons
Splunk Employee
Splunk Employee

Splunk throws error message when data input tar.gz file contains Simplified Chinese characters(GB2312): Input is not proper UTF-8, indicate encoding!

Tags (1)
1 Solution

zliu
Splunk Employee
Splunk Employee

This is a known issue. Bug#SPL-38488.

Workaround: manually extract the CSV files from the tar.gz file and put them in the same data input file path. Splunk will recognize all the CSV files with Chinese file names and all events will be read into Splunk correctly.

View solution in original post

zliu
Splunk Employee
Splunk Employee

This is a known issue. Bug#SPL-38488.

Workaround: manually extract the CSV files from the tar.gz file and put them in the same data input file path. Splunk will recognize all the CSV files with Chinese file names and all events will be read into Splunk correctly.

Get Updates on the Splunk Community!

Index This | What is broken 80% of the time by February?

December 2025 Edition   Hayyy Splunk Education Enthusiasts and the Eternally Curious!    We’re back with this ...

Unlock Faster Time-to-Value on Edge and Ingest Processor with New SPL2 Pipeline ...

Hello Splunk Community,   We're thrilled to share an exciting update that will help you manage your data more ...

Splunk MCP & Agentic AI: Machine Data Without Limits

Discover how the Splunk Model Context Protocol (MCP) Server can revolutionize the way your organization uses ...