Splunk throws error message when data input tar.gz file contains Simplified Chinese characters(GB2312): Input is not proper UTF-8, indicate encoding!
This is a known issue. Bug#SPL-38488.
Workaround: manually extract the CSV files from the tar.gz file and put them in the same data input file path. Splunk will recognize all the CSV files with Chinese file names and all events will be read into Splunk correctly.
View solution in original post