Getting Data In

Sharepoint logs are coming in Hex

mikelanghorst
Motivator

Attempting to Splunk Sharepoint 2010 logs but it's unreadable in the UI

0\x004\x00/\x001\x007\x00/\x002\x000\x001...

Using the file command on linux, it says the file is UTF-16 Little Endian. Trying to set that charset on the sourcetype doesn't have any effect. In fact seems to conflict in the server, as I get messages that monitor detects UTF-8. Looking at a very old wiki page, mentions
splunk cmd classify

But that classifier is wrong, saying it is UTF-8 binary.

Output of classify:
WARN FileClassifierManager - The file 'FSHPTP02-20130408-1404.log' is invalid. Reason: binary
PROPERTIES OF FSHPTP02-20130408-1404.log
PropertiesMap: {
CHARSET -> UTF-8
invalid_cause -> binary
is_valid -> False
sourcetype -> unknown
}

But the linux file command says otherwise:
[mlanghor@mlanghor-wkstn U]$ file FSHPTP02-20130408-1404.log
FSHPTP02-20130408-1404.log: Little-endian UTF-16 Unicode English text, with very long lines, with CRLF line terminators

mcs24
Explorer

See http://wiki.splunk.com/Community:WindowsCharacterEncoding, which provides a solution for
- Logs coming in as hex
- Logs not monitored with the messages: "TailReader - Ignoring file '' due to: binary" and "FileClassifierManager - The file '' is invalid. Reason: binary"

0 Karma

jmeyers_splunk
Splunk Employee
Splunk Employee
0 Karma

hvandenb
Path Finder

Any resolution on this I have the same issue.

0 Karma
Get Updates on the Splunk Community!

Index This | Why did the turkey cross the road?

November 2025 Edition  Hayyy Splunk Education Enthusiasts and the Eternally Curious!   We’re back with this ...

Enter the Agentic Era with Splunk AI Assistant for SPL 1.4

  🚀 Your data just got a serious AI upgrade — are you ready? Say hello to the Agentic Era with the ...

Feel the Splunk Love: Real Stories from Real Customers

Hello Splunk Community,    What’s the best part of hearing how our customers use Splunk? Easy: the positive ...