Getting Data In

pdf files source input

vcorace
New Member

I downloaded and installed Splunk version 4.3.4, build 136012 successfully on my Windows XP system for evaluation. I
then tried to build an index with readable pdf files and text files, but have not been sucessful. Must be doing something
wrong because the pdf is not being inputed properly. Below is part of an example of what I get from Splunk when I use the pdf for
source input.

I also have not figured out yet how to delete/remove an index and then create a new one.

3:42:04.000 PM \xFDq6\xE9\xF\xFW\xCC`\xEF\xFA\x98\x8Cnj\x11\x1F\x8D\x2\xCF\xDD\xB\x8CQ\x1A\x8\x8ER\xE8G\x99\xDC\x17
".sourcetype=Radiology PDF Options| source=C:\Cognition Result Files\Radiographics PDFs\234_1_109.pdf

Tags (3)
0 Karma

MarioM
Motivator

Splunk won't index PDF only ascii data ie txt. PDF is binary and need pdf reader.

to delete an index http://docs.splunk.com/Documentation/Splunk/4.3.4/Admin/RemovedatafromSplunk#Delete_an_index_entirel...

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Index This | What has goals but no motivation?

June 2026 Edition  Hayyy Splunk Education Enthusiasts and the Eternally Curious!   We’re back with this ...

Deep Dive: Accelerate threat investigation with Splunk’s AI Assistant in Security

AI is one of the biggest topics in the market today, and for security teams, its value goes far beyond the ...

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...