Getting Data In

pdf files source input

vcorace
New Member

I downloaded and installed Splunk version 4.3.4, build 136012 successfully on my Windows XP system for evaluation. I
then tried to build an index with readable pdf files and text files, but have not been sucessful. Must be doing something
wrong because the pdf is not being inputed properly. Below is part of an example of what I get from Splunk when I use the pdf for
source input.

I also have not figured out yet how to delete/remove an index and then create a new one.

3:42:04.000 PM \xFDq6\xE9\xF\xFW\xCC`\xEF\xFA\x98\x8Cnj\x11\x1F\x8D\x2\xCF\xDD\xB\x8CQ\x1A\x8\x8ER\xE8G\x99\xDC\x17
".sourcetype=Radiology PDF Options| source=C:\Cognition Result Files\Radiographics PDFs\234_1_109.pdf

Tags (3)
0 Karma

MarioM
Motivator

Splunk won't index PDF only ascii data ie txt. PDF is binary and need pdf reader.

to delete an index http://docs.splunk.com/Documentation/Splunk/4.3.4/Admin/RemovedatafromSplunk#Delete_an_index_entirel...

0 Karma
Get Updates on the Splunk Community!

Data Management Digest – December 2025

Welcome to the December edition of Data Management Digest! As we continue our journey of data innovation, the ...

Index This | What is broken 80% of the time by February?

December 2025 Edition   Hayyy Splunk Education Enthusiasts and the Eternally Curious!    We’re back with this ...

Unlock Faster Time-to-Value on Edge and Ingest Processor with New SPL2 Pipeline ...

Hello Splunk Community,   We're thrilled to share an exciting update that will help you manage your data more ...