Getting Data In

pdf files source input

vcorace
New Member

I downloaded and installed Splunk version 4.3.4, build 136012 successfully on my Windows XP system for evaluation. I
then tried to build an index with readable pdf files and text files, but have not been sucessful. Must be doing something
wrong because the pdf is not being inputed properly. Below is part of an example of what I get from Splunk when I use the pdf for
source input.

I also have not figured out yet how to delete/remove an index and then create a new one.

3:42:04.000 PM \xFDq6\xE9\xF\xFW\xCC`\xEF\xFA\x98\x8Cnj\x11\x1F\x8D\x2\xCF\xDD\xB\x8CQ\x1A\x8\x8ER\xE8G\x99\xDC\x17
".sourcetype=Radiology PDF Options| source=C:\Cognition Result Files\Radiographics PDFs\234_1_109.pdf

Tags (3)
0 Karma

MarioM
Motivator

Splunk won't index PDF only ascii data ie txt. PDF is binary and need pdf reader.

to delete an index http://docs.splunk.com/Documentation/Splunk/4.3.4/Admin/RemovedatafromSplunk#Delete_an_index_entirel...

0 Karma
Get Updates on the Splunk Community!

Don't wait! Accept the Mission Possible: Splunk Adoption Challenge Now and Win ...

Attention everyone! We have exciting news to share! We are recruiting new members for the Mission Possible: ...

Unify Your SecOps with Splunk Mission Control

In today’s post, I'm excited to share some recent Splunk Mission Control innovations. With Splunk Mission ...

Data Preparation Made Easy: SPL2 for Edge Processor

By now, you may have heard the exciting news that Edge Processor, the easy-to-use Splunk data preparation tool ...