Splunk Search

How to convert binary ms word doc into text so I can splunk it?

maverick
Splunk Employee
Splunk Employee

I currently have some medical records in doc form that are binary text created in ms office word.

I want to create dates, times, doctor, patient visits into charts for the doctors within a dashboard in Splunk.

Is there a way that I can convert my binary document into a text based document so I can splunk it?

Is this possible?

0 Karma

lguinn2
Legend

It would be trivial to write a Microsoft VBA macro that simply re-saved a file with the same name, but in Unicode text format with a .txt extension. In fact, here it is

Public Sub SaveAsUnicodeFile()
    Dim newName As String
    Dim pos As Integer  
    pos = InStrRev(ActiveDocument.Name, ".")
    If pos > 0 Then
        newName = Left$(ActiveDocument.Name, pos - 1)
    Else
        newName = ActiveDocument.Name
    End If
    newName = ActiveDocument.Path & "\" & newName & ".txt"  
    Call ActiveDocument.SaveAs(FileName:=newName, FileFormat:=wdFormatUnicodeText)
End Sub

This version of the macro re-saves the active document, but you could change it so that the name of the file to save was supplied as an argument...

0 Karma

yannK
Splunk Employee
Splunk Employee

What about exporting to RTF ?

0 Karma

maverick
Splunk Employee
Splunk Employee

Not sure if this is what you are asking for, but it appears to me to be worth trying:

http://www.cometdocs.com/

0 Karma
Get Updates on the Splunk Community!

[Puzzles] Solve, Learn, Repeat: Dynamic formatting from XML events

This challenge was first posted on Slack #puzzles channelFor a previous puzzle, I needed a set of fixed-length ...

Enter the Agentic Era with Splunk AI Assistant for SPL 1.4

  🚀 Your data just got a serious AI upgrade — are you ready? Say hello to the Agentic Era with the ...

Stronger Security with Federated Search for S3, GCP SQL & Australian Threat ...

Splunk Lantern is a Splunk customer success center that provides advice from Splunk experts on valuable data ...