Splunk Search

How to convert binary ms word doc into text so I can splunk it?

maverick
Splunk Employee
Splunk Employee

I currently have some medical records in doc form that are binary text created in ms office word.

I want to create dates, times, doctor, patient visits into charts for the doctors within a dashboard in Splunk.

Is there a way that I can convert my binary document into a text based document so I can splunk it?

Is this possible?

0 Karma

lguinn2
Legend

It would be trivial to write a Microsoft VBA macro that simply re-saved a file with the same name, but in Unicode text format with a .txt extension. In fact, here it is

Public Sub SaveAsUnicodeFile()
    Dim newName As String
    Dim pos As Integer  
    pos = InStrRev(ActiveDocument.Name, ".")
    If pos > 0 Then
        newName = Left$(ActiveDocument.Name, pos - 1)
    Else
        newName = ActiveDocument.Name
    End If
    newName = ActiveDocument.Path & "\" & newName & ".txt"  
    Call ActiveDocument.SaveAs(FileName:=newName, FileFormat:=wdFormatUnicodeText)
End Sub

This version of the macro re-saves the active document, but you could change it so that the name of the file to save was supplied as an argument...

0 Karma

yannK
Splunk Employee
Splunk Employee

What about exporting to RTF ?

0 Karma

maverick
Splunk Employee
Splunk Employee

Not sure if this is what you are asking for, but it appears to me to be worth trying:

http://www.cometdocs.com/

0 Karma
Get Updates on the Splunk Community!

Building Reliable Asset and Identity Frameworks in Splunk ES

 Accurate asset and identity resolution is the backbone of security operations. Without it, alerts are ...

Cloud Monitoring Console - Unlocking Greater Visibility in SVC Usage Reporting

For Splunk Cloud customers, understanding and optimizing Splunk Virtual Compute (SVC) usage and resource ...

Automatic Discovery Part 3: Practical Use Cases

If you’ve enabled Automatic Discovery in your install of the Splunk Distribution of the OpenTelemetry ...