<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Ingesting Word document in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517454#M87562</link>
    <description>&lt;P&gt;Thanks for the reply.&amp;nbsp; I am, however, still confused.&amp;nbsp; There are a number of Questions about how to ingest with the correct format - eg&amp;nbsp;&lt;A href="https://community.splunk.com/t5/Archive/How-to-ingest-doc-format-file-into-splunk-with-correct-format/td-p/488474" target="_blank"&gt;https://community.splunk.com/t5/Archive/How-to-ingest-doc-format-file-into-splunk-with-correct-format/td-p/488474&lt;/A&gt;&lt;/P&gt;&lt;P&gt;As I have stated, I am not concerned with the format within the doc, only the filename is of importance.&lt;/P&gt;</description>
    <pubDate>Wed, 02 Sep 2020 10:03:52 GMT</pubDate>
    <dc:creator>timrich66</dc:creator>
    <dc:date>2020-09-02T10:03:52Z</dc:date>
    <item>
      <title>Ingesting Word document</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/516254#M87369</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt;&lt;P&gt;My latest challenge is to ingest a Word doc into our environment.&amp;nbsp; According to everything I have read so far, this should be straight forward as Splunk can ingest 'any' file.&amp;nbsp; At this point I should point out that I am not concerned about the contents of the file (as this all needs to be obfuscated).&amp;nbsp; I only need to ingest the file to get its name.&amp;nbsp; I am not concerned about whether or not Splunk can read the 'Word' type formatting.&lt;/P&gt;&lt;P&gt;The file is created daily with the format - "My Word Doc ddmmyyyy hh mm.doc"&lt;/P&gt;&lt;P&gt;I am only interested in the "ddmmyyyy hh mm" part to ensure that it has been created today.&lt;/P&gt;&lt;P&gt;I cannot get the doc file to ingest at all.&amp;nbsp; Not even in an unformatted state.&amp;nbsp; If I save the file as a ".txt" file, then it is ingested.&amp;nbsp; Unfortunately, the 'save as' option is not an option in production.&lt;/P&gt;&lt;P&gt;I have tried using 'whitelist=' option without any success.&lt;/P&gt;&lt;P&gt;Can anyone suggest a solution?&amp;nbsp; Is there something in my installation that is stopping Word docs from being ingested?&amp;nbsp; Has anyone else had a similar experience?&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 26 Aug 2020 12:09:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/516254#M87369</guid>
      <dc:creator>timrich66</dc:creator>
      <dc:date>2020-08-26T12:09:28Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting Word document</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/516262#M87370</link>
      <description>Your initial assumption is faulty. Splunk cannot ingest *any* file. It can, however, ingest any *text* file. Word files are not text.&lt;BR /&gt;Consider writing a python script to test for the presence of the file and making it a scripted input.</description>
      <pubDate>Wed, 26 Aug 2020 12:37:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/516262#M87370</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-08-26T12:37:44Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting Word document</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517454#M87562</link>
      <description>&lt;P&gt;Thanks for the reply.&amp;nbsp; I am, however, still confused.&amp;nbsp; There are a number of Questions about how to ingest with the correct format - eg&amp;nbsp;&lt;A href="https://community.splunk.com/t5/Archive/How-to-ingest-doc-format-file-into-splunk-with-correct-format/td-p/488474" target="_blank"&gt;https://community.splunk.com/t5/Archive/How-to-ingest-doc-format-file-into-splunk-with-correct-format/td-p/488474&lt;/A&gt;&lt;/P&gt;&lt;P&gt;As I have stated, I am not concerned with the format within the doc, only the filename is of importance.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Sep 2020 10:03:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517454#M87562</guid>
      <dc:creator>timrich66</dc:creator>
      <dc:date>2020-09-02T10:03:52Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting Word document</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517469#M87564</link>
      <description>&lt;P&gt;Ingesting an entire Word package, possibly several MB, just to find out if a file exists seems wasteful to me.&lt;/P&gt;&lt;P&gt;As I suggested earlier, consider a script to test for the presence of the file and report to Splunk.&lt;/P&gt;</description>
      <pubDate>Wed, 02 Sep 2020 12:46:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517469#M87564</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2020-09-02T12:46:42Z</dc:date>
    </item>
    <item>
      <title>Re: Ingesting Word document</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517505#M87572</link>
      <description>&lt;P&gt;The file is tiny.&amp;nbsp; I will look at what other options are available. Thanks&lt;/P&gt;</description>
      <pubDate>Wed, 02 Sep 2020 15:23:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Ingesting-Word-document/m-p/517505#M87572</guid>
      <dc:creator>timrich66</dc:creator>
      <dc:date>2020-09-02T15:23:26Z</dc:date>
    </item>
  </channel>
</rss>

