<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Does Splunk have OCR capability? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359017#M65510</link>
    <description>&lt;P&gt;Apparently just text/char ... on the following 2015 thread they said "ASCII", but that's obviously an archaic reference to some old-school last century version of UTF-8 that only old fogies like me (and the forgettless Internet) have heard of...&lt;/P&gt;

&lt;P&gt;&lt;A href="https://answers.splunk.com/answers/263997/db-connect-blob-object-search.html"&gt;https://answers.splunk.com/answers/263997/db-connect-blob-object-search.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;There are some useful suggestions and opinions there, well worth reviewing.&lt;/P&gt;</description>
    <pubDate>Tue, 02 May 2017 19:51:29 GMT</pubDate>
    <dc:creator>DalJeanis</dc:creator>
    <dc:date>2017-05-02T19:51:29Z</dc:date>
    <item>
      <title>Does Splunk have OCR capability?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359016#M65509</link>
      <description>&lt;P&gt;I am trying to utilize Splunk to implement entity extraction or text mining. I have huge number of PDF, TIFF, and HTML files which I need to upload to Splunk from which I would need to parse and extract useful text information. Then retrieve specific parts of the information from those files. &lt;/P&gt;

&lt;P&gt;Does Splunk have OCR (Optical Character Recognition) capability? How do I upload these files into Splunk and extract desired tags or text from the file?&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2017 16:30:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359016#M65509</guid>
      <dc:creator>vsabbis</dc:creator>
      <dc:date>2017-05-02T16:30:33Z</dc:date>
    </item>
    <item>
      <title>Re: Does Splunk have OCR capability?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359017#M65510</link>
      <description>&lt;P&gt;Apparently just text/char ... on the following 2015 thread they said "ASCII", but that's obviously an archaic reference to some old-school last century version of UTF-8 that only old fogies like me (and the forgettless Internet) have heard of...&lt;/P&gt;

&lt;P&gt;&lt;A href="https://answers.splunk.com/answers/263997/db-connect-blob-object-search.html"&gt;https://answers.splunk.com/answers/263997/db-connect-blob-object-search.html&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;There are some useful suggestions and opinions there, well worth reviewing.&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2017 19:51:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359017#M65510</guid>
      <dc:creator>DalJeanis</dc:creator>
      <dc:date>2017-05-02T19:51:29Z</dc:date>
    </item>
    <item>
      <title>Re: Does Splunk have OCR capability?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359018#M65511</link>
      <description>&lt;P&gt;PDF, no&lt;BR /&gt;
TIFF, no&lt;BR /&gt;
HTML, yes &lt;/P&gt;

&lt;P&gt;Best you could do with PDF and TIFF is store them in binary format in Splunk.  Not OCR by any means but you can use regular expression to parse out data from the HTML files which are most likely in ANSI or UTF.&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2017 23:48:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359018#M65511</guid>
      <dc:creator>jkat54</dc:creator>
      <dc:date>2017-05-02T23:48:26Z</dc:date>
    </item>
    <item>
      <title>Re: Does Splunk have OCR capability?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359019#M65512</link>
      <description>&lt;P&gt;Hey, can you contact me @daljeanis?  We need to connect somehow.&lt;/P&gt;</description>
      <pubDate>Tue, 02 May 2017 23:50:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359019#M65512</guid>
      <dc:creator>jkat54</dc:creator>
      <dc:date>2017-05-02T23:50:16Z</dc:date>
    </item>
    <item>
      <title>Re: Does Splunk have OCR capability?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359020#M65513</link>
      <description>&lt;P&gt;Always open to connect to anyone real on linkedin at &lt;A href="http://linkedin.com/in/daljeanis"&gt;http://linkedin.com/in/daljeanis&lt;/A&gt;.  &lt;/P&gt;

&lt;P&gt;Invitation sent.  &lt;/P&gt;</description>
      <pubDate>Wed, 03 May 2017 01:26:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-Splunk-have-OCR-capability/m-p/359020#M65513</guid>
      <dc:creator>DalJeanis</dc:creator>
      <dc:date>2017-05-03T01:26:07Z</dc:date>
    </item>
  </channel>
</rss>

