<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Charset Encoding in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25354#M4100</link>
    <description>&lt;P&gt;Here is a list of supported character sets, and instructions on how to apply them to data:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/data/Configurecharactersetencoding"&gt;http://docs.splunk.com/Documentation/Splunk/latest/data/Configurecharactersetencoding&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Thu, 12 Apr 2012 14:35:37 GMT</pubDate>
    <dc:creator>jbsplunk</dc:creator>
    <dc:date>2012-04-12T14:35:37Z</dc:date>
    <item>
      <title>Charset Encoding</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25353#M4099</link>
      <description>&lt;P&gt;Hi guys,&lt;/P&gt;

&lt;P&gt;I have installed Splunk 4.3 on a MAC OSX 10.7.&lt;/P&gt;

&lt;P&gt;I am trying to index data with non utf encoding. I have tried pretty much every encoding available with splunk without any luck... the non unicode characters get replaced with some other symbols.&lt;/P&gt;

&lt;P&gt;Example&lt;/P&gt;

&lt;P&gt;in my log files i have "DAVOR ĆORIĆ" and it gets indexed as "DAVOR žORIž" or some other symnbol depending on which charset i use with this sourcetype... I never get the correct data indexed...&lt;/P&gt;

&lt;P&gt;Has anyone had similar problem... and possibly a simple solution?&lt;/P&gt;</description>
      <pubDate>Thu, 12 Apr 2012 12:33:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25353#M4099</guid>
      <dc:creator>kenchisho</dc:creator>
      <dc:date>2012-04-12T12:33:02Z</dc:date>
    </item>
    <item>
      <title>Re: Charset Encoding</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25354#M4100</link>
      <description>&lt;P&gt;Here is a list of supported character sets, and instructions on how to apply them to data:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/data/Configurecharactersetencoding"&gt;http://docs.splunk.com/Documentation/Splunk/latest/data/Configurecharactersetencoding&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 12 Apr 2012 14:35:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25354#M4100</guid>
      <dc:creator>jbsplunk</dc:creator>
      <dc:date>2012-04-12T14:35:37Z</dc:date>
    </item>
    <item>
      <title>Re: Charset Encoding</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25355#M4101</link>
      <description>&lt;P&gt;Hi jbsplunk,&lt;/P&gt;

&lt;P&gt;thanks for the quick reply.&lt;/P&gt;

&lt;P&gt;I have tried seting the charset manualy but splunk still garbles up the data when indexing. I have tried pretty much all the charsets available with splunk. Usualy with this type of data i use CP1250 and all goes well but with this set of data it is a no go with any charset config...&lt;/P&gt;

&lt;P&gt;I have tried this with a linux install of splunk, thinking it might be an OSX related issue, and get the same results...&lt;/P&gt;

&lt;P&gt;I am geusing this might be a bug but am not quite sure yet...&lt;/P&gt;</description>
      <pubDate>Thu, 12 Apr 2012 14:44:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25355#M4101</guid>
      <dc:creator>kenchisho</dc:creator>
      <dc:date>2012-04-12T14:44:04Z</dc:date>
    </item>
    <item>
      <title>Re: Charset Encoding</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25356#M4102</link>
      <description>&lt;P&gt;If you open the file with a tool like text wrangler, what does it detect as the charset? I've found that to be pretty reliable in troubleshooting these kinds of issues.&lt;/P&gt;</description>
      <pubDate>Thu, 12 Apr 2012 14:54:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Charset-Encoding/m-p/25356#M4102</guid>
      <dc:creator>jbsplunk</dc:creator>
      <dc:date>2012-04-12T14:54:31Z</dc:date>
    </item>
  </channel>
</rss>

