<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Json ingest with weird characters or binary in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396085#M70619</link>
    <description>&lt;P&gt;It would be most useful to show us these &lt;EM&gt;weird_characters&lt;/EM&gt;.&lt;/P&gt;</description>
    <pubDate>Mon, 19 Nov 2018 02:55:40 GMT</pubDate>
    <dc:creator>ddrillic</dc:creator>
    <dc:date>2018-11-19T02:55:40Z</dc:date>
    <item>
      <title>Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396084#M70618</link>
      <description>&lt;P&gt;Hi all, &lt;/P&gt;

&lt;P&gt;I was trying to ingest some json files however the json seems to have some weird characters or binary and parsing failed. &lt;/P&gt;

&lt;P&gt;Example of JSON:&lt;/P&gt;

&lt;P&gt;{&lt;BR /&gt;
     "abc": "&lt;EM&gt;weird_characters&lt;/EM&gt;"&lt;BR /&gt;
}&lt;/P&gt;

&lt;P&gt;I got this error : ERROR JsonLineBreaker - JSON Stream ID: xxxxxxxxxxxxxxxxxxxxxx had parsing error: Unexpected character while parsing backslash escape: 'x' &lt;/P&gt;

&lt;P&gt;I had experimented on a lot of prof.conf including setting binary to false. I suspect this is something to do with encoding. &lt;/P&gt;

&lt;P&gt;How do i solved this? &lt;/P&gt;

&lt;P&gt;Thanks in advance&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 02:24:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396084#M70618</guid>
      <dc:creator>quahfamili</dc:creator>
      <dc:date>2018-11-19T02:24:19Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396085#M70619</link>
      <description>&lt;P&gt;It would be most useful to show us these &lt;EM&gt;weird_characters&lt;/EM&gt;.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 02:55:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396085#M70619</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2018-11-19T02:55:40Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396086#M70620</link>
      <description>&lt;P&gt;check the CHARSET value may be adjust it to UCS-2LE&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 02:58:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396086#M70620</guid>
      <dc:creator>Kosanam</dc:creator>
      <dc:date>2018-11-19T02:58:59Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396087#M70621</link>
      <description>&lt;P&gt;@ddrillic  I cannot paste it over. It looks like characters that are forced UTF or something&lt;/P&gt;

&lt;P&gt;@Kosanam  How do I check the CHARSET?&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 03:00:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396087#M70621</guid>
      <dc:creator>quahfamili</dc:creator>
      <dc:date>2018-11-19T03:00:52Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396088#M70622</link>
      <description>&lt;P&gt;you can edit in props.conf or when you add the sample file to set the sourcetype  check the advanced settings&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 03:04:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396088#M70622</guid>
      <dc:creator>Kosanam</dc:creator>
      <dc:date>2018-11-19T03:04:59Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396089#M70623</link>
      <description>&lt;P&gt;It is not set, I thought it is automatically set to UTF-8 if it is not defined. From the document it is documented as ACSII As default.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 03:18:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396089#M70623</guid>
      <dc:creator>quahfamili</dc:creator>
      <dc:date>2018-11-19T03:18:48Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396090#M70624</link>
      <description>&lt;P&gt;Does the JSON string (Assuming you have the correct CHARSET in props.conf) actually contain &lt;CODE&gt;\x&lt;/CODE&gt;? If so, you may have invalid JSON... check out the grammar on &lt;A href="https://json.org"&gt;https://json.org&lt;/A&gt; The only characters that can follow a backslash in a string are slash, backslash, double quote, b, f, n, r, t, OR u (when immediately followed by 4 hex digits).&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 05:54:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396090#M70624</guid>
      <dc:creator>acharlieh</dc:creator>
      <dc:date>2018-11-19T05:54:01Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396091#M70625</link>
      <description>&lt;P&gt;Hi, &lt;/P&gt;

&lt;P&gt;I manually removed the weird_characters and the JSON file can be ingested. However, these character are housed in the double quotes. &lt;/P&gt;

&lt;P&gt;@acharlieh The file does not actually contain \x. However, I thought due to the encoding of these weird_characters, splunk might had recognized it as \x. I had set CHARSET to UTF-8 and the files continue to get the same error. &lt;/P&gt;

&lt;P&gt;Anyone can help? &lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 06:05:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396091#M70625</guid>
      <dc:creator>quahfamili</dc:creator>
      <dc:date>2018-11-19T06:05:06Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396092#M70626</link>
      <description>&lt;P&gt;Then take a screenshot and upload that somewhere to share it with us. Without understand what "weird characters" you're seeing it is a bit shooting in the dark.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 10:35:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396092#M70626</guid>
      <dc:creator>FrankVl</dc:creator>
      <dc:date>2018-11-19T10:35:29Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396093#M70627</link>
      <description>&lt;P&gt;Where did you set the CHARSET? Just to double check this is on the Forwarder or other node performing ingestion yes? (Being an ingestion time thing). And you restarted the forwarder before trying ingesting one of these files again?&lt;/P&gt;

&lt;P&gt;Is the source system actually producing the whole file as UTF-8 encoded JSON? How do you know?&lt;/P&gt;

&lt;P&gt;Have you looked at your input in a good hex editor? If you're on Mac, I like &lt;A href="https://ridiculousfish.com/hexfiend/"&gt;HexFiend&lt;/A&gt; but there are many other good ones out there. The goal of this exercise is to know the actual bytes that are being ingested, and try to determine for certain what encoding is actually in place. A good editor will let you try out interpreting the bytes as a few different encodings, and see what is there when you do so. Using the output of this, and possibly a site like &lt;A href="https://fileformat.info/info/unicode/"&gt;https://fileformat.info/info/unicode/&lt;/A&gt; you can actually figure out what these "weird" characters actually are and reason about them.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 14:42:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396093#M70627</guid>
      <dc:creator>acharlieh</dc:creator>
      <dc:date>2018-11-19T14:42:01Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396094#M70628</link>
      <description>&lt;P&gt;Hi all, &lt;/P&gt;

&lt;P&gt;I checked the weird_characters are chinese character. I had set the encoding at UTF-8. I even try to modify my data to "abc": "\weird_characters". However, to no avail. I still cannot parse the data. &lt;/P&gt;

&lt;P&gt;Need help&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 23:52:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396094#M70628</guid>
      <dc:creator>quahfamili</dc:creator>
      <dc:date>2020-09-29T23:52:27Z</dc:date>
    </item>
    <item>
      <title>Re: Json ingest with weird characters or binary</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396095#M70629</link>
      <description>&lt;P&gt;If its Chinese, have you tried with UTF-16?&lt;/P&gt;</description>
      <pubDate>Tue, 26 Mar 2019 11:32:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Json-ingest-with-weird-characters-or-binary/m-p/396095#M70629</guid>
      <dc:creator>nickhills</dc:creator>
      <dc:date>2019-03-26T11:32:04Z</dc:date>
    </item>
  </channel>
</rss>

