<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: follow tail returning jumbled mess in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14923#M1595</link>
    <description>&lt;P&gt;"the source type equals the file path" ... did you mean sourcetype or source equals file path?&lt;/P&gt;</description>
    <pubDate>Sat, 05 Jun 2010 03:33:44 GMT</pubDate>
    <dc:creator>Vishal_Patel</dc:creator>
    <dc:date>2010-06-05T03:33:44Z</dc:date>
    <item>
      <title>follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14917#M1589</link>
      <description>&lt;P&gt;I’m currently getting a new log source ready for production, and I almost have it except for one issue.  I’m forwarding email logs, which the email application appends each entry to.  I’m using the followTail directive, which works but the appended data is coming in to the indexer all jumbled and cooked looking while the original data in the file is not.  Below is the inputs.conf file from the indexer and a screen shot of what I’m seeing. Please help……….thanks!&lt;/P&gt;

&lt;P&gt;[default]&lt;BR /&gt;
host = TEST_SERVER&lt;/P&gt;

&lt;P&gt;[monitor:///home/dcarmack/myLogs2]&lt;BR /&gt;
disabled = false&lt;BR /&gt;
host = TEST&lt;BR /&gt;
sourcetype = TEST&lt;BR /&gt;
index = default&lt;BR /&gt;
followTail = 1&lt;/P&gt;

&lt;P&gt;&lt;IMG src="http://24.209.7.146/probImg.bmp" alt="alt text" /&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2010 21:35:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14917#M1589</guid>
      <dc:creator>carmackd</dc:creator>
      <dc:date>2010-06-04T21:35:20Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14918#M1590</link>
      <description>&lt;P&gt;&lt;CODE&gt;followTail&lt;/CODE&gt; has little to do with this. It seems to me that you are sending data to a standard TCP port, not a Splunk TCP port. Is that your intention? If you're using a Splunk forwarder, you should not do that. Standard TCP ports are for raw TCP log streams.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2010 21:48:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14918#M1590</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-06-04T21:48:13Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14919#M1591</link>
      <description>&lt;P&gt;No, I'm using the [splunktcp:] stanza on my indexer.&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2010 23:10:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14919#M1591</guid>
      <dc:creator>carmackd</dc:creator>
      <dc:date>2010-06-04T23:10:52Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14920#M1592</link>
      <description>&lt;P&gt;Output like this means that the data isn't valid UTF-8 when it arrives at the indexer.&lt;/P&gt;

&lt;P&gt;I find it very odd that "source" is not properly set for this data. When forwarding and receiving, we typically expect source, sourcetype and host to be properly set by the forwarder. What does this directory structure look like?&lt;/P&gt;

&lt;P&gt;As a side note, followTail is rarely a desired setting. Splunk will automatically start reading where it left off in a file. This setting is used to tell Splunk to reset this point to the end of the file, not where we last read up to.&lt;/P&gt;

&lt;P&gt;This setting could possibly be related if there's a bad interaction with archived files (that don't look like text) or files with a character set that requires some long history to decode (this doesn't seem to be the case here).&lt;/P&gt;</description>
      <pubDate>Fri, 04 Jun 2010 23:50:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14920#M1592</guid>
      <dc:creator>Stephen_Sorkin</dc:creator>
      <dc:date>2010-06-04T23:50:07Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14921#M1593</link>
      <description>&lt;P&gt;When I don't use followTail, the entire file gets re-indexed. One other thing I should mention, when the original file is indexed, the source type equals the file path, when the data that's appended to the file gets indexed, the source equals tcp:5000. As far as the directory structure, the forwarder is sitting in /home/dcarmack and the log files are located at /home/dcarmack/myLogs2&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jun 2010 02:30:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14921#M1593</guid>
      <dc:creator>carmackd</dc:creator>
      <dc:date>2010-06-05T02:30:28Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14922#M1594</link>
      <description>&lt;P&gt;I'm more curious about the files within the directory /home/dcarmack/myLogs2. I'm also wondering why we're reindex the files as that should not happen. Do they share a common header? What does the data inside the files look like?&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jun 2010 03:14:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14922#M1594</guid>
      <dc:creator>Stephen_Sorkin</dc:creator>
      <dc:date>2010-06-05T03:14:49Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14923#M1595</link>
      <description>&lt;P&gt;"the source type equals the file path" ... did you mean sourcetype or source equals file path?&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jun 2010 03:33:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14923#M1595</guid>
      <dc:creator>Vishal_Patel</dc:creator>
      <dc:date>2010-06-05T03:33:44Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14924#M1596</link>
      <description>&lt;P&gt;It looks to me, because you're data is showing &lt;CODE&gt;tcp:5000&lt;/CODE&gt; that it's being sent to and received on a plain TCP port number 5000. I'm not sure where that would come from. Perhaps you have some rogue conf files around.&lt;/P&gt;

&lt;P&gt;The followTail behavior may be an artifact of how your files are being written? Perhaps they are being modified near the top of the file when they are appended?&lt;/P&gt;</description>
      <pubDate>Sat, 05 Jun 2010 10:05:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14924#M1596</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-06-05T10:05:59Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14925#M1597</link>
      <description>&lt;P&gt;It will show as tcp:5000 either if it's raw TCP in or if the forwarder isn't properly applying the source at input time. I don't suspect that it's raw TCP. I'm more curious about the file reading code. Are these archive files?&lt;/P&gt;</description>
      <pubDate>Sun, 06 Jun 2010 09:29:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14925#M1597</guid>
      <dc:creator>Stephen_Sorkin</dc:creator>
      <dc:date>2010-06-06T09:29:03Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14926#M1598</link>
      <description>&lt;P&gt;yes, sorry to confuse, source=path&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2010 20:32:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14926#M1598</guid>
      <dc:creator>carmackd</dc:creator>
      <dc:date>2010-06-07T20:32:50Z</dc:date>
    </item>
    <item>
      <title>Re: follow tail returning jumbled mess</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14927#M1599</link>
      <description>&lt;P&gt;The data is xml and comes from an email security appliance. Each entry has a common header. Yes, the files are archived using gzip&lt;/P&gt;</description>
      <pubDate>Mon, 07 Jun 2010 20:37:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/follow-tail-returning-jumbled-mess/m-p/14927#M1599</guid>
      <dc:creator>carmackd</dc:creator>
      <dc:date>2010-06-07T20:37:56Z</dc:date>
    </item>
  </channel>
</rss>

