<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic long delay /missing data from sourcetypes? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441035#M76876</link>
    <description>&lt;P&gt;Hello all,&lt;/P&gt;

&lt;P&gt;I have 4 SH, 2 indexer's, 1 Deployment Server in one of my environments (windows). &lt;/P&gt;

&lt;P&gt;I'm now noticing that there's a long delay in some of my data showing up when searched on. This is a BIG issue for me as with operations you need to catch thing near real time. &lt;/P&gt;

&lt;P&gt;Some items i'm not able to search on until the next day. for example my IIS logs, if i search on the last 15 minutes, maybe 4 out of the 8 Web Servers show as producing logs. If i perform the same search maybe an hour later i'll get 7/8 servers, and hour after that maybe 2/8 servers (so it's sporadic and various). if i search for IIS before 6 hours ago, all is well.&lt;/P&gt;

&lt;P&gt;For my IIS indexer&lt;BR /&gt;
12 CPU, 24GB memory&lt;BR /&gt;
Indexing rate: around 250 KB/s (status = normal)&lt;BR /&gt;
Indexing rate every 5 minutes is around 394 KB's&lt;/P&gt;

&lt;P&gt;props.conf on indexer&lt;BR /&gt;
[iis]&lt;BR /&gt;
TZ = GMT&lt;/P&gt;

&lt;P&gt;Index size= 700GB&lt;BR /&gt;
Max size of Hot/Warm/Cold Bucket set to: auto&lt;BR /&gt;
Homepath 263/ unlimited &lt;BR /&gt;
cold 436/ unlimited &lt;/P&gt;

&lt;P&gt;The highest host IIS Log Event Count: 343,166,069&lt;BR /&gt;
by sourcetype (iis) 1,74,31,09,978&lt;/P&gt;

&lt;P&gt;Maxdatasize auto&lt;BR /&gt;
maxhotbuckets 3&lt;BR /&gt;
maxwarmdbcount 300&lt;/P&gt;

&lt;P&gt;Splunk Data Piple line is 0% across the board and show's no delays.&lt;/P&gt;

&lt;P&gt;I noticed under the index Detail: instance my cold buckets size was much larger than my hot/warm buckets also &lt;/P&gt;</description>
    <pubDate>Thu, 08 Aug 2019 21:31:25 GMT</pubDate>
    <dc:creator>Jarohnimo</dc:creator>
    <dc:date>2019-08-08T21:31:25Z</dc:date>
    <item>
      <title>long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441035#M76876</link>
      <description>&lt;P&gt;Hello all,&lt;/P&gt;

&lt;P&gt;I have 4 SH, 2 indexer's, 1 Deployment Server in one of my environments (windows). &lt;/P&gt;

&lt;P&gt;I'm now noticing that there's a long delay in some of my data showing up when searched on. This is a BIG issue for me as with operations you need to catch thing near real time. &lt;/P&gt;

&lt;P&gt;Some items i'm not able to search on until the next day. for example my IIS logs, if i search on the last 15 minutes, maybe 4 out of the 8 Web Servers show as producing logs. If i perform the same search maybe an hour later i'll get 7/8 servers, and hour after that maybe 2/8 servers (so it's sporadic and various). if i search for IIS before 6 hours ago, all is well.&lt;/P&gt;

&lt;P&gt;For my IIS indexer&lt;BR /&gt;
12 CPU, 24GB memory&lt;BR /&gt;
Indexing rate: around 250 KB/s (status = normal)&lt;BR /&gt;
Indexing rate every 5 minutes is around 394 KB's&lt;/P&gt;

&lt;P&gt;props.conf on indexer&lt;BR /&gt;
[iis]&lt;BR /&gt;
TZ = GMT&lt;/P&gt;

&lt;P&gt;Index size= 700GB&lt;BR /&gt;
Max size of Hot/Warm/Cold Bucket set to: auto&lt;BR /&gt;
Homepath 263/ unlimited &lt;BR /&gt;
cold 436/ unlimited &lt;/P&gt;

&lt;P&gt;The highest host IIS Log Event Count: 343,166,069&lt;BR /&gt;
by sourcetype (iis) 1,74,31,09,978&lt;/P&gt;

&lt;P&gt;Maxdatasize auto&lt;BR /&gt;
maxhotbuckets 3&lt;BR /&gt;
maxwarmdbcount 300&lt;/P&gt;

&lt;P&gt;Splunk Data Piple line is 0% across the board and show's no delays.&lt;/P&gt;

&lt;P&gt;I noticed under the index Detail: instance my cold buckets size was much larger than my hot/warm buckets also &lt;/P&gt;</description>
      <pubDate>Thu, 08 Aug 2019 21:31:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441035#M76876</guid>
      <dc:creator>Jarohnimo</dc:creator>
      <dc:date>2019-08-08T21:31:25Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441036#M76877</link>
      <description>&lt;P&gt;Have you verified all of the IIS servers have the correct time and time zone?&lt;BR /&gt;
When you compare _time to _indextime, what do you see?&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;| tstats latest(_time) AS _time latest(_indextime) AS _indextime where index=iis by host 
| eval delta=_indextime - _time 
| where delta != 0 
| eval indexTime=_indextime 
| fields delta indexTime _time host
| sort - delta 
| eval indexTime=strftime(indexTime, "%F %T") 
| eval Time=strftime(_time, "%F %T")
| table delta indexTime Time host
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 08 Aug 2019 22:40:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441036#M76877</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2019-08-08T22:40:35Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441037#M76878</link>
      <description>&lt;P&gt;Yes, the timestamp on all the IIS servers look fine. They are in UTC and as stated in the OP I've added a props.conf entry for that sourcetype that normalized the data. If I do a search on future logs nothing is returned so I'm not of the impression it's a timestamp issue. &lt;/P&gt;

&lt;P&gt;One thing I meant to mention i discovered leaving out work, another log source is also delayed. Both of these logs are the biggest logs source I'm pulling&lt;/P&gt;

&lt;P&gt;However smaller logs and sources still come through&lt;/P&gt;

&lt;P&gt;I'm starting to think I'm hitting my limit in limits.conf. &lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 01:19:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441037#M76878</guid>
      <dc:creator>Jarohnimo</dc:creator>
      <dc:date>2019-08-09T01:19:11Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441038#M76879</link>
      <description>&lt;P&gt;Maybe I'm ignorant to the idea of me hitting any limits as I'm only ingesting 250gb daily and I know of plenty who pull TB's of data a day. Perhaps they've adjusted their limits.conf to allow the data to flow or perhaps they are pulling from 1,000 devices to = that 1tb and no individual node is reaching the default limit in limits.conf where I'm only pulling from 84 devices = 250gb's? &lt;/P&gt;

&lt;P&gt;I definitely need to fix this problem asap!&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 01:19:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441038#M76879</guid>
      <dc:creator>Jarohnimo</dc:creator>
      <dc:date>2019-08-09T01:19:40Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441039#M76880</link>
      <description>&lt;P&gt;Putting &lt;CODE&gt;TZ = GMT&lt;/CODE&gt; in props.conf does not normalize data.  It's merely information to help indexers parse timestamps.  If the timestamp is not in UTC, &lt;CODE&gt;TZ = GMT&lt;/CODE&gt; will result in events being out of sequence.&lt;/P&gt;

&lt;P&gt;Are the logs being sent by a forwarder?  If so, consider increasing the &lt;CODE&gt;maxKBps&lt;/CODE&gt; setting in the forwarder's limits.conf file.&lt;/P&gt;

&lt;P&gt;Depending on what else the indexer is doing, 250GB/day is near the limit of what can be expected from a single indexer.  If you can't increase the storage I/O rate then consider adding an indexer.&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 12:30:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441039#M76880</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2019-08-09T12:30:39Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441040#M76881</link>
      <description>&lt;P&gt;I meant normalize the data in respect to the timestamp, I should of been clearer. Generally I do my Field extractions at search time on the search heads only.&lt;/P&gt;

&lt;P&gt;You may have missed that I have 2 indexers currently so one indxer is getting half this amount so I don't think it's the indexers.. no issues with the data pipeline.. i'm thinking limit.conf is probably where I need to concentrate.&lt;/P&gt;

&lt;P&gt;Today I evaluated my actual logs and see someone doing something crazy with web Api calls that have more than quadrupled the log size. So I'll have them stop what they are doing first and look into the limit.conf at the same time&lt;/P&gt;

&lt;P&gt;Thanks for your help&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2019 17:14:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/441040#M76881</guid>
      <dc:creator>Jarohnimo</dc:creator>
      <dc:date>2019-08-09T17:14:49Z</dc:date>
    </item>
    <item>
      <title>Re: long delay /missing data from sourcetypes?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/522615#M88270</link>
      <description>&lt;P&gt;How have you solved it?&lt;/P&gt;</description>
      <pubDate>Fri, 02 Oct 2020 01:46:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/long-delay-missing-data-from-sourcetypes/m-p/522615#M88270</guid>
      <dc:creator>splunkcol</dc:creator>
      <dc:date>2020-10-02T01:46:07Z</dc:date>
    </item>
  </channel>
</rss>

