<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Indexer missing events from forwarder in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86861#M18023</link>
    <description>&lt;P&gt;maybe use "persistentqueue"?&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Data/Usepersistentqueues"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Data/Usepersistentqueues&lt;/A&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 29 Jun 2012 13:38:16 GMT</pubDate>
    <dc:creator>jfraiberg</dc:creator>
    <dc:date>2012-06-29T13:38:16Z</dc:date>
    <item>
      <title>Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86857#M18019</link>
      <description>&lt;P&gt;Scenario: 1x load balancer, 2x light forwarders, 1x indexer.  &lt;/P&gt;

&lt;P&gt;&lt;IMG src="http://i49.tinypic.com/s2fdqv.jpg" alt="alt text" /&gt;&lt;/P&gt;

&lt;P&gt;The goal is to make it possible to reboot a single Splunk instance and not lose events.  Here is the output.conf on the forwarders:&lt;/P&gt;

&lt;P&gt;[tcpout:x.x.x.x_x]&lt;BR /&gt;
autoLB = false&lt;BR /&gt;
server = x.x.x.x_x&lt;BR /&gt;
useACK = true&lt;BR /&gt;
maxQueueSize = 400MB&lt;BR /&gt;
sendCookedData = true&lt;/P&gt;

&lt;P&gt;To test this scenario I produce a controlled number of events and then verify they are indexed.  Restarting either forwarder while sending events does not incur loss.  Though, if I restart the indexer I always lose a small number of events.  &lt;/P&gt;

&lt;P&gt;I have verified that the events are received by the forwarders.&lt;/P&gt;

&lt;P&gt;It is my understanding that using maxQueueSize and autoLB together should eliminate lost events.  (&lt;A href="http://docs.splunk.com/Documentation/Splunk/4.3.3/Deploy/Protectagainstlossofin-flightdata" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/4.3.3/Deploy/Protectagainstlossofin-flightdata&lt;/A&gt;)  &lt;/P&gt;

&lt;P&gt;Here is what the splunk.log looks like:&lt;/P&gt;

&lt;P&gt;06-28-2012 10:17:12.818 -0900 WARN  TcpOutputFd - Connect to x.x.x.x:x failed. Connection refused&lt;BR /&gt;
06-28-2012 10:17:12.818 -0900 ERROR TcpOutputFd - Connection to host=x.x.x.x:x failed&lt;BR /&gt;
06-28-2012 10:17:12.818 -0900 WARN  TcpOutputFd - Connect to x.x.x.x:x failed. Connection refused&lt;BR /&gt;
06-28-2012 10:17:12.818 -0900 ERROR TcpOutputFd - Connection to host=x.x.x.x:x failed&lt;BR /&gt;
06-28-2012 10:17:12.818 -0900 WARN  TcpOutputProc - Applying quarantine to idx=x.x.x.x:x numberOfFailures=4&lt;BR /&gt;
06-28-2012 10:17:12.818 -0900 WARN  TcpOutputProc - Applying quarantine to idx=x.x.x.x:x numberOfFailures=5&lt;BR /&gt;
06-28-2012 10:18:42.847 -0900 INFO  TcpOutputProc - Removing quarantine from idx=x.x.x.x:x&lt;BR /&gt;
06-28-2012 10:18:42.850 -0900 INFO  TcpOutputProc - Connected to idx=x.x.x.x:x using ACK.&lt;/P&gt;

&lt;P&gt;In this particular test I sent 150 events, 144 were indexed&lt;/P&gt;

&lt;P&gt;What am I missing?&lt;/P&gt;

&lt;P&gt;EDIT:&lt;/P&gt;

&lt;P&gt;The indexer log is shown below.  The log begins as I issue the restart command.  In this particular test 67 of 80 events were indexed.&lt;/P&gt;

&lt;HR /&gt;

&lt;P&gt;06-28-2012 17:11:10.296 -0400 INFO  IndexProcessor - handleSignal 2: Disabling streaming searches.&lt;BR /&gt;
06-28-2012 17:11:10.296 -0400 INFO  IndexProcessor - request state change RUN to SHUTDOWN_SIGNALED&lt;BR /&gt;
06-28-2012 17:11:10.296 -0400 WARN  HTTPServer - Disabling all acceptors due to shutdown&lt;BR /&gt;
06-28-2012 17:11:10.951 -0400 INFO  IndexProcessor - active realtime streams have hit 0 during shutdown&lt;BR /&gt;
06-28-2012 17:11:11.288 -0400 WARN  HTTPServer - Shutdown triggered via callback.&lt;BR /&gt;
06-28-2012 17:11:11.288 -0400 WARN  HTTPServer - HTTPServer shutting down. Terminating eloop.&lt;BR /&gt;
06-28-2012 17:11:11.288 -0400 WARN  HTTPServer - Closing all acceptors due to shutdown&lt;BR /&gt;
06-28-2012 17:11:11.288 -0400 INFO  loader - Shutdown HTTPDispatchThread&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  ShutdownHandler - Shutting down splunkd&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Thruput"&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  ShutdownHandler - Component 'ThruputProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_TcpInput"&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:11.289 -0400 INFO  TcpInputProc - Stopping IPv4 port x&lt;BR /&gt;
06-28-2012 17:11:12.084 -0400 INFO  TcpInputProc - Waiting for connection from src=x to close before shutting down TcpInputProcessor.&lt;BR /&gt;
06-28-2012 17:11:12.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4587; will try again&lt;BR /&gt;
06-28-2012 17:11:14.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4588; will try again&lt;BR /&gt;
06-28-2012 17:11:17.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4589; will try again&lt;BR /&gt;
06-28-2012 17:11:21.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4590; will try again&lt;BR /&gt;
06-28-2012 17:11:22.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4591; will try again&lt;BR /&gt;
06-28-2012 17:11:24.283 -0400 WARN  ProcessTracker - Failed to start process, PGID 4592; will try again&lt;BR /&gt;
06-28-2012 17:11:26.174 -0400 ERROR DispatchCommand - Failed to start the search process.&lt;BR /&gt;
06-28-2012 17:11:28.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4593; will try again&lt;BR /&gt;
06-28-2012 17:11:36.283 -0400 WARN  ProcessTracker - Failed to start process, PGID 4594; will try again&lt;BR /&gt;
06-28-2012 17:11:40.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4595; will try again&lt;BR /&gt;
06-28-2012 17:11:41.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4596; will try again&lt;BR /&gt;
06-28-2012 17:11:43.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4597; will try again&lt;BR /&gt;
06-28-2012 17:11:45.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4598; will try again&lt;BR /&gt;
06-28-2012 17:11:45.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4599; will try again&lt;BR /&gt;
06-28-2012 17:11:52.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4600; will try again&lt;BR /&gt;
06-28-2012 17:11:52.287 -0400 WARN  ProcessTracker - Failed to start process, PGID 4601; will try again&lt;BR /&gt;
06-28-2012 17:11:54.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4602; will try again&lt;BR /&gt;
06-28-2012 17:11:57.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4603; will try again&lt;BR /&gt;
06-28-2012 17:11:57.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4604; will try again&lt;BR /&gt;
06-28-2012 17:11:57.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4605; will try again&lt;BR /&gt;
06-28-2012 17:11:57.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4606; will try again&lt;BR /&gt;
06-28-2012 17:11:57.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4607; will try again&lt;BR /&gt;
06-28-2012 17:12:00.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4608; will try again&lt;BR /&gt;
06-28-2012 17:12:02.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4609; will try again&lt;BR /&gt;
06-28-2012 17:12:02.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4610; will try again&lt;BR /&gt;
06-28-2012 17:12:04.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4611; will try again&lt;BR /&gt;
06-28-2012 17:12:06.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4612; will try again&lt;BR /&gt;
06-28-2012 17:12:06.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4613; will try again&lt;BR /&gt;
06-28-2012 17:12:06.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4614; will try again&lt;BR /&gt;
06-28-2012 17:12:06.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4615; will try again&lt;BR /&gt;
06-28-2012 17:12:06.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4616; will try again&lt;BR /&gt;
06-28-2012 17:12:06.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4617; will try again&lt;BR /&gt;
06-28-2012 17:12:06.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4618; will try again&lt;BR /&gt;
06-28-2012 17:12:06.282 -0400 WARN  ProcessTracker - Failed to start process, PGID 4619; will try again&lt;BR /&gt;
06-28-2012 17:12:06.282 -0400 WARN  ProcessTracker - Failed to start process, PGID 4620; will try again&lt;BR /&gt;
06-28-2012 17:12:06.282 -0400 WARN  ProcessTracker - Failed to start process, PGID 4621; will try again&lt;BR /&gt;
06-28-2012 17:12:06.282 -0400 WARN  ProcessTracker - Failed to start process, PGID 4622; will try again&lt;BR /&gt;
06-28-2012 17:12:06.303 -0400 WARN  ProcessTracker - Failed to start process, PGID 4623; will try again&lt;BR /&gt;
06-28-2012 17:12:11.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4624; will try again&lt;BR /&gt;
06-28-2012 17:12:18.283 -0400 WARN  ProcessTracker - Failed to start process, PGID 4625; will try again&lt;BR /&gt;
06-28-2012 17:12:18.284 -0400 WARN  ProcessTracker - Failed to start process, PGID 4626; will try again&lt;BR /&gt;
06-28-2012 17:12:20.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4627; will try again&lt;BR /&gt;
06-28-2012 17:12:23.279 -0400 WARN  ProcessTracker - Failed to start process, PGID 4628; will try again&lt;BR /&gt;
06-28-2012 17:12:23.283 -0400 WARN  ProcessTracker - Failed to start process, PGID 4629; will try again&lt;BR /&gt;
06-28-2012 17:12:23.287 -0400 WARN  ProcessTracker - Failed to start process, PGID 4630; will try again&lt;BR /&gt;
06-28-2012 17:12:23.291 -0400 WARN  ProcessTracker - Failed to start process, PGID 4631; will try again&lt;BR /&gt;
06-28-2012 17:12:23.294 -0400 WARN  ProcessTracker - Failed to start process, PGID 4632; will try again&lt;BR /&gt;
06-28-2012 17:12:25.278 -0400 WARN  timeinvertedIndex - splunk-optimize is already running for this index /opt/splunk/var/lib/splunk/_internaldb/db/hot_v1_677.&lt;BR /&gt;
06-28-2012 17:12:25.278 -0400 WARN  timeinvertedIndex - splunk-optimize is already running for this index /opt/splunk/var/lib/splunk/syslog/db/hot_v1_358.&lt;BR /&gt;
06-28-2012 17:12:26.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4633; will try again&lt;BR /&gt;
06-28-2012 17:12:26.443 -0400 ERROR DispatchCommand - Failed to start the search process.&lt;BR /&gt;
06-28-2012 17:12:26.454 -0400 ERROR DispatchCommand - Failed to start the search process.&lt;BR /&gt;
06-28-2012 17:12:27.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4634; will try again&lt;BR /&gt;
06-28-2012 17:12:28.278 -0400 WARN  timeinvertedIndex - splunk-optimize is already running for this index /opt/splunk/var/lib/splunk/perfmon/db/hot_v1_12.&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4635; will try again&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4636; will try again&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4637; will try again&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4638; will try again&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4639; will try again&lt;BR /&gt;
06-28-2012 17:12:32.280 -0400 WARN  ProcessTracker - Failed to start process, PGID 4640; will try again&lt;BR /&gt;
06-28-2012 17:12:32.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4641; will try again&lt;BR /&gt;
06-28-2012 17:12:32.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4642; will try again&lt;BR /&gt;
06-28-2012 17:12:32.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4643; will try again&lt;BR /&gt;
06-28-2012 17:12:32.281 -0400 WARN  ProcessTracker - Failed to start process, PGID 4644; will try again&lt;BR /&gt;
06-28-2012 17:12:32.281 -0400 WARN  timeinvertedIndex - splunk-optimize is already running for this index /opt/splunk/var/lib/splunk/nagios/db/hot_v1_27.&lt;BR /&gt;
06-28-2012 17:12:37.278 -0400 WARN  ProcessTracker - Failed to start process, PGID 4645; will try again&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_TcpOutput"&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  ShutdownHandler - Component 'TcpOutputProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_UdpInput"&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  ShutdownHandler - Component 'UDPInputProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_FifoInput"&lt;BR /&gt;
06-28-2012 17:12:41.288 -0400 INFO  UDPInputProcessor - Unregistering metrics callback for: udpin_connections&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_WinEventLogInput"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Scheduler"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - Component 'SavedSearchScheduler' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - Component 'DispatchReaper' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Tailing"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_SyslogOutput"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - Component 'SyslogOutputProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_HTTPOutput"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - Component 'HTTPOutputProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_TailingXP"&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor - Received shutdown signal.&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor - Will reconfigure input.&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -   Calling addFromAnywhere in TailWatcher=0x7fb26886ab40.&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -   Shutting down with TailingShutdownActor=0x878fef0 and TailWatcher=0x7fb26886ab40.&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -   Pausing BatchReader module...&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  BatchReader - State transitioning from 0 to 1 (pseudoPause).&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  BatchReader - State transitioning from 1 to 2 (pseudoPause).&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -   Removing TailWatcher from eventloop...&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -     ...removed.&lt;BR /&gt;
06-28-2012 17:12:41.289 -0400 INFO  TailingProcessor -   Eventloop terminated successfully.&lt;BR /&gt;
06-28-2012 17:12:41.292 -0400 INFO  TailingProcessor - Signalling shutdown complete.&lt;BR /&gt;
06-28-2012 17:12:41.292 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_BatchReader"&lt;BR /&gt;
06-28-2012 17:12:41.292 -0400 INFO  BatchReader - State transitioning from 2 to 3 (signalShutdown).&lt;BR /&gt;
06-28-2012 17:12:41.292 -0400 INFO  BatchReader - Terminating threadpool.&lt;BR /&gt;
06-28-2012 17:12:41.293 -0400 INFO  ShutdownHandler - Component 'BatchReader' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:41.293 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Archive"&lt;BR /&gt;
06-28-2012 17:12:41.943 -0400 WARN  pipeline - Exiting pipeline udp gracefully: got eExit from processor udp&lt;BR /&gt;
06-28-2012 17:12:42.067 -0400 WARN  pipeline - Exiting pipeline fifo gracefully: got eExit from processor fifo&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_AuditTrailManager"&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 INFO  ShutdownHandler - Component 'AuditTrailManager' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_AuditTrailQueueServiceThread"&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 INFO  ShutdownHandler - Component 'QueueServiceThread' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_FSChangeMonitor"&lt;BR /&gt;
06-28-2012 17:12:42.085 -0400 WARN  pipeline - Exiting pipeline archivePipe gracefully: got eExit from processor archiveProcessor&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 INFO  TcpInputProc - Closing splunk 2 splunk IPv4 port x&lt;BR /&gt;
06-28-2012 17:12:42.106 -0400 ERROR TcpInputProc - Error encountered for connection from src=10.1.120.48:38420. Local side shutting down&lt;BR /&gt;
06-28-2012 17:12:42.713 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_FSChangeManagerProcessor"&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  ShutdownHandler - Component 'FSChangeManagerProcessor' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Select"&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  ShutdownHandler - Component 'PipeFlusher' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Database1"&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  IndexProcessor - request state change SHUTDOWN_SIGNALED to SHUTDOWN_IN_PROGRESS&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  PipeFlusher - Flushing pipelines...&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  PipeFlusher - Finished triggering pipeline flush.&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  IndexProcessor - shutting down: start&lt;BR /&gt;
06-28-2012 17:12:42.714 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: _audit&lt;BR /&gt;
06-28-2012 17:12:42.716 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: _internal&lt;BR /&gt;
06-28-2012 17:12:42.721 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: _thefishbucket&lt;BR /&gt;
06-28-2012 17:12:42.721 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: blackberry&lt;BR /&gt;
06-28-2012 17:12:42.722 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: email&lt;BR /&gt;
06-28-2012 17:12:42.722 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: history&lt;BR /&gt;
06-28-2012 17:12:42.722 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: main&lt;BR /&gt;
06-28-2012 17:12:42.729 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: modsecurity&lt;BR /&gt;
06-28-2012 17:12:42.729 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: msexchange&lt;BR /&gt;
06-28-2012 17:12:42.778 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: nagios&lt;BR /&gt;
06-28-2012 17:12:42.781 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: os&lt;BR /&gt;
06-28-2012 17:12:42.782 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: perfmon&lt;BR /&gt;
06-28-2012 17:12:42.785 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: rss&lt;BR /&gt;
06-28-2012 17:12:42.786 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary&lt;BR /&gt;
06-28-2012 17:12:42.786 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_forwarders&lt;BR /&gt;
06-28-2012 17:12:42.788 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_hosts&lt;BR /&gt;
06-28-2012 17:12:42.789 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_indexers&lt;BR /&gt;
06-28-2012 17:12:42.790 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_pools&lt;BR /&gt;
06-28-2012 17:12:42.790 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_sources&lt;BR /&gt;
06-28-2012 17:12:42.793 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: summary_sourcetypes&lt;BR /&gt;
06-28-2012 17:12:42.796 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: syslog&lt;BR /&gt;
06-28-2012 17:12:42.799 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: test&lt;BR /&gt;
06-28-2012 17:12:42.799 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: windows&lt;BR /&gt;
06-28-2012 17:12:42.799 -0400 INFO  databasePartitionPolicy - Handling shutdown or signal in DatabasePartitionPolicy: _blocksignature&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  BucketMover - 0 jobs canceled from BucketMoverExecutor&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  IndexProcessor - request state change SHUTDOWN_IN_PROGRESS to SHUTDOWN_COMPLETE&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  IndexProcessor - shutting down: end&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_LoadLDAPUsers"&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'LoadLDAPUsersThread' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_MetricsManager"&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'MetricsManager' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Pipeline"&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:dev-null' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:indexerPipe' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:exec' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:fschangemanager' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:tcp' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:merging' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:typing' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:stashparsing' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'Pipeline:parsing' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Queue"&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'indexQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'parsingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'aggQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'typingQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - Component 'indexQueue' completed shutdown immediately&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 INFO  ShutdownHandler - shutting down level "ShutdownLevel_Exec"&lt;BR /&gt;
06-28-2012 17:12:42.805 -0400 WARN  pipeline - Exiting pipeline indexerPipe gracefully: got eExit from processor indexer&lt;BR /&gt;
06-28-2012 17:12:42.807 -0400 INFO  ShutdownHandler - Shutdown complete in 91.52 seconds&lt;BR /&gt;
06-28-2012 17:12:42.831 -0400 WARN  pipeline - Exiting pipeline exec gracefully: got eExit from processor exec&lt;BR /&gt;
06-28-2012 17:12:43.075 -0400 INFO  TcpInputProc - Finishing shutdown. Timed out in 90 seconds&lt;BR /&gt;
06-28-2012 17:12:43.075 -0400 WARN  pipeline - Exiting pipeline tcp gracefully: got eExit from processor tcp&lt;BR /&gt;
06-28-2012 17:12:43.526 -0400 WARN  pipeline - Exiting pipeline stashparsing forcefully&lt;BR /&gt;
06-28-2012 17:12:43.581 -0400 WARN  pipeline - Exiting pipeline dev-null forcefully&lt;BR /&gt;
06-28-2012 17:12:43.713 -0400 WARN  pipeline - Exiting pipeline fschangemanager gracefully: got eExit from processor fschangemanager&lt;BR /&gt;
06-28-2012 17:12:43.714 -0400 WARN  pipeline - Exiting pipeline parsing forcefully&lt;BR /&gt;
06-28-2012 17:12:43.714 -0400 WARN  pipeline - Exiting pipeline merging forcefully&lt;BR /&gt;
06-28-2012 17:12:43.714 -0400 WARN  pipeline - Exiting pipeline typing forcefully&lt;BR /&gt;
06-28-2012 17:12:43.714 -0400 WARN  loader - All pipelines finished.&lt;/P&gt;

&lt;H2&gt;06-28-2012 17:12:52.291 -0400 INFO  loader - Splunkd starting (build 123586).&lt;/H2&gt;

&lt;P&gt;EDIT # 2 &lt;/P&gt;

&lt;P&gt;Added the following to the inputs.conf:&lt;/P&gt;

&lt;P&gt;queueSize=50KB&lt;BR /&gt;
persistentQueueSize=400MB&lt;/P&gt;

&lt;P&gt;Doing this resulted in duplication: 100 events triggered, 169 indexed &lt;/P&gt;

&lt;P&gt;Screenshot:&lt;/P&gt;

&lt;P&gt;&lt;IMG src="http://i48.tinypic.com/35c1vh2.jpg" alt="alt text" /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 12:00:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86857#M18019</guid>
      <dc:creator>crob6281</dc:creator>
      <dc:date>2020-09-28T12:00:18Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86858#M18020</link>
      <description>&lt;P&gt;Do you see corresponding events on the indexer that might give you clues about why it is refusing connections?&lt;/P&gt;</description>
      <pubDate>Thu, 28 Jun 2012 20:16:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86858#M18020</guid>
      <dc:creator>Ayn</dc:creator>
      <dc:date>2012-06-28T20:16:14Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86859#M18021</link>
      <description>&lt;P&gt;Umm, the portions of splunkd.log I was referring to was rather what happens BEFORE you issue a restart command - that is, in what is supposed to be normal operations, do you see errors on the indexer side with regards to TCP connections?&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jun 2012 05:36:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86859#M18021</guid>
      <dc:creator>Ayn</dc:creator>
      <dc:date>2012-06-29T05:36:47Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86860#M18022</link>
      <description>&lt;P&gt;@Ayn - No, there are no TCP errors during normal operation.  I have not seen events lost during normal operation - only during restarts.&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jun 2012 12:48:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86860#M18022</guid>
      <dc:creator>crob6281</dc:creator>
      <dc:date>2012-06-29T12:48:48Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86861#M18023</link>
      <description>&lt;P&gt;maybe use "persistentqueue"?&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Data/Usepersistentqueues"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Data/Usepersistentqueues&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jun 2012 13:38:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86861#M18023</guid>
      <dc:creator>jfraiberg</dc:creator>
      <dc:date>2012-06-29T13:38:16Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86862#M18024</link>
      <description>&lt;P&gt;I get a weird duplication issue when I try that.  Triggered 100 events, see 169 on the indexer. (see screenshot above)&lt;/P&gt;</description>
      <pubDate>Fri, 29 Jun 2012 13:54:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86862#M18024</guid>
      <dc:creator>crob6281</dc:creator>
      <dc:date>2012-06-29T13:54:45Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86863#M18025</link>
      <description>&lt;P&gt;I believe that if you just use "useACK =true" in your forwarder's outputs.conf (no need to touch persistentQueueSize in your forwarder's inputs.conf), you would not suffer from any events lost even when restarting your indexers.&lt;/P&gt;

&lt;P&gt;When using that "useACK=true", your indexer get some duplicates events due to the momentary discontinuity between your forwarder and indexer, but this is far better than losing some events. And to deal with those duplicates, you could just use dedup to get the unique events out of it. This should just solve it.&lt;/P&gt;

&lt;P&gt;useACK=true guarantees full delivery, try that to see how it goes&lt;/P&gt;</description>
      <pubDate>Tue, 16 Oct 2012 07:29:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86863#M18025</guid>
      <dc:creator>mic</dc:creator>
      <dc:date>2012-10-16T07:29:00Z</dc:date>
    </item>
    <item>
      <title>Re: Indexer missing events from forwarder</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86864#M18026</link>
      <description>&lt;P&gt;I got an answer from Splunk support.  It turns out that duplication is &lt;STRONG&gt;expected&lt;/STRONG&gt; behavior.  &lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;Chad,&lt;/P&gt;

&lt;P&gt;Unfortunately, there is no solution to&lt;BR /&gt;
the duplicated events.  With the&lt;BR /&gt;
"useACK" functionality, there's times&lt;BR /&gt;
where the forwarder doesn't get the&lt;BR /&gt;
ack message after the indexer has&lt;BR /&gt;
written the event to the log.  This is&lt;BR /&gt;
expected behavior.&lt;/P&gt;

&lt;P&gt;Brian&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;Here is the results of Splunk support's test:&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;PRE&gt;&lt;CODE&gt;24460 Messages Sent
24444 Messages received by the forwarder (tcpdump)
25276 Messages received by the indexer
832 Duplicated messages
&lt;/CODE&gt;&lt;/PRE&gt;
&lt;/BLOCKQUOTE&gt;</description>
      <pubDate>Tue, 16 Oct 2012 12:13:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Indexer-missing-events-from-forwarder/m-p/86864#M18026</guid>
      <dc:creator>crob6281</dc:creator>
      <dc:date>2012-10-16T12:13:11Z</dc:date>
    </item>
  </channel>
</rss>

