<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Splunk logging Driver Bringing Down the Entire Docker Swarm Cluster in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356924#M65188</link>
    <description>&lt;P&gt;Do you run your Splunk Indexer at the same Docker Swarm from where you are sending logs? Possible you want to separate infra and prod clusters.&lt;/P&gt;

&lt;P&gt;It is unexpected that after Splunk Indexer restart you see crashes or hangs. This behavior is not expected and should be reported on docker repository &lt;A href="http://github.com/moby/moby"&gt;http://github.com/moby/moby&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;If you have only one Indexer - I would suggest you create a fleet of Splunk Heavy Weight Forwarders, see &lt;A href="http://dev.splunk.com/view/event-collector/SP-CAAAE73"&gt;http://dev.splunk.com/view/event-collector/SP-CAAAE73&lt;/A&gt;, that way when you will need to restart Splunk Cluster - you will be able to restart it one by one.&lt;/P&gt;

&lt;P&gt;If you don’t mind paid solutions, I can suggest to use our solution for Monitoring and Logs Forwarding &lt;A href="https://www.outcoldsolutions.com/"&gt;https://www.outcoldsolutions.com/&lt;/A&gt;, where we implemented logs forwarding on top of default JSON logging driver, so we have no affect on Docker Swarm. Plus to that you will get application monitoring. You can find how to install our solution here &lt;A href="https://www.outcoldsolutions.com/docs/monitoring-docker/"&gt;https://www.outcoldsolutions.com/docs/monitoring-docker/&lt;/A&gt;  you can try it for free, as our images have a built-in trial license.&lt;/P&gt;</description>
    <pubDate>Tue, 24 Apr 2018 01:15:18 GMT</pubDate>
    <dc:creator>outcoldman</dc:creator>
    <dc:date>2018-04-24T01:15:18Z</dc:date>
    <item>
      <title>Splunk logging Driver Bringing Down the Entire Docker Swarm Cluster</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356923#M65187</link>
      <description>&lt;P&gt;Hello, &lt;/P&gt;

&lt;P&gt;We implemented collecting Docker logs using &lt;CODE&gt;splunk&lt;/CODE&gt; logging driver,  It pushes the docker logs very well and good. But we have a bigger problem now.&lt;/P&gt;

&lt;P&gt;Let's consider my Splunk-Indexor is down while spinning up docker containers, those containers will not be able to establish the connection with Splunk-indexor machine. Now that's going to crash entire docker engine on the system and you will not be able to execute any of the &lt;CODE&gt;docker&lt;/CODE&gt; commands in those machines, this will hang up the entire docker engine in the machine. To fix this I had to restart the VM, docker service restart is not helping.&lt;/P&gt;

&lt;P&gt;How can I mitigate this error? &lt;/P&gt;

&lt;P&gt;Is this the docker issue or the Splunk one? &lt;/P&gt;

&lt;P&gt;Here is the swarm-stack file I'm using &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;version: '3'
services:
  worker:
    image: "${DOCKER_IMAGE_PATH}/worker:${RELEASE_TAG}"
    deploy:
      replicas: 3
    build:
      context: ../../
      dockerfile: ../Dockerfile-worker
    environment:
    ports:
      - "8083:3000"
    logging:
       driver: splunk
       options:
          splunk-url: "${SPLUNK_URL}"
          splunk-token: "${SPLUNK_TOKEN}"
          splunk-insecureskipverify: "true"
          tag: "{{.Name}}/{{.ID}}"
          labels: "NEurope"
          env: "${TARGET_NAME}"
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;If the Splunk driver works like this, then I need to rebuild/restart &lt;CODE&gt;Docker Containers&lt;/CODE&gt; each and every time if there is a &lt;CODE&gt;restart&lt;/CODE&gt; on the Splunk server(Indexor)&lt;/P&gt;

&lt;P&gt;Thanks,&lt;BR /&gt;
Kiran&lt;/P&gt;</description>
      <pubDate>Mon, 23 Apr 2018 23:20:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356923#M65187</guid>
      <dc:creator>eygtmbot</dc:creator>
      <dc:date>2018-04-23T23:20:48Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk logging Driver Bringing Down the Entire Docker Swarm Cluster</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356924#M65188</link>
      <description>&lt;P&gt;Do you run your Splunk Indexer at the same Docker Swarm from where you are sending logs? Possible you want to separate infra and prod clusters.&lt;/P&gt;

&lt;P&gt;It is unexpected that after Splunk Indexer restart you see crashes or hangs. This behavior is not expected and should be reported on docker repository &lt;A href="http://github.com/moby/moby"&gt;http://github.com/moby/moby&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;If you have only one Indexer - I would suggest you create a fleet of Splunk Heavy Weight Forwarders, see &lt;A href="http://dev.splunk.com/view/event-collector/SP-CAAAE73"&gt;http://dev.splunk.com/view/event-collector/SP-CAAAE73&lt;/A&gt;, that way when you will need to restart Splunk Cluster - you will be able to restart it one by one.&lt;/P&gt;

&lt;P&gt;If you don’t mind paid solutions, I can suggest to use our solution for Monitoring and Logs Forwarding &lt;A href="https://www.outcoldsolutions.com/"&gt;https://www.outcoldsolutions.com/&lt;/A&gt;, where we implemented logs forwarding on top of default JSON logging driver, so we have no affect on Docker Swarm. Plus to that you will get application monitoring. You can find how to install our solution here &lt;A href="https://www.outcoldsolutions.com/docs/monitoring-docker/"&gt;https://www.outcoldsolutions.com/docs/monitoring-docker/&lt;/A&gt;  you can try it for free, as our images have a built-in trial license.&lt;/P&gt;</description>
      <pubDate>Tue, 24 Apr 2018 01:15:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356924#M65188</guid>
      <dc:creator>outcoldman</dc:creator>
      <dc:date>2018-04-24T01:15:18Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk logging Driver Bringing Down the Entire Docker Swarm Cluster</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356925#M65189</link>
      <description>&lt;P&gt;Hello, &lt;BR /&gt;
No. I'm not running Splunk indexer machine on the swarm cluster, that is a stand-alone machine sitting outside the cluster. &lt;/P&gt;

&lt;P&gt;I believe this is happening because we have some timeouts on the Splunk-indexer machine. &lt;/P&gt;

&lt;P&gt;I noticed that I can see some timeout error on the docker engine logs, Is the docker is going to hang on each and every timeout? &lt;/P&gt;

&lt;P&gt;Even if you set up a cluster with multiple heavy forwarders, that is not going to help, Because you may have timeout because of the network. &lt;/P&gt;

&lt;P&gt;Please let me know if you have any thoughts...! &lt;/P&gt;

&lt;P&gt;We are already in a process to procure Splunk, at this moment we don't have direct support. &lt;/P&gt;

&lt;P&gt;Thanks,&lt;BR /&gt;
Kiran&lt;/P&gt;</description>
      <pubDate>Tue, 24 Apr 2018 22:19:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356925#M65189</guid>
      <dc:creator>eygtmbot</dc:creator>
      <dc:date>2018-04-24T22:19:26Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk logging Driver Bringing Down the Entire Docker Swarm Cluster</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356926#M65190</link>
      <description>&lt;P&gt;Having multiple indexer will help with the indexer availability, but will not solve the networking problem. You can also have Heavy Weight Forwarders installed on the same node, so you will not have networking issues anymore. And that forwarders will send data to indexers, when they are available.&lt;/P&gt;

&lt;P&gt;The hang you are experiencing is unexpected, and I assume that it is possible that Splunk Logging Driver does not set the read timeout, and the connection is just getting disconnected from one end but does not close it on Splunk Logging Driver, so it indefinitely waits for a response. It does not seem like Splunk Logging Driver sets the ReadTimeout to the http.Client &lt;A href="https://github.com/moby/moby/blob/master/daemon/logger/splunk/splunk.go#L223"&gt;https://github.com/moby/moby/blob/master/daemon/logger/splunk/splunk.go#L223&lt;/A&gt;, so you can send a PR to add a timeout &lt;A href="https://golang.org/pkg/net/http/#Client"&gt;https://golang.org/pkg/net/http/#Client&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;That should solve this problem partially. &lt;/P&gt;

&lt;P&gt;But again, I will suggest you take a look on our solution, as our log forwarding does not depend on Splunk log driver, you will write the logs in JSON, our collector tails JSON logs and forwards them to Splunk. We have a free trial for 30 days. Give a try, send us an email to &lt;A href="mailto:sales@outcoldsolutions.com"&gt;sales@outcoldsolutions.com&lt;/A&gt; to learn more, we can schedule a call and discuss all the issues you experience. &lt;/P&gt;</description>
      <pubDate>Wed, 25 Apr 2018 02:19:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-logging-Driver-Bringing-Down-the-Entire-Docker-Swarm/m-p/356926#M65190</guid>
      <dc:creator>outcoldman</dc:creator>
      <dc:date>2018-04-25T02:19:03Z</dc:date>
    </item>
  </channel>
</rss>

