<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to search which forwarder is sending the most data to an indexer? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243852#M72576</link>
    <description>&lt;P&gt;You may also get what you need from something like this:&lt;BR /&gt;
    | metadata type=hosts index=&lt;YOURINDEXNAME&gt;&lt;/YOURINDEXNAME&gt;&lt;/P&gt;

&lt;P&gt;It will return a totalCount column per host. If your forwarders are the original source of your log events, the event count should accurately reflect what's coming from each forwarder.&lt;/P&gt;</description>
    <pubDate>Mon, 28 Sep 2015 17:15:45 GMT</pubDate>
    <dc:creator>s2_splunk</dc:creator>
    <dc:date>2015-09-28T17:15:45Z</dc:date>
    <item>
      <title>How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243849#M72573</link>
      <description>&lt;P&gt;How can I determine which forwarder is impacting the indexer the most?&lt;BR /&gt;
I have an index taking up 53 gigs of space with an event count of 296 million.&lt;BR /&gt;
There are multiple forwarders feeding into this index.&lt;BR /&gt;
The forwarders with the most events have directories that are less than 2 gigs in size.&lt;BR /&gt;
I am manually going server to server to try and determine 'what is using all the space?'&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2015 15:14:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243849#M72573</guid>
      <dc:creator>MikeBertelsen</dc:creator>
      <dc:date>2015-09-28T15:14:14Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243850#M72574</link>
      <description>&lt;P&gt;Hi, the _internal index should have this information. You can try something like:&lt;BR /&gt;
&lt;CODE&gt;index=_internal metrics "group=tcpin_connections"&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;"sourceHost" would be the forwarder... you extract per your requirement. Eg&lt;BR /&gt;
&lt;CODE&gt;index=_internal earliest=-15m metrics "group=tcpin_connections"|stats sum(tcp_Kprocessed) by sourceHost&lt;/CODE&gt;&lt;BR /&gt;
or tcp_eps (check out the &lt;A href="http://docs.splunk.com/Documentation/Splunk/6.2.0/Troubleshooting/Aboutmetricslog" target="_blank"&gt;docs&lt;/A&gt; for additional options)&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 07:22:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243850#M72574</guid>
      <dc:creator>Yasaswy</dc:creator>
      <dc:date>2020-09-29T07:22:58Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243851#M72575</link>
      <description>&lt;P&gt;If you are using 6.2 or higher you can use DMC (Distributed Management Console).   Here is the raw search you may want&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal host=lyn-del-spl-101 source="*metrics.log" sourcetype=splunkd  group=per_host_thruput | timechart per_second(kb) as per_second sum(kb) as kb by series useother=false limit=15
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 28 Sep 2015 17:12:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243851#M72575</guid>
      <dc:creator>bmacias84</dc:creator>
      <dc:date>2015-09-28T17:12:30Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243852#M72576</link>
      <description>&lt;P&gt;You may also get what you need from something like this:&lt;BR /&gt;
    | metadata type=hosts index=&lt;YOURINDEXNAME&gt;&lt;/YOURINDEXNAME&gt;&lt;/P&gt;

&lt;P&gt;It will return a totalCount column per host. If your forwarders are the original source of your log events, the event count should accurately reflect what's coming from each forwarder.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2015 17:15:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243852#M72576</guid>
      <dc:creator>s2_splunk</dc:creator>
      <dc:date>2015-09-28T17:15:45Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243853#M72577</link>
      <description>&lt;P&gt;To clarify I'm interested in the data versus the events. Can a filepath containing 2 gigs spawn 10 gigs of disk space on an index?&lt;/P&gt;

&lt;P&gt;In this case 12 hosts are involved. Each apparently has files that total about 2 gigs. So at best I would expect that the index size on disk would be about 16 gigs on a daily basis. &lt;/P&gt;

&lt;P&gt;Putting it another way, SoS reports that this index is consuming on average 30 gigs of data per day. Where is the other 14 gigs coming from?&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2015 19:27:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243853#M72577</guid>
      <dc:creator>MikeBertelsen</dc:creator>
      <dc:date>2015-09-28T19:27:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243854#M72578</link>
      <description>&lt;P&gt;Mike,&lt;BR /&gt;
it is technically possible, but seems a bit out of the ordinary. Raw data usually compresses very nicely on disk, we frequently see compression rates beyond 75%.&lt;BR /&gt;
The remaining disk space is used for the indexes and metadata that go along with the raw data. If you, for example, configured INDEXED_EXTRACTIONS = json/xml/etc. and you have very high cardinality in your source data, the size of the index files can quickly exceed the raw data size on disk.&lt;/P&gt;

&lt;P&gt;In other words: We need a bit more details on how you have your inputs configured for these 12 hosts.&lt;BR /&gt;
You can also look at the directory structure on the indexer to see if you have multiple large .tsidx files.&lt;/P&gt;

&lt;P&gt;Hope this helps.&lt;BR /&gt;
Stefan  &lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2015 19:35:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243854#M72578</guid>
      <dc:creator>s2_splunk</dc:creator>
      <dc:date>2015-09-28T19:35:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to search which forwarder is sending the most data to an indexer?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243855#M72579</link>
      <description>&lt;P&gt;Turns out the individual that set up the monitoring on these twelve servers didn't exclude log rolling.&lt;BR /&gt;
I'm going to clean up the monitoring and if the issue persists i will seek assistance another day.&lt;BR /&gt;
Thanks for the answers and support.&lt;/P&gt;

&lt;P&gt;Mike&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2015 21:22:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-to-search-which-forwarder-is-sending-the-most-data-to-an/m-p/243855#M72579</guid>
      <dc:creator>MikeBertelsen</dc:creator>
      <dc:date>2015-09-29T21:22:40Z</dc:date>
    </item>
  </channel>
</rss>

