<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244024#M47224</link>
    <description>&lt;P&gt;We're having to write some custom scripts to read/tail binary data, format them into something Splunk-able (k1=v1 k2=v2 k3=v3), and get them into Splunk. This will be running on a machine that will have a UF....&lt;/P&gt;

&lt;P&gt;At this point, I see three options for the "get them into Splunk" end:&lt;/P&gt;

&lt;OL&gt;
&lt;LI&gt;write a scratch file into a directory that Splunk is monitoring&lt;/LI&gt;
&lt;LI&gt;write against one of the SDKs to push the events into Splunk&lt;/LI&gt;
&lt;LI&gt;use the HTTP Event Collector&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;I know that #1 performs well (and is easy to troubleshoot and test), but leaves me with a small scratch file management problem (which is very manageable). Since I am so lazy that I don't even want to solve that problem, I was wondering if anyone had any experience as to how well #2 and #3 hold up when looking at 7 million events/1.1 Gb a day...&lt;/P&gt;</description>
    <pubDate>Wed, 12 Oct 2016 15:51:07 GMT</pubDate>
    <dc:creator>wegscd</dc:creator>
    <dc:date>2016-10-12T15:51:07Z</dc:date>
    <item>
      <title>Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244024#M47224</link>
      <description>&lt;P&gt;We're having to write some custom scripts to read/tail binary data, format them into something Splunk-able (k1=v1 k2=v2 k3=v3), and get them into Splunk. This will be running on a machine that will have a UF....&lt;/P&gt;

&lt;P&gt;At this point, I see three options for the "get them into Splunk" end:&lt;/P&gt;

&lt;OL&gt;
&lt;LI&gt;write a scratch file into a directory that Splunk is monitoring&lt;/LI&gt;
&lt;LI&gt;write against one of the SDKs to push the events into Splunk&lt;/LI&gt;
&lt;LI&gt;use the HTTP Event Collector&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;I know that #1 performs well (and is easy to troubleshoot and test), but leaves me with a small scratch file management problem (which is very manageable). Since I am so lazy that I don't even want to solve that problem, I was wondering if anyone had any experience as to how well #2 and #3 hold up when looking at 7 million events/1.1 Gb a day...&lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2016 15:51:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244024#M47224</guid>
      <dc:creator>wegscd</dc:creator>
      <dc:date>2016-10-12T15:51:07Z</dc:date>
    </item>
    <item>
      <title>Re: Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244025#M47225</link>
      <description>&lt;P&gt;I don't know your scripts but probably you could directly send script output to Splunk launching your script in inputs.conf&lt;BR /&gt;
[script://yourscript]&lt;BR /&gt;
....&lt;BR /&gt;
...&lt;/P&gt;

&lt;P&gt;In this way you redirect script output directly in Splunk.&lt;BR /&gt;
You have only to correctly set script permissions.&lt;BR /&gt;
This is better then files.&lt;BR /&gt;
Bye.&lt;BR /&gt;
Giuseppe &lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2016 16:14:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244025#M47225</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2016-10-12T16:14:05Z</dc:date>
    </item>
    <item>
      <title>Re: Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244026#M47226</link>
      <description>&lt;P&gt;why is it better than files?&lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2016 17:12:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244026#M47226</guid>
      <dc:creator>wegscd</dc:creator>
      <dc:date>2016-10-12T17:12:43Z</dc:date>
    </item>
    <item>
      <title>Re: Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244027#M47227</link>
      <description>&lt;P&gt;Because writing script output in a file and monitoring it requests more time to execute.&lt;BR /&gt;
Bye.&lt;BR /&gt;
Giuseppe &lt;/P&gt;</description>
      <pubDate>Thu, 13 Oct 2016 05:55:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244027#M47227</guid>
      <dc:creator>gcusello</dc:creator>
      <dc:date>2016-10-13T05:55:58Z</dc:date>
    </item>
    <item>
      <title>Re: Which method allows the best performance for Splunk to ingest custom scripts: monitor, SDK, or HTTP Event Collector?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244028#M47228</link>
      <description>&lt;P&gt;Any of those methods will work fine at the scale of GB/day.  Writing files at larger scales will run into normal universal forwarder issues such as ulimits, race conditions of reading large files before you log rotate them out of the UF observation etc. I am a fan of HTTP Event Collector (HEC) if you are already working in something like Python where your data is likely in a JSON payload format already.  I have a simple threaded python class for it already. There customers with HEC up in the TB/day.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://blogs.splunk.com/2015/12/11/http-event-collect-a-python-class/"&gt;http://blogs.splunk.com/2015/12/11/http-event-collect-a-python-class/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 16 Oct 2016 14:15:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Which-method-allows-the-best-performance-for-Splunk-to-ingest/m-p/244028#M47228</guid>
      <dc:creator>starcher</dc:creator>
      <dc:date>2016-10-16T14:15:30Z</dc:date>
    </item>
  </channel>
</rss>

