About mdickey_splunk

mdickey_splunk · ‎10-29-2018

That is correct; any connections with no activity/packets for > tcpConnectionTimeout would likely result in multiple events being generated for the same connection.

mdickey_splunk · ‎10-22-2018

Wow, those are really big numbers! I would definitely not recommend setting those so big. It's hard to say exactly what the issue is withougt digging further into logs or pcaps (best to do that with your SE, if necessary), but here are two possibilities: You are running stream as a modular input via splunk. There is a pretty low performance bottleneck with this architecture, that you would likely run into at 2 Gbps. It will create back-pressure in the event queue, which normally would just cause events to drop with errors. But with these settings you're going to blow out memory, slow everything down, and cause everything to start failing, pretty quick. The only way to make this work is to do more filtering/aggregation at the edge (so far fewer events are going to splunk) OR using an independent agent configuration. The latter we've tested upwards of 10 Gbps, and is absolutely a requirement to scale stream. Your aggregator isn't sending all the packets necessary. For example, it may be dropping FIN packets causing things to never close out in reassembly. Lowering tcpConnectionTimeout (and corresponding udp timeout values) may help work-around this. Unless you really can't tolerate premature termination of those flows, I'd recommend lower those regardless. Values as low as 10 are perfectly reasonable since this is an inactivity timeout. Another thing I've seen a lot is configs that only forward ingress packets, or only forward egress packets. So the flows sit indefinitely waiting for the other side of the "conversation" to arrive. This is easy to check/diagnose using something like tcpdump. You are doing a lot of decryption or something that requires a lot of processing early in the pipeline, aand need more processorThreads. Since you have 20, you could try increasing that to see if things improve.

mdickey_splunk · ‎10-19-2018

How much traffic are you trying to capture, what are your system specs, and how many ProcessorThreads do you have configured? Typically, this is caused by not assigning enough threads (corresponding to cpu cores) to process the packets coming in.

mdickey_splunk · ‎03-23-2016

"_time" represents the system time at which the TA read the last corresponding packet from the network device. This is currently set to the "endtime" field in the JSON event object. Note that "timestamp" in the JSON event object represents the first packet time, and "time_taken" is the delta (in microseconds) between the two. "_indextime" represents the system time at which the event was stored on your indexer.

mdickey_splunk · ‎03-22-2016

There currently is no way for Stream to monitor a directory for new pcap files and automatically load them. You would need to write a script that does this, and calls the above command whenever a new file is available.

mdickey_splunk · ‎03-18-2016

I believe you have an invalid splunk_app_stream_location. It should look like this: splunk_stream_app_location = http://FQDN-of-SearchHead:8000/en-us/custom/splunk_app_stream/ You can try running this from one of your universal forwarders to verify that it's correct and able to communicate with the app. curl http://FQDN-of-SearchHead:8000/en-us/custom/splunk_app_stream/ping/ You should get a JSON response like this: {"dateLastUpdated": 1458251157984, "version": "6.4.2"} It's also not clear that your universal forwarders are configured to send events to your indexers via outputs.conf. You can find documentation on this at http://docs.splunk.com/Documentation/Splunk/latest/Forwarding/Configureforwarderswithoutputs.confd.

mdickey_splunk · ‎02-10-2016

This is a sanity check, so setting it really high won't hurt anything unless it actually tries to go that high. You could try something like 100k or more and watch it. If it seems to settle around specific number (say 30k), then maybe lower it to 2x or so. Note that this threshold getting exceeded can be a sign of other problems, in which case it will just grow to the max no matter how high you set it. At a certain point, it would likely just run out of memory. Common causes may be stream only getting all ingress packets or all egress packets (but not both), or lots of dropped packets causing reassembly to fail.

mdickey_splunk · ‎09-22-2015

The Splunk UF has an in-memory event queue and HA capabilities that are controlled by outputs.conf. In particular, see the maxQueueSize, dropEventsOnQueueFull and dropClonedEventsOnQueueFull. Note that dropEventsOnQueueFull defaults to -1, which blocks the Stream TA and will cause its event queue to overflow (causing events to be dropped) if the UF is unable to contact an indexer. You can reduce the risk of dropped events by configuring groups of indexers in your outputs.conf.

mdickey_splunk · ‎09-09-2015

In addition to memory, there is also marginal CPU overhead for having more ProcessingThreads than is necessary.

mdickey_splunk · ‎09-09-2015

Here is some additional guidance on when to change each of these parameters: Increase ProcessingThreads if you see "Max packet queue size exceeded" Do not change MaxPacketQueueSize unless directed by a Splunk engineer (99.9% of the time this will have no impact other than increasing memory usage) Increase MaxTcpSessionCount if you see "Dropped ??? TCP session(s) due to session limit reached" Increase MaxTcpReassemblyPacketCount if you see ""TCP reassembly error - maximum number of cached packets reached" Please note that those last two conditions are almost always caused by data feed problems, such as a SPAN port that is configured to only send ingress packets, or a large number of packets being dropped (say, you are trying to send a 2 Gbps stream to a 1 Gb NIC). If you increase them and it only results in a delay in how long it takes to see the errors (and higher memory usage), this is likely the root cause. So, 99.9% of the time the only parameter you should ever have to change is ProcessingThreads.

mdickey_splunk · ‎09-04-2015

Stream only puts internal logs and statistics into the "_internal" index (this doesn't count towards your license volume). By default, it will put all events derived from network traffic into the "main" index. You can change the index used for each stream within the UI (at the top of the page, after you click on a particular stream in the list), or the default index for all streams by setting the following parameter in the streamfwd section of your inputs.conf file: index=somethingelse Note that the priority for index selection is: The config for a specific stream index defined in inputs.conf main

mdickey_splunk · ‎09-04-2015

Stream does not store packets (this is a very common misconception). Instead it generates events derived from the data contained in the packets. It has extensive capabilities for filtering, aggregation, etc. that limit and control the events it generates so that you don't have to store any more data than you want. If you're trying to get events into Splunk from wire data (or network traffic), Stream is (almost*) always the best way to do it. *The only exceptions I can think of is when you already have another system in place generating the data you want, for example you only want NetFlow data (a small subset of what stream provides), and you already have a switch in place that generates this data for you. In this case (and assuming there is already an Splunk app available, as there is for NetFlow data), it may be easier to go that route versus using Stream.

mdickey_splunk · ‎09-04-2015

Please try using your splunk web UI port (8000?) instead of the data port (9997) for splunk_stream_app_location. It uses this to pull down configuration information via the REST API. Your splunkd forwarder will send the events from stream to port 9997 assuming it is configured properly via outputs.conf.

mdickey_splunk · ‎06-30-2015

Just for historical purposes in case someone is searching for this error.. you will most commonly encounter "Event queue overflow" errors coming from the modular input process when it is running inside a Universal Forwarder that is unable to send data to your indexers fast enough. Most often this is caused by not modifying the default limits.conf settings, which restricts the data rate to 256 kbps. You can fix this by adding the following to your limits.conf: [thruput] maxKBps = 0 Please see Splunk Components Requirements in the Stream documentation for more information.

mdickey_splunk · ‎06-30-2015

BTW "-s http://localhost:8889" is implied. You only need to specify "-s" in the command line if you are changed the defaults.

mdickey_splunk · ‎06-30-2015

I assume your pcap file is fairly large? What is probably happening here is that the thread reading the file is much faster than the one sending the events to splunkd, so the queue in between is getting overwhelmed. You could try increasing <MaxEventQueueSize> in streamfwd.xml but most likely this would just use more memory without fixing the problem. Instead, try setting the bitrate for how fast the pcap file is read using the "-b" command line parameter: ./streamfwd -b 10000000 -r /mnt/sdb1/Hcaptures/s0.pcap -s http://localhost:8889 This will limit the rate of the reader thread to about 10Mbps, which should be slow enough for the sender to keep up. The default bitrate is currently unlimited, and it probably shouldn't be (I just filed a bug to change this).

mdickey_splunk · ‎06-23-2015

If the pcap files are on the same machine that your "Wire Data Input" data input (streamfwd) is running on, you should be able to use the command line parameters to send the pcap files to it. See http://docs.splunk.com/Documentation/StreamApp/6.3.0/DeployStreamApp/streamfwdcommandlineoptions#Read_pcap_files Note that the streamfwd binary is platform specific. So.. if you have App for Stream installed on a 64-bit Linux server, and you have enabled the Wire Data data input (you should be able to see it's UI at http://your_hostname:8889), and you have Splunk installed in /opt/splunk: /opt/splunk/etc/apps/Splunk_TA_stream/linux_x86_64/bin/streamfwd -r <pcap files> If you're on OSX instead, it would be: /opt/splunk/etc/apps/Splunk_TA_stream/darwin_x86_64/bin/streamfwd -r <pcap files>

mdickey_splunk · ‎06-12-2015

Are you using a non-English version of RHEL? The exact cause is difficult to determine, but my suspicion is that certain RHEL distros failed to include a properly configured (standard) "C" locale and this problem was fixed in a later release.

mdickey_splunk · ‎06-12-2015

I believe it is only populated when there is a new SSL session/handshake. So, it will be empty for subsequent TCP flows that re-use previously negotiated session keys.

mdickey_splunk · ‎06-11-2015

It looks like our docs are missing several of the SSL fields available in TCP flow events. Give this query a try: sourcetype=stream:tcp ssl_signature_algorithm=* | stats count by ssl_signature_algorithm

mdickey_splunk · ‎06-05-2015

What is alert.1.gz?

mdickey_splunk · ‎05-29-2015

The only difference between the XML config and the command line above is the <Filter> and <SysTime> nodes. Try removing those and it should work the same. It could be that your pcap doesn't contain "tcp port 80" packets.

mdickey_splunk · ‎05-06-2015

The graphs in stream config are only populated if the streams are enabled (or "stats only"). By default, everything is disabled except the tcp and udp streams. You'll need to enable the other protocols/streams to get their graphs to populate.

mdickey_splunk · ‎05-04-2015

Have you tried using tcpdump -i lo to verify it's getting the mysql packets? The connection may be using a local socket versus network connection. See also this article on serverfault.

mdickey_splunk · ‎04-28-2015

Stream can capture traffic on the loopback interface, but this is disabled with the default configuration. You will need to edit the streamfwd.xml file to add the loopback device to the list of interfaces that it monitors. See example #1 in the docs for details.

Posts	74
Solutions	23
Karma Given	3
Karma Received	51
Member Since	‎01-09-2014

Online Status	Offline
Date Last Visited	‎06-05-2020 02:03 AM

Re: Why the Packet Queue Size increase continuousl...

Re: Why the Packet Queue Size increase continuousl...

Re: Why the Packet Queue Size increase continuousl...

Re: What is the Difference beween time of capture ...

Re: Captured traffic with Wireshark and Splunk app...

Re: How to setup Splunk App for Stream in a Distri...

Re: Splunk App for Stream: Where can I find the de...

Re: Splunk App for Stream : if indexing queue bloc...

Re: Splunk App for Stream: Where can I find the de...

Re: Splunk App for Stream: Where can I find the de...

Re: Why is the Splunk App for Stream putting all d...

Re: Has anyone compared using Splunk's netmon vs t...

Re: streamfwd and Splunk Cloud: unable to establis...

Re: Splunk App for Stream: Trying to add data from...

Re: Splunk App for Stream: Trying to add data from...

Re: Splunk App for Stream: Trying to add data from...

Re: Captured traffic with Wireshark and Splunk app...

Re: Why am I getting error "locale::facet::_S_crea...

Re: Can the Splunk App for Stream log the ciphers ...

Re: Can the Splunk App for Stream log the ciphers ...

Re: forwarder for ARM: /opt/splunkforwarder/bin/sp...

Re: Stream App: Configuring the streamfwd.xml

Re: Splunk stream app

Re: Can we capture localhost traffic by stream app...

Re: Can we capture localhost traffic by stream app...

Join the Conversation