About cpride_splunk

ijager_splunk · ‎07-19-2021

This should be fixed in the Splunk Python SDK version 1.6.15 or later. Upgrading the SDK in your app should be enough for all supported Splunk versions. (No need to upgrade splunkd.) The problem was indeed related to the flush() method. It appears the SDK support for the chunked protocol was written assuming the protocol would support "partial chunks", allowing the response to one input message to be split into multiple output messages, with a partial: true flag used to indicate that the response would be continued on the next message. The code in the SDK to mark partial chunks had been commented out, but the code still sent partial chunks when a response produced maxresults rows (50,000 by default) -- just with no indication that it was to be interpreted partial response. This was a problem even for commands that simply returned the same number of rows, because it happened when the limit was reached, even if it was never exceeded. As a result, every time the script produced 50,000 records, the expected response was followed by an additional chunk, which -- per the protocol -- was the response to the next chunk. (The protocol expects each request to have one repsonse.) Since the script would produce responses before reading the request, as the script got more and more out of sync with the protocol, more unread requests would end up buffered in the stdin pipe with more responses buffered in the stdout pipe until both buffers were full and writes started to block/fail. I considered adding a workaround to splunkd so that apps wouldn't need to update the SDK they use, but there was no a reliable way to determine which commands needed it, or which commands would be broken by it. Anyway, if you're curious, the full fix (and a tiny bit of related clean up) is in https://github.com/splunk/splunk-sdk-python/pull/301/files Kudos to @kulick and @cpride_splunk for their early analysis of this bug!

muebel · ‎08-29-2016

thanks Martin! That fieldsummary bit is a good point.

splunkIT · ‎11-13-2015

Thank you @cpride, for the eloquent explanation.

cpride_splunk · ‎01-30-2015

If you are using the REST API for queries you should also have the "sid". In that case you could use the "loadjob" command with the sid to chain from one job to the next. If you are worried about the previous job expiring before you use it, you can always set the ttl with the REST API as well. http://docs.splunk.com/Documentation/Splunk/6.2.1/RESTREF/RESTsearch#search.2Fjobs.2F.7Bsearch_id.7D.2Fcontrol So end up with something like: POST /services/search/jobs "search 1" -> get sid from response while (/services/search/jobs/search1_sid status != DONE) sleep 1 POST /services/search/jobs " | loadjob search1_sid | ..." -> get sid from response ...

Runals · ‎12-13-2014

Ah - hadn't thought about the log.cfg. Will have to monkey around with this. Appreciate it!

camillak · ‎06-27-2018

Did you set the stanza correctly? eg: [source::your_source] Also the parse won't show up in an Events search, need to table or similar.

cpride_splunk · ‎11-13-2015

I think you could do this with an eventstats: index=traffic report=AppsDst | eval MegaBits=(nbytes*(8/3600))*pow(10,-6) | stats sum(MegaBits) as MegaBits_per_Sec by app, dst | eventstats sum(MegaBits_per_Sec) as AppMegaBits_per_Sec by app | sort -AppMegaBits_per_Sec -MegaBits_per_Sec | fields app, dst, MegaBits_per_Sec

Posts	15
Solutions	3
Karma Given	2
Karma Received	47
Member Since	‎06-20-2012

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

Re: How come custom search commands (CSC) SCPv2 ca...

Re: What is the difference between the metasearch ...

Re: Why am I getting inconsistent event counts whe...

Re: Is there a way to run multiple searches one af...

Re: Audit Log: Can someone confirm that cache_size...

Re: moving "spath" from query to config file

Re: Difficulty sorting a field by another field

Join the Conversation