About burwell

burwell

Have a look at this solution https://community.splunk.com/t5/Splunk-Search/Can-you-create-modify-a-lookup-file-via-REST-API/m-p/193699 @mthcht wrote a script that works. I modified it a little for my use but it is basically the same solution and works on a single head or on a SHC. The gist is that it loops through and reads the contents in python and then uploads a big string.

burwell

Hi. What version of Splunk are you running? I ran into a bad bug with both Splunk Enterprise 9.3.7 and 9.4.5. The heavy forwarders sending to DNS load balanced indexers get TCPOUT blocked. This bug does not appear to be on the known issues despite many attempts by me trying to get it added there. It does not happen with 9.2.4. The Splunk JIRA that was opened is SPL-288904 The bug is said to be fixed in the upcoming releases 9.3.9 , 9.4.7, 10.0.3, 10.1 Hopefully soon. A workaround is the setting of dnsResolutionInterval in outputs.conf dnsResolutionInterval = <integer> * The base time interval, in seconds, at which indexer Domain Name Server (DNS) names are resolved to IP addresses. * This is used to compute runtime dnsResolutionInterval as follows: Runtime interval = 'dnsResolutionInterval' + (number of indexers in server settings - 1) * 30. * The DNS resolution interval is extended by 30 seconds for each additional indexer in the server setting. * Default: 300 seconds (5 minutes) Splunk had recommended we set dnsResolutionInterval =480 (tcpout blocked). I tried 1000 (also blocked). I have set it to 10000 (ie 10,000) and after ~ 3 days this seems to be working.

burwell · ‎10-26-2025

Looks like you can find the quarantined lookup files https://splunk.my.site.com/customer/s/article/Monitor-quarantined-lookup-files

burwell · ‎10-25-2025

Hi @yazeedallabadi2 did you see this https://splunk.my.site.com/customer/s/article/HF-paused-data-flow-due-to-connection-timeout-but-didn-t-resume-data-transfer-after-the-connection-is-established I see that it says it affects UF as well as HF

burwell · ‎07-06-2025

SPL-268481is a bug we encountered in Enterprise 9.1 and also is in 9.2. We have very large SHC cluster with 6 indexer clusters and a total of > 1500 indexers across these 6 clusters. The issue: - we would add an indexer back to an indexer cluster (e.g. it had hardware fixed) - the indexer would join the cluster again - the search heads would briefly REMOVE ALL/almost all indexers (not just the ones that were in the SAME indexer cluster being added back) - then each SHC would add the indexers back - most or all of the SHC heads would repeat this process so over a many minute period you could have searches that were not searching all possible indexers For each head the time period where all indexers were removed was less than a minute BUT it meant that searches would run and find NO indexers/fewer indexers to search. The solution provided by Splunk that worked is to add a setting to distsearch.conf (and btw the setting is not documented and not in distsearch.conf.spec so you would get a btool warning I am told) [distributedSearch] useIPAddrAsHost = false I am sharing this solution in case you encountered the issue.

burwell · ‎04-23-2025

We are a big customer. We hit a big issue in upgrading from 9.1.7 to 9.2.4 and it took a long time for the issue to be resolved. We have a large stack with many indexers. Our current operating system is RedHat 7; we are in the process of migrating to RedHat 8. On upgrade from 9.1.7 to 9.2.4, one of the indexer clusters that ingests the most amount of data, suddenly had aggregation and parsing queues filled at 100% during our peak logging hours. The indexers were not using much more cpu or memory it’s just that the queues were very full. It turns out that Splunk has enabled profiling starting in 9.2: specifically cpu time profiling. These settings are controlled in limits.conf: https://docs.splunk.com/Documentation/Splunk/9.2.4/Admin/Limitsconf. There are 6 new profiling metrics and these are all enabled by default. In addition, the agg_cpu_profiling runs a lot of time of day routines. A lot. There are several choices for clocksource in RedHat https://docs.redhat.com/en/documentation/red_hat_enterprise_linux_for_real_time/7/html/reference_guide/chap-Timestamping#sect-Hardware_clocks It turns out that we had set our clock source to use the clocksource “hpet” some number of years ago. This clocksource, while high precision, is much slower than using “tsc”. Once we switched to using tsc, the problem with our aggregation and parsing queues at 100% during peak hours was fixed. Even if you don't have the clock source issue, the change in profiling is something to be aware of in the upgrade to 9.2

burwell · ‎02-24-2025

Hi @jcorcorans I haven't discovered any great way to parse the chef-client.log A few things that can help 1) look for the log_level when it isn't INFO/WARN [2025-02-24T19:06:07+00:00] FATAL: Please provide the contents of the stacktrace.out file if you file a bug report 2) for log rotate, I see we have directives in /etc/logrotate.d/chef-cilent "/var/log/chef/client.log" { weekly rotate 12 compress postrotate systemctl reload chef-client.service >/dev/null || : endscript } 3) and if you have a number of servers and you are running chef a lot and want to know when to truly spend time debugging since we find a chef operation can fail due to timeout or load, you check over a time period and see if in the end things are running okay. So we have something like this: if after 3 times chef run is still not good then investigate idx=your_index sourcetype=chef:client ("FATAL: Chef::Exceptions::ChildConvergeError:" OR "FATAL: Chef::Exceptions::ValidationFailed" OR "Chef run process exited unsuccessfully" OR "INFO: Chef Run complete" OR "INFO: Report handlers complete") | eval chef_status=if(searchmatch("ERROR") OR searchmatch("FATAL"), "failed", "succeeded") | stats count(eval(chef_status="failed")) AS num_failed, count(eval(chef_status="succeeded")) AS num_succeeded,latest(chef_status) as latest_chef_status by host | search num_failed > 3 AND latest_chef_status!="succeede To monitor the logs, a simple monitoring stanza in your inputs [monitor:///var/log/chef/client.log] sourcetype=yourchefsourcetype index=your_index

burwell · ‎02-05-2025

Hi @secure as @gcusello stated you can have only one base search. What would it mean to have 2 in a panel? The base search just returns the results so how could you use 2 together? I am not sure if this helps but you can have a base search use another base search.

burwell · ‎11-01-2024

Here's some good information about the dispatch directory: https://docs.splunk.com/Documentation/Splunk/9.3.1/Search/Dispatchdirectoryandsearchartifacts Splunk normally does age things out but read the doc above. Perhaps the disk is full for other reasons? https://community.splunk.com/t5/Splunk-Search/Splunk-says-dispatch-directory-is-full-but-when-I-go-to-the/m-p/370243 One thing that can cause your dispatch directory to grow is if you adjust the time to live (TTL) of jobs.

burwell · ‎08-12-2024

This answer https://community.splunk.com/t5/Dashboards-Visualizations/How-to-use-token-in-a-multi-select-form-input/m-p/480570 is close to what you want. You would end up with a set of sourcetype=data1 OR sourcetype=data2 etc. And you can initialize the default value with comma separated values as shown in https://community.splunk.com/t5/Dashboards-Visualizations/choose-all-Multiselect-values-by-default-without-using/m-p/357860

burwell · ‎07-09-2024

Sounds like you might want to use two bin commands. First bin by time: | bin _time span=1h Then bin the netPerf.netOriginLatency into 5 (?) bins e.g. | bin netPerf.netOriginLatency bins=5 See the bin command https://docs.splunk.com/Documentation/Splunk/9.2.2/SearchReference/Bin Finally you could do a timechart with your bins (you will have to do your percentage etc calculation)

burwell · ‎06-21-2024

What I tend to do to get all the results in email or Slack is to use stats as described here https://community.splunk.com/t5/Reporting/Using-result-fieldname-in-email-text-body-splunk-email-alert/m-p/399711

burwell · ‎05-31-2024

Splunk has finally added the issue to their known issues page https://docs.splunk.com/Documentation/Splunk/9.2.0/ReleaseNotes/KnownIssues https://docs.splunk.com/Documentation/Splunk/9.2.1/ReleaseNotes/KnownIssues

burwell · ‎05-21-2024

If you always want to ignore the same hosts each time, you could create a lookup file with names of the hosts and use a search as described in this post: https://community.splunk.com/t5/Splunk-Search/How-to-search-for-all-IP-s-not-in-a-lookup-table/m-p/371170 Something like index=myindex NOT [|inputlookup mylookup.csv | fields host]

burwell · ‎05-21-2024

Hi can you say a little more about what the intended field values are that you are trying to achieve?

burwell · ‎05-21-2024

Hi. We have Splunk case 3421789 opened for this bug. For us it is installing from rpm that fails. It is not an option to install from tar.

burwell · ‎05-19-2024

So there's a bug with installing Splunk Enterprise 9.2.x and the universal forwarder on the same server, something that should work. I have opened a case with Splunk and requested them to document the issue in the known issues. They have not done that yet.

burwell · ‎05-03-2024

Thanks @hrawat The logs are as expected then 05-03-2024 17:46:52.999 +0000 WARN AutoLoadBalancedConnectionStrategy [24761 TcpOutEloop] - Current dest host connection 1.2.3.4:5678, oneTimeClient=0, _events.size()=993, _refCount=1, _waitingAckQ.size()=0, _supportsACK=0, _lastHBRecvTime=Fri May 3 17:46:48 2024 is using 475826 bytes. Total tcpout queue size is 512000. Warningcount=2001

burwell · ‎04-30-2024

Hi in addition to using tokens as was suggested you can template the body of the message by configuring alert_actions.conf For example you could have an alert_actions.conf (don't update the etc/system/default one) and add [email] message.alert = Please fill this report in as follows\ Step 1\ Step 2\ \ And finally\ Step 3 I use the \ because you want the message.alert value to span multiple lines. That should get you something like the attached screenshot. More details here https://docs.splunk.com/Documentation/Splunk/latest/Admin/Alertactionsconf

burwell · ‎04-29-2024

Hi. I would thoroughly read this doc https://docs.splunk.com/Documentation/Splunk/9.1.4/Installation/AboutupgradingREADTHISFIRST I would make sure you migrate your kvstore before going to 9.x If you aren't a member of Splunk's Slack usergroups instance, now is a good time to join. There's a Slack channel just for 9.x upgrade issues. Good luck!

burwell · ‎04-24-2024

Hi. We just upgraded from 9.0.6 to 9.1.4 and are seeing these same warnings. Do we know that this was fixed in 9.1.4?

burwell · ‎04-11-2024

Hi. Have a look at https://community.splunk.com/t5/Splunk-Search/Why-is-Lookup-definition-in-transforms-conf-not-returning/td-p/589334 Sounds like there's a lookup definition in a transforms.conf and the corresponding lookup file does not exist. You can use btool on the Splunk head to locate the setting. For example /opt/splunk/bin/splunk btool transforms list --debug | grep file You can see all the lookup file definitions.

burwell · ‎03-22-2024

Thanks @isoutamo

burwell · ‎03-18-2024

How about | eval MyNewField=Host+Domain

burwell · ‎03-11-2024

Hi. So you tried | makeresults ns=project* | eval _raw="\"totalTimeTaken\":4" | rex field=_raw "\"totalTimeTaken\":+(?<Response_Time>\d+)" | stats avg(response_time) And there are two problems. 1) the first makeresults .. I don't know what the ns=project* is. Here's the reference https://docs.splunk.com/Documentation/Splunk/latest/SearchReference/Makeresults 2) your rex is extracting the value into the field Response_Time but then use do stats avg on response_time in lowercase. Case matters in Splunk field names. Here's what seems to work | makeresults=1 | eval _raw="\"totalTimeTaken\":4" | rex field=_raw "\"totalTimeTaken\":+(?<Response_Time>\d+)" | stats avg(Response_Time)

Posts	466
Solutions	50
Karma Given	138
Karma Received	165
Member Since	‎05-01-2014

Online Status	Offline
Date Last Visited	yesterday

Splunk bug in Enterprise 9.1 and 9.2: Indexers are...

Upgrading Enterprise from 9.1.7 -> 9.2.4: hit cloc...

Changing permission of a private knowledge object ...

Run a generating command over a set of values

Non-admin user needs savedsearch job date in dashb...

Best practices for deployment server: make sure ap...

MLTK: How best to conditionally update model in an...

Replacing third party server certificate while Spl...

How does Splunk log keyword pair extraction overwr...

How can I combine stats count by host into a singl...

Re: Upload/update lookup file using rest API

Re: The TCP output processor has paused the data f...

Re: 9.4.5: SearchHead Cluster csv notification!!!

Re: Why did logs stopped abruptly after installing...

Splunk bug in Enterprise 9.1 and 9.2: Indexers are...

Upgrading Enterprise from 9.1.7 -> 9.2.4: hit cloc...

Re: I have a chef automate logger script in python...

Re: multiple base search

Re: Dispatch directory full

Re: choose all Multiselect values by default

Re: Making a table with Eval and Top results

Re: Alerting Email Single result

Re: Splunkforwarder 9.2.0.1 install fails with con...

Re: How to Ignore few hosts in a search

Re: Case and coalesce statement in one

Re: Splunkforwarder 9.2.0.1 install fails with con...

Re: Splunkforwarder 9.2.0.1 install fails with con...

Re: Current dest host connection is using 18446603...

Re: Alert Trigger Send Email Default message -> c...

Re: Splunk Enterprise Upgrade Question.

Re: Current dest host connection is using 18446603...

Re: Splunk error for lookup files - Unable to find...

Re: Combining two fields into one field

Re: Combining two fields into one field

Re: Regular expression works separately but, not a...

Join the Conversation