About paimonsoror

paimonsoror · ‎06-26-2017

Thanks for the quick response. And after thinking about it, I agree that the extra Var isn't needed. Especially because that means now if i stand up a new indexer, i need to remember to add that var to the conf file. Regarding your second point, would there be a better alternative so that I can make sure that the indexer points to the right place for the network data when i start it back up, but before i push out a new bundle?

paimonsoror · ‎06-26-2017

Hi Folks; As our network indexes has grown rapidly over time, I am looking to preserve data and splunk performance, while making sure that we have the capacity to store the network data. In doing so, I have requested a second LUN for our network index. I have performed the following steps in my non-Prod environment, and it seems like everything was successful, but I do want to make sure that I didn't miss a step: Set maintenance mode on the cluster For each individual indexer Stop indexer edit etc/splunk-launch.conf to add a new 'SPLUNK_NETWORK_DB' variable edit etc/slave-apps/all_indexes/local/indexes.conf to update the network db/thaweddb/colddb reference to use new var mv var/lib/splunk/network/*db /opt/splunk_network_data start indexer disable maintenance mode update master index file deploy master index.conf to cluster to make sure all indexers are in sync

paimonsoror · ‎05-01-2017

Hi Folks; I came across this post on github https://github.com/kubernetes/kubernetes/issues/24677 and it had some fantastic options for pulling data from K8s/Docker into Splunk. It seems that the 'easy' approach here is to leverage the integration of K8S/Redhat with Fluentd, and then push the data into splunk. I was hoping to pick the brain of some of our Splunk experts to see if there is also a way to do a direct to splunk integration. Ideally, our goal is to make sure that the data that comes into splunk is 'containerized' so that it can easily be organized. I see the docker Splunk logging driver is available, but seems to be the less trusted approach since it doesn't integrate well with K8s.

paimonsoror · ‎02-20-2017

Interesting. We will give it a shot. Also testing the same summary indexing using 'collect' vs the summary indexing built into scheduled reporting to see if it makes any difference. (@cjmckenna)

paimonsoror · ‎02-20-2017

The interesting thing is that all other values are correct, the only value that is 'doubled' is the "Count" value that is inserted into the summary index. it is as if the "count" command has a glitch that is doubling the value before it inserts to the summary index. Problem is that it doesn't happen every time. And just to be sure, we checked the summary index, and for each of the hours, the data is coming from a single search head.

paimonsoror · ‎02-20-2017

Main Index index="app_silayer7" "~HTTP://" NOT "~ERR~" |dedup _raw|bucket _time span=5m |rex field=_raw "^[^~\n]~(?P\w+)" |rex field=_raw "^(?:[^~\n]~){2}(?P[^~]+)" |rex field=_raw "^(?:[^~\n]~){3}(?P[^~]+)" |rex field=_raw "^(?:[^~\n]~){4}(?P[^~]+)" |rex field=_raw "^(?:[^~\n]~){5}(?P[^~]+)" |rex field=_raw "^(?:[^~\n]~){6}(?P[^~]+)" |rex field=_raw "^(?:[^~\n]*~){7}(?P[^~]+)" |rex field=NameSpace "(?[^\/])(|\/)$" |eval Version="V".Version |stats count as Count by _time, Consumer, Domain, Service, Operation, Version Summary Index search index="summary_eu" | stats sum(Count) as Count_summary by _time, Consumer, Domain, Service, Operation, Version Here is how we can see that the counts are dupilcated

paimonsoror · ‎02-20-2017

Yep, so Detail count is from the actual source index, and SummCount is the counts within the summary index

paimonsoror · ‎02-17-2017

Hi @aaraneta , I would have to say that @bmacias84 answered the core question i had 🙂 Thanks!

paimonsoror · ‎02-17-2017

Perfect, thanks to you both for the advice!

paimonsoror · ‎02-17-2017

That makes sense, for sure. I guess the other question would be, does it make sense to have these 3 on a single piece of hardware? Like you said, our cluster master isn't doing any indexing of external data, and the license master and DMC are very quiet servers

paimonsoror · ‎02-17-2017

So are you saying, to clear out our summary index (no big deal), and force our query to produce results from 8am-9am each time? My assumption is that you are trying to remove time out of the equation and to see if the same exact result set experiences different behaviors over time? To rehash: The actual result set never has duplicates, but it seems like the path between 'i got my results' and 'im putting my results in the index', some duplication occurs Also FYI, different hours were 'duplicated' in the summary index yesterday, just incase we were thinking something strange was happening with time parsing.

paimonsoror · ‎02-17-2017

I am having a VERY strange problem with my summary indexing. I have the following search running every hour at 20 minutes past the however, doing a summary of -1h@h to @h index="app_silayer7" "*~HTTP://*" NOT "*~ERR~*" |dedup _raw|bucket _time span=5m |rex field=_raw "^[^~\n]*~(?P<Domain>\w+)" |rex field=_raw "^(?:[^~\n]*~){2}(?P<Service>[^~]+)" |rex field=_raw "^(?:[^~\n]*~){3}(?P<Operation>[^~]+)" |rex field=_raw "^(?:[^~\n]*~){4}(?P<NameSpace>[^~]+)" |rex field=_raw "^(?:[^~\n]*~){5}(?P<Consumer>[^~]+)" |rex field=_raw "^(?:[^~\n]*~){6}(?P<BackendTime>[^~]+)" |rex field=_raw "^(?:[^~\n]*~){7}(?P<TotalTime>[^~]+)" |rex field=NameSpace "(?<Version>[^\/])(|\/)$" |eval Version="V".Version |stats count as Count, avg(TotalTime) as TotalTime, min(TotalTime) as MinTime, max(TotalTime) as MaxTime, stdev(TotalTime) as STDDEV, perc95(TotalTime) as 95_Percentile ,by _time, Consumer, Domain, Service, Operation, Version Every once in a while however, I get some duplicate events. Some hours there are no duplicates, but some hours there are. The interesting thing is, on the hours that I see duplicates, I review the job results in the job inspector, and the results look clean! Has anyone run into this issue before? The raw events don't have any duplication in them, but it almost seems like when Splunk is stashing the results in my summary index, it hiccups and adds a few extra duplicate rows. For example, here are my results from the 9:00 hour: Service DetailCount SummCount delta Processor 36 72 36 Profile 55 110 55 ProfileAnd 185 370 185 It is exactly doubling the rows that were entered.

paimonsoror · ‎02-17-2017

All; When our Splunk environment was set up by a consultant, he had used 3 different servers to host the DMC (Distributed Management Console), LM (License Master), and the CM (Cluster Master). I am trying to consolidate our environment and be a 'good-neighbor' by releasing servers back to our pool that are unnecessary. After taking a look at a few implementations at other places, it seems that typically the DMC, LM, and CM are hosted on a single server. It seems to make sense as these three components each do not require very much compute power. I was wondering what the risk involved is in performing this consolidation? My current Cluster Master is a VM while the DMC and LM are cloud boxes, so ideally i would be moving everything to the CM server. I am not seeing much of a risk, and minus updating the cluster's license master server (we use an 'app' deployed to indexers and forwarders to configure the LM), this should be "easy". Am I missing something obvious?

paimonsoror · ‎02-13-2017

For those who have done some SNMP trap integrations with other monitoring tools, have you solved the issue of sending 'clear traps' when the condition is no longer met? For example, I have created a custom action that will send an SNMP trap to another one of my monitoring tools using trapgen. The integration works great, the only issue is that I need a way to send a clear trap for that alert when the condition is no longer met so that I dont have stale alarms in my monitoring tools. Wondering if anyone has done that yet?

paimonsoror · ‎02-13-2017

I am almost positive that we are on dedicated LUNs for our Splunk servers, but I will certainly validate. Also, the screenshot above is my production environment, which was not part of the outage that I mentioned in my first post. Sorry for the confusion.

paimonsoror · ‎02-13-2017

Not sure if this helps in telling anymore of the story, but our performance team came back with the following data showing 4 of my 5 production indexers;

paimonsoror · ‎02-11-2017

Couldn't add more attachments to my original post @ddrillic so hopefully this works: Test Environment (Using about 200GB of license / day) Prod Environment (Using about 1TB of license / day)

paimonsoror · ‎02-11-2017

Thanks for the quick response as always!! I have updated my original post with the query results for both my production environment and my test environment.

paimonsoror · ‎02-10-2017

Hi Folks; Wondering if someone could help me out here. I just had a big issue with Splunk. 3 of my Indexers just crashed for a bit (replication factor of 3). One of the services crashed with a bucket replication error (i fixed this), server 2 the service crashed and was simply restarted, server 3 completely got hung and required a reboot. After taking a quick peek, all of the stats are looking 'normal' including cpu/ physical/ storage, however, there was something that jumped out at me which was the iostats: Any particular reason this would start to happen? I just checked my forwarders and I dont see anything out of the ordinary with a large ramp in data ingestion I am working with my Linux team to restore one of my servers and they are stating that there was a "kernel level CPU soft lockup" Any Advice would be helpful in triaging this!

paimonsoror · ‎02-02-2017

Thanks for the response. What I am ideally trying to do is this: User creates an alert User decides "i want this alert to the enterprise command center" User uses my custom alert action called 'spectrum_alert' Our best practice is to have the user pick a meaninful title for the alert, and description The JSON payload is great, and it includes the title but it doesn't include the alert description. Ideally I would like to also send in the alert type Those two additional things from #4 are what I am looking to add to my payload

paimonsoror · ‎01-31-2017

Hi Folks; I was wondering how to add some of the details that a user has put in for defining an Alert into the payload that gets sent to my custom alert. For example: Here is a sample alert that I am using. I have a custom app on my search head, and within the local folder there is an alert_actions.conf defined like so: [spectrum_alert] disabled=0 payload_format=json is_custom=1 icon_path=alerticon.png label=Enterprise Alert description=Dispatch Alerts to Command Center For Escalation within my app, there is a bin directory with a python script called 'spectrum_alert.py'. It looks like when the alert is triggered, two things are passed in, one being the '--execute' command, and second is the json payload that is passed in. There are however a few things missing that I would like to have, like the 'description', and the 'event count' for example. How would one add that? I know that with the out of the box command you can add things like $counttype$ $relation$ $quantity$, but is that still possible here with a custom alert? If so, could someone guide me? Thanks!

paimonsoror · ‎01-25-2017

@woodcock To my rescue again, thanks!

paimonsoror · ‎01-25-2017

I was wondering how enterprises were handling this situation. I know within my organization, the /var/log subdirectories, especially teh system ones are locked down to root access only. How is your enterprising handling the Splunking of these logs. We run Splunk with a service account that we deploy across the Infrastructure that has a non-expiry password, but doesn't have sudo rights. I have spent some time undoing a lot of installations that were accidentally done by root because I know it is against best practices. So I was looking for some advice on how to best handle this one 🙂

paimonsoror · ‎01-24-2017

YOU ROCK!!

paimonsoror · ‎01-24-2017

Wasn't able to find a solid answer on this one, but I am using Splunk 6.x, and was wondering if I could have a sourcetype, that essentially "inherits" another sourcetype. For example [monitor:///var/log/httpd/access.log] index = app_cp sourcetype = cp:httpd:access #souretype = access_combined ignoreOlderThan = 1d Ideally I would like the team to be able to leverage a sourcetype called cp:httpd:access so that they only get the access logs that pertain to their particular logs files, but i also want it to inherit the extractions defined by access_combined . So essentially, can cp:httpd:access inherit from access_combined?

Posts	171
Solutions	4
Karma Given	31
Karma Received	27
Member Since	‎10-03-2016

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

Datamodel To Accelerate Billing Data: Help With Be...

Is there a way to hide trial license warnings from...

Stats sum command experiencing strange behavior af...

[Bug Report - Reported To Devs] If You Are Having ...

Would any of our AWS experts be able to assist wit...

AWS Addon Not Showing Personal Health

App for Infrastructure AWS Entities Disappeared

Is there a way to limit data acceleration time opt...

Is anyone having issues with base searches in 7.2?

Please help me identify why Splunk is omitting ext...

Re: Tested with success, but looking for validatio...

Tested with success, but looking for validation to...

How can we log and containerize the logs using Kub...

Re: Why I am seeing data duplication in my summary...

Re: Why I am seeing data duplication in my summary...

Re: Why I am seeing data duplication in my summary...

Re: Why I am seeing data duplication in my summary...

Re: Currently my DMC, License Master, and Cluster ...

Re: Currently my DMC, License Master, and Cluster ...

Re: Currently my DMC, License Master, and Cluster ...

Re: Why I am seeing data duplication in my summary...

Why I am seeing data duplication in my summary ind...

Currently my DMC, License Master, and Cluster Mast...

Has anyone built a SNMP trap and clear-trap integr...

Re: Why did my indexers have a large spike in io?

Re: Why did my indexers have a large spike in io?

Re: Why did my indexers have a large spike in io?

Re: Why did my indexers have a large spike in io?

Why did my indexers have a large spike in io?

Re: How can I get some additional alert details in...

How can I get some additional alert details into m...

Re: How do you handle Splunking Linux logs through...

How do you handle Splunking Linux logs through the...

Re: Can a sourcetype be aliased, meaning, can it i...

Can a sourcetype be aliased, meaning, can it inher...

Join the Conversation