About beatus

beatus · ‎06-18-2020

I'd check out some of the awesome dashboards provided here: https://github.com/dpaper-splunk/public/tree/master/dashboards There are a few views that do what you want, plus a whole lot of other good-ness.

beatus · ‎06-22-2018

DBX is a perfect candidate for use on a HWF. It's my go to route for getting data in via DBX. In fact, it's one of the documented ways to install it (http://docs.splunk.com/Documentation/DBX/3.1.3/DeployDBX/Architectureandperformanceconsiderations#Distributed_environments). It's not without it's downsides - When using a HWF, you're only going to be able to index data. Lookups and ad-hoc searching will not work unless DB Connect is installed on the Search Heads. I'd recommend HWFs for any scripted / modular inputs (AWS is a great example). I avoid co-locating those functions with a search head when at all possible.

beatus · ‎06-22-2018

If all your data is coming in via HEC, I'd use a load balancer. The Splunk department can open up access to their HEC receivers (I suspect they have more than one) and you can load balance to them. The HWF would still work here as well, and that would allow you to manage your own tokens (which may come in handy, see below). There are a number of things you can do in HEC to set the sourcetype / index. As part of the post payload: You can set the host, index, source, sourcetype in the post (detailed here: http://dev.splunk.com/view/event-collector/SP-CAAAE6P). Basically as part of the HTTP payload, you spell out all the meta you'd like associated with the data. This is the route I'd take. As part of the HEC token When you configure the token, you can force settings such as index and sourcetype (http://docs.splunk.com/Documentation/Splunk/7.1.1/Data/HECWalkthrough#Create_an_HEC_token) If you're writing code to integrate HEC, I'd check out the SDKs from Splunk and some of the community efforts: Python (Thanks Starcher!) - https://github.com/georgestarcher/Splunk-Class-httpevent JS - http://dev.splunk.com/view/splunk-logging-javascript/SP-CAAAE6U .NET - http://dev.splunk.com/view/splunk-loglib-dotnet/SP-CAAAEX4 Java - http://dev.splunk.com/view/splunk-logging-java/SP-CAAAE2K Ultimately it's just a HTTP post, but you're responsible for handling if the receiver rejects your post in whatever manner you'd like (cache it? drop it on the floor? etc). Happy to help! Please accept the answer if you feel it's solved all of your issues.

beatus · ‎06-22-2018

thomastaylor, Lets break this down a bit... HTTP Event Collector: A Heavy Forwarder is a great option here. You can manage the token and receive HEC inputs on the HWF without the need of the main Splunk install to do anything. As the data is JSON, you'll also get your field extracts "for free" from autokv. Transforming data: Yes you can use a Heavy Forwarder for this. I must caution that there are a number of pitfalls that come with using a HWF to "pre-parse" data before it hits the indexers. Cooked data is larger on the network than uncooked data: https://www.splunk.com/blog/2016/12/12/universal-or-heavy-that-is-the-question.html - Some have theorized that unless you're doing a massive amount of Index time operations, the load on the indexers is actually higher CPU wise too (Still an argument in the community so take this with a grain of salt). Heavy Forwarders tend to cause data in-balance on Indexers (They get sticky to which indexer they send to, due to not having a break in incoming traffic. A common problem for syslog boxes that use a HWF). The Indexers are not given a second chance to parse the data - This means if your main Splunk install needs to do sourcetyping, index renaming or host renaming, it will be unable to (Well there are some special things you can do to cheat here, but it's a bad idea) Creating field extracts: You are unable to create "search time" field extracts with a Heavy Forwarder. The vast majority of TAs you'll find on Splunkbase are search time. Additionally, creating "index time" field extracts comes with a whole list of caveats (NOTE THE CAUTION WARNING: http://docs.splunk.com/Documentation/Splunk/7.1.1/Data/Configureindex-timefieldextraction). While possible, you're opening yourself up to a massive list of potential issues. To name a few: Greater storage requirements (index time fields are stored in the TSIDX files, uncompressed) Lack of flexibility (Once a field is written, it's "burnt" into the index) Potentially extreme CPU overhead at the HWF level Also, no the HWF will not let you use the regex tool - that's for search time field extracts. You'd have to have a dev search head / indexer for it and lift the extracts AND convert them to index time. DO NOT RECOMMEND. TLDR: For HEC, I think it's a great use case for you. For everything else, I'd advise against it. I'd recommend attempting to fix the relationship with whomever owns your Splunk install. You're setting your team and the Splunk owners up for potential issues down the road (and a bunch of up-front work for yourself as nothing on Splunk base will be plug and play).

beatus · ‎04-20-2018

guimilare, Splunk ships with it's own Python environment. This means if you're running a Python script you'd need to have the module you'd like to import in the current working directory. For more on how Python imports work, this blog post seems helpful: https://leemendelowitz.github.io/blog/how-does-python-find-packages.html I'd also say, do NOT modify Splunk's python modules. It's not supported to install additional modules into Splunk's python environment. You should be putting the modules next to the python script you're trying to run. All that said, I will echo what p_gurav said - If you're trying to import data from a database directly, you're probably best off utilizing Splunk DB Connect. https://splunkbase.splunk.com/app/2686/ Specifically for a lookup: http://docs.splunk.com/Documentation/DBX/3.1.3/DeployDBX/Createandmanagedatabaselookups

beatus · ‎04-18-2018

Cecampbell, I'd highly recommend you engage Professional services for this. It sounds like you're new to Splunk and ES is a very complicated product. Based on the information you've provided so far, I'm very concerned with your deployment and wouldn't recommend going forward with the path you've laid out. Some additional information would be required to make a final judgement, that said my initial reaction is you're on a path for major pain. Some issues I see so far: Below minimum specs for CPU (6 socket systems are not a thing, i'm assuming either single socket or dual socket) / Memory It sounds like ES is installed on both search heads? That's a big issue if so. Windows (Not a deal breaker, but also going to draw flak from others) Some additional info that would help: - License size - Current amount of stored data - Storage subsystem Again, I'd HIGHLY recommend engaging Splunk Professional services for this. ES is a complex product, under-sizing it from the get go will be a massive problem. Migrating data is also a complex undertaking with many variables that PS can help with.

beatus · ‎09-29-2017

Murikadan, You can adjust the number of buckets worked on by a peer on the Mast Node. In your configurations you can add the following: server.conf [clustering] max_peer_build_load = <integer> * This is the maximum number of concurrent tasks to make buckets searchable that can be assigned to a peer. * Defaults to 2. max_peer_rep_load = <integer> * This is the maximum number of concurrent non-streaming replications that a peer can take part in as a target. * Defaults to 5. max_peer_sum_rep_load = <integer> * This is the maximum number of concurrent summary replications that a peer can take part in as either a target or source. * Defaults to 5. Provided you have the hardware to handle the additional CPU, memory and disk load these values can be safely increased. Not knowing what your environment is, I'd recommend some caution and increase the settings in small increments while monitoring load on your Indexers. Additionally, these settings can be modified in memory only (As in, run time & not saved to config) with the following commands (No restart required!). Perform these on the Master Node: splunk edit cluster-config -max_peer_build_load 4 splunk edit cluster-config -max_peer_rep_load 10

beatus · ‎09-29-2017

Chustar, You're correct that you need to deploy them with the deployer. As you said, a "custom_configs" app is perfect for this. I do see some folks splitting these custom configs up into a few apps, such as: Indexes (for autocomplete). Often there is an indexes app that can be copied from the Indexer Cluster Master Node and re-used. Just remember to adjust the volume configurations or make them present. Outputs app. Often there is an outputs app already configured correctly for Universal Forwarders in the environment on the Deployment Server. A "SHC_settings" app for anything left over. limits.conf, server.conf, alert_actions.conf, etc. Whatever you need to do here that's SHC or SH specific. There's no requirement to split them out into apps, but if you already have apps that do most of the things you'd like to push to the SHC then it makes sense to reuse them. If you prefer a combined app, that's fine too.

beatus · ‎05-26-2017

Well said. I'd like to add that by being on Answers you're in the right place. You should also join us in Splunk's community Slack (http://splk.it/slack) or both. Slack is an incredible tool, no question is too basic and there are hundreds of users willing to help. Just being a fly on the wall in Slack will net you some big help. Start asking questions, we're very welcoming.

beatus · ‎05-11-2017

Here's a regex that should work for you: \[(?<Thread>.*?)\]\s(?<TimeStamp>(\d{4})-(\d{2})-(\d{2}) (\d{2}):(\d{2}):(\d{2})?)\s:\s(?:T:(?< ResponseTime>\S+\s+\S+))?(.)*(?<Action>(\w{3})):\s\[(?<XML>.*?)\] This should handle the timeouts either existing in the log or not and only create the "timeout" field when they are there. Hope this helps!

beatus · ‎04-26-2017

Okay, lets change our regular expression to match this a bit better then: Props.conf: [rf_ip] REPORT_rfip = rf_ip Transforms.conf: [rf_ip] REGEX = \b(?<rf_ip>\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})\b MV_ADD = true Here's the result on my box:

beatus · ‎04-25-2017

Can you share some sample data?

beatus · ‎04-25-2017

rbardonetorian, "time_before_close" Will cause Splunk to wait a specified amount of time after Splunk has reach an EOF condition. The default of 3 seconds can be too low on systems buffering their writes or very heavily loaded systems. That will cause Splunk to truncate events. Another option that will help is "multiline_event_extra_waittime = true". I'd recommend using this setting in combination with "time_before_close". Between these two settings, Splunk will wait longer for writes to happen when they're "mid event" and that will reduce event truncation significantly. Some draw-backs of "time_before_close" are that Splunk will use extra file descriptors as it is keeping more files open longer. Lastly, don't use "followTail" unless instructed to do so by support. It doesn't sound like it will help in your situation and will likely cause more issues than it solves.

beatus · ‎04-25-2017

A couple of issues here: Don't use indexed fields for this. Unless you have a very specific reason for trying to use an indexed field you're just causing more issues than you're solving. Because of your use of indexed fields, the MV_ADD does not work. A fix is going to be using search time fields like so: props.conf: [rf_ip] REPORT_rfip = rf_ip transforms.conf: [rf_ip] REGEX = \b(?<rf_ip>\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})\b MV_ADD = true Lastly, remove your fields.conf. All of this needs to exist on the Search Head, as it's a search time configuration.

beatus · ‎04-14-2017

karanvirsharma, You have three options here: Make that rex a field extract. You can do this by adding it to props.conf under that sourcetype as such: props.conf [my_sourcetype] EXTRACT-from_to = SchemaValidationFilter rejected a message because it apparently failed to validate against the schema. org.xml.sax.SAXParseException: cvc-maxLength-valid: Value (?.*) is not facet-valid with respect to maxLength '65'(?.*) This will cause the field to be automatically extracted. 2. Replace your < and > with the encoded version (< == < and > == >) 3. Utilize CDATA for the search in the XML: <param name="search"><![CDATA[ index=Mule source = "*item-subscriber-manhattan*" "*is not facet-valid with respect to maxLength '65' for type*" | rex field==_raw "SchemaValidationFilter rejected a message because it apparently failed to validate against the schema. org.xml.sax.SAXParseException: cvc-maxLength-valid: Value (?<from>.*) is not facet-valid with respect to maxLength '65'(?<to>.*)" | timechart span=1d count by to]]></param> Hope this helps!

beatus · ‎04-07-2017

Yeah, that sounds good to me.

beatus · ‎04-06-2017

ankithreddy777, In Splunk 6.4 and greater, the Universal Forwarder is reloadable via the serverclass.conf. By adding the following settings to the serverclass, Splunk will opportunistically reload (and issue a restart if the objects are not reloadable). serverclass.conf: restartSplunkd = false #* If true, restarts splunkd on the client when a member app or a directly configured app is updated. #* Can be overridden at the serverClass level and the serverClass:app level. #* Defaults to false issueReload = true #* If true, triggers a reload of internal processors at the client when a member app or a directly configured app is updated #* If you don't want to immediately start using an app that is pushed to a client, you should set this to false. #* defaults to false restartIfNeeded = true #* This is only valid on forwarders that are newer than 6.4. #* If true and issueReload is also true, then when an updated app is delpoyed # to the client, that client will try to reload that app. If it fails, it will then restart. #* defaults to false Add these to any serverclasses that you'd like to reload. --- Old answer for Prior to 6.4 --- Technically the UF doesn't require a restart. The problem is the Deployment Server does not do any reload of any sort. This means the only way to automatically reload the new inputs configuration on the UF is to trigger a restart. The alternative here is to touch a reload rest endpoint on the UF every time you would like to reload the configuration. This would get quite tedious and requires either being on the system with the UF or having changed the UF's password to access it's rest API. Long story short - unless you have a reason to prevent restarts on a UF, absolutely set "restartSplunkd" to true in the serverclass.conf.

beatus · ‎04-04-2017

Glad you got it!

beatus · ‎04-04-2017

You'd probably be best off with something like this then: |inputlookup test1.csv | search NOT [search index=my_index search_request|dedup host | table host] That will show any hosts in your lookup table that do not contain the term in the subsearch.

beatus · ‎04-04-2017

hippe21, You can use a subsearch to accomplish this: |inputlookup test1.csv | search NOT [search index=_internal |dedup host | table host] This search will take your CSV and elemenate hosts found in the subsearch. The results in your case woulkd be a table with: environment,host prod,server102 Obliviously, modify the subsearch and CSV names to suit your environment. If you'd like to look at your data as the only indicator, i'd recommend | tstats: | tstats count, latest(_time) AS last_seen where index=* by sourcetype,host | eval timeDiff=now()-last_seen | search timeDiff>900 Change "900" to how long you'd like to consider something missing in seconds. | tstats is going to be significantly faster than | metadata.

beatus · ‎04-03-2017

Shearsey, I think you're right about tackling it at the script level. If you can get it consistent then and control the script it's the way to go. I just wanted to provide a Splunk solution in the event that others do not have the ability to mod the script. As for the eval issue, you can force the type to int with "tonumber()" like so: |eval MEMTOT=case(like(MEMTOT, "%G"), (tonumber(substr(MEMTOT, 1, len(MEMTOT)-1))*1024), like(MEMTOT, "%M"), tonumber(substr(MEMTOT, 1, len(MEMTOT)-1)), 1==1, tonumber(MEMTOT)) Thanks for catching that typo, I had tested it so i'm not sure how i managed to mangle it after a copy and paste.

beatus · ‎04-03-2017

Marksedam, The answer above uses whatever your sourcetype rule sets it to and just removes the "-too_small". So it's completely independent of the file name (unless your sourcetype is actually based off the file name). Glad I could help either way, if there's more I can do please feel free to comment.

beatus · ‎04-03-2017

It won't fix it with the way you're doing things (the rule parsing). Typically when I see "*-too_small" it's from the learned app, but not in this case. Based on my testing this will work for you though: There's no way to turn off the too_small behavior it seems, so we can deal with it at index time then. This won't be the cheapest possible way (in terms of CPU) to do so, but it should work for you. props.conf: [(?::){0}*-too_small] TRANSFORMS-remove_too_small = remove_too_small transforms.conf: [remove_too_small] SOURCE_KEY = MetaData:Sourcetype DEST_KEY = MetaData:Sourcetype REGEX = sourcetype::(.*)-too_small FORMAT = sourcetype::$1 As for the difference between props and transforms, transforms exposes additional options props doesn't. I've edited my original answer with this as well, so others can see what works without digging through our comments. Please accept it if you feel it was helpful!

beatus · ‎04-03-2017

sassens1, I'm a big fan of using "Input Addons" aka IA-thing. So it sounds like you could the following: Push the default Splunk_TA_Windows to everything that needs it, with no inputs enabled. Create a baseline IA-windows that collects standard logs from all systems and deploy to all. Note - if you need to send some system's logs to specific indexes, then there may have to be mutliple IAs here too. Create N number of specialized IA-* to collect specific logs from specific sets of systems. So I agree with the idea, but use this as an opportunity to make the names make more sense.

beatus · ‎04-03-2017

Was the learned app disabled on the UF? That may be worth a shot. The other option is to set a sourcetype for this monitor and be done with it if that's possible. Last option is to do it dynamically based on something in the log or the path. Something like: props.conf: [source::/my/log/path] TRANSFORMS-fix_st = fix_st transforms.conf: [fix_st] REGEX = event_regex_here DEST_KEY = MetaData:Sourcetype FORMAT = sourcetype::my_new_st You could make use of "SOURCE_KEY = MetaData:Source" if you'd like your regex to match on the file path.

Posts	56
Solutions	18
Karma Given	19
Karma Received	57
Member Since	‎09-28-2015

Online Status	Offline
Date Last Visited	‎08-27-2020 03:25 PM

Re: Counting number of Splunk Searches

Re: Does a Heavy Forwarder fit my needs?

Re: Does a Heavy Forwarder fit my needs?

Re: Does a Heavy Forwarder fit my needs?

Re: cx_Oracle import problem

Re: What are some of the best practices of setting...

Re: How to improve index replication speed?

Re: How do I replicate settings in system/local ac...

Re: Hungry Newbie: Best way to learn Splunk well e...

Re: Extract and Transform Custom Event

Re: Extract multiple IP addresses from _raw and as...

Re: Extract multiple IP addresses from _raw and as...

Re: inputs.conf wait time t monitor file

Re: Extract multiple IP addresses from _raw and as...

Re: Need help with REX and Panels

Re: How to deploy windows TA over different enviro...

Re: Splunk forwarder need to restart when new app...

Re: Use inputlookup to find servers not reporting

Re: Use inputlookup to find servers not reporting

Re: Use inputlookup to find servers not reporting

Re: How to change the unit values (5G to 5 and 400...

Re: Can I turn off the data is too_small sourcetyp...

Re: Can I turn off the data is too_small sourcetyp...

Re: How to deploy windows TA over different enviro...

Re: Can I turn off the data is too_small sourcetyp...

Join the Conversation