About jrodman

jrodman · ‎03-23-2010

There's not any exposed controls to filter the so-called bulletin board messages. I think internally some messages are filtered, but it's not important because we obviously haven't filtered everything you would tend to want. There's an ongoing conversation internally around improving both the bulletin board and license messaging. I will point people to this answers link, but explicit bugs/ERs around this topic are a good input.

jrodman · ‎03-23-2010

Rough first take: In the <indexname>/db directory, delete the file .bucketmanifest In the <indexname>/db directory, create the file (0 bytes works) meta.dirty If we get into goat sacrifice territory, try also deleting .metamanifest. Step 2 should render that unnecessary. These files and their associated data should get rebuilt on need by search activity.

jrodman · ‎03-23-2010

Mostly, search-time fields have superior performance to parse-time (indexed) fields, regardless of whether they are explicitly configured. When running a search that includes a term such as fieldname=value , Splunk will treat this as a search-time field by default, unless fieldname is explicitly configured as an indexed field in fields.conf. This is true both for configured fields (delimiters, regular expressions) as well as for automatically identified fields where, eg you have fieldname:value in the text of your event. We call this automatic handling code auto-kv for automatic key-value extraction. The Splunk search machinery presumes that value will be present in the events as an indexed string, and will apply the same mechanics to filter the events as if you entered the string directly without the fieldname or equals sign. For most patterns, this offers all the performance advantage of a parse-time field, and none of the penalty. The tradeoffs are discussed in more detail in "About indexed field extraction" in the Getting Data In Manual. In all cases, the post-filtering is applied to the (hopefully) small set of events that actually contain the value string, by applying any extraction mechanisms, then testing to see if the field has been created containing the desired value. Ideally the index-based filtering is the most important factor in the speed of your search, but there are cases when search-time extraction must be applied to a large percentage of events. For example if almost all of your events have the word xml but only a small portion have this value in the storage_format field, the speed of extraction becomes important. Delim-based extractions are quite fast. Auto-kv are quite fast. Regex-based extractions are slower. Sourcetypes with a very large number of regexes or very inefficient regexes can be slower still.

jrodman · ‎03-23-2010

Are search-time fields slow? Can I rely on them to efficiently sort through my data? Are there significant differences in searching on automatically created fields from the text of my events, vs fields that I configure in manually? Are some types of extractions faster than others?

jrodman · ‎03-23-2010

The behavior for csv is pretty much identical to iis with the same cause. It's how our AutoHeader / CHECK_FOR_HEADER logic works. Again, you can mitigate with rename=original_souretype in the autogenerated sourcetypes.

jrodman · ‎03-23-2010

I was positing that the search was created in manager. IF that is not the case, it is not the cause of the problem.

jrodman · ‎03-15-2010

I don't think the requisite flag, current_only, is exposed in the WMI interface. You will need to do one of: Manually (possibly by script) tweak the inputs.conf post-install, but this will not prevent splunk from starting to pull the eventlog when it's first started. Since you are setting LAUNCHSPLUNK=0, this should be achivable before the first start. Alternatively, leave all the inputs disabled, but configure your hosts as deployment clients. Then you can deliver an inputs.conf tailored to your needs via that method. Unfortunately deployment client configuration isn't triggerable via the MSI commandline.

jrodman · ‎03-15-2010

This happens when an alert is created in manager, instead of in the search app. We neglect to create the viewstate ID (vsid) that the UI wants to figure out how to show the search. As a result, the UI fails saying "hey, I don't know how to show you this search" in its own inscrutible way. Workaround, create a vsid for the search (someone will have to provide some more detail here.. I don't know the steps). Solution, use Splunk 4.0.10 or later which doesn't muck this up.

jrodman · ‎03-14-2010

Here's a starting point: http://www.splunk.com/wiki/Community:TroubleshootingForwarding

jrodman · ‎03-13-2010

On a reasonable OS and filesystem, I think you can get pretty reasoanble behavior for a file as well with small (under 4k) writes where you set the file to append mode, unbuffered and flush after each write. And your apps should really be doing that with logfiles anyway, if you want to find out when they crash what happened. So I'm not sure the big deal with FIFOs is atomicity, though if you're sure of the behavior, go for it.. that sort of thing is pretty well outside the Splunk boundary. Where I've foud FIFOs useful is when writing auomated inputs, like scripts. The FIFO acts as a flow control for your program, which lets you have a pretty good idea of when and how fast that data is getting into Splunk. It also allows you to be pretty lazy in your script authoring without much of a problem. The most obvious downside is crashes. If the system crashes, or if splunk crashes, some data will be lost. Splunk can't put the data on disk from the FIFO before it reads it, and the OS isn't going to provide a backing disk store. Even if the source app has the data, there's no generic protocol for the program and splunk to renegotiate the position. The second problem is debuggability. If something's going fishy with your datastream, the FIFO offers no clues. It's hidden from view.

jrodman · ‎03-13-2010

In 3.x it was every few minutes. I'll have to do some code Splunking to find the current information.

jrodman · ‎03-13-2010

A LWF doesn't have an _internal index, gk. You could probably delete various binaries that are used for testing purposes and so on, but who knows when you'll want them again, and they'll come back on upgrade. You could erase the code that creates the webui, and so on... But all of that is just going to make a huge unmaintainable mess. Disk isn't that expensive. Tell engineering what you want to see in a future truly tiny forwarder.

jrodman · ‎03-13-2010

Splunk index bucket management is usually not something you want to poke at manually. The hot-to-warm case is somewhat interesting because it comes up for backup purposes, and can be useful to force bucket sizing in time or in space on your own terms instead of with Splunk's pre-packaged logic. The warm-to-cold case is only interesting when dealing with multiple datastores (multiple filesystems). However, this does become a point of interest when first setting splunk up, in order to validate behavior and operation. There's no easy way to force it, so the general method is to simply constrict the allowed number of warm buckets to force some to reach cold. In indexes.conf (generally set up in etc/system/local/indexes.conf) you can set the maxWarmDBCount on a index-by-index basis. maxWarmDBCount = <integer> * The maximum number of warm DB_N_N_N directories. * All warm DBs are in the <homePath> for the index. * Warm DBs are kept in open state. * Defaults to 300. This means you can temporarily configure your main index (say in the initial setup case) or you could configure a test index to try things with.

jrodman · ‎03-13-2010

I'm not aware of any reason to use multiple receiving ports for splunk-forwarded data. It's possible that at very large numbers of forwarders (say several thousand and up) there may be scalability issues that are mitigated with multiple receiving ports. Historically we have identified some problems of this shape, but the known issues have been addressed by design. When sending non-splunk data, such as syslog, into a UDP or TCP port, there are convenient reasons to use multiple inputs, because Splunk can apply default to the data at the input level, such as host, sourcetype, and so on. However, this should not be necessary and is not really possible in the forwarded case; the forwarder has already labelled the data. If you wanted to have both SSL and non-SSL forwarders connecting, that would require two ports.

jrodman · ‎03-13-2010

The "simialr file to this that was longer" message typically means you've got files with headers, so more than one looks the same, and when the log rolls, splunk is a bit unsure what the story is. The "hit EOF while computing CRC: 0/256" means that, as you say, we failed to grab a CRC from the file, which could be because it's too short (under 256 bytes), or sometimes it can indicate weird read errors that splunk didn't expect. If you're seeing these within milliseconds of each other, you may want to take a closer look at what's going on with the files before taking action.

jrodman · ‎03-12-2010

Optionally, both output groups can be autoLB groups, though obviously your usecase was the above.

jrodman · ‎03-11-2010

I think there's been some optimization to the merged_lexicon files. They're currently under 5% for me.

jrodman · ‎03-11-2010

Incidentally, a variety of things were not replicated to the search nodes correctly in versions of 4.0.x, for example lookup scripts didn't make it across until 4.0.7 or so. Still wish i knew what happens in case of conflict. Search head says the transform uses REGEX1, the indexer says it uses REGEX2....

jrodman · ‎03-11-2010

Heehee sending splunk a SIGHUP causes it to shut down. So you probably don't want to do this.

jrodman · ‎03-11-2010

We store the timestamp values in UTC always, and just display them in the localtime where the server is configured, so changing the timezone of splunk should not damage the accuracy of the timestamps in any way. What might be trouble is if the incoming log datastream does not declare its timezone, and thus the newly arriving data is interpreted differently from the historical data (one of them is probably wrong!)

jrodman · ‎03-11-2010

A pause in metrics.log while splunkd.log continues is symptomatic of a bug fixed around 4.0.5 or 4.0.6 timeframe. I do not have access to the defect system right now, but the version of splunk you are running is probably informative. Generally speaking, metrics.log should always continue chatting when splunkd is up, provided that there is disk space available and suchlike.

jrodman · ‎03-11-2010

It's certainly true that if splunk encounters 'someapp.log' without configuration, likely to create a new sourcetype called 'someapp'. Later, as the file rolls, splunk may not be able to correctly guess that the new rolled files are the same, and create a new sourcetype 'someapp-1', and then 2 and so on, as you say. However, IIS gets these sourcetypes for another reason. IIS is a sourcetype with positional field names in a header at the top of the file. However, since each file lists the fields present, Splunk assumes that not all files of this type will necessarily have the same list of fields. Therefore a new sourcetype is generated whenever the list of fields must be stored, and the list is inserted into a field extraction configuration for each sourcetype in turn. This works fine in a simple splunk environment, although it does look a bit confusing. However, because it creates configuration at index time intended to be used at search time it can break in distributed search environments, or in situations where data is forwarded after it is parsed. Incidentally we have a proposal to make searches work for all 'sourcetype=iis' which is to add the configuration 'rename=iis' to each of the autogenerated sourcetypes. This can be done manually for now, but I hope this starts happening automatically in a release in the near future.

jrodman · ‎03-11-2010

What always springs to my mind for this kind of goal is: run a search that gives the list of hosts sending syslog run a search that gives the list of hosts sendind splunkd compare the two lists 3 is a bit clumsy. You can do it with the set command, but it is the clumsy part. The Search & Indexing team is much more fond of a declarative sql-like style, and may have a more clever variation. There's always the simplistic approach: For the last 24 hours: sourcetype=splunkd OR sourcetype=syslog | dedup host, sourcetype Then review the data manually If you wanted to get very fancy you could filter with something like: sourcetype=splunkd OR sourcetype=syslog | dedup host, sourcetype | transaction host | search linecount=2

jrodman · ‎03-11-2010

4.0 doesn't have terribly good log events for alerting. You can see that the search was run, but not that it was run by the scheduler, so you cannot differentiate between manually-initiated and schedule-initiated searches. You can see the python event if the search eventually fires the email sending command sendemail.py, but that only will catch searches whose conditions were met, and which were configured to send email. In 4.1, all scheduled searches are explicitly logged, as well as the result (conditions met / not met). If a search would have run but was not for some reason, this is also logged. There are some built-in status views that try to give useful reporting on this data, but you can build your own slicings of it.

jrodman · ‎03-11-2010

If I have more than one splunk user interface that users log into, either for regional goals, or for load balancing, how do I ensure that the configuration data created by users in the interface is available on all my nodes?

Posts	949
Solutions	172
Karma Given	397
Karma Received	987
Member Since	‎01-15-2010

Online Status	Offline
Date Last Visited	‎06-05-2020 02:02 AM

Why is copy-truncate a low-quality log-rotation st...

In LDAP integration for user authentication, what ...

Can I limit the total memory used by Splunk on my ...

After upgrading to Splunk 6.1, I have searches ret...

What is a splunk search in "zombie" state? What d...

How can I run a windowed realtime seach from the c...

Changes to search configuration (field extractions...

I've updated to the latest version of the PDF Serv...

Why doesn't the upload image feature of answers wo...

How can I install a splunk 4.2+ license from the c...

Re: How can I supress certain Splunk web error/war...

Re: How can you add/move a bucket without restarti...

Re: Do search-time fields have performance conside...

Do search-time fields have performance considerati...

Re: Why do variations in sourcetype appear?

Re: Error in retriving job result

Re: start tail monitoring windows event log upon S...

Re: Error in retriving job result

Re: I've set up a forwarder but I'm not receving a...

Re: When should I use a FIFO for an input?

Re: How often does Splunk check to apply retention...

Re: Any way to reduce the storage footprint of a W...

Re: How can I trigger migration of buckets from Wa...

Re: What are the reasons to use multiple receiving...

Re: I'm getting an error for some files I am monit...

Re: Is it possible to configure cloning AND autoLB...

Re: What are the merged_lexicon.lex files in my bu...

Re: In a distributed search environment, where do ...

Re: Splunk configuration changes - SIGHUP or resta...

Re: Our server's configured timezone offset (TZ) h...

Re: Why is there a gap in my metrics.log?

Re: Why do variations in sourcetype appear?

Re: Search to find hosts sending syslog AND splunk...

Re: How to search recent alerts fired by Splunk?

How do I ensure my user-created data is coherent a...

Are you a member of the Splunk Community?