About Mick

Mick · ‎09-09-2010

Every Splunk instance needs 'some' kind of license to run, but not every instance requires an indexing license. Please read the information here so that you are familiar with the various types of license - http://www.splunk.com/base/Documentation/latest/Installation/AboutSplunklicenses The first time Splunk is installed, it will use the 'Enterprise Trial' license that is bundled with the download package. This is usually valid for between 30 and 60 days. For forwarders, you generally don't need an indexing capability, so we have also included a 'Forwarding license' in the download package. This is a 1MB, perpetual Enterprise license that will enable all features, like security, distributed search and deployment server, but will not all for any indexing. You can also use this license on search head instances. Lastly, there is the perpetual, 500MB, free license. You can apply this to your forwarders also, and they will work just fine if all you want to do is forward data, but none of the other features will be enabled - the most important of which is security. To resolve the messages you are seeing, simply update the $SPLUNK_HOME/etc/splunk.license file with either the free or the forwarder license, and recycle your instances.

Mick · ‎09-02-2010

Yes, Splunk does use that library. The error itself doesn't specify Splunk, so I wouldn't jump to the conclusion that it's a result of the installation straight away. If there was a system or an app error, I would expect to be caught in the Windows event logs also, have you checked those logs for the times when you saw these messages on-screen. When Splunk fails to do something or gets an unexpected result from an operation, it will usually record that in it's own logs, so check out $SPLUNK_HOME/var/log/splunk/splunkd.log for the same time-periods and see if you spot any possibly related error events in there. While you're looking in the log directory, check and see if there are any crash logs either from splunkd or splunk search, it could also be related to those processes

Mick · ‎09-02-2010

Yes, you can edit the default login page yourself and add anything you want to it. Be aware that any changes you make will be overwritten on upgrade, so make sure you add a step to your upgrade process to back it up and restore it each time. The login page lives here $SPLUNK_HOME/share/splunk/search_mrsparkle/templates/account/login.html

Mick · ‎09-01-2010

The rawdata within the index is compressed, so we try to make the actual disk footprint as small as possible. If you want to take a copy of one of the files in $SPLUNK_DB/foo/db//rawdata and uncompress it to assess the compression ratio, go right ahead.

Mick · ‎08-26-2010

It's not currently possible but we will add that to the list of enhancement requests

Mick · ‎08-26-2010

It really depends on your requirements, your intended/expected data thruput and your budget. Take a look at this deployment article referenced above, that's directly from our Engineering team to help estimate your hardware needs. The top performers in terms of indexing and search speed & capabilities are Linux and Windows, those two are consistently ahead of the pack when it comes to performance, with Linux currently edging the lead. A lot of environments have old SPARC boxes that can be reappropriated and on paper look like an ideal platform, but note the stipulation of x86 architecture in that planning article. Splunk will run just fine on SPARC, but the hardware will limit the performance simply because it's not suited to the way Splunk works. If you care about performance, SPARC is not for you. If you don't care so much and just need a server to run on, go right ahead, but bear in mind that at some point you may want to migrate to x86 and currently there's no easy way to just copy your indexes over.

Mick · ‎08-26-2010

Yes that behaviour is expected. maxQueueSize controls the number of events that can be stored in memory at any point in time, and increasing it doesn't necessarily mean indexing will work any faster or more efficiently. If the connection between an indexer and a forwarder goes down, the intended behaviour is for the fowarder to fill up it's queues with data ready to send, and then block any more incoming data from file, or from a network device. If the value is set too high, that will result in high resource consumption in the event of a problem/disconnect. Generally, if your deployment is performing well, there's no reason to increase this beyond the default, as it should never even get as high as 1000. If you were receiving UDP data on your forwarder however, and it was imperative you captured as much as possible when this happens, that would be a reason to increase it to a high number. In the case that data retention was a priority however, I would question the suitability of using UDP in the first place.

Mick · ‎08-10-2010

If you don't vote up my answer, nobody knows how smart I am. Click the "up" arrow if you find the answer useful people!

Mick · ‎08-10-2010

No, there's no way to push out individual files, or to define a variable in inputs.conf that will be server specific. The only real solution here is to update the $SPLUNK_HOME/etc/system/local/inputs.conf at install time. That gets created with the localhost as the default host name initially, so if you add a step to your install process/script you should be all squared away. I'll also suggest the idea of variables to the Dev team and see if there's any way we can add this capability in a future release

Mick · ‎08-04-2010

It depends on the app-specific permissons, which are controlled by $SPLUNK_HOME/etc/apps/<app_name>/metadata/default.meta For example, in the search app, only admin users have write permissions for views - ### VIEWS [views] access = read : [ * ], write : [ admin ] If you change the permissions so that the user role has write-access, then you should be able to promote your views.

Mick · ‎07-22-2010

Migration of views and dashboards from 3.x to 4.x was not easily and reliably achievable with a scripted solution. The changes in the UI were just too fast, and the new options too many for us to be able to reproduce exactly what was needed. Also, the UI was completely re-written so that it's now infinitely flexible and can display data any way you want, so this was also looked on as an opportunity to get people to build new dashboards and get familiar with the new UI. The basic UI dashboard builder is pretty striaghforward if your searches aren't too complicated - http://www.splunk.com/base/Documentation/latest/User/CreateSimpleDashboards - just remember that any charts you build are driven by the search itself rather than the actual dashboard XML. If more advanced dashboards are required, you'll just have to get your hands dirty with the XML.

Mick · ‎07-22-2010

RAID 5 is not a good storage solution for Splunk, as it is slow, and Splunk can only work as fast as the storage volume can, so if you're interested in fast performance, don't go down that route. Another consideration is 100GB of data doesn't necessarily equate to 100GB of disk-space when indexed. The compression ratio depends on the data, but you could find that 100GB equates to 50GB of space within $SPLUNK_DB . The only way to know for sure, is to run some tests with known data volumes. Also remember that cheap NFS storage is perfectly acceptable for older data you don't search very often

Mick · ‎07-20-2010

Following on from gkanapathy - When Splunk moves data from the Hot DB to the Warm DB, nothing is deleted - it is simply moved When Splunk moves data from the Warm DB to the Cold DB, nothing is deleted - it is simply moved When Splunk "retires" data from the Cold DB, it will be deleted unless you have configured a coldToFrozenScript in indexes.conf. This is done as part of a larger exercise to configure your data retirement policy, click here to learn more about this subject. A follow on page from the above link is this one, which will tell you how to set up your script, and some other options you may want to consider. The main thing to understand here are the states through which your data moves as Splunk is indexing it and as it ages. You can't keep data in the warm DB forever unless you have a lot of space or are indexing very little, so you need to consider how much space you want to use and how often you want to access it. If you usually run searches on data from the last week or two, then that's all you really need to keep in the hot & warm DB's and you can move your cold DB off to a cheap NFS location somewhere. Searching data on a NFS location, is slower than local disk, so if you're going to be regularly searching over data from the last 3 - 6 months and speed is important to you, then you will want to size your Splunk server accordingly and give it a lot of local storage.

Mick · ‎07-19-2010

We don't have a timeline for a 64-bit release on Snowleopard I'm afraid, so I can't give you an expected date. It's on the Development list of 'things-to-do' but not a priority right now. If this is something that would be important for a major deployment, you can feed this info to the PM team via your Sales rep or by filing a request for it via the Support Case process.

Mick · ‎07-19-2010

It's the same problem as answered on your other question - one of the indexes.conf files within your instance is referencing an index that Splunk doesn't know exists. In this case, it's the fault of a product change, which removed an old legacy index called [splunklogger] . In the pre-4.1.4 days, this index was established in $SPLUNK_HOME/etc/system/default/indexes.conf , but it has now been removed. If your retirement policy referenced this index in any way, to limit the size it could grow to for example, then you just have to remove that setting from your $SPLUNK_HOME/etc/system/local/indexes.conf

Mick · ‎07-15-2010

Where does your backed up data live and do you have the SPLUNK_DB variable set correctly in %SPLUNK_HOME\etc\splunk-launch.conf ? Or do you have your $SPLUNK_HOME\etc\system\local\indexes.conf pointing to the backup location? Basically what I'm asking is, have you told your new Splunk instance where to find the existing data? An easy solution would be to just copy all of your index buckets into the %SPLUNK_HOME\var\lib\splunk\defaultdb\db directory, assuming it's a brand new instance and you haven't yet made any changes to the files mentioned above.

Mick · ‎07-06-2010

Yes, the list monitor command does have some enhancements coming down the pipeline, right now it doesn't give complete information on every file & directory being monitored, and it lists files that are not being monitored due to whitelist/blacklist settings. Also, it's possible that it could take a long time to complete if you are monitoring hundreds of thousands of files & directories. There is a way you can get this information, but it's in the early stages of development and not as pretty as it could be. If you hit this endpoint with your browser - https://<servername>:8089/services/admin/inputstatus/TailingProcessor%3AFileStatus - you will get a lot of XML back detailing the current status of your monitored files. Again, this is not fully developed yet so it just gives you a lot of bare-bones info. Remember to insert your servername and splunkd management port into the URL.

Mick · ‎06-30-2010

There is a bug in the 4.1.3 and previous builds that is triggered by the slow NFS stat() calls, that much we know for sure. If your Splunk instance just stops indexing data, or stops picking up new files, then it's highly likely you are hitting this bug. Splunk should never stop indexing data from monitored files and discovering new files as long as new data is available. Increasing the max_fd setting will increase the number of FD's we can keep open for files that we've finished reading. The files will remain open until the time_before_close seconds. This doesn't mean that Splunk will run faster if you increase it to a huge number, it just means that files stay open longer so we may pick up new data more quickly. If files aren't updated once read, then it's of little use to you. Theoretically, there should be no limit to the number of files that Splunk can monitor. We depend on the OS to tell us which files have changed, so if the OS knows then Splunk will know too. An important distinction here is a 'live' file vs a 'static' file. Live files are currently being updated/written to, and Splunk will pick up new data from here as long as it keeps being added. Static files should be indexed and then ignored indefinitely. The concept of 'real-time' can also play a major role here. How quickly do you want Splunk to pick up data once it's written to a file? If your requirement is that Splunk should display the data as quickly as possible, then you want to limit the number of live files that Splunk is monitoring. If you have 20,000 live files then Splunk will not be able to keep up to date with every single file at the same time. However, if you have 20,000 files and only 50 of them are live, then Splunk should be easily able to keep up - are we starting to make sense yet? Our testing has shown that when tailing 10,000 live files, you can expect somewhere between 30 seconds and 1 minute lag. Those timings should increase linearly as you increase the number of files, so at 20,000 files you can expect 1 - 2 minutes delay. There's no hard and fast rule here, the speed of Splunk is dependent on the hardware resources available and how the instance is tuned - number of CPU cores, number of indexing threads, number of FD's, speed of the disk Splunk is writing to, data-segmentation etc. The faster Splunk can write to the index, the faster it can pull new data from files. Testing is the only true way to gauge the expected performance of your hardware, with your data.

Mick · ‎06-30-2010

Lets take a 3GB license as an example - Your Enterprise license will allow you to index 3GB of data per day. In case of an unexpected data surge, we do allow for 5 violations in a 30-day period - so if you have a web-server start to produce lots of errors and you index 3.5GB on a Sunday, Splunk will tell you this on Monday morning, and everything will continue to work as before. If you don't fix the problem and Splunk continues to index more that 3GB data on Monday, Tuesday, Wednesday & Thursday, that makes 5 violations, so Splunk will not allow you to search any data on Friday morning. Data will still be indexed, as Splunk never stops eating your data, but you won't be able to search it until you get a reset key from Splunk Support. The first one is free but repeated requests for reset keys will result in questions on the suitability of your license size so make sure you size your purchased license appropriately for your needs The only reason Splunk will stop indexing data is if it runs out of disk-space to write to. As long as data keeps coming in, Splunk will keep indexing it, even if you have 20 violations.

Mick · ‎06-24-2010

This means that Splunk has a problem figuring out where the homePath of one of your indexes lives. Usually it's the result of copying & pasting index settings and forgetting to update the homePath for a new directory, so you end up with 2 indexes with the same path. Alternatively, it can be the result of not defining an index correctly when editing an indexes.conf file by hand. You may have a typo in the setting, or you may have forgotten it entirely. One common fault we find is that an index is created in one app - for example the os index in the Unix app - and somehow modified in $SPLUNK_HOME/et/system/local/indexes.conf . If the Unix app then disabled later on for some reason, the next time Splunk starts up, it will see the entry in the .../local/indexes.conf file, but won't see the original entry in $SPLUNK_HOME/etc/apps/unix/default/indexes.conf , because it has been disabled. As far as Splunk is concerned, it's being told to apply a setting to an index, but it doesn't know where that index lives.

Mick · ‎06-24-2010

Currently this looks like a bug around the metrics monitoring, as no data should be written to _thefishbucket index. It used to be used as the repository for keeping track of monitored file information, but has since been replaced by the btree tracking method. We'll update this again once we know exactly what's happening here

Mick · ‎06-24-2010

It sounds like there's something in your server's startup prcoess that calls both the splunkd and splunkweb services directly. Perhaps something was configured originally to do this before the SplunkLightForwarder app was enabled? When you just run the ./splunk start or ./splunk restart commands, it sounds like Splunk itself is doing the right thing, and not starting the splunkweb process, so something else must be calling that service directly. What have you got configured to start Splunk automatically when the server boots?

Mick · ‎06-23-2010

Yes there is, and it's documented here Your startup command becomes ./splunk start --accept-license

Mick · ‎06-22-2010

This is one explanation, another is that there is a known bug in 4.1.2 & 4.1.3 where Splunk tries to access a results file before it is actually created. It's not really anything to be concerned about, unless you're actually noticing a problem with loading search results or accessing saved results objects. It will be resolved in an upcoming release

Mick · ‎06-17-2010

Sounds to me that the services didn't install properly, but that's just stating the obvious. Every MSI installation should write a log file in %temp% , if there are any problems/errors during installation they should be recorded there. This smells like a permissions issue to me, have you tried installing as the 'Local System User'? One of the requirements of the 'User' Splunk runs as, is that User needs to have the 'Run as a Service' capability. From what I see here, it sounds like your Admin user doesn't have this permission. You can create a service, but that's a different function.

Posts	182
Solutions	71
Karma Given	61
Karma Received	462
Member Since	‎01-21-2010

Online Status	Offline
Date Last Visited	‎06-05-2020 02:02 AM

How can I configure an embedded search to display ...

Does a forwarder implement load balancing when sen...

Email alerts for scheduled searches are failing - ...

What version of SSL does splunkd use?

Has anybody built an app to analyze MS Exchange lo...

Is this Sparc system a good choice for my splunk i...

Is there a CLI command that will return license in...

How do I extract a value from the 'source' field a...

How do I make a report save to a share on a differ...

How do I configure Splunk to use multiple location...

Re: Licensing error reported in splunkd.log on LWF...

Re: Does Splunk use the MS Visual C++ Runtime libr...

Re: Can I change the Splunk login page?

Re: How is my indexed data stored/compressed on di...

Re: interactive pdf

Re: What is the recommended OS to run Splunk on?

Re: What does 'maxQueueSize' in outputs.conf do?

Re: Can a regular User promote views/dashboards to...

Re: Host-specific files with Deployment Server

Re: Can a regular User promote views/dashboards to...

Re: Migrating chart settings from 3.4 -> 4.x: poss...

Re: Splunk Index Size

Re: Splunk Index Size

Re: 64-bit support in Mac OS 10.6

Re: I upgraded to 4.1.4 but now I get an error on ...

Re: Can't access data after DB migration

Re: Does splunk list monitor accurately reflect fi...

Re: Practical limit for monitor inputs? 20000+ dir...

Re: How does licensing work?

Re: ERROR :: 'homePath' - what does this mean?

Re: What is behind the volume reported by _thefish...

Re: disable webserver ignored on AIX?

Re: Is there any way to bypass the EULA on first-t...

Re: in splunkd.log a lot of warnings : DispatchCom...

Re: Why does the Splunkd and Splunkweb service not...

Join the Conversation