About Simeon

goelli · ‎12-10-2019

Hi, I have the same problem with rotated logfiles. I'm using Universal Forwarder in version 6.4.5 to monitor a log file and it's rotated versions. There was a network outage and the UF was not able to send it's data for some time. In the meanwhile the logs were rotated and zipped. The files it never began to read were read fine after the Network problem was resolved - even the zipped ones. But the file it was reading at the beginning of the outage was only unzipped and then commented with "already read, so skipped". When I manually unpacked the file and put it in place the UF started reading where it stopped because of the outage. So I think UF is skipping the check of the seekCRC at seekAdress as mentioned here: https://docs.splunk.com/Documentation/Splunk/6.4.5/Data/HowLogFileRotationIsHandled Does anyone know, if this is resolved in any Version?

Simeon · ‎05-12-2011

In 4.2.1, you do not need to account for bucket sizes to figure out how to partition. You can easily set the hot/warm and cold with specific parameters: [volume:volume_name] maxVolumeDataSizeMB = <integer> In general, auto_high_volume works well and you should not be required to perform manual bucket sizing. Review the indexes.conf.spec or example file for more information. To answer the setting question, "auto_high_volume" sets the size to 10GB on 64-bit, and 1GB on 32-bit systems.

DUThibault · ‎11-07-2017

The "root directory of the index" is e.g. $SPLUNK_DB/defaultdb/db/ ($SPLUNK_DB/defaultdb/ will NOT work). With Splunk 7, meta.dirty is deleted from db/ upon restart but the index is not rebuilt. I found the following method on https://answers.splunk.com/answers/72562/how-to-reindex-data-from-a-forwarder.html (dating back to 2013): 1) # splunk stop 2) # splunk clean eventdata -index main This sort of worked, except older data did not get re-indexed. My horizon shrunk from several days to about 5 hours. It ended up easier to remove the data sources (which were directories under surveillance anyway) and add them back in.

tpsplunk · ‎07-25-2011

if i upgrade to 4.2.2, do I still need to run the rebuild/repair operations?

jamesdon · ‎05-18-2011

OK, I will keep it simple and pre-parse.

Simeon · ‎04-19-2011

The multikv command is likely occurring prior to the lookup. Therefore, you can manually specify the lookup to occur after the multikv: source=top | multikv | lookup

Simeon · ‎12-28-2010

If you have a user that exists in both the scripted authentication system and the Splunk authentication system, the Splunk authenticated user/password will take priority. An example of this situation might be the "admin" user. Note that in 4.1.x and later, Splunk will authenticate users in both systems. Similarly, if you are running LDAP auth and Splunk auth, the LDAP user's credentials will be used instead of Splunk auth.

mattbrowncitrix · ‎03-21-2013

I signed up just to be able to upvote this question, since it has been so helpful to me!

gnovak · ‎12-21-2010

Chart generated nicely! Thanks for the help as I missed a few minor details as usual!

Simeon · ‎12-21-2010

Recover padding is simply a line in the metadata file that gets created when Splunk needs to fix/recover metadata. Host values are an example of metadata, which are contained in a file called Hosts.data. You can simply ignore these names, although they can be a bit annoying when looking at your summary page. To fix the summary page, you can modify the default summary dashboard to include the following additional query terms for the metadata search: | search host!=*recover-padding* So, any time you use a metadata search you will need to append the above, similar to this complete search: | metadata type=hosts | search host!=*recover-padding*

nbcohen · ‎11-16-2010

Much obliged - nbc

my2ndhead · ‎03-08-2018

A few new examples... Asynchronous search: $ curl -u admin:changeit -k https://localhost:8089/services/search/jobs -d search="search index=_internal" <?xml version="1.0" encoding="UTF-8"?> <response> <sid>1520569635.358</sid> </response> Fetching results: $ curl -G -u admin:changeit -k https://localhost:8089/services/search/jobs/1520569635.358/results -d output_mode=csv Synchronous search: $ curl -u admin:changeit -k https://localhost:8089/services/search/jobs/export -d output_mode=csv -d search="search index=_internal |head 10" Getting authentication token: $ curl -k https://localhost:8089/services/auth/login --data-urlencode username=admin --data-urlencode password=changeit <response> <sessionKey>lTsi0Gyhadou77kplKboa8_4DBsMbRB1gpu6sCEvIXIFotnMqNLOJyXQgCLdwM^uhDSRgxpfg_dG0gSbtRIkObpkWrbF2TisTo</sessionKey> </response> Running synchronous search with authentication token: $ curl -k -H "Authorization: Splunk lTsi0Gyhadou77kplKboa8_4DBsMbRB1gpu6sCEvIXIFotnMqNLOJyXQgCLdwM^uhDSRgxpfg_dG0gSbtRIkObpkWrbF2TisTo" \ https://localhost:8089/services/search/jobs/export \ -d output_mode=csv \ -d search="search index=_internal |head 10"

Paolo_Prigione · ‎11-11-2010

I think that's due to Splunk trying to match the 12 against the indexed words rather than the raw event: the _raw contains 12ms which is not segmented in two blocks, it has been indexed as a single term, being it a single word without any major/minor breaking character into it (ref: segmenters.conf) This instead could work (but would return more results than expected): sourcetype=ruby ruby_call_completed=12* because Splunk shoud try to find indexed tokens starting with 12 (so 12ms, 123ms, ... would be found in the index) On the other side, your second example: sourcetype=ruby | search ruby_call_completed=12 first acts on the indexed data matching sourcetype=ruby, then fields are extracted, THEN the secondary search is executed. I think this is due to the map-reduce paradigm: "map" is executed on the distributed servers, and it is just a search for matching events based on the precomputed index of the logs "reduce" extracts fields and applyes the secondary search, but this is only executed on the node where the search was first launched. However, that's only my two cents... Paolo

Simeon · ‎11-08-2010

You can do this via configuration files or search-time "kv" (aka extract command). Specifically, for your situation you want to delimit based on the "=>" and ", ". You can use the extract command as follows: ... | extract pairdelim=", }{", kvdelim="=>", auto=f This will turn off auto extraction, break the key value pairs based on the =>, and break the pairs based on the "," whitespace, or either curly bracket. So your extracted fields would be: item1=food item2=drink item3=water

gkanapathy · ‎11-05-2010

You can set frozenTimePeriodInSecs , but it does not necessarily guarantee that data will be removed when it reaches that age. What it does specify is that data may be rolled out when it hits that age, provided that everything in its data bucket is also aged enough to be rolled out. You can control to some degree by setting maxHotSpanSecs , but this setting can have significant impact on search performance and changing it probably requires changing other index configurations, and generally should not be done without official Splunk recommendations.

kmattern · ‎11-03-2010

This is in the log. How do I get my data back? 10-25-2010 10:10:08.452 INFO databasePartitionPolicy - Moving db with id of 43: /opt/splunk/var/lib/splunk/_internaldb/db/hot_v1_43 to warm: size exceeded: maxDataSize=104857600 bytes, bucketSize=106525084 bytes 10-25-2010 10:10:08.452 WARN databasePartitionPolicy - About to move db at /opt/splunk/var/lib/splunk/_internaldb/db/hot_v1_43 to warm

jaxjohnny2000 · ‎01-14-2019

Correct - https://docs.splunk.com/Documentation/Splunk/7.2.3/ReleaseNotes/RunningSplunkalongsideWindowsantivirusproducts

cfoleydivert · ‎09-27-2017

A simple and direct answer for this is to use "Format Visualization". I found this in the Search Tutorial manual (after looking in several other manuals, experimenting unsuccessfully in my case using the log() function with the timechart command; also found only way in Splunk Light Cloud to edit the formatting xml to be in the Dashboard feature, not right in Visualization). Working in Splunk Light Cloud, display results with Visualization tab; in the upper left there are three links - click the Format one: In that dialog, pick the Y-Axis button on the left and change the Scale from Linear to Log:

mmletzko · ‎11-01-2010

Thanks for the reply Simeon. I figured out the problem. Somehow my inputs.conf file got poplulated with a bunch of things that shouldn't have been in there, and missing what should have been in there. Once I got that fixed, the licensing information was OK.

xabidh · ‎06-22-2016

For me worked, renaming the file locate in $Splunk_Home\etc\licenses\download-trial\enttrial.lic and restart Splunk services.

dwaddle · ‎10-20-2010

'randomly' is a little unfair. The OS will choose an ephemeral port number and use that. How the OS determines the ephemeral port is OS dependent and also is related to how many ephemeral ports have been used so far. On Linux the ephemeral port range is controlled by the sysctl net.ipv4.ip_local_port_range, and on Windows it's a registry setting.

justinhart · ‎10-21-2010

the c_ip field contains the external IP addresses of the client upon connection. I would rather not post exact examples since they contain secure data. I can say however that I'm not getting any fields that contain lat,long for the ip addresses when doing: host=" " | geoip I do get client_lat,client_lon when doing: host=" " | lookup geoip clientip as c_ip | geonormalize This does not show any results on the map when in the Google Maps search.

Simeon · ‎10-15-2010

The dedup command will return the first key value found for that particular field. This means the most recent in time, as splunk searches from latest to earliest. For example, in the search that dedups the ip_address value for your firewall log, you will see the most recent ip_address that has been logged to that source. For more detail: http://www.splunk.com/base/Documentation/latest/SearchReference/Dedup

Simeon · ‎10-05-2010

Splunk will track the top 10 inputs based on source and host. To retrieve that information, you could run the following search: index=_internal source=*metrics.log* per_host_thruput | timechart sum(kb) by series To increase the number of tracked inputs, you can set that in your limits.conf file for metrics tracking.

blee_i365 · ‎06-08-2011

Assuming your list of events is in chronological order and belongs to a single user, you can try this: *| delta _time as timeSpentOnPreviousPage | accum timeSpentOnPreviousPage as totalTime From your 2nd event on you will get for each event a timeSpentOnPreviousPage and totalTime field containing running time difference between events, and running total time, respectively.

Posts	366
Solutions	72
Karma Given	69
Karma Received	470
Member Since	‎11-08-2009

Online Status	Offline
Date Last Visited	‎06-05-2020 02:02 AM

What is the difference between passing Appinspect ...

How to find common packages installed on many host...

how do I find the different packages installed on ...

Can I index WMI from a Splunk instance running on ...

Does multikv work with lookup tables?

Does a scripted authentication user take priority ...

How can I run searches against the Splunk API?

Why do I get different results when I search my ex...

How do I extract Key Value pairs from Ruby on Rail...

What do I need to do to run Anti Virus software wi...

Re: Rolled logs compressed immediately

Re: auto_high_volume vs hardset to 10GB ?

Re: Rebuilding index level .data files

Re: Missing data - Splunk is showing random gaps i...

Re: Extracting fields from a multi line log, with ...

Re: Does multikv work with lookup tables?

Re: Does a scripted authentication user take prior...

Re: How to show events per second in timechart reg...

Re: Chart data from 2 saved searches

Re: wierd hosts: "recover-padding-#" listed?

Re: New User - how to ask a follow-on question?

Re: How can I run searches against the Splunk API?

Re: Why do I get different results when I search m...

Re: How do I extract Key Value pairs from Ruby on ...

Re: Is there a method for rolling data completely ...

Re: Hole in my data

Re: What do I need to do to run Anti Virus softwar...

Re: Change y axis scale to logarithmic scale

Re: Splunk License Usage showing everything by hos...

Re: Your Splunk license expired or you have exceed...

Re: What ports does a forwarder bind to for sendin...

Re: Google Maps App Not Showing Results

Re: Which value does the dedup command keep?

Re: examples of searches to capture network thrupu...

Re: How do I get the amount of time between event ...

Are you a member of the Splunk Community?