About rhinomike

rhinomike · ‎07-26-2018

I had a look on those, however subsearches seem to behave more like SQL's UNION or Sub-SELECT statements than a proper lookup. They are just not powerful enough (or incredibly poorly documented)...

rhinomike · ‎07-25-2018

I have a query that goes into an index and filter a particular type of events of interest using stats and returns something like: search Event_Class = EVENT_TYPE_1 The results get pipped into | stats count as Stats1 dc as Stats2 avg(data) as Stats3 by Hostname . String_Field_One, Numeric_Field_One, Dest_IP This results into something like: Hostname String _Field_One Numeric_Field_One Dest_IP Stats1 Stats2 Stats3 now the challenge. I would like to use Hostname, String_Field_One to "lookup" against data from a separate query, resulting in an additional field being added to the results of the original data. search Event_Class = EVENT_TYPE_1 HostName=<value_from_hostname_would_go_here> AND String_Field_One=<value_from_String_Field_One_should_go_here> AND Numeric_Field_One=<value_from_numeric_field_one_would_go_here> | head 1 | table String_Field_That_I_Want_To_Join Can Splunk do this?

rhinomike · ‎09-30-2016

I get the renaming of Hunk into Splunk Analytics for Hadoop, however, the product pricing is no longer available on the site, nor are definitions on the licensing model. Would anyone know who is Splunk Analytics for Hadoop going to be licensed? On a previous job we had TBs of data we used to query with Hunk. Things were great as licenses were per node and for the non-real time stuff that worked very well. However, since moving jobs I was planning to use Hunk in here as well but I now wonder if we should start looking for a replacement? I thank you in advance

rhinomike · ‎03-01-2015

Hi there, I have been testing Hunk and noticed that due to the lack of pre-indexing, it relies quite a lot on proper Regexes and other sorts of filters to speed up searches. An example of this is the use of vix.input.1.path and vix.input.1.et.* and vix.input.1.lt.* settings as illustrated below: [hunktest] vix.input.1.accept = \.gz$ vix.input.1.path = /test/logs/${environmentid}/... vix.provider = test-hadoop-cluster vix.input.1.et.format = yyyyMMddHHmmssSSSS vix.input.1.et.offset = -3600 vix.input.1.et.regex = .*/logs/\d+/data\.(\d+).* vix.input.1.et.timezone = GMT vix.input.1.lt.format = yyyyMMddHHmmssSSSS vix.input.1.lt.offset = 0 vix.input.1.lt.regex = .*/logs/\d+/data\.(\d+).* vix.input.1.lt.timezone = GMT While the above works great, I am facing a small complication. ${environmentid} is a numerical value that has very little meaning to the people who would be using the search heads. I know I can use a lookup and I have configured one: [preprocess-gzip] LOOKUP-env_to_ids = environment_name environmentid OUTPUTNEW environment_name I also tested the lookup and it seems it is working: When I perform a search like index=hunktest environmentid=123 I can navigate through the matches and see the environment_name field has been created and matches the CSV contents. I can also see that just one subfolder (123) has raised matches. However, if I try to run index=hunktest environmentname=Test or index=hunktest environmentname="Test" , upon inspecting the search.log, it seems like Hunk crawled the whole HDFS store instead of crawling just /logs/123/ Is it possible to define a lookup so that it act as a filter on search time?

rhinomike · ‎03-01-2015

Solved it perfectly. Thanks

rhinomike · ‎01-23-2015

I have a log that more or less looks like: timestamp=1422006650 from=bob@sender.com to=alice@receiver.com subject="I love you honey" score=100 timestamp=1422007650 from=bob@sender.com to=alice@receiver.com subject="I love you honey" score=100 timestamp=1422008650 from=eve@sender.com to=alice@receiver.com subject="I loved him first" score=100 timestamp=1422009650 from=eve@sender.com to=alice@receiver.com subject="I loved you first" score=50 timestamp=1422009750 from=eve@sender.com to=alice@receiver.com subject="I loved him first" score=10 I am now trying to perform a stats like from subject count_to avg_score bob@sender.com I love you honey 2 100 eve@sender.com I loved you first 1 50 eve@sender.com I loved him first 2 55 If I'm not mistaken, I can use: stats count by from,to, subject to build the four first columns, however it is not clear to me how to calculate the average for a particular set of values in accordance with the first round of stats. Is it possible?

Posts	6
Solutions	0
Karma Given	4
Karma Received	0
Member Since	‎01-23-2015

Online Status	Offline
Date Last Visited	‎06-05-2020 02:04 AM

Correlating data between two searches

Splunk Analytics for Hadoop licensing model

Is it possible to define a lookup to act as a Hunk...

Stats count and average

Re: Correlating data between two searches

Correlating data between two searches

Splunk Analytics for Hadoop licensing model

Is it possible to define a lookup to act as a Hunk...

Re: Stats count and average

Stats count and average

Join the Conversation