Knowledge Management

Setting _time with collect not working

nonno_pinto
Engager

I have a small query that splits events depending on a multivalue field and each of n's date from the multivalue needs to become the _time of n's "collected" row.

 

index=test source=test
| eval fooDates=coalesce(fooDates, foo2), fooTrip=mvsort(mvdedup(split(fooDates, ", "))), fooCount=mvcount(fooTrip), fooValue=fooValue/fooCount
| mvexpand fooTrip
| fields - _raw
| eval _time=strptime(fooTrip, "%F")
| table _time VARIOUS FIELDS
| collect index=test source="fooTest" addtime=true

 

The ouput table view is exactly what i'm expecting, but when i search for these fields on new source, they have today time (or, with addtime=false, earliest from the time picker).

Also using testmode=true, i still see results as supposed to be.

What's wrong? Thanks 

Labels (1)
0 Karma

bowesmana
SplunkTrust
SplunkTrust

collect can collect events in the future, the issue is how the collect command handles _time.

It will NOT use the _time field as _time. It has different behaviour depending on whether it's run as a scheduled saved search or an add hoc search. The docs on collect are really bad and buggy.

Using addtime is also problematic.

We use this process via a macro when using collect and you need specific control over _time

| eval _raw=printf("_time=%d", _time)
| foreach "*" 
    [| eval _raw=_raw.case(isnull('<<FIELD>>'),"",    ``` Ignore null fields ```
                           mvcount('<<FIELD>>')>1,", <<FIELD>>=\"".mvjoin('<<FIELD>>',"###")."\"", ``` Handle MV fields just in case ```
                           ``` Concatenate the field with a quoted value and remove the original field ```
                           !isnum('<<FIELD>>') AND match('<<FIELD>>', "[\[\]<>\(\){\}\|\!\;\,\'\"\*\n\r\s\t\&\?\+]"),", <<FIELD>>=\"".replace('<<FIELD>>',"\"","\\\"")."\"",
                           ``` if no breakers, then dont quote the field ```
                           true(), ", <<FIELD>>=".'<<FIELD>>') 
    | fields - "<<FIELD>>" ] 
| fields _raw
| collect index=bla addtime=f testmode=f

It will ignore null fields, it will write unquoted fields when they do not contain major breakers (this allows for more performant searching using TERM() search techniques) and it will join multivalue fields together with ### 

You can also use this similar but slightly different approach

| foreach * 
    [ eval _raw = if(isnull('<<FIELD>>'), _raw, json_set(coalesce(_raw, json_object()), "<<FIELD>>",'<<FIELD>>'))]
| table _time _raw 

or you can use output_mode=hec, which I believe will get time correct.

 

0 Karma

nonno_pinto
Engager

After few tries, i changed test case and saw the problem was that is was asking splunk to save an event "in the future", and apparently that's not possibile

0 Karma
Get Updates on the Splunk Community!

Splunk Observability as Code: From Zero to Dashboard

For the details on what Self-Service Observability and Observability as Code is, we have some awesome content ...

[Puzzles] Solve, Learn, Repeat: Character substitutions with Regular Expressions

This challenge was first posted on Slack #puzzles channelFor BORE at .conf23, we had a puzzle question which ...

Shape the Future of Splunk: Join the Product Research Lab!

Join the Splunk Product Research Lab and connect with us in the Slack channel #product-research-lab to get ...