About nrohbock

nrohbock · ‎08-28-2018

Sorry for the false lead. I hope this is more helpfull. index=common_index sourcetype=common_sourcetype | streamstats count as order by GroupID | eval myEventName=if(order=1 and somethingElse=whatever, EventName, null()), myKeyField=if(order=2, keyField, null()) | stats values(myEventName) as EventName, values(myKeyField) as keyField by GroupID | search EventName=* keyField=*

nrohbock · ‎08-28-2018

Since all of your sources are already indexed, I think it should be as simple as: index=common_index sourcetype=common_sourcetype ((EventName=First somethingElse=whatever) or EventName=Second) | stats values(keyField) as keyField by GroupID, EventName | fields - GroupID | mvexpand keyField You may also want to dedup the table, but technically, I think this should give you the same result.

nrohbock · ‎08-27-2018

Sorry for the delayed response. After reading through all parts and links in your response, I am still using the 'map' solution. I think you may be right, that ultimately the solution will be to import the actual statsmodel OLS using the method outlined in your last link.

nrohbock · ‎08-06-2018

I have a predicament that keeps recurring. I have a large dataset with a categorical variable. I want to fit a regression and output what the model's predicted value is out to a single column. Currently, I can do this by iteratively subsetting on each level of the categorical variable, fitting the model, then mapping the results back to the output column: | inputlookup test_generic.csv | stats values(x1) as x1 | mvexpand x1 | map search="inputlookup test_generic.csv | search x1=$x1$ | fit LinearRegression response from x2" I would attach the data I prepared for this question, but I don't have the karma. My question is this: Q: Is there a way to do this by how the | fit LinearRegression ... is specified? I have to think there's a better way. If it helps, this would be fit in R as: dat <- read.csv("test_generic.csv",header=T) mod <- lm(response ~ -1 + x1*x2, data=dat) It could also be fit in python as: import pandas import statsmodels.formula.api as sm dat = pandas.read_csv('test_generic.csv') mod = sm.ols(formula="response ~ -1 + x1*x2", data=dat).fit() Thanks in advance! PS: Here's some data for the test_generic.csv lookup: "response","x1","x2" 3084,"Alt-Control",221 5623,"Alt-Control",237.8 4957,"Alt-Control",381.5 4019,"Alt-Control",196.8 3283,"Alt-Control",356.45 7365,"Clinical",381.5 3099,"Clinical",483.9 6144,"Clinical",162.6 5499,"Clinical",277.06 3211,"Clinical",422.1 8448,"Control",319.2 14243,"Control",242.5 15917,"Control",229.6 11399,"Control",335.5 6960,"Control",196.9

nrohbock · ‎12-07-2017

So, I like your idea. My challenge is that the distinct count of eventtype in the eventstats line returns the count for all events not each event. I'm sure this could be fixed with the appropriate by statement... but I don't know how to make a by statement that is unique to each event. It looks like using: | eval dc_etype=mvcount(eventtype) accomplished what the eventstats command was intended to do. Thank you!

nrohbock · ‎12-07-2017

I'm going to go mad trying to get splunk to return only field values that are a given value and don't start or contain the value I give. Here's my example: index=myindex host=a_server | where match(eventtype, "^dataflow(^-|$)") index=myindex host=a_server | where match(eventtype, "^dataflow$") index=myindex host=a_server | where eventtype="dataflow") index=myindex host=a_server eventtype=dataflow index=myindex host=a_server eventtype=TERM(dataflow) All five searches return items like: dataflow-end dataflow-start dataflow-cache ... etc. I ONLY want events with eventtype of dataflow. Any guidance on how to have a less greedy search would be great!

Posts	6
Solutions	0
Karma Given	6
Karma Received	1
Member Since	‎12-01-2017

Online Status	Offline
Date Last Visited	‎06-05-2020 02:03 AM

Can I fit independent slopes and intercepts in a s...

Search for fields that match a value versus fields...

Re: How to use stats as a filtered self join?

Re: How to use stats as a filtered self join?

Re: Can I fit independent slopes and intercepts in...

Can I fit independent slopes and intercepts in a s...

Re: Search for fields that match a value versus fi...

Search for fields that match a value versus fields...

Join the Conversation