About Graham_Hanningt

Graham_Hanningt · ‎05-09-2016

@gblock, Re: Using a TCP Input from the browser. Yes, for all the reasons you cite, that's pretty much just a party trick; certainly, not something I'd want to implement in a production environment. (That party trick works from curl, too.) The unwanted event, containing the lines I'd told LINE_BREAKER to discard, irks me. This morning, I used a TCP client to send an HTTP request - and also text that "looks" like an HTTP request, with \r\n -delimited preamble lines - to the Splunk TCP input. I still get that unwanted event. I've just asked about this in a separate question, "Why do the contents of the first capturing group in this LINE_BREAKER regex appear as a separate event?".

Graham_Hanningt · ‎05-09-2016

Thanks for converting the comment, Patrick 🙂

Graham_Hanningt · ‎05-09-2016

I have defined a TCP input in inputs.conf with the following corresponding stanza in props.conf (Splunk Enterprise 6.4): [source::tcp:6067] KV_MODE = json LINE_BREAKER = ((^[^{][^\r]*\r\n)*)\{\"[^}]+\} SHOULD_LINEMERGE = false If I send the following text to that port: Preamble lines That I do not want To appear in the event The following line is intentionally blank {"myfield": "some_value"} (with \r\n at the end of each line) I get two events in Splunk: The event I want, {"myfield": "some_value"} , with myfield correctly presented as a field (so, KV_MODE = json is working). An unwanted event, with a time stamp that is the same or earlier, consisting of the "preamble" lines that I thought I'd told LINE_BREAKER to discard! According to the props.conf documentation: The contents of the first capturing group are discarded, and will not be present in any event. Yes, the contents of the first capturing group are discarded from the event I want... but they are present in that unwanted (and unexpected) event. Why do I get that unwanted event? How do I prevent it? I'm deliberately using the descriptive term "preamble" here, because I have previously attempted to do the same thing (discard those "preamble" lines) using PREAMBLE_REGEX instead of LINE_BREAKER : [source::tcp:6067] KV_MODE = json HEADER_FIELD_LINE_NUMBER = 1 PREAMBLE_REGEX = ^[^{].* but I cannot get PREAMBLE_REGEX to work, no matter what combination of regex and preamble test cases I use; at least, not for a TCP input. I wonder whether PREAMBLE_REGEX only applies to, say, file inputs, not TCP (or other network) inputs. The props.conf documentation hints at this with the word "files": Some files contain preamble lines. but if it's true, I'd prefer that the documentation was more explicit (and this makes me wonder about the implicit limitations of other settings).

Graham_Hanningt · ‎05-09-2016

Hi @gblock, Re: you can load balance TCP but it is expensive Any reason not to use HAProxy for TCP load balancing of Splunk TCP inputs? From the HAProxy website: HAProxy is a free, very fast and reliable solution offering high availability, load balancing, and proxying for TCP and HTTP-based applications. It is particularly suited for very high traffic web sites and powers quite a number of the world's most visited ones. Over the years it has become the de-facto standard opensource load balancer, is now shipped with most mainstream Linux distributions, and is often deployed by default in cloud platforms. Since it does not advertise itself, we only know it's used when the admins report it 🙂

Graham_Hanningt · ‎05-09-2016

Re: Does Splunk support receiving a continual stream of input via an HTTP POST? No. Not a continual (endless) stream. You can "batch" (send multiple) events in a single HTTP POST, but there is a maximum limit to the size of an HTTP request. The limit is set by max_content_length in limits.conf . The default value, as of Splunk 6.4, is 1000000 bytes (~ 1 MB). Exceeding that limit results in the HTTP response error code 413 (request entity too large). The Splunk documentation that describes batching events (such as "About the JSON event protocol in HTTP Event Collector") does not mention this limit (at least, I can't find any such mention). I think it should.

Graham_Hanningt · ‎05-09-2016

@gblock, thanks very much for weighing in on this question, much appreciated. I'd hoped to catch your attention. I'm asking this question primarily on behalf of some developer colleagues who will soon be turning their attention to sending events to Splunk over an IP network. In particular, I want to present them with information to help them decide whether to use HEC or a TCP input. As mentioned in my question, I've already done some research, and have hands-on experience using both HEC and TCP inputs (albeit currently only on a small scale, on a single Splunk instance). To recap: my question is "Why would I use HEC when I can use TCP?". That is, as further clarified in the details of the question, why would I choose to use HEC in situations where I can use either HEC or TCP? I'm interpreting your answer in the context of that question. Point by point: There are many clients where TCP is not a viable option, such as sending from the browser. Yes, fair point. However - I sincerely don't mean to be adversarial or otherwise annoy you; I'm grateful for your time, and hope for more advice from you on your subsequent points - this point is not relevant to the specific context of this question, where TCP is a viable option. Incidentally, and more or less just for fun, this morning I played around sending events from a web browser (Chrome) to a Splunk TCP input. Yes, really (and, yes, I do have better things to do ;-): xhr = new XMLHttpRequest() xhr.open("POST", "http://localhost:6067") xhr.send("{\"my_field\": \"some_value\"}") with the following stanza in props.conf : [source::tcp:6067] KV_MODE = json LINE_BREAKER = ((^[^{][^\n]*\r\n)*)\{\"[^}]+\} SHOULD_LINEMERGE = false The LINE_BREAKER is intended to ditch the multiline HTTP request header. It kinda works: for each xhr.send , I get two events in Splunk: The event I want, {"my_field": "some_value"} , with myfield correctly presented as a field. An unwanted event, with a time stamp 10 seconds earlier (!), consisting only of the multiline HTTP header (which I thought I told LINE_BREAKER to discard!) I spent some time Googling about automatic HTTP request retries, and whether I can set an Ajax request to use HTTP 1.0 instead of 1.1, but gave up. Maybe I just specified an inappropriate regex? Interesting, but academic, thanks to HEC. Moving on. Scale. HEC is stateless and designed to easily scale out across a pool of instances behind a LB. Again, fair point. But, again, the point of this question is to decide between using HEC and a Splunk TCP input. A Splunk TCP input is also stateless. Right? And a Splunk TCP input easily scales out across a pool of instances behind a load balancer (LB), too. Or am I missing something here? The Splunk dev topic "High volume HTTP Event Collector data collection using distributed deployment" describes using a network traffic load balancer (such as NGINX) in front of several Splunk Enterprise indexers. Is there any reason why I can't do the same thing - use a TCP LB, such as NGINX or HAProxy - for Splunk TCP traffic? Performance. We've heavily optimized HEC to handle 100K events or more per instance. How does that compare with the performance of a Splunk TCP input? HTTP involves processing that TCP does not, such as parsing an HTTP request header and returning a response with a header (and, in the case of HEC, a JSON-format body). This is one reason for my original question: if I don't want or need the processing overhead of HTTP versus TCP, why use HEC? Outside of this processing that is specific to HTTP - and so, an overhead, when compared to TCP - I would have thought that the remainder of the event processing would be common to both HEC and Splunk TCP inputs. Or could be, if it isn't: that's one reason why I recently asked the question "Can I use the HTTP Event Collector JSON event protocol for TCP inputs?". As you mention in the next point, HEC has rich support for JSON out of the box. Does that "protocol" - for example, specifying the time in the metadata as a Unix Epoch value - improve the performance of HEC versus a Splunk TCP input? If so, why not offer that same JSON structure for TCP inputs? (Or are you deliberately deprecating TCP inputs in favor of HEC?) Aside: It occurred to me that perhaps you deliberately chose "EC" as the official abbreviation for HEC for this very reason: that you had plans to "roll out" the JSON-based EC metadata/data protocol across other input methods, including TCP. But nope, I was wrong, because you've recently clarified the official abbreviation as being HEC, not EC. Ease of use. HEC has really rich support for JSON out of the box, you don't have to mess with sourcetypes or bending over backwards with your JSON. Yes. I describe some of that "bending over" in my question. Much nicer with HEC, thanks. However, as I mentioned, I don't find this (ease of use) a compelling enough reason to choose HEC over TCP. Unless the "rich support for JSON" comes with a performance benefit (that you don't plan to make available to TCP inputs). Security.... Yes. However, in the use cases I expect to see - I didn't mention this in my question - I suspect (although I don't know for sure) that all of this traffic will occur behind a firewall on an intranet or over a VPN. I look forward to hearing more from you, especially regarding performance. Not wishing to put words in your mouth, but framing your answer in the context of my question, I think what you're telling me is: There's a bunch of reasons [why you would use HEC when you can use TCP]: ... Performance That is, in a nutshell: HEC offers better performance than using TCP inputs. I'd like to hear more about that. And if that's true, then perhaps the Splunk docs recommendation I cited in my question needs revisiting (or at least, qualifying): TCP ... is the recommended protocol for sending data from any remote host to your Splunk Enterprise server

Graham_Hanningt · ‎05-06-2016

@martin_mueller mentioned the following new visualization in a comment on a related question: Horizon Chart - Custom Visualization

Graham_Hanningt · ‎05-06-2016

Thanks very much for the link, Martin. Yes, that looks very useful: I'll try it out on Monday (I'm writing this on Friday night). Apologies for this belated acknowledgement. I don't know why I missed your comment; I just stumbled on it now as I was reviewing some of my old questions. I guess I must have overlooked the email notification, or deleted it by mistake.

Graham_Hanningt · ‎05-06-2016

Thanks! I'll wait until Monday to see if anyone pitches in with a different, and more compelling, answer (I would be surprised), but if not, I'll accept your answer. Thanks again, and have a good weekend.

Graham_Hanningt · ‎05-06-2016

I sent two events in JSON format to Splunk (Enterprise 6.4) via TCP. The second event was deliberately malformed: a string value was missing its closing quote. The first event was successfully indexed. As expected, the second wasn't. How do I troubleshoot this? For example, which Splunk log records the failure to ingest the second event? If I send similarly malformed event data to the HTTP Event Collector (EC) as two events batched in a single request: {"time":1459241926.498019000,"sourcetype":"my_test","index":"test","event":{"myfield":"good"}} {"time":1459241926.498019000,"sourcetype":"my_test","index":"test","event":{"myfield":"bad}} (note the deliberately missing closing quote after the bad value) then, again, as expected, only the first event gets indexed. Unexpectedly, though, EC responds with: {"text":"Success","code":0} whereas, if I reverse the order of the JSON lines (putting the event with the bad value first), I get: {"text":"Invalid data format","code":6,"invalid-event-number":0} (For JSON parsing errors in EC input, I've seen that the data.num_of_parser_errors metric in the _introspection index for that time period gets incremented. But that's all the evidence I can see: I don't see the specific error details logged anywhere.)

Graham_Hanningt · ‎05-06-2016

(This question encompasses single-instance Splunk installations and multisite indexer clusters.) I'm working on a platform that does not have a Splunk Universal Forwarder. I want to send events to Splunk over an IP network. I don't want to use UDP. I am successfully sending events in JSON format to a single Splunk instance via the HTTP Event Collector (EC) and TCP. So I'm already familiar with some of the differences between EC and TCP inputs. For example, the EC protocol enables you to specify event time and source type as metadata, whereas using TCP involves configuring timestamp recognition and overriding source type per event (in .conf files). So, that's one answer: EC separates metadata from data. Whereas, with TCP, you have to embed the time stamp and (if you want to send multiple source types to the same TCP port) source type as fields in the event data. (I've already bleated about this in the question "Can I use the HTTP Event Collector JSON event protocol for TCP inputs?".) Another answer: using EC - HTTP - means you get a response (in JSON) that reports the success or failure of the request. However, neither of these answers is compelling to me. In fact, while I want to know that there's a Splunk server listening - and I know that when I attempt to open a connection (which is why I don't want to use UDP) - I do not want to spend CPU time on the "sending" platform handling errors reported by Splunk. I'd prefer to capture and handle those errors via Splunk's own logging. (I have questions about that, that I might ask - in a separate question - here on Splunk Answers. As far as I can tell, Splunk does not log the details of individual EC request errors. For example, when I deliberately send badly formed JSON to EC, the data.num_of_parser_errors in the _introspection index for that time period has a value of 1, but I cannot find specific details of that error in any Splunk log... perhaps I'm just not looking in the right places, or perhaps I need to enable debug logging for some category, although I'd rather not do that for ongoing "production" use.) I've read various Splunk blog posts and Splunk dev topics on EC (including "Introduction", "Walkthrough", and "Distributed deployment"), but I don't see any compelling reasons there to use EC when I can use TCP. I'd be interested in results of high volume performance benchmark testing of EC versus TCP. According to the Splunk docs topic "Getting Data In": TCP ... is the recommended protocol for sending data from any remote host to your Splunk Enterprise server While that recommendation pre-dates EC (the same text appears in pre-6.3 docs), it remains in the current (6.4) docs. Is it still true? More broadly - outside of the specific context of Splunk - I've read discussions about using HTTP versus TCP. (When I write "versus", I know that, in this context, HTTP runs over TCP: that is, I can use a TCP client to open a connection to port 80 on a computer, send a "GET / HTTP..." request, and get the response.) Here, in this question, I'm specifically interested in using HTTP (EC) versus TCP for Splunk.

Graham_Hanningt · ‎05-05-2016

I use cURL on Windows for ad hoc EC ingestion. To avoid escaping quotes, I save my JSON to a file, and refer to that file in the curl -d option by prefixing the path with an at sign (@). For example: -d @ec_input.json For details, see the curl man page. I also use a variety of homegrown PowerShell scripts (.ps1), batch files (.bat) - some of which are simply wrappers for curl - and Java programs to send JSON to EC. For example, I use Java to massage JSON lines-formatted event data with an ISO 8601-formatted time stamp field into EC "packets" with a Unix Epoch time metadata field.

Graham_Hanningt · ‎05-05-2016

Hi @woodcock, thanks for the suggestion: I would use HEC:xyz where HEC is the common name for HTTP Event Collector. How common? The first Splunk blog post tagged http-event-collector , "HTTP Event Collector, your DIRECT event pipe to Splunk 6.3", uses the abbreviation EC: HTTP Event Collector (EC) is a new, robust, token-based JSON API So does the Splunk dev topic "Introduction to Splunk HTTP Event Collector": Welcome to Splunk HTTP Event Collector (EC) So does the "Walkthrough" dev topic: the EC port ... an HTTP Event Collector authentication token ("EC token"). EC tokens are ... the EC event protocol ... But then, the latest Splunk blog post tagged `http-event-collector, "There is a “LOG”! Introducing Splunk Logging Driver in Docker 1.10.0", on 10 February 2016, refers to HEC: Built on the HTTP Event Collector (HEC) ... Enable HEC ... Create a New HEC Token And Googling for: "HTTP Event Collector (HEC)" site:splunk.com returns "about 38 results", whereas: "HTTP Event Collector (EC)" site:splunk.com returns "about 32 results". If any Splunk tech writers are reading this: what's the official abbreviation: EC or HEC?

Graham_Hanningt · ‎05-05-2016

I've seen the related question "Override source key in inputs.conf". I've pretty much decided that I do want to override the source key (although I'm open to counterarguments): the question now is, to what? Here's my situation: I'm using a proprietary, platform-specific tool to extract many types of log records from various systems on that platform. I'm then sending those extracted log records to a remote Splunk instance via either HTTP (that is, to the Splunk HTTP Event Collector; EC) or TCP. For the purposes of this question, I'm going to refer to that log extraction tool as xyz . Events ingested via EC have the source field value http:xyz , where xyz is the name of the Event Collector token that I created for this purpose, deliberately matching the name of the tool. I am dimly aware of the possibility - although no use case occurs to me right now - that, in the future, I might want to create additional EC tokens for xyz ; perhaps I'll append qualifying terms with an underscore separator, I'm not sure. Events ingested via TCP have the default source field value tcp:6666 , where 6666 is the TCP port. I don't feel that comfortable with this default source value for the TCP-ingested events. I'd prefer a more "mnemonic" value that doesn't refer to a specific port number. In a multisite cluster, indexers might, for site-specific reasons, be listening on different port numbers. I think I'd prefer to have the same source value - for example, tcp:xyz - regardless of which indexer ingests an event, and what TCP port it's listening on. So, although this naming scheme is likely simplistic - hence this question about best practice; I'm hoping for advice from more experienced users - I'm leaning towards source values in the following format: input : sender where sender is, in my case, the tool xyz . So: http:xyz (as now) for the EC-ingested events, and tcp:xyz (instead of the default tcp:6666 ) for the TCP-ingested events. Thoughts, suggestions welcome. For example: Should I use an underscore instead of the colon as a separator? (I realize that the colon implies a protocol rather than some more generalized notion of "input type/method".) Should I reverse the order of these qualifiers: for example, xyz_http ? Why don't I use the same source value - perhaps just xyz - regardless of input (ingestion) method? Difficult to put my finger on many concrete reasons. Perhaps one: I'm sending JSON to both EC and TCP, but the JSON structure is slightly different (I wish it wasn't). If I need to debug ingestion issues, it might be helpful to be able to differentiate the events; but then, the inherent differences in the structure of the JSON payloads means I can already do that. I understand that some of this might come down to personal preference, but I'm interested in what other people are doing, and why.

Graham_Hanningt · ‎05-04-2016

Yeah, I read the Splunk docs topic on that ("Rename source types at search time") before asking this question. Problem is, that functionality is too limited to be useful in this situatoin: it only offers a one-to-one renaming, from the original sourcetype value to a different literal string value. Thanks for the suggestion, though. Thanks also for prodding me to try sourcetype and see what happens. I think that your answer, combined with this trail of comments, will prove useful to users with the same question, so I'm going to accept it. I'm considering asking a new question, spawned by the testing I've done here, to ask about (re)using the EC protocol for TCP inputs.

Graham_Hanningt · ‎05-04-2016

What I'd really like is to use the same JSON for TCP input as I use for the HTTP Event Collector. That is, to specify time and sourcetype as metadata keys, rather than having to write stanzas to configure timestamp recognition and override the source type per-event.

Graham_Hanningt · ‎05-04-2016

I realize that I could save myself a heap of trouble here by using a single sourcetype value for all of the different types of log records - all of which have different record structures - that are extracted by the platform-specific log extraction tool I referred to in my original question. And I could coin some new field with the unique values that would have been in sourcetype . But I think that would be a "cop out"; an un-Splunk-y thing to do; in neither the spirit nor the letter of the Splexicon definition of source type.

Graham_Hanningt · ‎05-04-2016

I'm now overriding the sourcetype. Here's my working transforms.conf stanza: [set_sourcetype_xyz] REGEX = \x22event_sourcetype\x22:\x22([^\x22]+)\x22 FORMAT = sourcetype::$1 DEST_KEY = MetaData:Sourcetype ( \x22 is an escaped double quote) Using sourcetype instead of event_sourcetype as a field name in the JSON input data also works, but you end up with an ingested event with a sourcetype field that has two identical values (for example, xyz_123 and xyz_123 ). I'm torn, and would appreciate advice on this. On the one hand, I'd prefer not to coin my own field name; on the other, I'm not comfortable with sourcetype having two values. That just looks weird to me, and I don't have enough experience with Splunk to know whether this will bite me in the a...

Graham_Hanningt · ‎05-04-2016

Well, that was interesting. I defined a TCP input in inputs.conf : [tcp://:6666] index = test sourcetype = xyz with a corresponding stanza in props.conf : [source::tcp:6666] INDEXED_EXTRACTIONS = JSON As an initial test, I used a Windows PowerShell script to send a few events in JSON format, and confirmed in Splunk Web that the following search: sourcetype=xyz displayed the events, with the field names and values extracted from the JSON. So far so good. Then I added: "sourcetype":"xyz_123" to the JSON, and sent that. The event appears in the Splunk Web Events tab with two values for the sourcetype field: xyz and xyz_123 . There's no new or renamed field: just the one sourcetype field with two values. That event appears if I use the search cited above, but if I change the search to: sourcetype=xyz_123 I get no results. I'm now about to try overriding the sourcetype as described in Splunk docs. Just for fun, I might try doing that using a sourcetype field, and see what happens: I wonder whether Splunk will "collapse" the two (now identical) values into one, or show them as separate values. Probably safer, though (since I don't understand the underlying code), to use a different field name.

Graham_Hanningt · ‎05-03-2016

Thanks, I'll try that first and report back.

Graham_Hanningt · ‎05-03-2016

Background Some background to this question: I'm working on a platform that does not have a Splunk Universal Forwarder. I don't mean to be coy about which platform I'm working on, but I'd prefer to leave it at that. To get data into Splunk, I've developed a small Java application that massages JSON lines-format data (from a log extraction tool on that platform) into the similar JSON lines format required by the Splunk HTTP Event Collector (EC). The Java app sends the data from the remote platform to Splunk EC on my PC (for small-scale testing, I'm running Splunk Enterprise 6.4 with a free license on my Windows 7 PC). This works fine. EC was an easy first choice for me because I'm familiar with HTTP-based tools. For example, I'm comfortable using cURL and writing Ajax requests in JavaScript. I'm now looking at using a TCP input to Splunk instead of EC. I understand that many experienced Splunk users will have started with TCP first. The EC event protocol separates the contents of a "packet" into event metadata and data. The metadata can include, among other items, the event time (timestamp) and sourcetype. For TCP inputs, there is, to my knowledge, no such formalized protocol for separating metadata and data. I want the timestamp of the events ingested via TCP to match the value from the original log data, not the time it is ingested into Splunk. With EC, I achieved this by setting the "time" key in the event metadata. For TCP, I believe I'll have to configure timestamp recognition in props.conf as described in Splunk docs. Why I'm asking this question: I'm sending a wide variety of sourcetypes to Splunk via EC, using the "sourcetype" key in the event metadata. For TCP, I believe I'll have to overrride source types on a per-event basis as described in Splunk docs. I do not want to use a different TCP port for each sourcetype. So, I plan to create a stanza in transforms.conf that gets a field value from the JSON-format data received via TCP, and uses it to set the sourcetype, like this: [set_source_type_my_log_type] REGEX = \"somefieldname\"\:\"(?[^\"]+)\" FORMAT = sourcetype::$1 DEST_KEY = MetaData:Sourcetype (I've tested this regular expression using the rex command, but not yet in the context of overriding sourcetype; I don't yet know for sure whether I'll have to escape the double quotes, as done here.) where the JSON received via TCP contains a field like this: "somefieldname": "xyz_123" where "xyz_123" is the sourcetype I want the event to have. The question All of the above (thanks for reading this far) boils down to one simple question: what field name should I use in place of somefieldname (as per the example above)? Thoughts on possible answers It occurs to me that I probably shouldn't use the default field name sourcetype . "Anything you like, except for one of the default fields (so, not sourcetype )" might be a valid answer, but I'd prefer a more specific answer: an actual field name that other users in the same situation might also choose to use, as an informal convention (rather than the formalized EC protocol). Er, event_sourcetype ? It also occurs to me that, given that I can supply both the sourcetype and the _time (expressed as Unix time) as fields in the JSON data, is there some better, more direct way than using regexes to configure the timestamp recognition and override the sourcetype? Specifying a regex to extract a JSON key value seems a bit like... inserting a key into a car door and turning the key, when you've got a remote unlock button on the same keychain. (Someone is going to lecture me on the data pipeline, and parsing versus search-time field extraction, and I probably deserve it.) I'm not thrilled by having to pass through, as a field that will appear in the _raw field of each event, a value that will also be represented in the sourcetype field. That strikes me as inelegant. My EC-ingested events don't have such a field, and I'm hoping for my EC-ingested and TCP-ingested events to cohabit in the same indexes, so I'd prefer them to be as similar as possible. I'd appreciate advice on that, too.

Graham_Hanningt · ‎05-03-2016

Thanks, @somesoni2! Yeah, I rename series in charts that display a legend. For single-series charts, I typically don't bother (renaming or displaying a legend). If you can convert your comment to an answer, I'm happy to accept it. It's very useful for me to get feedback here on what I'm doing as I'm learning Splunk, especially from someone with high karma points (which I'm roughly equating to "experience"). Thanks again.

Graham_Hanningt · ‎05-03-2016

I have a Splunk Enterprise 6.4 dashboard that displays multiple timecharts, all based on the same events in the same time range. When I first developed this dashboard, each chart had its own self-contained search string. I recently edited the dashboard to use the following base search: <search id="base_timechart"> <query>sourcetype=my_log_type $blacklist$ $tran$ $response$ | timechart avg(response) sum(usercpu) avg(dispatch) avg(suspend) avg(syncproc) avg(rmielap) avg(fcwait) avg(jcwait) avg(pcstghwm) avg(tdwait)</query> </search> (The tokens are set by user interface controls.) Here is an example chart search in the updated dashboard: <search base="base_timechart"> <query>fields _time avg(response)</query> </search> That is, I'm using the fields command to pick a series from the results of the timechart in the base search (for some charts, I pick more than one series). This works for me, but I thought I'd ask (after searching for existing related questions here and Googling for information elsewhere): is this best practice, or is there a better way?

Graham_Hanningt · ‎05-02-2016

The solution I came up with (mentioned in my previous comment) was flawed, because xyseries does not produce the same "intelligent" X-axis labels as timechart . @Jeremiah provided the following improved solution (in response to a different question😞 sourcetype=my_log_type | bin _time | stats count by _time, conn_type | lookup connection_types.csv conn_type output description | timechart sum(count) as count by description

Graham_Hanningt · ‎05-02-2016

Thank you, @Jeremiah! That works for me. I've removed the span=1s option after reading the docs: bucket (and bin ) seem to share the same default spanning behavior as timechart . I've also replaced the bucket command name with bin , because - tell me if I'm wrong - the bin command seems to be the "primary" command (for which bucket is an alias): the Splunk docs topic for bucket refers the reader to the bin topic. I'd like to convert your comment into an answer so that I can accept it, but I can't see how to do that. I'm guessing I lack the authority - or karma points - for that option to appear in my user interface. Could you (or anyone reading this) please do that for me, or point me to where I can do that myself? So, pushing timechart to the end of the search solves my problem. I'm still curious, though: timechart seems to be "doing stuff under the covers" (perhaps: generating "internal use only fields" that Splunk "hides" from users?) that I do not (yet?) have the wit to see.

Posts	217
Solutions	3
Karma Given	18
Karma Received	55
Member Since	‎11-15-2015

Online Status	Offline
Date Last Visited	‎08-05-2023 11:58 AM

What are the advantages of using the Splunk HEC JS...

fieldformat not working for Single Value visualiza...

Override ">= 0" legend label in sankey diagram?

How to have Sankey diagram drilldown to set form t...

Is it best practice to constrain searches by index...

AppInspect check_all_lookups_are_used too restrict...

AppInspect check_indexes_conf_does_not_exist too r...

How do I add readable app screenshots to Splunkbas...

Why doesn't my custom app icon appear immediately ...

Performantly overriding sourcetype per event with ...

Re: Why would I use the HTTP Event Collector when ...

Re: Best practice for post-process searches with a...

Why do the contents of the first capturing group i...

Re: Why would I use the HTTP Event Collector when ...

Re: Sending input data over HTTP

Re: Why would I use the HTTP Event Collector when ...

Re: Draw vertical line(s) on chart

Re: Can I click (or hover) in a timechart to highl...

Re: Why would I use the HTTP Event Collector when ...

Where does Splunk log errors about malformed JSON ...

Why would I use the HTTP Event Collector when I ca...

Re: HTTP Event Collector: Is there a working examp...

Re: Best practice for overriding source key in inp...

Best practice for overriding source key in inputs....

Re: What field name should I use to contain the va...

Re: What field name should I use to contain the va...

Re: What field name should I use to contain the va...

Re: What field name should I use to contain the va...

Re: What field name should I use to contain the va...

Re: What field name should I use to contain the va...

What field name should I use to contain the value ...

Re: Best practice for post-process searches with a...

Best practice for post-process searches with a tim...

Re: Can a search string dynamically build commands...

Re: How do you make output from xyseries generate ...

Are you a member of the Splunk Community?