Hi Mathias,
Which flow fields do we really need to have the "query" and "answer" field
Not sure what you mean by flow fields? The only SPL magic that you may need to do with regards to "stream:dns" events is to normalize the name, host_addr, host_type (and maybe other) fields since they may be single value as well as MV field. This is a known pain with Stream and we're working on addressing it. What do you see if you run a SPL query such as this:
sourcetype="stream:dns" dest_port=53 query_type="A" | table query, reply_code, name{}, host_addr{}, host_type{}
sourcetype="stream:dns" dest_port=53 query_type="A" | table query, reply_code, name, host_addr, host_type
Re: estimating the data volume needs. You can turn the dns stream (or streams) into "stats-only" mode to make streamfwd do pretty much everything it would do when that stream (dns, or any other one for that matter) is enabled - capture and process the data and build an event in memory - except actually sending the data to Splunk indexer. Instead, it sends the aggregated statistics about the size of the events you can see on the App for Stream's "Stream Stats" dashboard. So it's completely free and doesn't affect your license data volume at all. You'll need the setup the _internal index forwarding on your UF layer running Splunk_TA_stream though as the stats events are sent to _internal.
... View more