About dokaas_2

dokaas_2 · ‎05-25-2024

This is a new instance being installed by a contractor who posited that it's Linux best practice that root owns all mount points. Internally, a security architect (not a Splunk admin) supported the 'Linux best practice' argument. In all of my many years of experience with Splunk, or in training, I have never heard of setting root as the owner of mount points within the /opt/splunk folder tree. So this is a difference of professional opinion and I'm looking for definitive, authoritative consequences of having root as the owner of the mount points. Often it takes someone outside the organization to validate an item.

dokaas_2 · ‎05-25-2024

Thanks. Can you give me any specific issues that might arise if the non-root user 'splunk' can't write into the /opt/splunk/var/run folder? (this is a search head in a cluster) As well, what about in the /opt/splunk/var/lib folder on an indexer. Note that buckets are actually stored in /opt/splunk/var/lib/splunk/

dokaas_2 · ‎05-25-2024

Thanks. Can you give me any specific issues that might arise if the non-root user 'splunk' can't write into the /opt/splunk/var/run folder? (this is a search head in a cluster)

dokaas_2 · ‎05-25-2024

I'm seeing errors such as: Corrupt csv header in CSV file , 2 columns with the same name '' (col #12 and #8, #12 will be ignored) but there I can't find the reference to what CSV file that is causing this error. Does anyone have any guidance on how to find the offending CSV file?

dokaas_2 · ‎05-23-2024

We have a contractor installing a Splunk instance for us. For search heads, we have an NVMe volume mounted for the /opt/splunk/var/run folder. The ./run folder is owned by 'root' and the 'splunk' user cannot write into the folder. Similar, our indexers have a mounted NVMe volume for the /opt/splunk/var/lib folder and it too is owned by 'root'. Index folders and files are located one level below that in the ./lib/splunk folder where the 'splunk' user is the owner. What are the consequences of having 'root' own these folders on the operation of Splunk? I assumed that when Splunk is running as non-root, that it must be the owner of all folders and files from /opt/splunk on down. Am I wrong?

dokaas_2 · ‎03-29-2024

So here is my understanding and the way that I've got our on-prem instance configured. hot buckets are stored on a local flash array. When the bucket closes, it keeps the closed bucket on the flash drive and writes a copy to the S3 storage. The S3 storage copy is considered to be the 'master copy'. I try not to use the term 'warm bucket', but instead use 'cached bucket'. All searches are performed on either hot or cached buckets on the local flash array. Cached buckets are eligible for eviction from local storage by the cache manager. So if your search needs a bucket that is not on the local storage, it will evict eligible cached buckets, retrieve the buckets from S3 storage and then perform the search. The frozenTimePeriod defines our overall retention time. We use hotlist_recency_secs to define when a cached bucket is eligible for eviction. That is. buckets less than the hotlist_recency_secs age are not eligible for eviction. Our statistics show that probably 90% of the queries have a time span of 7 days or less (research gosplunk.com for query). Thus, by setting the hotlist_recency_sec to 14 days, we are ensured that the search buckets are on local, searchable storage w/o having to reach out to the S3 storage (which is slower). One last thing. We need a 1 yr searchable retention. However, we also need to keep 30 months total retention. To accomplish this, I use ingest actions to the S3 storage. Ingest actions will write the events in compressed json format by year, months, day, and sourcetype. Hope this helps.

dokaas_2 · ‎02-09-2024

In a SmartStore configuration, there are a significant number of deletes/writes as buckets are evicted and copied to the indexer's volume. To improve performance, SSD disks are being used. In this case, how often should one run the TRIM command to help with SSD garbage collection?

dokaas_2 · ‎08-15-2023

These visualizations looks great. However, I'm on version 7.1.1 and I don't see the visualizations. Is there any special configurations/conditions required to get them to display?

dokaas_2 · ‎07-11-2023

SYSLOG often sends the timestamp in the older format (e.g. Jul 11 14:23:32). Unfortunately, that format does not have a year or timezone. I know that Splunk has logic to 'figure' it out, but I need to have it reformatted to the following: YYYY-MM-DDTHH:mm:ss<GMT offset> Is there a way to accomplish this with INGEST_EVAL or other method? If so how is it done? This should change the _raw event(that is, this is not a search time question). Kind of like a mask.

dokaas_2 · ‎06-05-2023

That script, I believe, moves frozen buckets to an S3 storage location. However, I what I want is to use the ingest actions (Settings / Data / Ingest actions) to write all raw events in compressed JSON format. Ingest actions is per source type and with nearly 100 or more source types, manually creating one for each is onerous. Ingest actions use RULESET in the props to define how to store on the Remote File System (RFS).

dokaas_2 · ‎06-04-2023

To clarify. Splunk +9 provides 'ingest actions' to filter, mask and route data. This can be done on a heavy forwarder or indexer. Heavy forwarders would receive their configuration from a deployment server, but clustered indexers receive their configuration from the cluster manager (I guess stand-alone indexers could get their configurations from a deployment server too). It was that in the class documentation, they only mentioned that 'ingest actions' were deployed through the deployment server and not that they would be deployed by the cluster manager if you had clustered indexers.

dokaas_2 · ‎06-03-2023

In a recent "Splunk Enterprise 9.0 Data Administration" class, the documentation says that Ingest Actions should be implemented on a Deployment Server. Am I correct that this only refers to Ingest Actions defined for a heavy forwarder and that if the Ingest Action is to be deployed on an indexer it should be defined on the cluster manager?

dokaas_2 · ‎06-01-2023

Our requirements are to have readily searchable data for 12 months and 'cold store' of data for an additional 18 mths (30 mths total). Ingest Actions seems like the obvious choice since it can write to an S3 bucket and compress the data in a format easily re-ingested or passed to a 3rd party if needed. However, the ingest actions seem to only work given you apply the ruleset to a sourcetype. Given that there may be a hundred or more sourcetypes, this is a little onerous. Is there a method to accomplish this w/o creating a ruleset for every sourcetype?

dokaas_2 · ‎02-27-2023

According to the Splunk documentation on the attribute [splunktcp-ssl:<port>] it states that: * Use this stanza type if you are receiving encrypted, parsed data from a forwarder." UFs cook, but do not 'parse' the data. Thus, is this effective to send encrypted data from the UF to indexers?

dokaas_2 · ‎02-24-2023

When one configures the indexer cluster for SmartStore, does each indexer get its own S3 bucket? Or is there just one very large S3 bucket and all indexers write into the same S3 bucket (separated by indexer GUID or something like that)?

dokaas_2 · ‎02-24-2023

Using ingestion actions, one can write a copy of events to an S3 bucket prior to indexing. Can one search these S3 buckets with Splunk even though they were not ingested (it'd be slow, but could be useful for historical searches)?

dokaas_2 · ‎02-16-2023

In the Admin classes configuration precedence was defined for index and search time. However, since the Splunk UF is neither index nor search, what precedence order does the Splunk UF follow?

dokaas_2 · ‎02-14-2023

I may get slapped for this answer since Splunk technically says it's deprecated, but you could consider using the LightWeightForwarder. The LWF disables the parsing & index queues on the HF, but gives one the HEC inputs and Python (for running apps such as eStreamer, dbConnect, etc) as an intermediate relay. A LWF disables the web interface, but that can be turned back on in a higher precedence web.conf. I use a LWF in the DMZ which gives me all the above, but also provides a deployment server for assets in the DMZ.

dokaas_2 · ‎02-13-2023

If an HF is used for a intermediate / aggregation tier and the data is parsed, what does the ingestion pipeline look like when it hits the indexer. That is, if the HF does parsing, aggregation, typing, but not indexing, does the data flow through those same queues at the indexer? Or is the data injected directly in the the indexing queue?

dokaas_2 · ‎02-11-2023

I realize this is a very old post, but as I was browsing I didn't see an answer to your questions. The best answer is probably to use a LightweightForwarders. Yes, I know it's supposedly deprecated, but it does pretty much what you want. That is, it sends cooked, but unparsed/unindexed data to the indexer. It also gives you all of the functionality such as Python, HEC inputs, etc. By default, the LightweightForwarder will disable the web interface, but that can be turned though the web.conf setting. I use this configuration in my DMZ.

dokaas_2 · ‎10-12-2022

Is there a way to query ES investigations for artifacts? For example, suppose that I have a current notable with a hostile foreign IP address. I would like to query Splunk and find all previous investigations with that IP address so that analyst can review the previous investigations.

dokaas_2 · ‎10-09-2022

Correlation rules from sources such as ESCU will have a filter macro. For example, the rule "ESCU - Access LSASS Memory for Dump Creation - Rule" has a macro named "access_lsass_memory_for_dump_creation_filter". You should modify the macro rather than the rule itself. Sometimes rules are updated in the ESCU. If one modifies the rule directly, the search will be saved in the 'local' folder and have precedence over an updated rule in the default folder.

dokaas_2 · ‎08-21-2022

When deploying apps with the deployment server, I receive the following warning: "Server Certificate Hostname Validation is disabled. Please see server.conf/[sslConfig]/cliVerifyServerName for details..." It appears as though Splunk wants host certificates for authentication on all deployment clients. With over 10K Windows systems, generating PEM formatted certificates for use in Splunk seems onerous. Does anyone have a solution for how to manage this? It would seem that the Splunk UF for Windows should be able to read the Windows certificate store for this authentication, but I've not found that solution.

dokaas_2 · ‎07-31-2022

Seems like Splunk should incorporate that feature; however, as far as I can tell they haven't. My work around has been to: 1. Run Splunk https on port 8443. 2. Install lighttp light web server that redirects http port 80 to https port 8443. 3. Run firewalld and redirect port 443 to port 8443.

dokaas_2 · ‎07-06-2022

I'm seeing something similar with my v9 intermediate forwarders. I have not upgraded my indexer cluster to v9 yet (8.2.5) and was wondering if the forwarders being at a higher version than the indexer was causing the issues. I'm seeing events from the problem relay servers such as: Invalid ACK received from indexer=aaa.bbb.ccc.ddd:9997. Got unexpected ACK with eventid=53825 Are you seeing anything such as that? index=_internal sourcetype=splunkd ack

Posts	55
Solutions	1
Karma Given	8
Karma Received	5
Member Since	‎01-31-2019

Online Status	Offline
Date Last Visited	‎08-05-2024 10:55 AM

Corrupt csv header in CSV file , 2 columns with th...

Folder/File ownership by root when Splunk runs as ...

Smartstore Hot/Cache Flash Drive TRIM Frequency

How to reformat timestamp in SYSLOG _raw

Where to implement Ingest Actions?

What is the best method to write all event to RFS ...

Why don't UFs parse data when docs say splunktcp-s...

Indexers Cluster and Smartstore S3 buckets configu...

Can you search S3 buckets from the search head tha...

What is the configuration precedence for Splunk UF...

Re: Folder/File ownership by root when Splunk runs...

Re: Folder/File ownership by root when Splunk runs...

Re: Folder/File ownership by root when Splunk runs...

Corrupt csv header in CSV file , 2 columns with th...

Folder/File ownership by root when Splunk runs as ...

Re: Is it possible to have WarmData stored partial...

Smartstore Hot/Cache Flash Drive TRIM Frequency

Re: Enhance Security Visibility with Splunk Enterp...

How to reformat timestamp in SYSLOG _raw

Re: Best method to write all event to RFS with Ing...

Re: Where to implement Ingest Actions?

Where to implement Ingest Actions?

What is the best method to write all event to RFS ...

Why don't UFs parse data when docs say splunktcp-s...

Indexers Cluster and Smartstore S3 buckets configu...

Can you search S3 buckets from the search head tha...

What is the configuration precedence for Splunk UF...

Re: Can a Heavy-Forwarder just raw forward and not...

Does the data flow through those same queues at th...

Re: Can a Heavy Forwarder send cooked but unparsed...

Is there a way to query ES investigations for arti...

Re: How to prevent notable events from being creat...

How to manage Windows "Server Certificate Hostname...

Re: Splunk web access auto redirect from http to h...

Re: Splunk Enterprise Parallel Data Pipelines on t...