Dear Splunk Community,
I am currently working on a project focused on identifying the essential data that should be collected from endpoints into Splunk, with the goal of avoiding data duplication and ensuring efficiency in both performance and storage.
Here’s what has been implemented so far:
The Splunk_TA_windows add-on has been deployed.
The inputs.conf file has been configured to include all available data.
Sysmon has been installed on the endpoints.
The Sysmon inputs.conf path has been added to be collected using the default configuration from the Splunk_TA_windows add-on.
In addition, we are currently collecting data from firewalls and network switches.
I’ve attached screenshots showing the volume of data collected from one endpoint over a 24-hour period. The data volume is quite large, especially in the following categories:
WinRegistry
Service
Upon reviewing the data, I noticed that some information gathered from endpoints may be redundant or unnecessary, especially since we are already collecting valuable data from firewalls and switches. This has led me to consider whether we can reduce the amount of endpoint data being collected without compromising visibility.
I would appreciate your input on the following:
What are Splunk's best practices for collecting data from endpoints?
What types of data are considered essential for security monitoring and analysis?
Is relying solely on Sysmon generally sufficient in most security environments?
Is there a recommended framework or guideline for collecting the minimum necessary data while maintaining effective monitoring?
I appreciate any suggestions, experiences, or insights you can share. Looking forward to learning from your expertise.
Hi @kn450
I'll try and keep this brief!
Best practice for endpoint data collection varies by organisation, industry, size, and threat profile (e.g., insider threats, advanced attackers, commodity malware). The right approach is to define your critical security use-cases (such as lateral movement, privilege escalation, or unauthorised access) and then determine what data to collect to support those cases. Relying on tuned Windows event logs provides robust coverage for most behavioral detection; additional Splunk_TA_windows inputs should only be enabled if tied directly to your specific use-cases.
Essential Windows event types commonly used for security monitoring:
The free Splunk Security Essentials app can help you map your currently collected data to known security use-cases and identify gaps, aligning your ingestion strategy with what provides actionable security value. I would highly recommend installing this and having a look!
Avoid enabling all data sources by default. Regularly review event types and volumes, filtering or disabling sources that do not support your prioritised detection requirements, to control storage and license use.
Are you using Splunk Enterprise Security, or are you planning to create your own rules? There is a lot we havent covered here such as making sure data is CIM compliant/mapped, hardware sizing etc. This is the sort of thing which would usually be architected out at the start to ensure you have the right data and resources available.
I'd recommend checking out the following links too:
Data source planning for Splunk Enterprise Security
Lantern - Getting data onboarded to Splunk Enterprise Security
🌟 Did this answer help you? If so, please consider:
Your feedback encourages the volunteers in this community to continue contributing
There is no single "best practice" here. It all depends on what you want to achieve. If you have specific use cases and detections, you might want to limit your data only to the data directly contributing to those detections and nothing else. And generally the extent of the data gethered will differ depending on your detections.
But if you want to have data for subsequent forensics or threathunting then you might want to have as much data as possible. As long as you know what your data is about (and that's the most important thing with data onboarding - don't onboard unknown data just because "it might come in handy one day"), you want as much data as you can afford with your environment size and license constraints.
Configuring Windows event logs for Enterprise Security use
Ensure that data sent from endpoints to Splunk is encrypted using SSL/TLS
Configure Splunk indexing and forwarding to use TLS certificates - Splunk Documentation
Utilize heavy forwarders (HF) to filter and route data based on event types, reducing unnecessary data ingestion
Route and filter data - Splunk Documentation
Data collection architecture - Splunk Lantern
https://lantern.splunk.com/Data_Descriptors/Firewall_data