About AHBrook

AHBrook · ‎08-15-2022

Thanks for the detailed replies! That's a good point about the redundancy aspect with the Deployment Server. I guess it makes sense that I shouldn't have to run more than 1 at any given time, unless we go 1 cloud, 1 on-prem. I'd still like them to share a code base / setup as much as possible, just to make updating/maintenance as easy as possible. In the environment we are building out, the indexers are search heads are with Splunk Cloud. The Deployment Server and Heavy Forwarders are going to be managed by us and talk to the Splunk Cloud environment. I apologize for not being clear about the "shared storage." I was referring to the splunk/etc/deployment-apps folder. I recognize that sharing storage for indexers is a bad idea(tm). And yeah, the idea with the deployment servers having a load balancer would be for the initial check in of deployed components. I haven't gotten this far yet, but I'd like to make it so that whenever a Splunk UF or HF is deployed in our environment, it knows where to look to register with our Splunk instance. I don't know yet if that is possible to have them check into our Cloud instance, or into our Deployment servers directly. The clarification on Heavy Forwarders is well taken. 🙂 I got my terms mixed up a bit. The plan we have is for horizontal scaling, where our networking syslogs and other systems that don't have local log storage would not have to care which node they talk to. The architecture element is also well taken. I've so far only gotten my Splunk Power User certificate, but I have taken Splunk System and Data Administraton (I need to get on taking that cert exam here once things calm down). I hope to get Architect someday. My coworker has also taken Splunk Cloud Administrator courses. We are pretty well engaged with Splunk when it comes to architecture and planning and setting up our environment, but again this is a bit of a side project / proof of concept for me. 🙂

AHBrook · ‎08-13-2022

Hey everyone! We're currently in the process of getting ready to deploy a Splunk Cloud instance to migrate our local on-prem version from. Currently, our environment is a hodge-podge of installs, including completely unmanaged universal forwarders, a couple heavy forwarder clusters, and so on. We also have resources both in our local datacenter and in various cloud providers. I've been of the thought for a while that we should toss the deployment servers into a container environment. I was curious if anyone had experience with doing this? Here's the design I want to build towards: Running at least two instances of Splunk Enterprise, so that we have redundancy and load balancing and can transparently upgrade The instances would not have any indexer or search head functionality, per Splunk's best practices Ideally, the instances would not have any web interfaces, because everything would be code managed All the instances would be configured to talk up to the Splunk Cloud environment as part of their initial deploy All of the instances would use a shared storage location for their apps, including self-configuration for anything beyond the initial setup. This shared storage location would be git-controlled. In an ideal world, the individual Splunk components would not care which deployment server they talked to - they would just check in to a load balanced URI. Now, I know this is massively over-engineering the solution. We've got a couple thousand potential endpoints to manage, so a single standalone deployment server would do the trick. But I want to try this route for two reasons. First, I think it will scale better - especially if I get it agnostic enough that we can use it to deploy to AWS or Azure and get cloud-local deployment servers. Second, and perhaps more importantly, I want to practice and stretch my skills with containers. I've already worked with our cloud team to build out a Splunk Connect for Kubernetes setup in order to monitor our pods and Openshift environment. I want to take this opportunity to learn.

AHBrook · ‎02-23-2022

I should note, I was able to turn off the logs by updating the "monitoring_agent_enabled" value to false in the helm chart values.yaml. Interestingly, leaving the field blank will default to "True," despite not listing that anywhere in the reference docs I could find. However, this does not explain what the monitoring agent is or what it collects, and how that data is used. Given that it is grouped with prometheus, I suspect it is assisting with the metric display, but.. again, no definitions that I could find.

AHBrook · ‎02-18-2022

Gah! I always forget that's an option. That's on me. 🙂 Yeah. I think a feature request is in order. I'll link it here once I make it.

AHBrook · ‎02-16-2022

Hey everyone! I've spent a good few hours here learning the basics of creating custom packages to load into our Splunk Cloud instance. In the process, I've started playing around with Splunk AppInspect and Splunk Packaging Toolkit. Thing is, these were bombing hard for me due to trying to build out of my git working directory. Every file in .git basically triggered the validator's fail state. Eventually I found an answer in creating a .slimignore file. The only reference I could find to this file was here: https://dev.splunk.com/enterprise/reference/packagingtoolkit/packagingtoolkitcli#slim-package The manual for slim also mentioned that this should be in the root of the app's development folder. So this leads me to a fairly basic question: Why isn't the standard .git structure included in the default ignore file? It seems this would make overall development easier.. (Also interesting that it says /local is ignored, but the actual ignore file in the library only lists Python, Jetbrain, OSX, and Windows Thumbnail files to ignore...)

AHBrook · ‎01-13-2022

Hey everyone! I've successfully set up a link from Splunk Connect for Kubernetes on our OpenShift environment. It outputs to a local Heavy forwarder, which then splits the data stream and sends to our on-prem Splunk instance and a proof of concept Splunk Cloud instance (which we're hopefully going to be moving towards in the future). I have the system setup so that it sends most of its logs to an index called "test_ocp_logs". This covers cases in the format of [ocp:container:ContainerName]. However, I am getting a strange log into our root "test" index, which I have set up as the baseline default in the configuration. These have the following info: source = namespace:splunkconnect/pod:splunkconnect-splunk-kubernetes-logging-XXXXX sourcetype = fluentd:monitor-agent These look like some kind of report on what the SCK system grabbed and processed, but I can't seem to find any kind of definition anywhere. Here's what one of the events looks like : { [-] emit_records: 278304 emit_size: 0 output_plugin: false plugin_category: filter plugin_id: object:c760 retry_count: null type: jq_transformer } So I have a few main questions: What is this log, and is it something we should care about? If we should care about this, what do the fields mean? If we should care about this, how do I direct where it goes so that I keep all my SCK/OpenShift events kept in the same index (at least for now)? For reference, this is the contents of my values.yaml for the helm chart to build SCK: global: logLevel: info splunk: hec: host: REDACTED port: 8088 token: REDACTED protocol: indexName: test insecureSSL: true clientCert: clientKey: caFile: indexRouting: kubernetes: clusterName: "paas02-t" prometheus_enabled: monitoring_agent_enabled: monitoring_agent_index_name: serviceMonitor: enabled: false metricsPort: 24231 interval: "" scrapeTimeout: "10s" additionalLabels: { } splunk-kubernetes-logging: enabled: true logLevel: fluentd: # Resticting to APP logs only for the proof of concept path: /var/log/containers/*APP*.log exclude_path: - /var/log/containers/kube-svc-redirect*.log - /var/log/containers/tiller*.log - /var/log/containers/*_kube-system_*.log # ignoring internal Openshift Logging generated errors - /var/log/containers/*_openshift-logging_*.log containers: path: /var/log pathDest: /var/lib/docker/containers logFormatType: cri logFormat: "%Y-%m-%dT%H:%M:%S.%N%:z" refreshInterval: k8sMetadata: podLabels: - app - k8s-app - release watch: true cache_ttl: 3600 sourcetypePrefix: "ocp" rbac: create: true openshiftPrivilegedSccBinding: true serviceAccount: create: true name: splunkconnect podSecurityPolicy: create: false apparmor_security: true splunk: hec: host: port: token: protocol: indexName: test_ocp_logs insecureSSL: clientCert: clientKey: caFile: journalLogPath: /run/log/journal charEncodingUtf8: false logs: docker: from: journald: unit: docker.service timestampExtraction: regexp: time="(?<time>\d{4}-\d{2}-\d{2}T[0-2]\d:[0-5]\d:[0-5]\d.\d{9}Z)" format: "%Y-%m-%dT%H:%M:%S.%NZ" sourcetype: kube:docker kubelet: &glog from: journald: unit: kubelet.service timestampExtraction: regexp: \w(?<time>[0-1]\d[0-3]\d [^\s]*) format: "%m%d %H:%M:%S.%N" multiline: firstline: /^\w[0-1]\d[0-3]\d/ sourcetype: kube:kubelet etcd: from: pod: etcd-server container: etcd-container timestampExtraction: regexp: (?<time>\d{4}-\d{2}-\d{2} [0-2]\d:[0-5]\d:[0-5]\d\.\d{6}) format: "%Y-%m-%d %H:%M:%S.%N" etcd-minikube: from: pod: etcd-minikube container: etcd timestampExtraction: regexp: (?<time>\d{4}-\d{2}-\d{2} [0-2]\d:[0-5]\d:[0-5]\d\.\d{6}) format: "%Y-%m-%d %H:%M:%S.%N" etcd-events: from: pod: etcd-server-events container: etcd-container timestampExtraction: regexp: (?<time>\d{4}-[0-1]\d-[0-3]\d [0-2]\d:[0-5]\d:[0-5]\d\.\d{6}) format: "%Y-%m-%d %H:%M:%S.%N" kube-apiserver: <<: *glog from: pod: kube-apiserver sourcetype: kube:kube-apiserver kube-scheduler: <<: *glog from: pod: kube-scheduler sourcetype: kube:kube-scheduler kube-controller-manager: <<: *glog from: pod: kube-controller-manager sourcetype: kube:kube-controller-manager kube-proxy: <<: *glog from: pod: kube-proxy sourcetype: kube:kube-proxy kubedns: <<: *glog from: pod: kube-dns sourcetype: kube:kubedns dnsmasq: <<: *glog from: pod: kube-dns sourcetype: kube:dnsmasq dns-sidecar: <<: *glog from: pod: kube-dns container: sidecar sourcetype: kube:kubedns-sidecar dns-controller: <<: *glog from: pod: dns-controller sourcetype: kube:dns-controller kube-dns-autoscaler: <<: *glog from: pod: kube-dns-autoscaler container: autoscaler sourcetype: kube:kube-dns-autoscaler kube-audit: from: file: path: /var/log/kube-apiserver/audit.log timestampExtraction: format: "%Y-%m-%dT%H:%M:%SZ" sourcetype: kube:apiserver-audit openshift-audit: from: file: path: /var/log/openshift-apiserver/audit.log timestampExtraction: format: "%Y-%m-%dT%H:%M:%SZ" sourcetype: kube:openshift-apiserver-audit oauth-audit: from: file: path: /var/log/oauth-apiserver/audit.log timestampExtraction: format: "%Y-%m-%dT%H:%M:%SZ" sourcetype: kube:oauth-apiserver-audit resources: requests: cpu: 100m memory: 200Mi buffer: "@type": memory total_limit_size: 600m chunk_limit_size: 20m chunk_limit_records: 100000 flush_interval: 5s flush_thread_count: 1 overflow_action: block retry_max_times: 5 retry_type: periodic sendAllMetadata: false nodeSelector: node-role.kubernetes.io/app: '' affinity: {} extraVolumes: [] extraVolumeMounts: [] priorityClassName: kubernetes: securityContext: true splunk-kubernetes-objects: enabled: false splunk-kubernetes-metrics: enabled: false

AHBrook · ‎12-09-2021

I just wanted to update people to let them know that I did figure this out.. and it was a bit of a wild trip. 🙂 Turns out, I was mixing up index-based field extraction and searchtime field extraction. What I'm doing now (and I can post code examples) is having the SCK send in data, which the heavy forwarder then changes to a different sourcetype. From there, I have an app on my indexer that is doing field extraction. I do need a unique field extraction regex for each source type, but that's not terribly difficult. I'm also adding in a bunch of human readable aliases and CIM compliant aliases, so I think I'm getting some pretty rich data. The idea of setting the transform to match a known sourcetype and then having another app pick it up seems to have failed. I'm not sure why, but I'm also not questioning it. 🙂

AHBrook · ‎12-08-2021

Hunting a bit, I am finding evidence that I cannot feed one sourcetype into another. I've had some success in stripping out the openshift related delimiters at the front of the file, but I cannot then pass it into a sourcetype it seems. Or, if I do, it doesn't properly process it. I've also tried cloning the sourcetype, but I think I am missing a step there as it doesn't appear to be picking up by the tomcat_local_access sourcetype (if that even exists). I also saw some documents that indicated that you might be able to run two transforms in a row, but I do not know how to ensure that will pass properly on a single line. Instead, it looks as if it looks for different criteria. Edit: It's the end of the day and I need to step away for now. Here's my latest attempt, at trying to get the heavy forwarder to generate search-time field extractions. However, no new fields are showing up in my search results, so I can't tell if it is doing what it should or not. props.conf #Intake from Shib containers (OCP specific!) [ocp:container:ilstu-shib-test] TRANSFORMS-shibboleth-tomcat-access = shibboleth:tomcat-access-sourcetype [shibboleth:tomcat-access] REPORT = shibboleth:tomcat-access-fields transforms.conf [shibboleth:tomcat-access-sourcetype] REGEX = tomcat-access;console;.*?;.*?;(.*) DEST_KEY = MetaData:Sourcetype FORMAT = sourcetype::shibboleth:tomcat-access [shibboleth:tomcat-access-fields] REGEX = tomcat-access;console;.*?;.*?;(?P<ip>(?:(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)\.){3}(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)|(?:[A-F0-9]{1,4}:){7}[A-F0-9]{1,4})\s-\s(?P<user>(-|\w+))\s\[\d+\/\w+\/\d{4}:\d{2}:\d{2}:\d{2}\s[\+\-]\d{4}\]\s"(?P<method>[A-Z]{3,7})\s(?P<request_uri>[\S]+)\s(?P<protocol>[\w\/\.]+)"\s(?P<status>\d{3})\s(?P<bytes_sent>(?:\d+|-))$ FORMAT = ip::$1 user::$2 method::$4 request_uri::$5 protocol::$6 status::$7 bytes_sent::$8 WRITE_META = true

AHBrook · ‎12-08-2021

@mpflugfelder Did you ever find an answer to this? I'm running into the EXACT same scenario with my Openshift environment. Seeing that explination answers a lot about what I'm seeing, as I can't seem to get it to "re-sourcetype" my data. I do see a potential answer in CLONE_SOURCETYPE, but I am afraid that will double up the events, and I'd want to discard the original (and only if the second one contained all the metadata from the first).

AHBrook · ‎12-08-2021

Hey everyone! I have what I would consider a complex problem, and I was hoping to get some guidance on the best way to handle it. We are attempting to log events from an OpenShift (Kubernetes) environment. So far, I've successfully gotten raw logs coming in from Splunk Connect for Kubernetes, into our heavy forwarder via HEC, and then into our indexer. The data that is being ingested has a bunch of metadata about the pod name, container name, etc. from this step. However, the problem is what to do with it from there. In the case of this specific configuration, the individual component logs are getting combined into a single stream, with a few fields of metadata attached to the beginning. After this metadata, the event 100% matches up with what I'd consider a "standard" event, or something Splunk is more used to processing. For example: tomcat-access;console;test;nothing;127.0.0.1 - - [08/Dec/2021:13:25:21 -0600] "GET /idp/status HTTP/1.1" 200 3984 This is first semicolon-delimited, and then space delimited, as follows: "tomcat-access" is the name of the container component that generated the file. "console" indicates the source (console or file name) "test" indicates the environment. "Nothing" indicates the User Token And everything after this semicolon is the real log. In this example, it is a Tomcat-access sourcetype. Compare this to another line in the same log: shib-idp;idp-process.log;test;nothing;2021-12-08 13:11:21,335 - 10.103.10.30 - INFO [Shibboleth-Audit.SSO:283] - 10.103.10.30|2021-12-08T19:10:57.659584Z|2021-12-08T19:11:21.335145Z|sttreic shib-idp is the name of the container component that generated the log idp-process.log is the source file in that component test is the environment nothing is the user token And everything after that last semicolon is the Shibboleth process log. Notably, this part uses pipes as delimiters. The SCK components, as I have them configured now, ship all these sources to "ocp:container:shibboleth" (or something like that). When they are shipped over, metadata is added for the container_name, pod_name, and other CRI-based log data. What I am aiming to do I would like to use the semicolon-delimited parts of the event to tell the heavy forwarder what sourcetypes to work with. Ideally, I would like to cut down on having to make my own sourcetypes and regex, but I can do that if I must. So for the tomcat-access example above, I'd want: All the SCK / Openshift related fields to stick with the event. The event to be chopped up into 5 segments The event type to be recognized by the first 2 fields (there is some duplication in the first field, so the second field would be the most important) The first 4 segments to be appended as field information (like "identifier" or "internal_source") The 5th segment to be exported to another sourcetype for further processing (in this case, "tomcat_localhost_access" from "Splunk_TA_tomcat"). All the other fields would stick with the system as Splunk_TA_tomcat did its field extractions. If this isn't possible, I could make a unique sourcetype transform for each event type - the source program has 8 potential sources. But that would involve quite a bit of duplication. Even as I type this out, I'm getting the sinking feeling that I'll need to just bite the bullet and make 8 different transforms. But one can hope, right? Any help would be appreciated. I've gotten through Sysadmin and data admin training, but nothing more advanced than that. I suspect I'll need to use this pattern in the future for other Openshift logs of ours, but I don't know at this stage.

AHBrook · ‎12-07-2021

Hey @mab_cu - are you still around? We're investigating using this addon as we are spinning up Shibboleth at our institution. Have you found any further information since you made this post? In the app.conf file, I found a couple names of the authors, and I'm trying to reach out to them to get some more background on things. In the meantime, I wonder if this can be grafted or modified. There is a "Shib_handler.py" file that I suspect is trying to perform that workflow you reference. It appears pretty lightweight and provided by Splunk, so I wouldn't imagine it is too hard to make work properly. I did see that same error upon looking at the dashboards in our Heavy Forwarder. Biggest problem I face is that because this isn't certified with 8.0, I can't install it into our Splunk Cloud instance. So I am hoping to get help with that.

AHBrook · ‎10-27-2021

Hey all! I've inherited a Splunk instance that has been running for about 8 years now. There are instances of Splunk_TA_windows all over it - most are 4.8.3, but a couple are 8.0.0 and 8.1.2. (The overall Splunk instance is running at 7.2 currently). In the process of investigation, I have discovered that our Active Directory controllers had Universal Forwarders installed on them using the GUI installer. In the process, they were set to collect Windows event logs, but no other configuration was made. As a result, a ton of logging is flowing into our "main" index. In fact, the only thing in the "inputs.conf" file is the IP address of the host. Thanks to the help and pointers of many, I've determined that this is definitely "not good" and instead I should have some filters/blacklists in place. I've gotten the controllers in question hooked up to our deployment server, so I want to push some apps to them via that. My question is: Should I deploy the entire Splunk_TA_windows app to the domain controllers? Or should I just push custom apps that contain the filtering/settings I want, and leave Splunk_TA_windows to the Heavy Forwarders, Indexers, and Search Heads we plan on using? Or should I do both? I've consulted a few other resources, such as https://community.splunk.com/t5/All-Apps-and-Add-ons/Is-it-a-best-practice-to-use-the-Splunk-Add-on-for-Microsoft/td-p/427679 (Best practice to use Splunk_TA_windows) https://docs.splunk.com/Documentation/WindowsAddOn/8.1.2/User/AbouttheSplunkAdd-onforWindows (Deploy and use documentation) https://www.splunk.com/en_us/blog/tips-and-tricks/working-with-active-directory-on-splunk-universal-forwarders.html (working with AD on Splunk Universal Forwarders) Digging around, I'm seeing that some Windows logging is being put into the "ActiveDirectory" sourcetype already, but not from any configuration I can find applying to the system, so I assume it is just recognizing them as AD events. My biggest concern is that I want to build a "baseline" that is easy to maintain going forward. I know from my Data Admin training that deployed add-ons are evaluated in reverse-lexicographical order (IE "Splunk_TA_Windows" has lower priority than "institution_windows_core"), so I should be able to stack things... but again, I just want to make sure I'm following what people recommend. ( May also be using this forum as a "Rubber ducky" situation. 😄 )

AHBrook · ‎09-01-2021

On the client, there are 6 elements in the apps folder: introspection_generator_addon learned search splunkhttpinput splunk_internal_metrics SplunkUniversalForwarder The only one with a bin is the introspection generator, with collector.path. I do see a bunch of .cmd files in $SPLUNK_HOME/etc/system/bin, but those look like they set up the admon, perfmon, powershell, event log, etc. That said, the previous admin did throw the deployment server's configs into our git instance, so I'm gonna go spelunking into that and see if I can find this very particular reference. Edit: And a quick search shows the only place with the URI that I'm looking for in our gitlab... is in a few ansible files. I suspect this change may be something outside of splunk. I really do greatly appreciate the help figuring this out!

AHBrook · ‎09-01-2021

Sorry, missed this one. The deploymentclient.conf I referenced was on the universal forwarder in $SPLUNK_HOME/etc/system/local. The deployment-apps folder just has a README. On our deployment server, there are 22 apps in $SPLUNK_HOME/etc/deployment-apps.

AHBrook · ‎09-01-2021

This is actually the exact scenario I'm trying to hunt down, but I don't know where to look. I've confirmed we have an SCCM deployment for the universal forwarder that was developed but never deployed, and it has the right settings. So it really feels like these servers manually checked in, then were pointed somewhere else. I just.. can't find where that might be coming from.

AHBrook · ‎09-01-2021

Hey everyone! I'm in the process of investigating a Splunk instance that I have inherited. I've got a decent handle on things, but I am seeing that the majority of our index is being eaten up by logs from our multiple Active Directory controllers. Digging around, I see that the local inputs.conf file for the universal forwarder on the DCs is empty, and btool confirms they are not pulling in config from other places. There is, however, a deploymentclient.conf file, with a single targetUri in it. What's interesting, though, is that the listed TargetUri is not a server name that is present in our environment. It's close, but not exact. Further, I see no signs that this particular domain controller has ever checked in with our deployment server. I know for a fact that we manually installed the Universal Forwarder on the domain controller. I also know that the correct Deployment Server and Indexer were provided at install time. So what might have caused the targetUri to change? I'm thinking it may be something in the deployment server itself, but I don't know where to look for that setting or how the deployment server might have updated it. I'm still getting my head wrapped around just what the deployment server itself is doing, in fact. But I am worried that with a full throttle, out of the box universal forwarder, we are likely collecting way more information than we actually want.

Posts	16
Solutions	0
Karma Given	2
Karma Received	3
Member Since	‎08-31-2021

Online Status	Offline
Date Last Visited	‎05-02-2023 12:46 PM

Anyone have experience with deployment servers in ...

Splunk Packaging Toolkit: Why isn't the standard ....

Splunk Connect for Kubernetes - what are the fluen...

Heavy Forwarder based Transforms - splitting event...

Should I deploy Splunk TA for Windows to Universal...

Deployment Server Uri different than expected?

Re: Anyone have experience with deployment servers...

Anyone have experience with deployment servers in ...

Re: Splunk Connect for Kubernetes - what are the f...

Re: Splunk Packaging Toolkit - ignoring git files?

Splunk Packaging Toolkit: Why isn't the standard ....

Splunk Connect for Kubernetes - what are the fluen...

Re: Heavy Forwarder based Transforms - splitting e...

Re: Heavy Forwarder based Transforms - splitting e...

Re: Setting sourcetype with a complex regex - tran...

Heavy Forwarder based Transforms - splitting event...

Re: Shibboleth Add-on for Splunk v1.1.0 - bugs

Should I deploy Splunk TA for Windows to Universal...

Re: Deployment Server Uri different than expected?

Re: Deployment Server Uri different than expected?

Re: Deployment Server Uri different than expected?

Deployment Server Uri different than expected?

Join the Conversation