About eylonronen

eylonronen · ‎02-23-2021

Hello everyone I found a wierd bug in the cascading replication process. The shcluster captain says when he tries to replicate the bundle that the field content-length is too large (max is max int). I think its the replication plan thats failing. I would like to know how can i use cascading replication and also if there is a way to determine the exact size of the knowledge bundle and its contents. Thank you very much

eylonronen · ‎10-21-2020

Thanks @thellmann but I actually figured it out already... But since you came by this post it would be great if you'd look on this medium story I wrote about my experience in this very topic with emphasis on the lack of documentation and good examples and hopefully try to make it better for future developers My story: https://medium.com/@eylon.ronen/the-ordeal-of-using-js-inside-splunk-app-6fb3a441151d Thank you very much

eylonronen · ‎08-13-2020

Hi all I want to create an app that contains several dashboards, but i want to allow the user to config some things (macros, lookups, etc...) easily like the configuration page in Splunk DB Connect, but i can't find how to update config files with js. Also, i would like to know what is better: html inside a view, or using a template Thank you very much 🙂

eylonronen · ‎04-09-2018

You may want to be more specific about the index in your search. It happens quite often to me that when I search index=* and than filter by source I dont get result but when i search index= source=, i get results. Probebly splunk is doing some optimizations to the search process and when you have a lot of data it doesnt want to go over all of it if not necsessery.

eylonronen · ‎04-09-2018

Wow thats a great answer. We are going to ingest about 1TB/day spread across all indexers so about 200GB/day per indexer. We are going to retain data for as long as we can. Our amount of ingested dara may grow rapidly so we will need to change retention policy. We expect a lot of concurrent searches so it is good to know that the CPUs and RAM are not wasted. We are going to have one splunk instance on each machine, but the dillema now is how to mount the disks. You said that I/O is our limit, so we need to do everything we can to make it high enough. We dont want to mount the disks all to a single LVM, because than we may hit OS I/O limit, while we hardly reach the disks limit. However it is hard for us to have several mount points and save specific indexes on each mount, thusly distributing the data across the disks, because our indexes sizes are very dynamic and we have several houndreds indexes, so managing it would be a pain. Finally we thought about having 2 mount points, one of which we will save hot/warm buckets on, and the other will contain the cold buckets. But we are not 100% sure how to spread the disks among the mounts, because we read in indexes.conf.spec that an open file handle will be saved for each warm bucket unlike cold, so its faster to search, but all over the internet people are using much more space for the cold buckets instead of warm and hot. It would really help us to know exactly the differences in all aspects between cold and warm buckets, so we know how to mount the disks, or if someone have a better solution for the disk spreading.

eylonronen · ‎04-09-2018

Hi we have 6 physical machines with 512Gb RAM, 56 CPU, and 10 disks with 12Tb each. We want to use them for splunk indexer cluster, but we are not sure what is the best way to do this. Our problem is that we dont know how well splunk uses this large amount of memory and cpu, and also we dont want to have one logical volume that will contain all the disks space. Knowing that elasticsearch is recommended to run as multiple instances on a single machine we have thought about it, but its troublesome to configure it properly. We also thought of running a few splunk instances over docker. What is the best approach to this?

eylonronen · ‎02-21-2018

I didn't find any help in this page....

eylonronen · ‎02-21-2018

well the forwarder doesnt write log when it monitors, only with batch input for some reason. Today we indexed some logs, and we saw one of the files in the forwarders log, but we could not search it... I wonder if it has something to do with the fact that we added a few indexers recently. Is there something i should update in the search head when i add indexers?

eylonronen · ‎02-21-2018

I've already looked there... Zero warnings or errors.... Also we've tried both monitor and batch input. Both had the same problem...

eylonronen · ‎02-21-2018

Hi, lately we've been checking how many files our splunk is indexing, and we noticed that it "skips" some files... We checked by searching: index=our_index | stats dc(source) index = _internal group = per_so* series=*.ourfile | stats count And both ways we got the same results, which are not all the files we indexed.

eylonronen · ‎01-31-2018

Hi all, we have a big problem with our forwarder. We need to be able to index about 600GB/day and we have 10 indexers, 1 forwarder, and as of now we index about 260GB/day. Our license allows us to index this many. We have two problems: 1. Because we import our data from hadoop, we cant have many forwarders, because they will monitor the same directories. However, we logically split our data into two groups, and we are trying to add another forwarder, but he wont connect to the indexers. We copied and renamed the deployment app and created a serverclass in the deployment server for the new forwarder, and now aside from the different inputs, both the new and the old forwarder have the same configuration, but the new one refuses to connect. We just cant troubleshoot the problematic part. We tried using the monitoring console, and even searching the logs directly, and there are no full queues. Our max kbPerSec is 800mb which is more than enough. I believe the problem lies in our forwarder because we see the slow pace in him, its not like he's forwarding fast enough and the indexers are the problem. At the forwarder we are extracting timestamp from field, a field using regex, and index name with transforms. What we would like is to find the bottleneck so we can index as fast as we need.

eylonronen · ‎01-31-2018

Hi, I would like anyone to help me find a real decent way of importing data from Hadoop to Splunk. The methods we have found are the following: 1. Use Nifi to get the files from Hadoop, and then use the put Splunk processor to index them. I really doubt this method, because it seems like a bit overkill for a simple action. 2. Use NFS gateway to mount the hdfs on the forwarder and then use regular monitoring input. This is what we've been doing so far, however, we are looking to replace it because Hadoop's nfs is problematic, and also it is not as fast as we need it to be. 3. Hadoop Connect. A product by Splunk that really got our hopes up, and when we tested it, it showed better performance dramatically than the nfs solution on a single file, however, it was slower than the first with many small files(as we have in our production environment). Also, Hadoop connect is a modular input, and as such, it doesn't support indexing csv files, so I had to dive into the code and alter it to parse csv files to key-value pairs so they will be indexed. It still showed the same performance difference after the changes I've made. As of now, my wish is to understand Hadoop Connect's poor performance and enhance it. So if anyone can help me with this, or by giving me another indexing method this will be much appreciated. Thank you very very much 🙂

Posts	13
Solutions	0
Karma Given	3
Karma Received	3
Member Since	‎01-31-2018

Online Status	Offline
Date Last Visited	‎05-24-2021 08:06 AM

Splunk JAVA logging missing events

Cascading replication of large bundles

Configuration page in custom app

What is the best way to deploy splunk on very stro...

Why is splunk not indexing all the data?

How to improve forwarding performance when importi...

How to import data from Hadoop to Splunk?

Cascading replication of large bundles

Re: Configuration page in custom app

Configuration page in custom app

Re: Line breaking issues with source that is not i...

Re: What is the best way to deploy splunk on very ...

What is the best way to deploy splunk on very stro...

Re: Why is splunk not indexing all the data?

Re: Why is splunk not indexing all the data?

Re: Why is splunk not indexing all the data?

Why is splunk not indexing all the data?

How to improve forwarding performance when importi...

How to import data from Hadoop to Splunk?