Deployment Architecture

Optimizing indexing queue and question about the mechanism of rolling hot/warm/cold.

tungpx
Explorer

Hello all, 

I want to ask about the mechanic of rolling bucket from hot to cold. In our indexes.conf we don't setup a warm path, just hot and cold control by maxDataSizeMB. 

System team give me 1TB of SSD and 3TB of SAS to work with. So naturally, I put hot path to the SSD and cold path to the SAS. Now we are encountering the problems that the indexingQueue always fill up to 100 whenever that indexer ingest data. So my question is:

1. Does the process of rolling bucket from hot to cold affect the IOPS and the writting in indexingQueue?

2. My understanding is that, the data flow go like this  Forwarder -> indexer hot -> indexer cold, and this is a continuous process. And in case indexer hot is max out, it will roll to cold, but cold is SAS so the writing speed is < SSD. For example hot ingesting 2000 events per sec, but only push out 500 events per sec to cold, but hot is full already so it render the effective ingesting speed of hot to only 500 (since it full and can only take in the amount that it can push out). Is this correct? 

3. If my understanding is correct, how should I approach in optimizing it. I'm thinking of two option:

a) Switch our retention policy from size base to day base, setting hot retention to 1 day, cold remain size retention, since we ingested 600~800GB per day, we can ensure the hot partion will always have a buffer to ensure the smooth transition. My question in this section is when is the rolling happen, at the end of the day, or whenever the event is one day old, thus don't change anything.

b) Create a warm path as a buffer, hot->warm->cold, the warm bucket will have 1TB and retention of 1 day, so, and with how we ingest 600-800GB per day, the warm path will always have space for the hot to roll over

Is there anything else can I do?

0 Karma

isoutamo
SplunkTrust
SplunkTrust

Wait a minute. You said that you have 1TB SSD + 3TB SAS disks and you are ingesting 600-800GB per day.

How many indexers you have? I really hope that you have clusters which contains several indexers. And have you any Splunk premium apps like ES or ITSI running on your environment? So which kind of architecture you currently have to manage your workload? And what kind of nodes you have in resource point of view?

Basically your understanding is quite correct.

With this daily ingestion amount you definitely must have a cluster or at least several indexers take data in and serving searches at same time.

Do you have a MC (monitoring console) up and running? This is excellent tool do get more information what is happening in your environment.

Volumes are excellent way to manage your indexers space. I usually define one volume for hot+warm and another for cold. You must remember that you shouldn't allocate all disk space for use there must be some additional free space for disk operations. How much this is, depending on your filesystem. Some kind rule of thumb is 10-20%. Also you shouldn't allocate all filesystem space to Splunk Volume leave also there some room as splunk will need it when if flush data from warm to cold and also from cold to frozen if you have separate frozen space.

 

0 Karma

VatsalJagani
SplunkTrust
SplunkTrust

@tungpx- Your answers below:

1. Yes, in your situation because I think as you mentioned hot is already full.

2. Your understanding is mostly correct I think.

3. Solution:

  • The solution for you is to keep some of the hot bucket always empty.
  • Now there are two ways to do it:
  • 1. Use Volume for homePath & coldPath. (And allocate less storage for homePath than what is actual available)
    • I have personally not tried this approach. So please read the Volumes documentation in indexes.conf.spec first.
  • 2. Lower the maxWarmDBCount (this will move data early to coldPath, before its full)
    • This is what I use in my environments currently and it works for me!!!

 

  • This will improve performance of IOPS, which I think should eventually free the queues.

 

I hope this helps!!! Kindly upvote !!!!!

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

What Is Splunk? Here’s What You Can Do with Splunk

Hey Splunk Community, we know you know Splunk. You likely leverage its unparalleled ability to ingest, index, ...

Level Up Your .conf25: Splunk Arcade Comes to Boston

With .conf25 right around the corner in Boston, there’s a lot to look forward to — inspiring keynotes, ...

Manual Instrumentation with Splunk Observability Cloud: How to Instrument Frontend ...

Although it might seem daunting, as we’ve seen in this series, manual instrumentation can be straightforward ...