Solved: Consolidate Databases from multiple splunk instanc...

ghnwmlguy · ‎12-29-2010

I currently have two instances of splunk running on two separate hosts. I recently purchased a license so that I can consolidate the two onto one host. Is there a way to consolidate indexed logs/databases onto one host without losing data?

hulahoop · ‎12-29-2010

Yes, this is possible. However, if you have 2 separate servers it may be best to keep both and have one distribute searches to the other. This way you are effectively searching both Splunk servers and get the added bonus of 2 servers sharing the work and executing in parallel. More on distributed search if this interests you: http://www.splunk.com/base/Documentation/latest/Admin/Whatisdistributedsearch.

If, however, you are looking to re-purpose one of the servers and truly need to consolidate your datastore, then the process is similar to backing up your Splunk datastore, covered here: http://www.splunk.com/base/Documentation/latest/Admin/Backupindexeddata.

This is the skeleton process (assuming you have enough storage):

Redirect the incoming data stream to Splunk1
Shut down Splunk2
Roll the hot bucket on Splunk2 to grab the latest data
Move all buckets to Splunk1 after ensuring bucket sequence ids are unique
Repeat steps 3 and 4 for each index.

Steps 1 and 2 are self-explanatory.

For step 3, you can issue this command on the CLI:

./splunk _internal call /data/indexes/<index_name>/roll-hot-buckets –auth <admin_username>:<admin_password>

For Step 4, on Splunk1 and Splunk2, look in

$SPLUNK_HOME/var/lib/splunk/defaultdb/db
$SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

The directories in these folders all have a unique sequence ID at the end of the directory name:

db_#_#_id

You need to ensure all the directories in Splunk1 and Splunk2 have a unique ID. Write a script or change the sequence ID manually if there are any duplicates between Splunk1 and Splunk2. Then move all the directories from

Splunk2: $SPLUNK_HOME/var/lib/splunk/defaultdb/db
Splunk2: $SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

to

Splunk1: $SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

View solution in original post

hulahoop · ‎12-29-2010

Yes, this is possible. However, if you have 2 separate servers it may be best to keep both and have one distribute searches to the other. This way you are effectively searching both Splunk servers and get the added bonus of 2 servers sharing the work and executing in parallel. More on distributed search if this interests you: http://www.splunk.com/base/Documentation/latest/Admin/Whatisdistributedsearch.

If, however, you are looking to re-purpose one of the servers and truly need to consolidate your datastore, then the process is similar to backing up your Splunk datastore, covered here: http://www.splunk.com/base/Documentation/latest/Admin/Backupindexeddata.

This is the skeleton process (assuming you have enough storage):

Redirect the incoming data stream to Splunk1
Shut down Splunk2
Roll the hot bucket on Splunk2 to grab the latest data
Move all buckets to Splunk1 after ensuring bucket sequence ids are unique
Repeat steps 3 and 4 for each index.

Steps 1 and 2 are self-explanatory.

For step 3, you can issue this command on the CLI:

./splunk _internal call /data/indexes/<index_name>/roll-hot-buckets –auth <admin_username>:<admin_password>

For Step 4, on Splunk1 and Splunk2, look in

$SPLUNK_HOME/var/lib/splunk/defaultdb/db
$SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

The directories in these folders all have a unique sequence ID at the end of the directory name:

db_#_#_id

You need to ensure all the directories in Splunk1 and Splunk2 have a unique ID. Write a script or change the sequence ID manually if there are any duplicates between Splunk1 and Splunk2. Then move all the directories from

Splunk2: $SPLUNK_HOME/var/lib/splunk/defaultdb/db
Splunk2: $SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

to

Splunk1: $SPLUNK_HOME/var/lib/splunk/defaultdb/colddb

yannK · ‎12-30-2015

since the index-clustering exists, the recent buckets folder do contains an extra information, the GUID of the original indexer.

Because of that, it can help to avoid bucketID collisions.
But beware that if you have hot buckets, or old buckets (prior to splunk 6 or clusterting setup), you still want to check the bucketid.

hulahoop · ‎02-09-2011

No need to apologize, just let us know how it goes when you have the time to revisit this. 🙂

ghnwmlguy · ‎01-31-2011

I apologize for not giving the thumbs up yet...I have run into space issues on the primary host and need to put in new drive. When that is done I will use this process.

Consolidate Databases from multiple splunk instances

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Index This | What travels the world but is also stuck in place?

Discover New Use Cases: Unlock Greater Value from Your Existing Splunk Data

Continue Your Journey: Join Session 2 of the Data Management and Federation Bootcamp ...

Join the Conversation