Hi Team,
I have a theoretical question about multisite indexer clustering.
As site_replication_factor is how many copies of the raw data (unsearchable) are replicated within the cluster, and site_search_factor is how many copies of searchable data (which also contains the raw data). Then could I set up an environment with a configuration such as:
[clustering]
mode = master
multisite=true
available_sites=site1,site2,site3,site4
site_replication_factor = origin:1,site4:1,total:2
site_search_factor = origin:1,site4:0,total:3
OR
[clustering]
mode = master
multisite=true
available_sites=site1,site2,site3,site4
site_replication_factor = origin:1,site4:1,total:2
site_search_factor = origin:1,site1:1,site2:1,site3:1,total:3
The objective would be to have a designated site which would only be a store for the raw (unsearchable) data, therefore wouldn't be searched or used for anything else. While having the three other sites set up in a more standard configuration, where each has a copy of its own raw data, and a distributed copy of the searchable data.
I can't find anywhere in the documentation which says if you can specify site4:0 to restrict searchable data being replicated to a specific site.
If the above works, this would minimize the copies of raw data (unsearchable) within the cluster (saving space), but ensure the is always a site with a full backup of ALL raw data from around the cluster which could be used to rebuild ALL indexed data in the event of extensive disaster.
Thanks!
... View more