<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Multisite replication factor issues in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651643#M27318</link>
    <description>&lt;P&gt;We have the exact same issue: 2 Sites with&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;site_replication_factor = origin:2,total:4&lt;/LI-CODE&gt;&lt;P&gt;When a site is down (for example for desaster recovery test purposes or due to a datacenter/region outage), the other site starts to replicating everything to match total:4. Splunk even moves data to frozen to get to the point that this replication factor is matched again. So you can lose data due to this behavior&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":frowning_face:"&gt;☹️&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Is there a possibility to tell splunk that the maximum replication-factor per site must be 2, not 4?&lt;/P&gt;</description>
    <pubDate>Sun, 23 Jul 2023 12:10:12 GMT</pubDate>
    <dc:creator>flotridai</dc:creator>
    <dc:date>2023-07-23T12:10:12Z</dc:date>
    <item>
      <title>Multisite replication factor issues</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/575014#M25043</link>
      <description>&lt;P&gt;Hi all,&lt;/P&gt;&lt;P&gt;I'm testing multisite indexer clustering with below configuration and found an undesired behaviour in the case of a site failure.&lt;/P&gt;&lt;P&gt;&lt;EM&gt;available_sites = site1,site2&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;site_replication_factor = origin:2,site1:2,site2:2,total:4&lt;/EM&gt;&lt;BR /&gt;&lt;EM&gt;site_search_factor = origin:1,site1:1,site2:1,total:2&lt;/EM&gt;&lt;/P&gt;&lt;P&gt;As you can see I have configured the replication factor "origin:2,site1:2,site2:2,total:4" so that I will have 2 replicas in both sites. But, in the case of a site failure, I am observing that splunk will try to replicate locally in the site that is up and complete the 'total:4' condition. I think this can be a problem when the available disk space on the machines is less.&lt;/P&gt;&lt;P&gt;Let's say site2 indexer machines are at 80% disk space usage and&amp;nbsp;site1 fails - now when splunk tries to create 4 replicas in the same site (site2) due to site failure, it can easily exhaust the disks.&lt;/P&gt;&lt;P&gt;As per update from splunk support, this is default behaviour, but I feel there needs to be additional control over this. Any advise or suggestions around this issue will be really helpful. Thank you.&lt;/P&gt;</description>
      <pubDate>Tue, 16 Nov 2021 10:29:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/575014#M25043</guid>
      <dc:creator>jpillai</dc:creator>
      <dc:date>2021-11-16T10:29:19Z</dc:date>
    </item>
    <item>
      <title>Re: Multisite replication factor issues</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/575015#M25044</link>
      <description>&lt;P&gt;Update: The kind of failures we are usually expecting are network failures, where the failed site will be back in few hours. In the mean time we might not want 4 replicas in the same site that is up. Or in case we need additional replicas in any case, we want to do it manually&lt;/P&gt;</description>
      <pubDate>Tue, 16 Nov 2021 10:32:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/575015#M25044</guid>
      <dc:creator>jpillai</dc:creator>
      <dc:date>2021-11-16T10:32:53Z</dc:date>
    </item>
    <item>
      <title>Re: Multisite replication factor issues</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651643#M27318</link>
      <description>&lt;P&gt;We have the exact same issue: 2 Sites with&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;site_replication_factor = origin:2,total:4&lt;/LI-CODE&gt;&lt;P&gt;When a site is down (for example for desaster recovery test purposes or due to a datacenter/region outage), the other site starts to replicating everything to match total:4. Splunk even moves data to frozen to get to the point that this replication factor is matched again. So you can lose data due to this behavior&amp;nbsp;&lt;span class="lia-unicode-emoji" title=":frowning_face:"&gt;☹️&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Is there a possibility to tell splunk that the maximum replication-factor per site must be 2, not 4?&lt;/P&gt;</description>
      <pubDate>Sun, 23 Jul 2023 12:10:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651643#M27318</guid>
      <dc:creator>flotridai</dc:creator>
      <dc:date>2023-07-23T12:10:12Z</dc:date>
    </item>
    <item>
      <title>Re: Multisite replication factor issues</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651646#M27319</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;based on&amp;nbsp;&lt;A href="https://docs.splunk.com/Documentation/Splunk/9.1.0/Indexer/Sitereplicationfactor" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/9.1.0/Indexer/Sitereplicationfactor&lt;/A&gt;&amp;nbsp;it shouldn’t work like this. You could try to add site1:2,site2:2 to site_replication_factor. Then based on docs it should work. When another site is down it should store 2 buckets on current/origin site and report that SRF cannot met. If it do something else you should report a bug to splunk support.&lt;/P&gt;&lt;P&gt;As docs are not crystal clear about this you could also ask that they clarify this situation into docs &amp;nbsp;&lt;/P&gt;&lt;P&gt;r. Ismo&lt;/P&gt;</description>
      <pubDate>Sun, 23 Jul 2023 14:22:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651646#M27319</guid>
      <dc:creator>isoutamo</dc:creator>
      <dc:date>2023-07-23T14:22:13Z</dc:date>
    </item>
    <item>
      <title>Re: Multisite replication factor issues</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651669#M27321</link>
      <description>&lt;P&gt;&lt;a href="https://community.splunk.com/t5/user/viewprofilepage/user-id/228599"&gt;@flotridai&lt;/a&gt;&amp;nbsp; I think you just need to explicitly specify each sites replication factor as below.&lt;/P&gt;&lt;LI-CODE lang="python"&gt;site_replication_factor = origin:2,site1:2,site2:2,total:4&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;In my case, the status of indexes were mistakenly understood as splunk trying to achieve RF 4 on the local site, rather it was just showing that half the replicas are missing and no replication was in progress.&lt;/P&gt;</description>
      <pubDate>Mon, 24 Jul 2023 04:05:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Multisite-replication-factor-issues/m-p/651669#M27321</guid>
      <dc:creator>jpillai</dc:creator>
      <dc:date>2023-07-24T04:05:56Z</dc:date>
    </item>
  </channel>
</rss>

