Cosmos DB Multi-Region Sync Enabled – Azure Search Indexer Failures Observed, Requesting Sync Disablement or Resource Scale-Up

Raghava Kanike (NON EA SC ALT) 20 Reputation points Microsoft Employee

Cosmos DB Multi-Region Sync Enabled – Azure Search Indexer Failures Observed, Requesting Sync Disablement or Resource Scale-Up

  1. SAI JAGADEESH KUDIPUDI 3,470 Reputation points Microsoft External Staff Moderator

    Hi @Raghava Kanike (NON EA SC ALT) ,

    Thanks for reaching out — I understand you're experiencing Azure Search indexer failures when using Cosmos DB with multi-region sync. To help diagnose this, I need a few more details:

    1. Can you provide the exact error messages or failure logs returned by the Azure Search indexer?
    2. What is the current throughput (RU/s) configured for your Cosmos DB instance, and have you observed any spikes or throttling?
    3. How is your multi-region sync set up in Cosmos DB (number of regions, write/read regions, consistency level)?
    4. Have there been any recent changes to your Cosmos DB or Azure Search configurations?

    Once you share these details, I can identify the root cause.

    Thanks!

  2. SAI JAGADEESH KUDIPUDI 3,470 Reputation points Microsoft External Staff Moderator

    HI @Raghava Kanike (NON EA SC ALT) ,
    Just following up on the previous request.
    Could you please share the requested details so we can proceed further with the investigation?

    Thank you.

  3. Raghava Kanike (NON EA SC ALT) 20 Reputation points Microsoft Employee

    Hi @SAI JAGADEESH KUDIPUDI ,
    Our team has raised an additional request for the same(https://learn.microsoft.com/en-us/answers/questions/5904368/cosmos-db-multi-region-sync-enabled-azure-search-i), and it is currently being addressed.

    Thank you

  4. Santhosh Kumar Machukuri 0 Reputation points Microsoft External Staff Moderator

    Hi @Raghava Kanike (NON EA SC ALT) ,

    I hope you are doing well. Thanks for the update, kindly let us know how to proceed on this thread. Looking forward to your response.

  5. Raghava Kanike (NON EA SC ALT) 20 Reputation points Microsoft Employee

    Hi @Santhosh Kumar Machukuri
    We have already received the resolution in another active thread, so we can go ahead and close this one.


Sign in to comment

Answer accepted by question author

Deepak Krishna Bhalerao 80 Reputation points

When a multi-region synchronization process or regional expansion is running on an Azure Cosmos DB account, the account becomes locked against further manual scaling operations, and the background replication overhead can severely impact downstream services like Azure AI Search indexers. Because these operations modify the global infrastructure of your account, they cannot be canceled or modified via standard Control Plane (Azure Portal, CLI, or PowerShell) actions once initiated.

To stabilize your environment and prevent continuous indexer timeouts, you must proceed with the following actions immediately.

  1. File an Urgent Azure Support Ticket

Because the account's control plane is actively locked during regional data replication, only Microsoft Support engineers can manually abort or force-scale the synchronization from the backend.

  • Create a Ticket: Open a ticket via the Azure Portal Help + Support pane.
  • Classification: Set the problem type to Configuration and Setup / Scale-Up or Region Changes and the severity to Highest (Severity A / Critical Impact).
  • Required Details: Explicitly state that your environment is experiencing down-time, that the multi-region read/write sync has been stuck for 18+ hours, and request either an immediate backend scale-up or a forced disablement of the unfinished regional sync.
  1. Mitigate Azure AI Search Indexer Failures

While waiting for Azure support to clear the backend block, the Azure AI Search indexers will likely continue failing due to high request latencies or transient data source access blocks. Use these steps to mitigate the impact:

  • Pause the Indexer Schedules: Navigate to your search service, find the affected indexers, and clear their schedules to prevent them from flooding the stuck Cosmos DB instance with connections.
  • Switch to a Direct Region Connection (If Applicable): If your indexer allows a custom connection string, verify that it points explicitly to your primary stable region's endpoint rather than the global traffic manager routing endpoint, avoiding the replicating node.
  • Implement a Change Feed Architecture: For long-term stabilization, avoid pointing search indexers directly to high-throughput Cosmos DB instances. Transitioning to an architecture that combines the Cosmos DB Change Feed with an Azure Function will push updates directly to the search index safely without triggering resource contention. When a multi-region synchronization process or regional expansion is running on an Azure Cosmos DB account, the account becomes locked against further manual scaling operations, and the background replication overhead can severely impact downstream services like Azure AI Search indexers. Because these operations modify the global infrastructure of your account, they cannot be canceled or modified via standard Control Plane (Azure Portal, CLI, or PowerShell) actions once initiated. To stabilize your environment and prevent continuous indexer timeouts, you must proceed with the following actions immediately. 1. File an Urgent Azure Support Ticket Because the account's control plane is actively locked during regional data replication, only Microsoft Support engineers can manually abort or force-scale the synchronization from the backend.
    • Create a Ticket: Open a ticket via the Azure Portal Help + Support pane.
    • Classification: Set the problem type to Configuration and Setup / Scale-Up or Region Changes and the severity to Highest (Severity A / Critical Impact).
    • Required Details: Explicitly state that your environment is experiencing down-time, that the multi-region read/write sync has been stuck for 18+ hours, and request either an immediate backend scale-up or a forced disablement of the unfinished regional sync.
    2. Mitigate Azure AI Search Indexer Failures While waiting for Azure support to clear the backend block, the Azure AI Search indexers will likely continue failing due to high request latencies or transient data source access blocks. Use these steps to mitigate the impact:
    • Pause the Indexer Schedules: Navigate to your search service, find the affected indexers, and clear their schedules to prevent them from flooding the stuck Cosmos DB instance with connections.
    • Switch to a Direct Region Connection (If Applicable): If your indexer allows a custom connection string, verify that it points explicitly to your primary stable region's endpoint rather than the global traffic manager routing endpoint, avoiding the replicating node.
    • Implement a Change Feed Architecture: For long-term stabilization, avoid pointing search indexers directly to high-throughput Cosmos DB instances. Transitioning to an architecture that combines the Cosmos DB Change Feed with an Azure Function will push updates directly to the search index safely without triggering resource contention.
0 comments No comments

Sign in to comment

0 additional answers

Sign in to answer

Your answer