Lease Timeout and Cluster shutdown

  • During rebuild process we change our AG sync mode settings to manual failover and async mode on both primary and secondary using queries.

    ALTER AVAILABILITY GROUP *group_name* MODIFY REPLICA ON '*server_name*' WITH ( { AVAILABILITY_MODE = { SYNCHRONOUS_COMMIT | ASYNCHRONOUS_COMMIT } | FAILOVER_MODE = { AUTOMATIC | MANUAL } } )

    But for some reason the above queries took almost 25 minutes to update the below AG to manual and async mode on both replicas yesterday. During this time we observed the function to check primary replica for jobs also took long(close to 10 mins) for few  jobs which were scheduled to run around that time. After 25 minutes, we saw a sudden spike in worker threads which in turn caused the lease failover and cluster offline. Not much info in logs, extended events or cluster events to help why it took long.

    ag_settings

    Appreciate if any one can guide me to the right direction.

    • This topic was modified 5 years, 1 month ago by  Dhruva_51.
  • lifting the thread.

Viewing 2 posts - 1 through 1 (of 1 total)

You must be logged in to reply to this topic. Login to reply