Issue:

There was a server crash on the database server. After the crash, Always On has been failing. Windows Failover Cluster log indicate cluster resource failure. SQL Server log has a lot of “sql server i/o request taking longer than 15 seconds”.

Cluster Event:

EventID 1069

Cluster resource ‘xxxxx’ of type ‘SQL Server Availability Group’ in clustered role ‘xxxxx’ failed.

Based on the failure policies for the resource and role, the cluster service may try to bring the resource online on this node or move the group to another node of the cluster and then restart it. Check the resource and group state using Failover Cluster Manager or the Get-ClusterResource Windows PowerShell cmdlet.

EventID 1205

The Cluster service failed to bring clustered role ‘xxxxx’ completely online or offline. One or more resources may be in a failed state. This may impact the availability of the clustered role.

SQL Server Log:

SQL Server has encountered x occurrences(s) of I/O requests taking longer than 15 seconds to complete on file xxxxxxxx.

Fix:

Always On failover to Secondary server and restart the Primary server.

Last modified: February 12, 2025

Author

Comments

Write a Reply or Comment