Asm Health Checker Found 1 New Failures Updated _top_ ⇒
The message is not a death knell for your database. Instead, it is an early warning system that Oracle ASM has detected a single, specific anomaly in your storage infrastructure. By methodically examining the ASM alert log, querying the dynamic performance views, and investigating the OS/storage layer, you can quickly identify whether the issue is a transient path failure, an offline disk, or a more serious metadata corruption.
The coffee hadn’t even finished brewing when Sarah saw the notification on her primary dashboard:
SELECT * FROM v$asm_disk_iostat WHERE read_errs > 0 OR write_errs > 0 OR bytes_read = 0; asm health checker found 1 new failures updated
Depending on where you want to share this (e.g., Slack for your team, or a technical status update), here are a few options: Option 1: Internal Team Alert (Slack/Teams) ASM Alert: New Failure Detected The ASM Health Checker has just reported 1 new failure Next Steps: Checking the and running tfactl diagcollect to grab the latest logs for investigation.
in the Configuration utility. This screen displays which specific health alert was triggered (e.g., CPU usage, memory limits, or database connectivity). Check Daemon Health : Verify if critical ASM processes like asm_config_server are running. You can check this via the command line using tmsh show /sys service Investigate Recent Changes The message is not a death knell for your database
. This is often triggered when a monitored resource crosses a predefined threshold or a critical daemon stops responding. Immediate Review Checklist To review and resolve this failure, follow these steps: Identify the Failure Source : Navigate to Security > Reporting > Settings > ASM Alerts
Longer-term remediation and prevention
| Scenario | Recommended Action | |----------|--------------------| | | Ignore if expected; check after maintenance. | | Single disk failure | Replace or re-add disk: alter diskgroup DATA online disk 'DATA_0001'; | | Multiple disks in same failure group | Investigate storage/LUN path issues urgently. | | After power/storage event | Run asmcmd health check again; verify redundancy. |