Automation Suite install fails on rook-ceph.
Issue Description: Automation Suite install fails on rook-ceph
Root Cause: There are many things that could cause this but most likely there is an issue with node communication, the nodes are not in the right availability zones (for multinode) or there is clock skew.
Diagnosing / Resolving:
- For multinode, review the following: Advanced installation experience - Optional Enabling Resilience To Zonal Failures In A Multi Node- HA Ready Production Cluster
- For true multinode, there should be multiple availability zones.
- Its important that the requirements above are met for a HAA deployment. If they are not, that should be corrected.
- Enable kubectl: Accessing Automation Suite
- Verify if the tools pod was deployed and check the status of the rook-ceph instance
- kubectl -n rook-ceph exec -it deploy/rook-ceph-tools -- ceph status
- Check to see if the returned values have any warnings. For example if there is a clock skew warning, that could be the issue.(If this is the case, verify that chronyd is configured correctly)
- If ceph status fails, or there are no warnings, generate logs and open a ticket. The normal support bundle will not work, so generate an rke2 support bundle.