fix: pre-pull Rancher images and reset Rancher release during bootstrap
Deploy Cluster / Terraform (push) Successful in 28s
Deploy Cluster / Ansible (push) Failing after 27m30s

Rancher installs were stalling on transient Docker Hub TLS handshake timeouts
for rancher shell, webhook, and system-upgrade-controller images. Pre-pull the
required images onto all nodes after k3s comes up, extend the Rancher HelmRelease
timeout, and reset/force the Rancher HelmRelease before waiting on addon-rancher
so bootstrap can recover from stale failed remediation state.
This commit is contained in:
2026-04-22 11:00:54 +00:00
parent 8372d562ad
commit 9c0523e880
5 changed files with 32 additions and 2 deletions
@@ -5,6 +5,7 @@ metadata:
namespace: flux-system
spec:
interval: 10m
timeout: 15m
targetNamespace: cattle-system
chart:
spec: