Fix Keycloak blackbox probe: NetworkPolicy + internal URL (1 alert) #111
Labels
No labels
domain:backend
domain:devops
domain:frontend
status:approved
status:in-progress
status:needs-fix
status:qa
type:bug
type:devops
type:feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
forgejo_admin/pal-e-platform#111
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Lineage
plan-pal-e-platform→ Platform HardeningRepo
forgejo_admin/pal-e-platformUser Story
As a platform operator
I want the Keycloak probe to use the internal cluster URL with correct NetworkPolicy
So that EndpointDown reflects real outages, not probe misconfiguration
Context
Keycloak is healthy (200 on
/realms/masterfrom host). The alert is a false positive caused by two compounding issues:tailscalenamespace. Missing themonitoringnamespace rule every other platform service has.https://keycloak.tail5b443a.ts.netwhich routes through DERP relay and times out. All other core services use internal cluster URLs.Evidence: Pod 1/1 Running 4d2h,
curl -sk .../realms/masterreturns 200, blackbox from monitoring pod gets EOF on external URL and connection refused on internal URL (NetworkPolicy).File Targets
terraform/network-policies.tfline ~130 — addmonitoringnamespace to Keycloak ingress rulesterraform/main.tfline ~439 — change probe URL fromhttps://keycloak.tail5b443a.ts.nettohttp://keycloak.keycloak.svc.cluster.local:80/realms/masterFiles NOT to touch:
Acceptance Criteria
probe_success{service="keycloak"} == 1Test Expectations
tofu plan -lock=falseshows only the two expected changescurlfrom blackbox pod to internal URL returns 200Constraints
tofu apply(bundle with state drift apply if timing works)Checklist
tofu plancleanRelated
pal-e-platform— project boardScope Review: READY
Review note:
review-190-2026-03-18Scope is solid — all template sections present, both file targets verified against codebase, no blast radius concerns. Keycloak is the sole outlier (external probe URL + missing monitoring NetworkPolicy rule). Two-line fix, ready for agent dispatch.
Scope Review: READY
Review note:
review-190-2026-03-18All template sections present, both file targets verified against codebase. Keycloak NetworkPolicy (line 130) confirmed missing monitoring ingress; probe URL (line 440) confirmed as only platform-tier service using external URL. Internal service URL structure validated (port 80, namespace keycloak). Acceptance criteria are machine-verifiable. No blockers.
Blast radius note: Ollama NetworkPolicy also lacks monitoring ingress but has no blackbox probe — not in scope, flagged for future cleanup.