feat: synthetic monitoring + DORA dashboard fixes (Phases 14+15) #67
No reviewers
Labels
No labels
domain:backend
domain:devops
domain:frontend
status:approved
status:in-progress
status:needs-fix
status:qa
type:bug
type:devops
type:feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
forgejo_admin/pal-e-platform!67
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "66-feat-synthetic-monitoring-dora-dashboard"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Adds Blackbox Exporter for synthetic monitoring of all platform and application services, with alerting rules and a Grafana uptime dashboard. Also fixes DORA dashboard PromQL queries to use correct histogram_quantile syntax and the correct metric for the repo variable dropdown.
Changes
terraform/main.tf:helm_release.blackbox_exporter-- Blackbox Exporter v9.1.0 with ServiceMonitor targeting 13 endpoints (8 platform internal, 5 app external via Tailscale funnels), 60s probe intervalkubernetes_manifest.blackbox_alerts-- PrometheusRule withEndpointDown(critical, 2m) andEndpointSlowResponse(warning, 5m) alert ruleskubernetes_config_map_v1.uptime_dashboard-- Grafana dashboard ConfigMap for service uptimeterraform/dashboards/dora-dashboard.json:quantile()->histogram_quantile()with_bucketsuffixhistogram_quantile()with properby (le, repo)groupingdora_deployments_total->dora_pr_merges_total(deployments metric doesn't exist yet)terraform/dashboards/uptime-dashboard.json(new):Test Plan
tofu fmt -recursive-- passed (no formatting issues)tofu validate-- passed ("The configuration is valid")tofu plan-- state lock held by concurrent apply; plan deferred to post-merge. Expect 3 new resources:helm_release.blackbox_exporter,kubernetes_manifest.blackbox_alerts,kubernetes_config_map_v1.uptime_dashboardmonitoringnamespacedora_pr_merges_totalReview Checklist
Related
plan-pal-e-platform(Phases 14+15)Tofu Plan Output