docs: ArgoCD CFR spike decision record (#6) #8
No reviewers
Labels
No labels
domain:backend
domain:devops
domain:frontend
status:approved
status:in-progress
status:needs-fix
status:qa
type:bug
type:devops
type:feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
ldraney/pal-e-dora-exporter!8
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "6-argocd-cfr-spike"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Key Findings
argocd-server.argocd.svc.cluster.localbut has no ServiceMonitor -- Prometheus is not scraping its metricsargocd_app_sync_totalwithphaselabels (Succeeded/Failed/Error) on port 8082Deliverable
docs/argocd-cfr-decision.md-- full decision record with options evaluated, pros/cons, and follow-up ticketsTest Plan
Closes #6
🤖 Generated with Claude Code
PR #8 Review
DOMAIN REVIEW
Stack detected: Docs-only spike (ArgoCD / Prometheus / Kubernetes / PromQL domain)
Technical accuracy -- all verified correct:
argocd_app_sync_total,argocd_app_info,argocd_app_sync_duration_seconds_total) are accurateSucceeded,Failed,Error,Running) are correctincrease()over counters withphasefiltering, andclamp_min(..., 1)correctly prevents division-by-zeroserviceMonitorSelectorNilUsesHelmValues = falseis a real and operationally important kube-prometheus-stack setting -- good catch documenting itkubectlcommands are correct and actionableThree-option analysis quality:
Cross-references:
src/collectors/woodpecker.pyandterraform/modules/monitoring/main.tfare contextually appropriate for the pal-e-dora-exporter and pal-e-platform repos respectivelyMinor PromQL note (nit): The
phase=~"Succeeded|Failed|Error"regex in the denominator excludesRunningsyncs. This is correct for CFR calculation (in-progress syncs are neither success nor failure), but worth a one-line comment in the doc explaining the exclusion so future readers do not wonder if it is a bug.BLOCKERS
None. This is a docs-only spike with no code changes, no secrets, and no security surface.
NITS
PR body missing
## Changessection. SOP template (template-pr-body) expects## Summary,## Changes,## Test Plan,## Review Checklist,## Related Notes. The## Key Findingsand## Deliverablesections are useful but non-standard. For a docs-only spike this is cosmetic, not blocking.PR body missing
## Related Notessection. Should reference the plan slug or related pal-e-docs notes if any exist for this spike.PromQL comment suggestion. Add a brief inline comment in the decision doc explaining why
Runningphase is excluded from the denominator (see domain review above).Singular "Deliverable" header in PR body. Minor -- "Deliverables" (plural) would be consistent with spike template conventions.
Reference link validity. The
oneuptime.com/bloglink (reference #3) may be a hallucinated URL -- worth verifying it resolves before merge. The ArgoCD docs links are standard and should be fine.SOP COMPLIANCE
6-argocd-cfr-spike)Closes #6present in PR body.envfiles, or credentials committed## Changessection (nit for docs-only PR)## Related Notessection (nit for docs-only PR)PROCESS OBSERVATIONS
VERDICT: APPROVED