Fix Telegram chat_id type and dora-exporter OOM #53
Labels
No labels
domain:backend
domain:devops
domain:frontend
status:approved
status:in-progress
status:needs-fix
status:qa
type:bug
type:devops
type:feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
forgejo_admin/pal-e-platform#53
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Lineage
bug-alert-noise-broken-services— relates toplan-pal-e-platform→ Phase 3 (alerting)Repo
forgejo_admin/pal-e-platformUser Story
As a platform operator
I want the PrometheusOperator to sync successfully and the dora-exporter to stop OOM-killing
So that the monitoring stack is healthy and alerts are meaningful
Context
Two TF configuration issues in the monitoring stack:
PrometheusOperatorSyncFailed — The Telegram
chat_idHelm value at line 317 usestype = "string"which forces the value to be quoted. The alertmanager config parser expects int64 for chat_id. Error:yaml: unmarshal errors: line 31: cannot unmarshal !!str '-520096...' into int64. This was flagged as a QA nit on PR #43.OOMKilled (dora-exporter) — The dora-exporter container at line 1030 has a 128Mi memory limit. It's being OOM-killed repeatedly (exit code 137). Current OOMKilled alert is critical severity.
File Targets
Files the agent should modify:
terraform/main.tfline 317 — changetype = "string"totype = "auto"for chat_idterraform/main.tfline 1030 — change memory limit from"128Mi"to"256Mi"Files the agent should NOT touch:
.woodpecker.yaml— CI pipeline, no changes neededAcceptance Criteria
type = "auto"instead oftype = "string"tofu validatepassestofu fmtproduces no changesTest Expectations
tofu validatepassestofu fmtcheck produces no diffConstraints
Checklist
tofu validatepassestofu fmtcleanRelated
bug-alert-noise-broken-services— pal-e-docs bug noteplan-pal-e-platform— Platform Hardening plan