Fix embedding worker CrashLoopBackOff — set replicas to 0 #142
Labels
No labels
domain:backend
domain:devops
domain:frontend
status:approved
status:in-progress
status:needs-fix
status:qa
type:bug
type:devops
type:feature
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
forgejo_admin/pal-e-api#142
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Lineage
bug-embedding-worker-crashloop(no plan ancestry — bug fix)Repo
forgejo_admin/pal-e-docsUser Story
As a platform operator
I want the embedding worker deployment scaled to 0 replicas
So that CrashLoopBackOff noise stops and ArgoCD shows a clean sync
Context
Two embedding worker pods have been in CrashLoopBackOff for 3.5+ days (1000+ restarts). The error is
No module named pal_e_docs.embedding_worker— the deployment manifest pins an image tag (bec9a4ecb1...) that predates theembedding_worker.pymodule. The module exists in the repo atsrc/pal_e_docs/embedding_worker.pybut was never included in that image build. Embeddings are computed inline in the API, so this worker is not needed for current functionality.ArgoCD reconciles
kubectl scalechanges back toreplicas: 1, so the fix must go through the repo.File Targets
Files to modify:
k8s/embedding-worker.yaml— changereplicas: 1toreplicas: 0on line 6Files NOT to touch:
Acceptance Criteria
k8s/embedding-worker.yamlhasreplicas: 0Test Expectations
grep replicas k8s/embedding-worker.yamlshowsreplicas: 0Constraints
Checklist
Related
pal-e-docs— affected projectbug-embedding-worker-crashloop— investigation note in pal-e-docs