Previous build failed due to harbor quota and got the jobs stuck.
Clearing out the builds/jobs and making a new deployment has them failing on creation.
tools.cluebotng-trainer@tools-bastion-15:~$ toolforge components deployment show
Deployment ID: 20260408-220715-f39kf7ufyt
Created: 20260408-220715
Status: failed
Long status:
Got exception: Failed run for component grafana-alloy: 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
Builds:
coordinator(successful): id:cluebotng-trainer-buildpacks-pipelinerun-6kxpf You can see the logs with `toolforge build logs cluebotng-trainer-buildpacks-pipelinerun-6kxpf`
file-api(successful): id:cluebotng-trainer-buildpacks-pipelinerun-fpb2w You can see the logs with `toolforge build logs cluebotng-trainer-buildpacks-pipelinerun-fpb2w`
grafana-alloy(successful): id:cluebotng-trainer-buildpacks-pipelinerun-zqzvk You can see the logs with `toolforge build logs cluebotng-trainer-buildpacks-pipelinerun-zqzvk`
pushgateway(successful): id:cluebotng-trainer-buildpacks-pipelinerun-k8d79 You can see the logs with `toolforge build logs cluebotng-trainer-buildpacks-pipelinerun-k8d79`
Runs:
coordinator(successful): created or updated job coordinator, [info](Job coordinator was updated in storage and runtime)
file-api(successful): created or updated job file-api, [info](Job file-api was updated in storage and runtime)
grafana-alloy(failed): 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
pushgateway(skipped): Skipped due to previous failure
Tool config:
components:
coordinator:
build:
ref: refs/tags/v1.4.16
repository: https://github.com/cluebotng/trainer.git
use_latest_versions: true
run:
command: launcher ./deployment/entrypoint.sh run-edit-sets
mount: all
schedule: 13 13 * * *
file-api:
build:
ref: refs/tags/v1.2.4
repository: https://github.com/cluebotng/fileserver.git
use_latest_versions: true
run:
command: run-fileserver
mount: all
port: 8000
grafana-alloy:
build:
ref: refs/tags/v0.4.8
repository: https://github.com/cluebotng/external-grafana-alloy.git
use_latest_versions: true
run:
command: run-alloy
health_check_http: /health
mount: all
port: 8118
pushgateway:
build:
ref: refs/tags/v0.0.4
repository: https://github.com/cluebotng/external-pushgateway.git
use_latest_versions: true
run:
command: run-pushgateway
health_check_http: /-/healthy
mount: all
port: 9091tools.cluebotng-trainer@tools-bastion-15:~$ toolforge components deployment show
Deployment ID: 20260408-221003-y37iloyuxw
Created: 20260408-221003
Status: failed
Long status:
Got exception: Failed run for component grafana-alloy: 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
Builds:
coordinator(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-6kxpf Reusing existing build
file-api(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-fpb2w Reusing existing build
grafana-alloy(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-zqzvk Reusing existing build
pushgateway(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-k8d79 Reusing existing build
Runs:
coordinator(successful): job coordinator is already up to date, [info](Job coordinator is already up to date)
file-api(successful): job file-api is already up to date, [info](Job file-api is already up to date)
grafana-alloy(failed): 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
pushgateway(skipped): Skipped due to previous failure
Tool config:
components:
coordinator:
build:
ref: refs/tags/v1.4.16
repository: https://github.com/cluebotng/trainer.git
use_latest_versions: true
run:
command: launcher ./deployment/entrypoint.sh run-edit-sets
mount: all
schedule: 13 13 * * *
file-api:
build:
ref: refs/tags/v1.2.4
repository: https://github.com/cluebotng/fileserver.git
use_latest_versions: true
run:
command: run-fileserver
mount: all
port: 8000
grafana-alloy:
build:
ref: refs/tags/v0.4.8
repository: https://github.com/cluebotng/external-grafana-alloy.git
use_latest_versions: true
run:
command: run-alloy
health_check_http: /health
mount: all
port: 8118
pushgateway:
build:
ref: refs/tags/v0.0.4
repository: https://github.com/cluebotng/external-pushgateway.git
use_latest_versions: true
run:
command: run-pushgateway
health_check_http: /-/healthy
mount: all
port: 9091Trying with the previous config (file-api was changed to try and fix the 422, but then grafana started failing)
tools.cluebotng-trainer@tools-bastion-15:~$ toolforge components deployment show
Deployment ID: 20260408-221101-b5es1o04q3
Created: 20260408-221101
Status: failed
Long status:
Got exception: Failed run for component file-api: 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
Builds:
coordinator(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-6kxpf Reusing existing build
file-api(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-fpb2w Reusing existing build
grafana-alloy(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-zqzvk Reusing existing build
pushgateway(skipped): id:cluebotng-trainer-buildpacks-pipelinerun-k8d79 Reusing existing build
Runs:
coordinator(successful): job coordinator is already up to date, [info](Job coordinator is already up to date)
file-api(failed): 422 Client Error: Unprocessable Entity for url: https://api.svc.tools.eqiad1.wikimedia.cloud:30003/jobs/v1/tool/cluebotng-trainer/jobs/ (422): no details
grafana-alloy(skipped): Skipped due to previous failure
pushgateway(skipped): Skipped due to previous failure
Tool config:
components:
coordinator:
build:
ref: refs/tags/v1.4.16
repository: https://github.com/cluebotng/trainer.git
use_latest_versions: true
run:
command: launcher ./deployment/entrypoint.sh run-edit-sets
mount: all
schedule: 13 13 * * *
file-api:
build:
ref: refs/tags/v1.2.4
repository: https://github.com/cluebotng/fileserver.git
use_latest_versions: true
run:
command: run-fileserver
health_check_http: /_/health
mount: all
port: 8000
replicas: 2
grafana-alloy:
build:
ref: refs/tags/v0.4.8
repository: https://github.com/cluebotng/external-grafana-alloy.git
use_latest_versions: true
run:
command: run-alloy
health_check_http: /health
mount: all
port: 8118
pushgateway:
build:
ref: refs/tags/v0.0.4
repository: https://github.com/cluebotng/external-pushgateway.git
use_latest_versions: true
run:
command: run-pushgateway
health_check_http: /-/healthy
mount: all
port: 9091Jobs are missing as expected
tools.cluebotng-trainer@tools-bastion-15:~$ toolforge jobs list +-------------+-----------------------+----------------------------+ | Job name: | Job type: | Status: | +-------------+-----------------------+----------------------------+ | coordinator | schedule: 13 13 * * * | Waiting for scheduled time | | file-api | continuous | Running | +-------------+-----------------------+----------------------------+
Need someone to check the logs to see why this is failing, nothing has changed aside from a new image build.