Lately we have been having quite a few cronjob failures, with @phaultfinder opening various tasks.
On March 25th @ 15:00 we switched all jobs to eqiad, as part of {T413974}
On April 2nd @ 13:17 we pooled codfw back for reads
I updated the [[ https://logstash.wikimedia.org/app/dashboards#/view/d51552d0-e309-11ef-87d0-9371e01d3c68?_g=(filters%3A!()%2CrefreshInterval%3A(pause%3A!t%2Cvalue%3A0)%2Ctime%3A(from%3Anow-3d%2Cto%3Anow)) | mw-cron (MediaWiki Periodic Jobs on k8s) ]] dashboard, in a effort to extract more information. Due to multiline logs, the `MediaWiki Maintenance Jobs - k8s` search returns a lot of results, making filtering a bit difficult.
~~**Culprit #1**~~
~~Related to {T422455}. ~~
```
{
"query": {
"regexp": {
"log.keyword": ".*curl error: 28.*Timeout was reached.*"
}
}
}
```
**Culprit #2**
Additionally, there are some jobs which seem to fail due to inability to talk to the DB
{F75318592}
```
{
"query": {
"regexp": {
"log.keyword": ".*Error.+2006.+MySQL server has gone away.*"
}
}
}
```