Page MenuHomePhabricator

Error connecting to db1237 as user wikiadmin2023: :real_connect(): (HY000/2002): Connection timed out on labswiki
Closed, ResolvedPublicPRODUCTION ERROR

Description

Error
message
Error connecting to db1237 as user wikiadmin2023: :real_connect(): (HY000/2002): Connection timed out
trace
from /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/Database.php(1210)
#0 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/DatabaseMySQL.php(136): Wikimedia\Rdbms\Database->newExceptionAfterConnectError(string)
#1 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/Database.php(265): Wikimedia\Rdbms\DatabaseMySQL->open(string, string, string, null, null, string)
#2 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(1056): Wikimedia\Rdbms\Database->initConnection()
#3 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(915): Wikimedia\Rdbms\LoadBalancer->reallyOpenConnection(int, Wikimedia\Rdbms\DatabaseDomain, array)
#4 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(789): Wikimedia\Rdbms\LoadBalancer->reuseOrOpenConnectionForNewRef(int, Wikimedia\Rdbms\DatabaseDomain, int)
#5 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(1867): Wikimedia\Rdbms\LoadBalancer->getServerConnection(int, string, int)
#6 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/objectcache/wancache/WANObjectCache.php(1729): Wikimedia\Rdbms\LoadBalancer->Wikimedia\Rdbms\{closure}(bool, int, array, null, array)
#7 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/objectcache/wancache/WANObjectCache.php(1559): WANObjectCache->fetchOrRegenerate(string, int, Closure, array, array)
#8 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(1878): WANObjectCache->getWithSetCallback(string, int, Closure, array)
#9 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(375): Wikimedia\Rdbms\LoadBalancer->getLagTimes()
#10 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(510): Wikimedia\Rdbms\LoadBalancer->getRandomNonLagged(array)
#11 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(446): Wikimedia\Rdbms\LoadBalancer->pickReaderIndex(array)
#12 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/loadbalancer/LoadBalancer.php(769): Wikimedia\Rdbms\LoadBalancer->getReaderIndex(string)
#13 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/DBConnRef.php(111): Wikimedia\Rdbms\LoadBalancer->getConnectionInternal(int, array, string, int)
#14 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/DBConnRef.php(125): Wikimedia\Rdbms\DBConnRef->ensureConnection()
#15 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/database/DBConnRef.php(338): Wikimedia\Rdbms\DBConnRef->__call(string, array)
#16 /srv/mediawiki/php-1.43.0-wmf.21/includes/libs/rdbms/querybuilder/SelectQueryBuilder.php(764): Wikimedia\Rdbms\DBConnRef->selectField(array, string, array, string, array, array)
#17 /srv/mediawiki/php-1.43.0-wmf.21/extensions/CheckUser/src/Services/CheckUserCentralIndexManager.php(88): Wikimedia\Rdbms\SelectQueryBuilder->fetchField()
#18 /srv/mediawiki/php-1.43.0-wmf.21/extensions/CheckUser/src/Jobs/PruneCheckUserDataJob.php(64): MediaWiki\CheckUser\Services\CheckUserCentralIndexManager->purgeExpiredRows(string, string)
#19 /srv/mediawiki/php-1.43.0-wmf.21/includes/jobqueue/JobRunner.php(369): MediaWiki\CheckUser\Jobs\PruneCheckUserDataJob->run()
#20 /srv/mediawiki/php-1.43.0-wmf.21/includes/jobqueue/JobRunner.php(330): JobRunner->doExecuteJob(MediaWiki\CheckUser\Jobs\PruneCheckUserDataJob)
#21 /srv/mediawiki/php-1.43.0-wmf.21/includes/jobqueue/JobRunner.php(225): JobRunner->executeJob(MediaWiki\CheckUser\Jobs\PruneCheckUserDataJob)
#22 /srv/mediawiki/php-1.43.0-wmf.21/maintenance/runJobs.php(101): JobRunner->run(array)
#23 /srv/mediawiki/php-1.43.0-wmf.21/maintenance/includes/MaintenanceRunner.php(703): RunJobs->execute()
#24 /srv/mediawiki/php-1.43.0-wmf.21/maintenance/run.php(51): MediaWiki\Maintenance\MaintenanceRunner->run()
#25 /srv/mediawiki/multiversion/MWScript.php(158): require_once(string)
#26 {main}
Impact

Purging of CheckUser data is not occurring consistently for labswiki, which should be occurring to ensure that we are in-line with data retention policies. There have been 503 of these in the last 24 hours which also causing logstash spam.

Appears to start after wmf.21 was deployed to labswiki via the train.

Event Timeline

In less than a month, wikitech will go inside production and this would be automatically resolved. Is it breaking an important functionality?

This comment was removed by Dreamy_Jazz.

Looking at the stack trace again, I see this isn't actually failing to purge data from the local CheckUser tables but the new central tables on extension1. Therefore this is the same issue as seen in T366574.

Purging the central tables is important, but should not contain any specifically private data as we do not have temporary accounts enabled on labswiki. As such, I think it's not something that must be resolved within the month.

As such, I think it's not something that must be resolved within the month.

In this case, is this no longer a train-blocker?

As such, I think it's not something that must be resolved within the month.

In this case, is this no longer a train-blocker?

Yup, and I'll un-tag it as such.

It appears because of the DB connection issues that no data is actually being purged from the local CheckUser tables because the query rolls back. This is a serious problem.

Dreamy_Jazz claimed this task.