Page MenuHomePhabricator

Investigate if stopping mysql with buffer_pool dump between 10.4 versions is safe
Closed, ResolvedPublic

Description

During the upgrades from 10.1 to 10.4 we saw a bunch of crashes. MariaDB thought they could be related to the innodb_buffer_pool_dump flag between restarts. We disabled it when upgrading.
However, now we are fully running 10.4 everywhere, we should test if this is safe to do again

Event Timeline

Marostegui moved this task from Triage to Ready on the DBA board.

I have done quite a bunch of testing and so far I have not been able to reproduce the crashes when doing 10.4->10.4 migrations. Going to try a few more things.

Mentioned in SAL (#wikimedia-operations) [2022-03-16T09:15:34Z] <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1099:3318 T303498', diff saved to https://phabricator.wikimedia.org/P22670 and previous config saved to /var/cache/conftool/dbconfig/20220316-091533-marostegui.json

Change 771324 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/puppet@production] db1099: Disable notifications

https://gerrit.wikimedia.org/r/771324

Change 771324 merged by Marostegui:

[operations/puppet@production] db1099: Disable notifications

https://gerrit.wikimedia.org/r/771324

I haven't been able to reproduce a crash again. Before closing this, I am going to upgrade a few more hosts as part of the s3 upgrade dumping the buffer pool, to see if we get some extra insights.

Change 775260 had a related patch set uploaded (by Marostegui; author: Marostegui):

[operations/cookbooks@master] mysql/upgrade.py: Allow buffer pool dumps

https://gerrit.wikimedia.org/r/775260

I haven't been able to replicate the crashes/see more errors when upgrading, so I am considering this fixed for now. We can reopen if we see something else coming.
Ready to close this ticket once the above patch is reviewed + merged.

Change 775260 merged by jenkins-bot:

[operations/cookbooks@master] mysql/upgrade.py: Allow buffer pool dumps

https://gerrit.wikimedia.org/r/775260