During the upgrades from 10.1 to 10.4 we saw a bunch of crashes. MariaDB thought they could be related to the innodb_buffer_pool_dump flag between restarts. We disabled it when upgrading.
However, now we are fully running 10.4 everywhere, we should test if this is safe to do again
Description
Details
Project | Branch | Lines +/- | Subject | |
---|---|---|---|---|
operations/cookbooks | master | +1 -1 | mysql/upgrade.py: Allow buffer pool dumps | |
operations/puppet | production | +1 -0 | db1099: Disable notifications |
Related Objects
Event Timeline
I have done quite a bunch of testing and so far I have not been able to reproduce the crashes when doing 10.4->10.4 migrations. Going to try a few more things.
Mentioned in SAL (#wikimedia-operations) [2022-03-16T09:15:34Z] <marostegui@cumin1001> dbctl commit (dc=all): 'Depool db1099:3318 T303498', diff saved to https://phabricator.wikimedia.org/P22670 and previous config saved to /var/cache/conftool/dbconfig/20220316-091533-marostegui.json
Change 771324 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/puppet@production] db1099: Disable notifications
Change 771324 merged by Marostegui:
[operations/puppet@production] db1099: Disable notifications
I haven't been able to reproduce a crash again. Before closing this, I am going to upgrade a few more hosts as part of the s3 upgrade dumping the buffer pool, to see if we get some extra insights.
Change 775260 had a related patch set uploaded (by Marostegui; author: Marostegui):
[operations/cookbooks@master] mysql/upgrade.py: Allow buffer pool dumps
I haven't been able to replicate the crashes/see more errors when upgrading, so I am considering this fixed for now. We can reopen if we see something else coming.
Ready to close this ticket once the above patch is reviewed + merged.
Change 775260 merged by jenkins-bot:
[operations/cookbooks@master] mysql/upgrade.py: Allow buffer pool dumps