Page MenuHomePhabricator

Request increased quota for wikiwho Cloud VPS project
Closed, ResolvedPublic

Description

Project Name: wikiwho
Type of quota increase requested: CPU/RAM
Reason: Our VM (wikiwho-api.wikiwho.eqiad1.wikimedia.cloud) uses server-local storage instead of ceph. @Andrew has informed us Cloud Services needs to upgrade the hypervisor that hosts WikiWho, and in order to do so the VM would need to be shut down entirely. We were told building a new VM using ceph would prevent future downtime like this, thus here lies the request.

At T334891: Add more languages to WikiWho and build new VM we've been adding new languages to the volumes, using the current instance given it has a lot of resources (g3.cores24.ram122.disk20). The thought was once we added the languages, we could get away with a smaller instance flavor for the new VM. However, we didn't get as far as we wanted, and I definitely think we'll be adding more languages in the near future. Additionally, our RAM consumption on the current instance is already at about 98%, and that's while we're not importing new languages, and the disk space is at 61%. The 24 VPCUs are not all in use, but they will be when we add more languages again.

So, all things considered, I think it's best if we rebuild the new VM with the same resources as the old, that is 24 VCPUs, 122GB RAM, and 20GB disk space, and reminder that this time we would like the VM to be built using ceph storage. I'm assuming if we are granted the additional quota, simply creating the instance in Horizon will by default use ceph.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Note that I will be attending the Wikimedia Hackathon this week, followed by a week of holiday. I hope to get this done before then, but I understand some of Cloud Services will also be at the Hackathon, so we can take care of it there together if you'd prefer :) I say all of this as I understand there's a bit of urgency for Andrew to tend to the hypervisor. If Monday, May 22 comes and it still isn't migrated, I can see if @Ragesoss can assist while I'm away on holiday. We have a setup script so I suspect rebuilding the new VM shouldn't take very long.

+1 approved. Please ping on this ticket when the old VM is removed so we can revert (some of) the quota increase.

Mentioned in SAL (#wikimedia-cloud-feed) [2023-05-31T09:31:55Z] <wm-bot2> Increased quotas by 24 cores, 124928 ram (T336685) - cookbook ran by dcaro@vulcanus

dcaro claimed this task.
dcaro subscribed.

Sorry for the delay, done.

@dcaro @Andrew The old instance has now been deleted, so the temporary quota increase can be reverted. I'm not sure how to verify if we're now using ceph storage and not server local-storage, but everything seems to be running smoothly.

Thanks as always!

Thank you! I've reverted the quota change.