Create helm chart for Speechoid
Open, Needs TriagePublic16 Estimated Story Points
Actions

Assigned To

None

Authored By

	kalle
	Oct 12 2020, 12:21 PM

Description

https://wikitech.wikimedia.org/wiki/Deployment_pipeline/Migration/Tutorial#Creating_a_Helm_Chart

How do we define hostnames/ips for dependencies? I.e. Wikispeech-server needs to be aware of the other Speechoid-services. In the docker-compose script on wmflabs this is a fairly simple setup. Also, Speechoid might not start up until the dependent services are available. We are probably setting up a pod that bundles the complete Speechoid-package with all services.

See https://github.com/karlwettin/wikispeech-docker-compose/blob/master/docker-compose.yml

Dependent services are defined in the Wikispeech-server configuration. We will need to modify this to point at the service hostnames in k8s. See https://github.com/karlwettin/wikispeech-docker-compose/blob/master/compose-files/mockup.conf

We currently have a HAProxy in front of MaryTTS built in to our blubber, acting as a request queue (only one request at the time) to avoid overloading the service as each request will hit 100% CPU. Can we configure kubernetes to do this instead? Also consider what happens if we have multiple cores, MaryTTS seems to differ a bit, on my local Ubuntu it will use any available core and max it out, while I've seen it only using a single core on the wmflabs installation. This will indeed require consideration prior to deploying

Related Objects
Search...

Status	Assigned	Task
Open	None	T264842 Deploy Wikispeech in production
Open	None	T180015 ☂ Deploy Wikispeech on beta cluster
Open	None	T264748 ☂ Speechoid WMF deployment
Open	None	T281900 ☂ Speechoid on kubernets
Open	None	T264749 Decide if Speechoid should be a Kubernetes envelope pod or separate services
Resolved	kalle	T264752 Contact service ops regarding deployment of Speechoid
Open	None	T265280 Create helm chart for Speechoid

Event Timeline

kalle created this task.Oct 12 2020, 12:21 PM

kalle updated the task description. (Show Details)Oct 12 2020, 12:31 PM

kalle updated the task description. (Show Details)

kalle moved this task from 🥴 Backlog to 🤠 This week on the User-kalle board.Oct 12 2020, 12:35 PM

kalle edited parent tasks, added: T264749: Decide if Speechoid should be a Kubernetes envelope pod or separate services; removed: T253499: ☂Blubber CI pipeline.

kalle updated the task description. (Show Details)Oct 12 2020, 7:17 PM

A bit of IRC log from #wikimedia-serviceops:

17:19 < kalle> Does adding dependencies in a Helm chart mean that those dependencies will be bundled as containers running within that same pod?
17:21 < kalle> If so, how would one go about setting up the helm so that some dependency might be running in other pods, potentially automatic up-down-scaling the service? Or is it ment that we should scale the one specific pod up and down?
17:22 < kalle> This is regarding the Wikispeech extension. Our backend, speechoid, is a bundle of quite a few services with rather simple dependencies.
17:23 < kalle> But some of the services are rather heavy on CPU, e.g. the speech synthesis.
17:24 < kalle> I was thinking it would make sense to scale those services only, having k8s balance the requests.
17:26 < kalle> Also, we have installed a HAProxy infront of one of the services, to act as a request queue. Only letting in one request at the time since it will consume 100% of the available threads. It feels as we should let k8s handle that when there are multiple instances up and running.
17:27 < kalle> For reference:
17:27 < kalle> https://gerrit.wikimedia.org/r/admin/repos/q/filter:services+wikispeech
17:28 < kalle> https://www.mediawiki.org/wiki/Wikispeech
17:45 -!- ottomata [sid347637@gateway/web/irccloud.com/x-ghoxbuyonglmfbqv] has joined #wikimedia-serviceops
17:59 < effie> kalle: is there a task related to rolling out speechoid to kubernetes?
18:00 < effie> it would be lovely if we could discuss those details on a task
18:20 < _joe_> kalle: yeah also, new services architectures are usually discussed before getting to the deployment phase with the stakeholders (including SRE)
18:21 < _joe_> maybe that was done. In that case, can you point me to the people you spoke with?
18:21 < _joe_> so that I can get a better idea of how the release was planned
18:23 < _joe_> if not, we will need to take some time to advise you on how to proceed. Horizontal pod autoscaling is not a great way to spawn new workers on demand, unless we are ok with having a lot of latency for individual requests.
18:26 < _joe_> what you probably want to do is to return 503 to the readiness probe while your container is processing a request (or multiple requests if we decide to serve more than one thread from the same pod)
18:26 < _joe_> but again, that won't probably work if not with a small number of incoming requests
22:15 < kalle> effie: https://phabricator.wikimedia.org/T265280
22:16 < kalle> _joe_: We've only talked to releng at this point, making sure they accept how we blubbered things up. 
22:16 < kalle> So this is really the inital ops-contact, working our way towards beta-cluster release.

@jijiki This here :)

kalle moved this task from Incoming to Sprint on the Wikispeech-Jobrunner board.Oct 15 2020, 9:40 AM

kalle edited projects, added Wikispeech-Jobrunner (Sprint); removed Wikispeech-Jobrunner.

kalle moved this task from Backlog to In progress on the Wikispeech-Jobrunner (Sprint) board.

kalle set the point value for this task to 16.Oct 15 2020, 9:46 AM

Lokal_Profil assigned this task to kalle.Oct 20 2020, 8:25 AM

kalle moved this task from In progress to Backlog on the Wikispeech-Jobrunner (Sprint) board.Oct 29 2020, 8:25 AM

kalle moved this task from 🤠 This week to 🥴 Backlog on the User-kalle board.Oct 29 2020, 8:27 AM

kalle moved this task from 🥴 Backlog to 🤕 Watching on the User-kalle board.

kalle moved this task from Backlog to Blocked on the Wikispeech-Jobrunner (Sprint) board.

Lokal_Profil moved this task from Unsorted to Releases and packaging on the Wikispeech-Text-to-Speech board.Oct 29 2020, 8:48 AM

Sebastian_Berlin-WMSE changed the task status from Open to Stalled.Oct 29 2020, 9:12 AM

Sebastian_Berlin-WMSE changed the task status from Stalled to Open.

Lokal_Profil mentioned this in T264842: Deploy Wikispeech in production.Nov 9 2020, 12:46 PM

kalle added a parent task: T281900: ☂ Speechoid on kubernets.May 4 2021, 4:37 PM

@kalle: Removing task assignee as this open task has been assigned for more than two years - See the email sent to task assignee on Feburary 22nd, 2023.
Please assign this task to yourself again if you still realistically [plan to] work on this task - it would be welcome! :)
If this task has been resolved in the meantime, or should not be worked on by anybody ("declined"), please update its task status via "Add Action… 🡒 Change Status".
Also see https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup for tips how to best manage your individual work in Phabricator. Thanks!

Create helm chart for SpeechoidOpen, Needs TriagePublic16 Estimated Story PointsActions

Description

Related ObjectsSearch...

Event Timeline

Create helm chart for Speechoid
Open, Needs TriagePublic16 Estimated Story Points
Actions

Related Objects
Search...