Build blubber file for ORES
Closed, DeclinedPublic
Actions

Assigned To

Authored By

	Ladsgroup
	Nov 23 2018, 1:02 PM

Description

The blubber file(s) are needed to make the docker files and then helm charts

This is a heads up for SRE team mostly.

Related Objects
Search...

Status	Assigned	Task
Open	None	T198901 Migrate production services to kubernetes using the pipeline
Declined	None	T182331 [Epic] Deploy ORES in kubernetes cluster
Declined	• ACraze	T210268 Build blubber file for ORES
Resolved	• dduvall	T211625 Unify configuration for local build-context copies and variant artifacts
Resolved	• dduvall	T210267 Execution of the deployment pipeline should be configurable via .pipeline/config.yaml
Resolved	• dduvall	T222199 Post generated docs for pipelinelib

Event Timeline

Ladsgroup triaged this task as Low priority.Nov 23 2018, 1:02 PM

Ladsgroup created this task.

Ladsgroup raised the priority of this task from Low to Needs Triage.Nov 28 2018, 6:39 AM

Ladsgroup moved this task from Unsorted to New development on the Machine-Learning-Team board.

Ladsgroup updated the task description. (Show Details)Nov 28 2018, 11:37 AM

jijiki triaged this task as Medium priority.Dec 3 2018, 1:21 PM

thcipriani added a parent task: T212801: TEC3:O3:O3.1:Q3 Goal - Move cxserver, citoid, changeprop, eventgate (new service) and ORES (partially) through the production CD Pipeline.Jan 2 2019, 7:21 PM

thcipriani added a subtask: T211625: Unify configuration for local build-context copies and variant artifacts.Jan 8 2019, 5:36 PM

thcipriani added a subtask: T210267: Execution of the deployment pipeline should be configurable via .pipeline/config.yaml.

• Phabricator_maintenance moved this task from Backlog to Acknowledged on the SRE board.Jan 26 2019, 10:44 PM

akosiaris mentioned this in T182332: Refactor ORES puppet for Kubernetes.Feb 21 2019, 11:51 AM

thcipriani closed subtask T211625: Unify configuration for local build-context copies and variant artifacts as Resolved.Feb 25 2019, 5:47 PM

awight unsubscribed.Mar 21 2019, 4:04 PM

Ladsgroup unsubscribed.Apr 17 2019, 7:18 PM

thcipriani removed a parent task: T212801: TEC3:O3:O3.1:Q3 Goal - Move cxserver, citoid, changeprop, eventgate (new service) and ORES (partially) through the production CD Pipeline.Apr 30 2019, 4:14 PM

• ACraze claimed this task.May 18 2020, 10:01 PM

• ACraze edited projects, added Machine-Learning-Team (Active Tasks); removed Machine-Learning-Team.

Got a WIP PR here: https://github.com/wikimedia/ores/pull/345

Still need to slim down the production variant and handle the redis dep for the test variant.

A couple of questions here so far:

Does the base image need to come from the wmf docker registry? If so, then it might make sense for us to create an optimized base image that has the scipy + enchant binaries ready to go.

For the production image, we need to start a container to run the uwsgi service and also some separate containers to run the celery workers as well, can we specify that in blubberfile? Or do we need to create a deployment template for the helm chart somewhere?

In T210268#6167598, @ACraze wrote:

A couple of questions here so far:

Does the base image need to come from the wmf docker registry?

Yes, and it must inherit from the SRE-provided base images.

For the production image, we need to start a container to run the uwsgi service and also some separate containers to run the celery workers as well, can we specify that in blubberfile? Or do we need to create a deployment template for the helm chart somewhere?

Don't know that part; I don't know if we have any multi-container pods in production yet, sorry.

In T210268#6167607, @Jdforrester-WMF wrote:

In T210268#6167598, @ACraze wrote:

A couple of questions here so far:

For the production image, we need to start a container to run the uwsgi service and also some separate containers to run the celery workers as well, can we specify that in blubberfile?

Adding @dduvall for this as he is the best person to answer this.

Or do we need to create a deployment template for the helm chart somewhere?

Not sure I understand this part of the question. A helm chart (including a Deployment template) will have to be created anyway, and architecturally speaking there should be different charts for the the celery worker and uwsgi. Does that help?

Don't know that part; I don't know if we have any multi-container pods in production yet, sorry.

We do, but not of the nature mentioned above. There are sidecar containers that help the main application providing functionality like TLS termination or metrics collection. But that's entirely different from the pattern of colocating in the same pod 2 containers that are actively serving requests (in one way or another). The pattern of multi-app-container pods however is best to be avoided as it will interfere with the ability to increase capacity for one of the 2 parts at will as well as allow the 2 parts to cause side effects to each other when under stress.

In T210268#6167598, @ACraze wrote:

For the production image, we need to start a container to run the uwsgi service and also some separate containers to run the celery workers as well, can we specify that in blubberfile? Or do we need to create a deployment template for the helm chart somewhere?

We can build multiple images via PipelineLib using a .pipeline/config.yaml in the root of the repository. You can specify different variants of a single blubberfile be built or that different blubberfiles be used to build a variant. PipelineLib is documented on wikitech: https://wikitech.wikimedia.org/wiki/PipelineLib although you'll be a bit of a guinea pig for this use-case. I'm not aware of any other repos using it for this purpose; although it was designed with this purpose in mind :)

In T210268#6167598, @ACraze wrote:

Does the base image need to come from the wmf docker registry? If so, then it might make sense for us to create an optimized base image that has the scipy + enchant binaries ready to go.

If these dependencies are currently installed via apt then blubber should be able to install them just fine. The python base images have the wikimedia apt repos added as sources.

@thcipriani thanks, PipelineLib seems to be what I was missing here :)

Halfak moved this task from Parked to Review on the Machine-Learning-Team (Active Tasks) board.Jun 22 2020, 4:46 PM

Halfak moved this task from Review to Parked on the Machine-Learning-Team (Active Tasks) board.

Halfak edited projects, added Machine-Learning-Team; removed Machine-Learning-Team (Active Tasks).Jul 13 2020, 4:29 PM

Declining this as we've discovered ORES will not fit on k8s in it's current design

Here is a draft PR that we started on for documentation purposes: https://github.com/wikimedia/ores/pull/349

akosiaris mentioned this in T182331: [Epic] Deploy ORES in kubernetes cluster.Nov 20 2020, 7:48 AM

akosiaris mentioned this in T198901: Migrate production services to kubernetes using the pipeline.

• ACraze mentioned this in T279004: Production images for ORES/revscoring models.Mar 31 2021, 7:25 PM

thcipriani closed subtask T210267: Execution of the deployment pipeline should be configurable via .pipeline/config.yaml as Resolved.May 14 2021, 9:38 PM