[Infra] Investigate options for parallelising cypress tests
Open, Needs TriagePublic
Actions

Description

The expected implementation of cypress parallel testing is using their online service (https://www.cypress.io/) or a self-hosted version of the server software (https://sorry-cypress.dev/).

For the purposes of CI, we want to parallelise our cypress test suites in a docker environment, without relying on a third-party service.

Investigate the options for running Cypress tests in parallel in CI. Optionally demonstrate the different options with proofs-of-concept.

Acceptance Criteria

A detailed proposal and/or demo implementation of running Cypress tests in parallel in CI (without changing Quibble)

Details

Subject	Repo	Branch	Lines +/-
Use `@badeball/cypress-parallel` for parallel runs	mediawiki/extensions/WikibaseLexeme	master	+1 K -1 K
Add experimental agents for distributed parallel cypress runs	integration/config	master	+30 -1
Use `cypress-cloud` to support multiple parallel agents for cypress	mediawiki/extensions/WikibaseLexeme	master	+997 -1 K

Customize query in gerrit

Related Objects
Search...

Status	Assigned	Task
Open	None	T361118 [EPIC][Infra] Reduce CI test runtime for Wikibase and related extensions
Open	None	T370033 [Infra][SW] Make selenium E2E tests run in parallel in CI jobs
Open	karapayneWMDE	T374001 [Infra] Investigate options for parallelising cypress tests

Event Timeline

ArthurTaylor created this task.Sep 4 2024, 12:30 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptSep 4 2024, 12:30 PM

ArthurTaylor added a parent task: T370033: [Infra][SW] Make selenium E2E tests run in parallel in CI jobs.Sep 4 2024, 12:30 PM

Prio Notes:

Impact Area	Affected
production / end users	no
monitoring	no
development efforts	yes
onboarding efforts	no
additional stakeholders	maybe

ItamarWMDE renamed this task from Investigate options for parallelising cypress tests to [Infra] Investigate options for parallelising cypress tests.Sep 26 2024, 12:58 PM

ItamarWMDE moved this task from Incoming to [DOT] Prioritized on the wmde-wikidata-tech board.

ItamarWMDE moved this task from [DOT] Prioritized to [DOT] By Project on the wmde-wikidata-tech board.Oct 24 2024, 12:57 PM

ItamarWMDE added a project: Wikidata Dev Team.

ItamarWMDE moved this task from [DOT] By Project to [DOT] Prioritized on the wmde-wikidata-tech board.

WMDE-leszek moved this task from Incoming to [DOT] Tech Backlog on the Wikidata Dev Team board.Oct 29 2024, 10:07 PM

ItamarWMDE moved this task from [DOT] Tech Backlog to Unified DOT Backlog on the Wikidata Dev Team board.Nov 6 2024, 10:50 AM

When pulled into board, let's discuss what timebox should be given for this

I'm pulling this into the board because all my other tasks are blocked. I'll start looking at this and will report back regularly on progress. But as other tasks become unblocked I'll continue working on them, and I'll try and leave this in a state that other people can pick up.

Maintenance_bot moved this task from [DOT] Prioritized to Ongoing on the wmde-wikidata-tech board.Fri, Nov 22, 8:29 AM

Change #1094415 had a related patch set uploaded (by Arthur taylor; author: Arthur taylor):

[mediawiki/extensions/WikibaseLexeme@master] Use `cypress-cloud` to support multiple parallel agents for cypress

https://gerrit.wikimedia.org/r/1094415

gerritbot added a project: Patch-For-Review.Fri, Nov 22, 1:06 PM

Investigated the option of using cypress-cloud / currents.dev / sorry-cypress. Sorry Cypress provides an open-source, self-hosted equivalent of the cloud / SaaS cypress.io. A local version of the server-side can be run with:

$ docker run -p 1234:1234 agoldis/sorry-cypress-director

and starts listening for connections on https://localhost:1234. Combined with the attached patch, this in principle allows cypress test suite executions to be scaled across multiple agents / CI runners.

That's all well and good, but unfortunately it seems that Sorry-Cypress / currents.dev picked a fight with Cypress and lost. Cypress doesn't care about abandoning its users and cypress-cloud / currents only provides limited support on their beta channel - they appear to have pivoted to Playwright.

This seems to be a great example of greedy companies and the failures of the open-core funding model. It's bad news for our efforts to scale up cypress testing, and for me it calls into question the choice of using cypress at all.

timebox: 8 hours

Change #1097353 had a related patch set uploaded (by Arthur taylor; author: Arthur taylor):

[integration/config@master] Add experimental agents for distributed parallel cypress runs

https://gerrit.wikimedia.org/r/1097353

Change #1097356 had a related patch set uploaded (by Arthur taylor; author: Arthur taylor):

[mediawiki/extensions/WikibaseLexeme@master] Use `@badeball/cypress-parallel` for parallel runs

https://gerrit.wikimedia.org/r/1097356

Added two more patches (1, 2) looking at a different approach for multi-agent parallelism. @badeball/cypress-parallel takes the approach of splitting the tests locally and relying on an environment cue to determine which block of tests to run. In my local testing, I get the following for the cypress tests in WikibaseLexeme:

$ SELENIUM_PARALLEL_NODE_ID=1 MEDIAWIKI_USER=root MEDIAWIKI_PASSWORD=[password] MW_SERVER=http://wikidatawikidev.mediawiki.mwdd.localhost:8080/ MW_SCRIPT_PATH=w/ npm run cypress:parallel

Node 1: 01:10
Node 2: 00:56
Node 3: 01:16
Node 4: 00:55

vs. linear:

$ MEDIAWIKI_USER=root MEDIAWIKI_PASSWORD=[password] MW_SERVER=http://wikidatawikidev.mediawiki.mwdd.localhost:8080/ MW_SCRIPT_PATH=w/ npm run cypress:run

03:32

The second patch is a suggestion for how that might look in CI, but it is not tested and the approach has not been reviewed by WMF.

The @badeball/cypress-parallel library is not super active or popular. There are a number of forks and other packages with similar names and it's not clear how well-maintained the library is. But it is also a small library with limited functionality - it seems within our capacities to adopt the project if it gets abandoned by its maintainer, and it doesn't seem in direct competition with Cypress' business model (so it's unlikely to get sabotaged by the Cypress team).

ArthurTaylor removed ArthurTaylor as the assignee of this task.Mon, Nov 25, 12:30 PM

ArthurTaylor moved this task from In Development to Ready for Peer Review on the Wikidata Dev Team (Wikidata.org Slice) board.

Also worth noting here that wdio supports sharding out of the box . An equivalent agent setup to that outlined above could work for both frameworks, and would be transparent to the Quibble / Jenkins side.

ArthurTaylor mentioned this in T374003: [Infra] Investigate and explain benefits / drawbacks of parallelisation of browser / E2E tests at Quibble vs. extension level.Tue, Nov 26, 10:48 AM

FWIW, if we decide to go with parallelizing the browser tests inside one “test runner”, going with @badeball/cypress-parallel sounds fine to me – it’s not ideal, but if it breaks the worst case is probably just that we have to take it out and suffer longer build times again. But the general approach is also under discussion in T374003 at the moment.

karapayneWMDE claimed this task.Mon, Dec 2, 8:25 AM

karapayneWMDE moved this task from Ready for Peer Review to In Peer Review on the Wikidata Dev Team (Wikidata.org Slice) board.

Michael subscribed.Tue, Dec 10, 7:34 AM

[Infra] Investigate options for parallelising cypress testsOpen, Needs TriagePublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

[Infra] Investigate options for parallelising cypress tests
Open, Needs TriagePublic
Actions

Related Objects
Search...