[INVESTIGATION] Investigate flaky Mismatch Finder browser tests
Closed, ResolvedPublic1 Estimated Story Points
Actions

Description

Mismatch Finder tests fail randomly on CI very often. These errors are not consistent, since sometimes they pass, and sometimes they fail.

The following error that appears to be a timeout error keeps reappearing in different tests (example 1, 2, 3, 4):

Actual path [/_dusk/login/2] does not equal expected path [/results].
Failed asserting that '/_dusk/login/2' matches PCRE pattern "/^\/results$/u".

Related Objects
Search...

Status	Assigned	Task
Resolved	None	T311231 Chore: Update dependencies in new-lexeme-special-page.git, mismatch finder and query builder
Resolved	guergana.tzatchkova	T311401 Update outdated dependencies in Mismatch Finder and merge dependabot updates
Resolved	Michael	T311506 [INVESTIGATION] Investigate flaky Mismatch Finder browser tests

Event Timeline

guergana.tzatchkova removed guergana.tzatchkova as the assignee of this task.Jun 28 2022, 12:45 PM

guergana.tzatchkova created this task.

guergana.tzatchkova updated the task description. (Show Details)

Maintenance_bot moved this task from Inbox to Sorted Team A on the [DEPRECATED] wdwb-tech board.Jun 28 2022, 1:29 PM

ItamarWMDE renamed this task from Fix flacky Mismatch Finder browser tests to Fix flaky Mismatch Finder browser tests.Jul 1 2022, 11:21 AM

Prio Notes:

Affects development efforts
Doesn't affect external stakeholders
Doesn't affect end users
Doesn't affect onboarding efforts

ItamarWMDE moved this task from Incoming to [DOT] Prioritized on the wmde-wikidata-tech board.Jul 1 2022, 11:24 AM

ItamarWMDE moved this task from [DOT] Prioritized to Incoming on the wmde-wikidata-tech board.Jul 5 2022, 1:56 PM

ItamarWMDE moved this task from Incoming to [DOT] Prioritized on the wmde-wikidata-tech board.

ItamarWMDE added a project: Wikidata Dev Team.

ItamarWMDE moved this task from Incoming to [DOT] Tech Backlog on the Wikidata Dev Team board.Jul 5 2022, 2:01 PM

ItamarWMDE claimed this task.Jul 26 2022, 10:09 AM

Restricted Application added a project: User-ItamarWMDE. · View Herald TranscriptJul 26 2022, 10:09 AM

ItamarWMDE removed ItamarWMDE as the assignee of this task.Jul 26 2022, 10:10 AM

from backlog refinement: timebox is required (1/2 to 1 day)

karapayneWMDE moved this task from [DOT] Tech Backlog to Unified DOT Backlog on the Wikidata Dev Team board.Sep 28 2022, 10:58 AM

karapayneWMDE set the point value for this task to 1.Oct 18 2022, 9:32 AM

karapayneWMDE moved this task from Unified DOT Backlog to Sprint-∞ on the Wikidata Dev Team board.

karapayneWMDE edited projects, added Wikidata Dev Team (Sprint-∞); removed Wikidata Dev Team.

Lucas_Werkmeister_WMDE updated the task description. (Show Details)Oct 19 2022, 8:39 AM

Lucas_Werkmeister_WMDE updated the task description. (Show Details)

Lucas_Werkmeister_WMDE moved this task from Parents/Waiting to Todo/Backlog on the Wikidata Dev Team (Sprint-∞) board.

guergana.tzatchkova claimed this task.Oct 19 2022, 3:36 PM

guergana.tzatchkova moved this task from Todo/Backlog to Doing on the Wikidata Dev Team (Sprint-∞) board.

The listed examples have in common that they fail when visiting a page after logging in the user:

$browser->loginAs(User::factory()->create())
                ->visit(new ResultsPage($mismatch->item_id))

It seems like the redirect is not happening at the time visit is called or is not happening at all. The screenshots of the failed test show a blank page.

In all the examples about using loginAs the user is always created outside of the function. I don't have a logical way of proving why this might be the cause, maybe the database operation that creates the user isn't finished by the time loginAs is called occasionally in CI?
~~If this is the case then changing create() to make() if we still want to call the factory from inside the loginAs function could possibly solve the issue.~~

~~From reading the docs I gather that create() is used when we need to use the data after it is being created. Which is not the way we are using it in these tests.~~

Update: using make() doesn't persist the user until the end of the test, so we need to stick to create().

I am creating this test to see if it might be the reason why the tests fail sometimes. https://github.com/wmde/wikidata-mismatch-finder/pull/440

This seems to be relevant: https://laravel.com/docs/8.x/dusk#authentication

After using the loginAs method, the user session will be maintained for all tests within the file.

There is also this github issue that is similar to the issue we are having, the suggestion here is to check the SESSION_DOMAIN env variable: https://github.com/laravel/dusk/issues/408

Though I don't see how this could be affecting our tests in particular, because they are failing inconsistently.

Another possibility is that there is a database issue because we are using create() instead of make() to create the users.. Why do we need to store the user in the database if we are using DatabaseMigrations and a different db is used in every test? The laravel documentation is not very clear about this.

Update: we need to use create() instead of make() because loginAs checks the database for the user, and make() doesn't create an entry in the database, which causes the tests to fail.

https://laracasts.com/discuss/channels/laravel/please-explain-dusk-databasemigrations-like-im-a-dummy

guergana.tzatchkova moved this task from Doing to Peer Review on the Wikidata Dev Team (Sprint-∞) board.Oct 24 2022, 8:40 AM

ItamarWMDE moved this task from [DOT] Prioritized to Ongoing on the wmde-wikidata-tech board.Oct 25 2022, 8:32 AM

guergana.tzatchkova moved this task from Peer Review to Our work done on the Wikidata Dev Team (Sprint-∞) board.Oct 25 2022, 5:22 PM

guergana.tzatchkova moved this task from Our work done to Tech Verification on the Wikidata Dev Team (Sprint-∞) board.

guergana.tzatchkova moved this task from Tech Verification to Todo/Backlog on the Wikidata Dev Team (Sprint-∞) board.Oct 26 2022, 6:00 PM

After having the PoC PR merged for a bit and adding more PRs to the project, we can see that the tests are still failing randomly with the same issue, so we have to do another round of investigation. Example 1, 2, 3

Unassigning myself, someone else might have a better idea.

I think We probably exceeded the original timebox for this, no?

Not yet :D but if we continue on it yes.

Michael claimed this task.Nov 7 2022, 11:37 AM

Michael moved this task from Todo/Backlog to Doing on the Wikidata Dev Team (Sprint-∞) board.

Restricted Application added a project: User-Michael. · View Herald TranscriptNov 7 2022, 11:37 AM

https://github.com/wmde/wikidata-mismatch-finder/pull/459 was merged as a possible way to improve the situation. We will have to wait and see if this changes anything

Michael moved this task from Doing to Tech Verification on the Wikidata Dev Team (Sprint-∞) board.Nov 7 2022, 2:52 PM

Unfortunately, this seems to not have helped. Looking at the list of failed Test workflow runs on GitHub Actions, it seems like 42 failures occurred last month due to browser test (Although I didn't take the time to confirm each one of them was due to the same flakiness error). As the allotted timebox for this investigation is exceeded, I suggest we shelve this issue for now, and get back to it at a later date.

ItamarWMDE closed this task as Resolved.Dec 20 2022, 10:40 AM

In T311506#8480870, @ItamarWMDE wrote:

Unfortunately, this seems to not have helped. Looking at the list of failed Test workflow runs on GitHub Actions, it seems like 42 failures occurred last month due to browser test (Although I didn't take the time to confirm each one of them was due to the same flakiness error). As the allotted timebox for this investigation is exceeded, I suggest we shelve this issue for now, and get back to it at a later date.

Mh, maybe I'm doing something wrong, but when looking at CI actions that ran for the main branch (and thus exclude genuine bugs in PRs and unfinished PRs and such), then I see only four failed CI runs since my PR was merged: https://github.com/wmde/wikidata-mismatch-finder/actions?query=is%3Afailure+branch%3Amain

Three of them failed because of problems with toolforge and only one failed due to a browser test being flaky. That one flaky browser test is not the one I modified in #459 (ResultsTest.php), but AppTest.php.

Still, I agree with resolving this task and looking into the remaining flakyness another time.

Well the falkyness didn't happen only on main, I looked at the total of failed runs for the Test workflow, and sampled them to see what failed (predominantly - browser tests)

karapayneWMDE moved this task from Tech Verification to 2022 Completed Tasks on the Wikidata Dev Team (Sprint-∞) board.Dec 20 2022, 8:25 PM

[INVESTIGATION] Investigate flaky Mismatch Finder browser testsClosed, ResolvedPublic1 Estimated Story PointsActions

Description

Related ObjectsSearch...

Event Timeline

[INVESTIGATION] Investigate flaky Mismatch Finder browser tests
Closed, ResolvedPublic1 Estimated Story Points
Actions

Related Objects
Search...