[SPIKE] Test MGAD Model on LiftWing
Open, LowPublic
Actions

Assigned To

Authored By

	HNordeenWMF
	Mar 20 2024, 10:49 PM

Description

Background

For the initial experiment of machine-generated article descriptions, the model was hosted using Cloud APS and Toolforge. It was migrated recently to Lift Wing by the Machine Learning and Research teams, and is ready for testing by Android engineers in a production setting.

Related tasks:

Migrate Machine-generated Article Descriptions from toolforge to liftwing T343123 - comments contain instructions on how to access and documentation T343123#9607328
Investigate increased preprocessing latencies on LW of article-descriptions model T358195 - ML Team working, Android team Tracking
Put API on Cloud VPS T318384 - task for initial setup

The task

Test out the API endpoint directly
- Record latency (ideally under 3 sec)
- Verify that it is providing 2 article descriptions per article
Establish/document any relevant differences between the Liftwing hosted model and previous model
Express your opinion: is the model in a good enough state for us to rerelease this feature?
Document implementation steps based on the outcome of the engineering investigation and share with PM before proceeding with Implementation

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Open		JTannerWMF	T316375 [EPIC] Machine Generated Article Descriptions
		Open		Dbrant	T360581 [SPIKE] Test MGAD Model on LiftWing

Event Timeline

HNordeenWMF created this task.Mar 20 2024, 10:49 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMar 20 2024, 10:49 PM

HNordeenWMF updated the task description. (Show Details)Mar 20 2024, 10:53 PM

HNordeenWMF updated the task description. (Show Details)

HNordeenWMF updated the task description. (Show Details)Mar 20 2024, 11:50 PM

HNordeenWMF updated the task description. (Show Details)Mar 20 2024, 11:57 PM

HNordeenWMF updated the task description. (Show Details)

HNordeenWMF updated the task description. (Show Details)Mar 21 2024, 7:33 PM

HNordeenWMF triaged this task as Low priority.Mar 26 2024, 4:34 PM

HNordeenWMF updated the task description. (Show Details)

HNordeenWMF moved this task from Needs Triage to Up Next on the Wikipedia-Android-App-Backlog board.

JTannerWMF moved this task from Up Next to Android Release - FY2023-24 on the Wikipedia-Android-App-Backlog board.Apr 16 2024, 8:59 PM

JTannerWMF edited projects, added Wikipedia-Android-App-Backlog (Android Release - FY2023-24); removed Wikipedia-Android-App-Backlog.

JTannerWMF moved this task from Epics in Progress to Ready for dev on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.

HNordeenWMF updated the task description. (Show Details)Apr 17 2024, 4:39 PM

HNordeenWMF added a comment.Apr 17 2024, 4:47 PM

This comment was removed by HNordeenWMF.

JTannerWMF updated the task description. (Show Details)Apr 19 2024, 5:12 PM

Isaac subscribed.Apr 29 2024, 8:34 PM

Dbrant claimed this task.May 20 2024, 1:21 PM

Dbrant moved this task from Ready for dev to Doing on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.

Dbrant moved this task from Doing to Blocked/Waiting on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.May 23 2024, 2:59 PM

Putting this in Blocked for now:
I found an issue at the gateway level (T365439) that makes it difficult for us to query the URLs of Lift Wing services without an ugly workaround in our network layer.

Otherwise, from some preliminary testing, the latency of the Lift Wing service for providing generated article descriptions seems to be on par with the previous service on toolforge, and should therefore be perfectly good for us to start consuming.

dbrant opened https://github.com/wikimedia/apps-android-wikipedia/pull/4710

Dbrant updated the task description. (Show Details)May 30 2024, 3:54 PM

Dbrant moved this task from Blocked/Waiting to Design signoff on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.

The gateway API issue was fixed, so we can now continue testing the Lift Wing model. Here is an APK for anyone else who would like to try it:
https://github.com/wikimedia/apps-android-wikipedia/actions/runs/9305222379/artifacts/1552968133

From my testing so far, the latency seems to be in the same range as the previous wmcloud-hosted model. The latency is quite variable depending on the article, but the average seems to be ~3 seconds or less.

Seddon moved this task from Design signoff to Code Review on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.Jun 18 2024, 4:23 PM

cooltey closed https://github.com/wikimedia/apps-android-wikipedia/pull/4710

Dbrant moved this task from Code Review to Ready for PM signoff on the Wikipedia-Android-App-Backlog (Android Release - FY2023-24) board.Jun 26 2024, 5:04 PM

@OTichonova can you do design review?

JTannerWMF edited projects, added Wikipedia-Android-App-Backlog (Android Release - FY2024-25); removed Wikipedia-Android-App-Backlog (Android Release - FY2023-24).Jul 12 2024, 10:38 PM

JTannerWMF moved this task from Epics in Progress to Design signoff on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Jul 12 2024, 10:49 PM

Most frequently the suggested descriptions do take around 3 seconds to load.
I found that roughly every 10 articles -> 

1 took 8-10 seconds (which felt long) 
1-2 took 4-5 seconds  
the rest took around 3 seconds

Is it possible to have all article suggested descriptions to load in around 3 seconds?

OTichonova moved this task from Design signoff to Did not pass Design on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Jul 16 2024, 8:07 PM

In T360581#9987790, @OTichonova wrote:

Is it possible to have all article suggested descriptions to load in around 3 seconds?

Unfortunately not; there are bound to be outlier articles that cause the model to take a little longer to generate a suggestion, and I'm not sure the model can be optimized much further at the moment.

Yes, just linking back to some old performance data to back up @OTichonova's findings: T343123#9573432. @Dbrant is right and I would advocate for launching. In parallel, the one thing that can be done is to follow up with ML Platform to see how close they are to being able to host on GPUs. That should be a silent change from your perspective if they can (no code updates needed) while having a noticeable impact on latency per T343123#9520331.

@Dbrant, okay thanks, moving it forward.

OTichonova moved this task from Did not pass Design to Code Review on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Jul 17 2024, 8:22 PM

Dbrant moved this task from Code Review to Ready for PM signoff on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Jul 19 2024, 7:38 PM

Dbrant moved this task from Ready for PM signoff to Code Review on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Jul 19 2024, 7:49 PM

dbrant opened https://github.com/wikimedia/apps-android-wikipedia/pull/4830

@JTannerWMF The logic of showing machine-suggested descriptions is still subject to an ABC test that we had been running originally. Is this still applicable, or can it be removed?

cooltey closed https://github.com/wikimedia/apps-android-wikipedia/pull/4830

Dbrant moved this task from Code Review to Merged and Waiting on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Aug 2 2024, 9:31 PM

cooltey moved this task from Merged and Waiting to QA signoff on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Aug 5 2024, 7:53 PM

Dbrant moved this task from QA signoff to Blocked/Waiting on the Wikipedia-Android-App-Backlog (Android Release - FY2024-25) board.Aug 19 2024, 3:21 PM

[SPIKE] Test MGAD Model on LiftWingOpen, LowPublicActions

Description

Background

The task

Related ObjectsSearch...

Event Timeline

[SPIKE] Test MGAD Model on LiftWing
Open, LowPublic
Actions

Related Objects
Search...