Page MenuHomePhabricator
Feed Search

Yesterday

OKarakaya-WMF added a comment to P86608 Prototype serving Qwen3 embeddings with KServe using HF Transformers and ROCm-compatible FlashAttention-2.

Hey @kevinbazira do we already have a dockerfile to try this on staging?

Tue, Dec 16, 4:35 PM · Machine-Learning-Team

Mon, Dec 15

OKarakaya-WMF added a comment to P86608 Prototype serving Qwen3 embeddings with KServe using HF Transformers and ROCm-compatible FlashAttention-2.

Nice implementation!
Does last token pooling come from qwenlm?

Mon, Dec 15, 11:37 AM · Machine-Learning-Team

Fri, Dec 12

OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
  1. Reporting 05/11/2025
Fri, Dec 12, 11:40 AM · OKR-Work, Goal, Machine-Learning-Team

Thu, Dec 11

OKarakaya-WMF updated the task description for T412338: Semantic Search - Embeddings Service for MVP.
Thu, Dec 11, 9:07 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T412338: Semantic Search - Embeddings Service for MVP.

Below I share how long it will take to generate embeddings with different set ups and I compare two models:
model_name = "Qwen/Qwen3-Embedding-0.6B"

  • float16, all chars

205/207038 [00:55<15:41:24, 3.66it/s]

  • float16 , first 300 chars.

206/207038 [00:27<7:37:03, 7.54it/s]

  • float32 , first 300 chars.

264/207038 [01:06<14:31:14, 3.96it/s]

  • float32 , all chars.

124/207038 [01:12<33:27:43, 1.72it/s]
model_name = "sentence-transformers/all-mpnet-base-v2"

  • float32 , all chars.

209/207038 [00:21<5:51:39, 9.80it/s]

  • float32 , first 300 chars.

228/207038 [00:10<2:32:48, 22.56it/s]

Thu, Dec 11, 9:05 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T412338: Semantic Search - Embeddings Service for MVP.
Thu, Dec 11, 7:55 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T412338: Semantic Search - Embeddings Service for MVP.
Thu, Dec 11, 7:46 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF created T412338: Semantic Search - Embeddings Service for MVP.
Thu, Dec 11, 7:39 AM · Semantic Search, Machine-Learning-Team

Tue, Dec 9

OKarakaya-WMF added a comment to T412055: linkrecommendation API does not include the model version in its output.

I agree.
Currently, the only indicator about the model version is the model hash (c4796c3c193d983980a445bb2a76f65def9f2459599fa6df055984bd851d3ca3 is the v2 zhwiki model)
I think we can switch to a semantic versioning.

Tue, Dec 9, 10:37 AM · Growth-Team, Add-Link-Structured-Task

Mon, Dec 8

OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
  1. Reporting 05/11/2025
Mon, Dec 8, 12:14 PM · OKR-Work, Goal, Machine-Learning-Team

Fri, Nov 28

OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

Looking into 17days periods:

Fri, Nov 28, 11:16 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

I've created a list of currently in use models.
These models below got at least one suggestion accept or suggestion reject since 2025-06-01.
The wikis are sorted by accept count. Therefore, the wikis above are used less.
I'll split the remaining deployments into 3.

  • Deployment 1: Deploy wikis between 1-50.
  • Deployment 2: Deploy wikis between 51-113.
  • Deployment 3: Deploy enwiki.

Please feel free to suggest another order.

Fri, Nov 28, 11:12 AM · OKR-Work, Goal, Machine-Learning-Team

Wed, Nov 26

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Wed, Nov 26, 12:00 PM · OKR-Work, Goal, Machine-Learning-Team

Mon, Nov 24

OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

Started updating following wikis:

Mon, Nov 24, 11:56 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Mon, Nov 24, 11:54 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T410744: model reference-risk: reference_risk_score is always 0..

cool, thank you @Pablo ,

Mon, Nov 24, 11:11 AM · Machine-Learning-Team
OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

We got results for itwiki:

Mon, Nov 24, 10:54 AM · OKR-Work, Goal, Machine-Learning-Team

Fri, Nov 21

OKarakaya-WMF moved T410744: model reference-risk: reference_risk_score is always 0. from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Fri, Nov 21, 2:46 PM · Machine-Learning-Team
OKarakaya-WMF added a comment to T410744: model reference-risk: reference_risk_score is always 0..

The service works fine:
curl https://api.wikimedia.org/service/lw/inference/v1/models/reference-risk:predict -X POST -d '{"rev_id": 1322686680, "lang": "en"}'
{"model_name":"reference-risk","model_version":"2024-11","wiki_db":"enwiki","revision_id":1322686680,"reference_count":37,"survival_ratio":{"min":0.16666666666666666,"mean":0.6632285937319566,"median":0.6505386708644346},"reference_risk_score":0.08108108108108109}%
https://en.wikipedia.org/w/index.php?title=MarketStar&oldid=1322686680
The issue is that the Deprecated or Blacklisted domains are quiet rare (~120)
Please feel free to let me know if you get 0 for a url which is Deprecated or Blacklisted and we can take a look further.

Fri, Nov 21, 2:45 PM · Machine-Learning-Team
OKarakaya-WMF created T410744: model reference-risk: reference_risk_score is always 0..
Fri, Nov 21, 2:44 PM · Machine-Learning-Team

Thu, Nov 20

OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

thank you both @Sdkb and @Chipmunkdavis for reporting this issue,

Thu, Nov 20, 10:20 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Nov 14 2025

OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
  1. Reporting 14/11/2025
Nov 14 2025, 7:51 AM · OKR-Work, Goal, Machine-Learning-Team

Nov 6 2025

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Nov 6 2025, 3:49 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

I've collected current performance rates and counts of the candidate wikis:

Nov 6 2025, 9:45 AM · OKR-Work, Goal, Machine-Learning-Team

Nov 5 2025

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Nov 5 2025, 8:06 AM · OKR-Work, Goal, Machine-Learning-Team

Nov 4 2025

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Nov 4 2025, 3:36 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Nov 4 2025, 3:30 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Nov 4 2025, 3:18 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF moved T400446: Update blubber version in inference services images from In Progress to Unsorted on the Machine-Learning-Team board.
Nov 4 2025, 10:11 AM · Essential-Work, Machine-Learning-Team

Oct 31 2025

OKarakaya-WMF moved T405359: Semantic Search POC - In article QA from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Oct 31 2025, 8:58 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.
Oct 31 2025, 8:51 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I'm sharing final evaluation results for this phase:

Oct 31 2025, 8:40 AM · Semantic Search, Machine-Learning-Team

Oct 30 2025

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 2:53 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:52 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:52 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:33 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:29 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:27 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 12:27 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 11:47 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF moved T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines from Current Quarter Goals to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Oct 30 2025, 11:41 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

As discussed, I'm creating a new goal for deployments.
and I'm closing this goal.

Oct 30 2025, 11:41 AM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 11:40 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 11:38 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF created T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Oct 30 2025, 11:30 AM · OKR-Work, Goal, Machine-Learning-Team

Oct 29 2025

OKarakaya-WMF added a comment to T404460: Add a Link: Link Suggestions Code Review and rollout planning.

I've started a patch to deploy new models here: https://gerrit.wikimedia.org/r/c/research/mwaddlink/+/1199815
It's WIP but I think it should be ready to review tomorrow. I'll let you know.

Oct 29 2025, 3:56 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Oct 21 2025

OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

I think it should not be complicated to get offline scores for enwiki.
I'll get back to this and share the results soon after finishing some other tasks.
Please feel free to let me know if we should increase the priority.

Oct 21 2025, 10:47 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Oct 17 2025

OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 17 2025, 2:44 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 17 2025, 1:13 PM · Essential-Work, Machine-Learning-Team

Oct 16 2025

OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 16 2025, 11:59 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 16 2025, 11:59 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF added a comment to T400446: Update blubber version in inference services images.

articlequality deployed to staging successfully:

Oct 16 2025, 11:58 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 16 2025, 10:40 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 16 2025, 10:38 AM · Essential-Work, Machine-Learning-Team

Oct 15 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've added questions from two large models into the prototype ui.
gpt-oss:120b, aya:35b
Overall evaluation is in progress.

Oct 15 2025, 2:07 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF moved T400446: Update blubber version in inference services images from Ready To Go to In Progress on the Machine-Learning-Team board.
Oct 15 2025, 12:41 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Oct 15 2025, 12:41 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF changed the status of T400446: Update blubber version in inference services images from Open to In Progress.
Oct 15 2025, 12:40 PM · Essential-Work, Machine-Learning-Team

Oct 13 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've split the models below into two groups:

Oct 13 2025, 10:46 AM · Semantic Search, Machine-Learning-Team

Oct 10 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.
  • I've checked several benchmarks related to QA generation:
Oct 10 2025, 12:57 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (10/10/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • Last time, we discussed closing this goal, as the new models are moved to the new location
  • We will suggest Growth team that we deploy inference service if both teams agree.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • N/A
Oct 10 2025, 12:18 PM · Goal, Machine-Learning-Team

Oct 9 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Thank you for the comments.
We can run the experiments on larger LLMs. I've checked that we can use some larger models (tested with gpt-oss:120b, llama4:maverick) on mllab.
I'll check further some public benchmarks, and see if we can re-run the experiments on a different set of LLMs.
I'll revisit the evaluation part if we can do it better with minimum human effort.

Oct 9 2025, 7:58 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 9 2025, 7:49 AM · Semantic Search, Machine-Learning-Team

Oct 8 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 8 2025, 8:04 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Sharing the results for the larger dataset below.
I used evaluation model and the query model as same due to the limits on the cloud models.

Oct 8 2025, 8:04 AM · Semantic Search, Machine-Learning-Team

Oct 7 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 11:06 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 11:06 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 10:54 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 8:03 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 8:02 AM · Semantic Search, Machine-Learning-Team

Oct 6 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've started a toolforge app
This is a Streamlit app where we keep the data in gitlab registry

Oct 6 2025, 1:41 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T404201: Semantic search prototype .

Can you share implementation? (dataset generation, and application)
I'm curious to know how it works in more details and it should help with the QA part to get answers as well.

Oct 6 2025, 7:11 AM · Research-engineering, Research

Oct 3 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 3 2025, 12:05 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (03/10/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • We have deployed the new models to the new location.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • As discussed, based on the availability of the Growth team, we can become the owner of the api. We can also split the goals to two:
    • Inference service deployments (MLTeam)
      • Current project has model per wiki. We have previously discussed how to reduce the number of models.
      • The project has a mariadb database where we store the data needed for inference.
    • Mediawiki deployments (Growth Team)
Oct 3 2025, 12:05 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've updated the prompt based on the previous scores.

Oct 3 2025, 7:28 AM · Semantic Search, Machine-Learning-Team

Oct 2 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

All models are deployed to the new location via the airflow dag.

Oct 2 2025, 2:04 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Looking into the question related scores, we generally get low scores in question_relevance_to_title and curiosity.

Oct 2 2025, 11:49 AM · Semantic Search, Machine-Learning-Team

Oct 1 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've updated checks to a rubric based approach to:

  • Get better insights from generated QA
  • Compare models from multiple perspectives.
Oct 1 2025, 9:08 AM · Semantic Search, Machine-Learning-Team

Sep 30 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Results for both gpt-oss:20b and aya-expanse:32b are available in the spreadsheet.

Sep 30 2025, 8:28 AM · Semantic Search, Machine-Learning-Team

Sep 26 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 26 2025, 11:13 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (26/09/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • We have agreed with Growth Team to collaborate in October 2025.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • We have shared an analysis about case-sensitive recommendations.
  • Deployments will start in October as agreed with the Growth Team.
Sep 26 2025, 11:13 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

My short term suggestion is to make anchors case-sensitive and train/evaluate models. So that, we can analyse case where the performance increase/decrease.
Long term suggestion would be to have similarity between lower level embeddings (e.g. paragraph) as an additional feature.

Sep 26 2025, 11:07 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Sep 25 2025

OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

I'm sharing an analysis on case-insensitivity on enwiki and simplewiki.

Sep 25 2025, 10:28 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Sep 24 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Alternative ranking strategy from Fabian:
https://huggingface.co/BAAI/bge-reranker-v2-gemma

Sep 24 2025, 12:32 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Git branch for the current work

Sep 24 2025, 11:28 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Sep 24 2025, 11:23 AM · Semantic Search, Machine-Learning-Team

Sep 23 2025

OKarakaya-WMF added a comment to T404460: Add a Link: Link Suggestions Code Review and rollout planning.

hello @KStoller-WMF ,
I totally agree 💯 . All clear, thank you!

Sep 23 2025, 2:20 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Sep 23 2025, 2:08 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF created T405359: Semantic Search POC - In article QA.
Sep 23 2025, 2:06 PM · Semantic Search, Machine-Learning-Team

Sep 22 2025

OKarakaya-WMF added a comment to T404460: Add a Link: Link Suggestions Code Review and rollout planning.

About the release of new wikis that are above the release threshold in v2 and do not have add-a-link onboarding tasks;
I share the below the list of wikis filtered by the criteria above (47 in total);
The wikis are sorted by ~their size.

Sep 22 2025, 9:32 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF added a project to T404460: Add a Link: Link Suggestions Code Review and rollout planning: Machine-Learning-Team.
Sep 22 2025, 8:21 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF added a comment to T404460: Add a Link: Link Suggestions Code Review and rollout planning.

Hello good morning,

Sep 22 2025, 8:17 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Sep 19 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Progress update on the hypothesis for the week, including if something has shipped:

  • We propose a release plan in collaboration to the Growth Team. I understand they also want to add the wikis to the tasks. Therefore, we will update the plan.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • The serving patch needs to be reviewed/merged/deployed.

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • We collaborate with the Growth Team on the release plan in scope of this task.
Sep 19 2025, 1:59 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T404460: Add a Link: Link Suggestions Code Review and rollout planning.
  • I know the inference api currently supports the wikis here.
  • Also I have the list of wikis that are below/above the release threshold
  • However, I'm missing the information about which wikis are enabled in tasks currently. Can you share this information? Have we already enabled tasks for all the wikis here. I can look into usage if this is not easy to find.
  • As we want to enable tasks for wikis, I think we should depend on the list of wikis currently enabled in tasks, rather than the list of wikis that are currently being served. They might be the same though. I just want to make sure.
Sep 19 2025, 1:36 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Sep 12 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 12 2025, 12:27 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 12 2025, 11:29 AM · Goal, Machine-Learning-Team

Sep 11 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:28 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated subscribers of T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:20 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:19 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

I've calculated online scores for add-a-link here
I share the main highlights below:
We can re-use the notebook to calculate scores some time after the model releases.

Sep 11 2025, 12:15 PM · Goal, Machine-Learning-Team