Page MenuHomePhabricator

OKarakaya-WMF
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Apr 1 2025, 7:13 AM (31 w, 5 d)
Availability
Available
LDAP User
Ozge
MediaWiki User
OKarakaya-WMF [ Global Accounts ]

Recent Activity

Thu, Nov 6

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Nov 6, 3:49 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.

I've collected current performance rates and counts of the candidate wikis:

Thu, Nov 6, 9:45 AM · OKR-Work, Goal, Machine-Learning-Team

Wed, Nov 5

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Wed, Nov 5, 8:06 AM · OKR-Work, Goal, Machine-Learning-Team

Tue, Nov 4

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Tue, Nov 4, 3:36 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Tue, Nov 4, 3:30 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Tue, Nov 4, 3:18 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF moved T400446: Update blubber version in inference services images from In Progress to Unsorted on the Machine-Learning-Team board.
Tue, Nov 4, 10:11 AM · Essential-Work, Machine-Learning-Team

Fri, Oct 31

OKarakaya-WMF moved T405359: Semantic Search POC - In article QA from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Fri, Oct 31, 8:58 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.
Fri, Oct 31, 8:51 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I'm sharing final evaluation results for this phase:

Fri, Oct 31, 8:40 AM · Semantic Search, Machine-Learning-Team

Thu, Oct 30

OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 2:53 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:52 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:52 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:33 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:29 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:27 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 12:27 PM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 11:47 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF moved T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines from Current Quarter Goals to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Thu, Oct 30, 11:41 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

As discussed, I'm creating a new goal for deployments.
and I'm closing this goal.

Thu, Oct 30, 11:41 AM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 11:40 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 11:38 AM · OKR-Work, Goal, Machine-Learning-Team
OKarakaya-WMF created T408790: Q2 FY2025-26 Goal: Deploy Add-a-link v2 models to production.
Thu, Oct 30, 11:30 AM · OKR-Work, Goal, Machine-Learning-Team

Wed, Oct 29

OKarakaya-WMF added a comment to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model.

I've started a patch to deploy new models here: https://gerrit.wikimedia.org/r/c/research/mwaddlink/+/1199815
It's WIP but I think it should be ready to review tomorrow. I'll let you know.

Wed, Oct 29, 3:56 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Tue, Oct 21

OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

I think it should not be complicated to get offline scores for enwiki.
I'll get back to this and share the results soon after finishing some other tasks.

Tue, Oct 21, 10:47 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Fri, Oct 17

OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Fri, Oct 17, 2:44 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Fri, Oct 17, 1:13 PM · Essential-Work, Machine-Learning-Team

Thu, Oct 16

OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Thu, Oct 16, 11:59 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Thu, Oct 16, 11:59 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF added a comment to T400446: Update blubber version in inference services images.

articlequality deployed to staging successfully:

Thu, Oct 16, 11:58 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Thu, Oct 16, 10:40 AM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Thu, Oct 16, 10:38 AM · Essential-Work, Machine-Learning-Team

Wed, Oct 15

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've added questions from two large models into the prototype ui.
gpt-oss:120b, aya:35b
Overall evaluation is in progress.

Wed, Oct 15, 2:07 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF moved T400446: Update blubber version in inference services images from Ready To Go to In Progress on the Machine-Learning-Team board.
Wed, Oct 15, 12:41 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF updated the task description for T400446: Update blubber version in inference services images.
Wed, Oct 15, 12:41 PM · Essential-Work, Machine-Learning-Team
OKarakaya-WMF changed the status of T400446: Update blubber version in inference services images from Open to In Progress.
Wed, Oct 15, 12:40 PM · Essential-Work, Machine-Learning-Team

Mon, Oct 13

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've split the models below into two groups:

Mon, Oct 13, 10:46 AM · Semantic Search, Machine-Learning-Team

Oct 10 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.
  • I've checked several benchmarks related to QA generation:
Oct 10 2025, 12:57 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (10/10/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • Last time, we discussed closing this goal, as the new models are moved to the new location
  • We will suggest Growth team that we deploy inference service if both teams agree.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • N/A
Oct 10 2025, 12:18 PM · Goal, Machine-Learning-Team

Oct 9 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Thank you for the comments.
We can run the experiments on larger LLMs. I've checked that we can use some larger models (tested with gpt-oss:120b, llama4:maverick) on mllab.
I'll check further some public benchmarks, and see if we can re-run the experiments on a different set of LLMs.
I'll revisit the evaluation part if we can do it better with minimum human effort.

Oct 9 2025, 7:58 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 9 2025, 7:49 AM · Semantic Search, Machine-Learning-Team

Oct 8 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 8 2025, 8:04 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Sharing the results for the larger dataset below.
I used evaluation model and the query model as same due to the limits on the cloud models.

Oct 8 2025, 8:04 AM · Semantic Search, Machine-Learning-Team

Oct 7 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 11:06 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 11:06 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 10:54 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 8:03 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 7 2025, 8:02 AM · Semantic Search, Machine-Learning-Team

Oct 6 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've started a toolforge app
This is a Streamlit app where we keep the data in gitlab registry

Oct 6 2025, 1:41 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T404201: Semantic search prototype .

Can you share implementation? (dataset generation, and application)
I'm curious to know how it works in more details and it should help with the QA part to get answers as well.

Oct 6 2025, 7:11 AM · Research-engineering, Research

Oct 3 2025

OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Oct 3 2025, 12:05 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (03/10/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • We have deployed the new models to the new location.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • As discussed, based on the availability of the Growth team, we can become the owner of the api. We can also split the goals to two:
    • Inference service deployments (MLTeam)
      • Current project has model per wiki. We have previously discussed how to reduce the number of models.
      • The project has a mariadb database where we store the data needed for inference.
    • Mediawiki deployments (Growth Team)
Oct 3 2025, 12:05 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've updated the prompt based on the previous scores.

Oct 3 2025, 7:28 AM · Semantic Search, Machine-Learning-Team

Oct 2 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

All models are deployed to the new location via the airflow dag.

Oct 2 2025, 2:04 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Looking into the question related scores, we generally get low scores in question_relevance_to_title and curiosity.

Oct 2 2025, 11:49 AM · Semantic Search, Machine-Learning-Team

Oct 1 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

I've updated checks to a rubric based approach to:

  • Get better insights from generated QA
  • Compare models from multiple perspectives.
Oct 1 2025, 9:08 AM · Semantic Search, Machine-Learning-Team

Sep 30 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Results for both gpt-oss:20b and aya-expanse:32b are available in the spreadsheet.

Sep 30 2025, 8:28 AM · Semantic Search, Machine-Learning-Team

Sep 26 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 26 2025, 11:13 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Reporting (26/09/2025)

Progress update on the hypothesis for the week, including if something has shipped:

  • We have agreed with Growth Team to collaborate in October 2025.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • N/A

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • We have shared an analysis about case-sensitive recommendations.
  • Deployments will start in October as agreed with the Growth Team.
Sep 26 2025, 11:13 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

My short term suggestion is to make anchors case-sensitive and train/evaluate models. So that, we can analyse case where the performance increase/decrease.
Long term suggestion would be to have similarity between lower level embeddings (e.g. paragraph) as an additional feature.

Sep 26 2025, 11:07 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Sep 25 2025

OKarakaya-WMF added a comment to T405185: Introduce case sensitivity to machine learning model for Add a Link.

I'm sharing an analysis on case-insensitivity on enwiki and simplewiki.

Sep 25 2025, 10:28 AM · Community Feedback (Growth), Machine-Learning-Team, Growth-Team, Add-Link-Structured-Task

Sep 24 2025

OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Alternative ranking strategy from Fabian:
https://huggingface.co/BAAI/bge-reranker-v2-gemma

Sep 24 2025, 12:32 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF added a comment to T405359: Semantic Search POC - In article QA.

Git branch for the current work

Sep 24 2025, 11:28 AM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Sep 24 2025, 11:23 AM · Semantic Search, Machine-Learning-Team

Sep 23 2025

OKarakaya-WMF added a comment to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model.

hello @KStoller-WMF ,
I totally agree 💯 . All clear, thank you!

Sep 23 2025, 2:20 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF updated the task description for T405359: Semantic Search POC - In article QA.
Sep 23 2025, 2:08 PM · Semantic Search, Machine-Learning-Team
OKarakaya-WMF created T405359: Semantic Search POC - In article QA.
Sep 23 2025, 2:06 PM · Semantic Search, Machine-Learning-Team

Sep 22 2025

OKarakaya-WMF added a comment to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model.

About the release of new wikis that are above the release threshold in v2 and do not have add-a-link onboarding tasks;
I share the below the list of wikis filtered by the criteria above (47 in total);
The wikis are sorted by ~their size.

Sep 22 2025, 9:32 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF added a project to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model: Machine-Learning-Team.
Sep 22 2025, 8:21 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task
OKarakaya-WMF added a comment to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model.

Hello good morning,

Sep 22 2025, 8:17 AM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Sep 19 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Progress update on the hypothesis for the week, including if something has shipped:

  • We propose a release plan in collaboration to the Growth Team. I understand they also want to add the wikis to the tasks. Therefore, we will update the plan.

Any updates on metrics related to this hypothesis (including baseline, target, or actuals, if applicable):

  • N/A

Any emerging blockers or risks:

  • The serving patch needs to be reviewed/merged/deployed.

Any unresolved dependencies:

  • N/A

New lessons from the hypothesis:

  • N/A

Changes to the hypothesis scope or timeline:

  • We collaborate with the Growth Team on the release plan in scope of this task.
Sep 19 2025, 1:59 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T404460: Add a Link: Rollout "Add a Link" Structured Task to Wikipedias that are supported by V2 model.
  • I know the inference api currently supports the wikis here.
  • Also I have the list of wikis that are below/above the release threshold
  • However, I'm missing the information about which wikis are enabled in tasks currently. Can you share this information? Have we already enabled tasks for all the wikis here. I can look into usage if this is not easy to find.
  • As we want to enable tasks for wikis, I think we should depend on the list of wikis currently enabled in tasks, rather than the list of wikis that are currently being served. They might be the same though. I just want to make sure.
Sep 19 2025, 1:36 PM · Growth-Team (FY2025-26 Q2 Sprint 2), Add-Link-Structured-Task

Sep 12 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 12 2025, 12:27 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 12 2025, 11:29 AM · Goal, Machine-Learning-Team

Sep 11 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:28 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated subscribers of T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:20 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 12:19 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

I've calculated online scores for add-a-link here
I share the main highlights below:
We can re-use the notebook to calculate scores some time after the model releases.

Sep 11 2025, 12:15 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

enwiki results:

Sep 11 2025, 7:46 AM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 11 2025, 7:40 AM · Goal, Machine-Learning-Team

Sep 9 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

csv in the previous comment is also available here:

Sep 9 2025, 2:28 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 9 2025, 12:44 PM · Goal, Machine-Learning-Team

Sep 8 2025

OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

I've picked the best scores and compared v1 (results from current prod) vs v2 (results from the new pipeline).

Sep 8 2025, 8:40 AM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

Benchmark completed (except for enwiki):

Sep 8 2025, 7:21 AM · Goal, Machine-Learning-Team

Sep 2 2025

OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

Thank you @brouberol ,

Sep 2 2025, 2:37 PM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 2 2025, 2:08 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T275358: XGboost fails to load JSON model on macOS when using Docker image built on Linux host.

This will be fixed in scope of https://phabricator.wikimedia.org/T398950

Sep 2 2025, 8:20 AM · Growth-Team-Filtering, Growth-Team, Add-Link-Structured-Task
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Sep 2 2025, 8:19 AM · Goal, Machine-Learning-Team

Aug 28 2025

OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

Use_the_yarn_CLI page works like charm!

Aug 28 2025, 11:29 AM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

hey @brouberol ,

I'm getting following errors. Could it be related to the patches above?

ozge@stat1010:~$ kinit
Password for analytics-ml/stat1010.eqiad.wmnet@WIKIMEDIA:
kinit: Password incorrect while getting initial credentials
hdfs dfs -ls /user/ozge/addalink
Permission denied: user=analytics-ml, access=EXECUTE, inode="/user/ozge":ozge:ozge:drwxr-x---
hdfs dfs -ls /tmp/ozge/addalink
ls: Permission denied: user=analytics-ml, access=READ_EXECUTE, inode="/tmp/ozge/addalink":ozge:hdfs:drwxr-x---
Aug 28 2025, 7:52 AM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team

Aug 27 2025

OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

oh thanks :)

Aug 27 2025, 2:02 PM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

I don't have access to yarn logs. Is it expected?

Aug 27 2025, 1:12 PM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Aug 27 2025, 1:04 PM · Goal, Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Aug 27 2025, 1:01 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

staging release airflow dag tested on dev with three wikis and it works well.

Aug 27 2025, 12:50 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.

airflow dag mr for staging release:
https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/1638

Aug 27 2025, 12:45 PM · Goal, Machine-Learning-Team

Aug 26 2025

OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Aug 26 2025, 12:03 PM · Goal, Machine-Learning-Team
OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

cool, no problem! it's back to normal 😍

Aug 26 2025, 10:31 AM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF added a comment to T400902: Create an analytics service user for the ML team.

I'm getting following errors. Could it be related to the patches above?

Aug 26 2025, 9:48 AM · Essential-Work, Data-Platform-SRE (2025.08.16 - 2025.09.05), Machine-Learning-Team
OKarakaya-WMF updated the task description for T398950: Q1 FY2025-26 Goal: Scaling Add-a-link to more wikis via production (airflow) pipelines.
Aug 26 2025, 7:49 AM · Goal, Machine-Learning-Team