Page MenuHomePhabricator

gkyziridis (George Kyziridis)
ML-Engineer

Today

  • No visible events.

Tomorrow

  • No visible events.

Tuesday

  • No visible events.

User Details

User Since
Jan 6 2025, 12:21 PM (56 w, 6 d)
Availability
Available
IRC Nick
georgekyz
LDAP User
Gkyziridis
MediaWiki User
GKyziridis-WMF [ Global Accounts ]

Recent Activity

Fri, Feb 6

gkyziridis added a comment to T396495: Build model training pipeline for tone check using WMF ML Airflow instance.

Update

Since the task: T406217 is finished we have a first version of end-to-end pipeline including all the basic steps of an ML-Lifecycle: Data Generation -> Model Training -> Export model in S3 bucket.
More info could be found here: https://phabricator.wikimedia.org/T398970

Fri, Feb 6, 1:18 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Editing-team (Tracking), Machine-Learning-Team
gkyziridis added a comment to T398970: Q1 FY2025-26 Goal: Airflow training pipeline for Tone check model.

Generate Data (SparkSubmitOperator) -> Train/Validation/Test split (SparkSubmitOperator) -> Copy from HDFS to a PVC (WMFKubernetesPodOperator) -> Train model on GPU pod (WMFKubernetesPodOperator) -> Copy retrained model to S3 (PythonOperator)

Fri, Feb 6, 1:13 PM · Goal, Machine-Learning-Team
gkyziridis closed T406217: Export retrained Tone-check model to an S3 bucket, a subtask of T398970: Q1 FY2025-26 Goal: Airflow training pipeline for Tone check model, as Resolved.
Fri, Feb 6, 1:07 PM · Goal, Machine-Learning-Team
gkyziridis closed T406217: Export retrained Tone-check model to an S3 bucket as Resolved.
Fri, Feb 6, 1:07 PM · Patch-For-Review, Machine-Learning-Team
gkyziridis closed T396495: Build model training pipeline for tone check using WMF ML Airflow instance, a subtask of T365301: Tone Check: Prompt people to revise promotional language, as Resolved.
Fri, Feb 6, 1:06 PM · Epic, EditCheck, VisualEditor
gkyziridis closed T396495: Build model training pipeline for tone check using WMF ML Airflow instance, a subtask of T391940: FY2024-25 Q4 Goal: Productionize tone check model, as Resolved.
Fri, Feb 6, 1:05 PM · Goal, Machine-Learning-Team
gkyziridis closed T396495: Build model training pipeline for tone check using WMF ML Airflow instance, a subtask of T398970: Q1 FY2025-26 Goal: Airflow training pipeline for Tone check model, as Resolved.
Fri, Feb 6, 1:05 PM · Goal, Machine-Learning-Team
gkyziridis closed T396495: Build model training pipeline for tone check using WMF ML Airflow instance as Resolved.
Fri, Feb 6, 1:05 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Editing-team (Tracking), Machine-Learning-Team
gkyziridis added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

Hey, I am working on this, I think that I have finished the implementation for publishing the predictions in events. I am now testing it locally.
Based on this: https://wikitech.wikimedia.org/wiki/Machine_Learning/LiftWing/Streams I think there are these steps:

  1. Implementation on inference-services side (this is what I am testing).
  2. Test it and deploy the new model server versions.
  3. Configure Changeprop.
  4. Configure the new changes in the mediawiki-config repo.
Fri, Feb 6, 12:43 PM · Event-Platform, Data-Engineering, Machine-Learning-Team

Tue, Feb 3

gkyziridis closed T411786: ORES is not working on testwiki as Resolved.
Tue, Feb 3, 10:31 AM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES

Fri, Jan 30

gkyziridis moved T406217: Export retrained Tone-check model to an S3 bucket from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Fri, Jan 30, 12:20 PM · Patch-For-Review, Machine-Learning-Team
gkyziridis moved T396495: Build model training pipeline for tone check using WMF ML Airflow instance from Ready To Go to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Fri, Jan 30, 12:20 PM · Data-Platform-SRE (2026.01.23 - 2026.02.13), Essential-Work, Editing-team (Tracking), Machine-Learning-Team

Thu, Jan 29

gkyziridis added a comment to T412357: Install AMD GPU + torch version of ML Labs machines.

Hey @Isaac, this ticket is assigned to @klausman but he is currently on his sabbatical. He will start working on this when he is back, I think around next month (???).
I am tagging @DPogorzelski-WMF here for visibility, maybe he has something more to add.

Thu, Jan 29, 12:25 PM · Machine-Learning-Team
gkyziridis added a comment to T406217: Export retrained Tone-check model to an S3 bucket.

Update

The end-to-end tone-check retraining pipeline succeeded, we solved the issues of Multy-Attach PVC.

image.png (946×2 px, 112 KB)

The new version of the retrained tone-check model is successfully copied to the dedicated S3 bucket under: s3://wmf-ml-models/retrained-models/tone-check/, here are the logs of the export step:
1tone-check-training-dag-move-model-to-s3-nv8wgsew
2 ▶ Log message source details
3[2026-01-28, 22:24:03 UTC] {local_task_job_runner.py:123} ▶ Pre task execution logs
4[2026-01-28, 22:24:04 UTC] {crypto.py:82} WARNING - empty cryptography key - values will not be stored encrypted.
5[2026-01-28, 22:24:05 UTC] {tone_check_training_dag.py:101} INFO - [+] S3 client loaded !
6[2026-01-28, 22:24:05 UTC] {tone_check_training_dag.py:103} INFO - Searching files in /mnt/model-training/tone_check/20260128T134152/output_model:
7[2026-01-28, 22:24:05 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/config.json | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/config.json
8[2026-01-28, 22:24:05 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/model.safetensors | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/model.safetensors
9[2026-01-28, 22:24:12 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/special_tokens_map.json | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/special_tokens_map.json
10[2026-01-28, 22:24:12 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/rng_state.pth | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/rng_state.pth
11[2026-01-28, 22:24:12 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/tokenizer_config.json | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/tokenizer_config.json
12[2026-01-28, 22:24:13 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/vocab.txt | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/vocab.txt
13[2026-01-28, 22:24:13 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/tokenizer.json | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/tokenizer.json
14[2026-01-28, 22:24:13 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/training_args.bin | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/training_args.bin
15[2026-01-28, 22:24:14 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/scheduler.pt | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/scheduler.pt
16[2026-01-28, 22:24:14 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/trainer_state.json | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/trainer_state.json
17[2026-01-28, 22:24:14 UTC] {tone_check_training_dag.py:109} INFO - - File: /mnt/model-training/tone_check/20260128T134152/output_model/checkpoint-21530/optimizer.pt | wmf-ml-models | retrained-models/tone-check/checkpoint-21530/optimizer.pt
18[2026-01-28, 22:24:29 UTC] {tone_check_training_dag.py:112} INFO - [+] Files uploded correctly at: s3://wmf-ml-models/retrained-models/tone-check//
19[2026-01-28, 22:24:29 UTC] {python.py:240} INFO - Done. Returned value was: None

Here are the content of the S3 bucket:

$ s3cmd -c /etc/s3cmd/cfg.d/ml-team.cfg ls -H s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/
2026-01-28 22:24   865   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/config.json
2026-01-28 22:24   678M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/model.safetensors
2026-01-28 22:24  1357M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/optimizer.pt
2026-01-28 22:24    13K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/rng_state.pth
2026-01-28 22:24  1064   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/scheduler.pt
2026-01-28 22:24   695   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/special_tokens_map.json
2026-01-28 22:24     2M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/tokenizer.json
2026-01-28 22:24  1330   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/tokenizer_config.json
2026-01-28 22:24     9K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/trainer_state.json
2026-01-28 22:24     5K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/training_args.bin
2026-01-28 22:24   972K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-21530/vocab.txt
Thu, Jan 29, 8:53 AM · Patch-For-Review, Machine-Learning-Team
gkyziridis created P88090 Logs of move_model_to_s3_task from Airflow.
Thu, Jan 29, 8:50 AM · Machine-Learning-Team

Wed, Jan 28

gkyziridis added a comment to T405358: Add LiftWing streams data to event_sanitized (increase data retention).

We are currently do not store anywhere the predictions from the rr-multilingual model so we cannot export them in the same way that we are doing for the rr-language-agnostic one.
If there is this necessity, I can open a new Phabricator task in order to start developing the first step of saving the slice of the rr-multilingual predictions into the event stream, and then we can add them to the refinery and export them into the event_sanitized as we do for the rr-langugage-agnostic.

Wed, Jan 28, 2:32 PM · Lift-Wing, Machine-Learning-Team
gkyziridis added a comment to T405358: Add LiftWing streams data to event_sanitized (increase data retention).

@gkyziridis I'm testing this out today but only seeing revertrisk-language-agnostic for an example revision on enwiki, is that expected?

spark-sql (default)> select predicted_classification from event.mediawiki_page_revert_risk_prediction_change_v1 where revision.rev_id = 1333904928;
predicted_classification
{"model_name":"revertrisk-language-agnostic","model_version":"3","predictions":["false"],"probabilities":{"false":0.7348057627677917,"true":0.26519423723220825}}
Wed, Jan 28, 1:54 PM · Lift-Wing, Machine-Learning-Team
gkyziridis created P87991 Error during training pipeline.
Wed, Jan 28, 11:27 AM

Tue, Jan 27

gkyziridis added a comment to T406217: Export retrained Tone-check model to an S3 bucket.

I also checked the PVC using kubectl and I see that the PVC is "RWO": "ReadWriteOnce" I am not sure if this makes the problem:

$ kube_env airflow-ml-deploy dse-k8s-eqiad
$ kubectl get pvc airflow-ml-model-training -n airflow-dev
NAME                        STATUS   VOLUME                                     CAPACITY   ACCESS MODES   STORAGECLASS   AGE
airflow-ml-model-training   Bound    pvc-8a6a2920-8d7e-4616-8ab6-a6a70b26d116   20Gi       RWO            ceph-rbd-ssd   151d
Tue, Jan 27, 8:56 AM · Patch-For-Review, Machine-Learning-Team

Wed, Jan 21

gkyziridis added a comment to T406217: Export retrained Tone-check model to an S3 bucket.
$ s3cmd -c /etc/s3cmd/cfg.d/ml-team.cfg ls -H --recursive s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/
2026-01-20 13:33   865   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/config.json
2026-01-20 13:33   678M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/model.safetensors
2026-01-20 13:33  1357M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/optimizer.pt
2026-01-20 13:33    13K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/rng_state.pth
2026-01-20 13:33  1064   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/scheduler.pt
2026-01-20 13:33   695   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/special_tokens_map.json
2026-01-20 13:33     2M  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/tokenizer.json
2026-01-20 13:33  1330   s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/tokenizer_config.json
2026-01-20 13:33    24K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/trainer_state.json
2026-01-20 13:33     5K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/training_args.bin
2026-01-20 13:33   972K  s3://wmf-ml-models/retrained-models/tone-check/checkpoint-63618/vocab.txt
Wed, Jan 21, 2:43 PM · Patch-For-Review, Machine-Learning-Team
gkyziridis created P87832 Airflow logs for move_model_to_s3_task.
Wed, Jan 21, 2:42 PM

Tue, Jan 20

gkyziridis moved T411786: ORES is not working on testwiki from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Tue, Jan 20, 2:56 PM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES
gkyziridis moved T406217: Export retrained Tone-check model to an S3 bucket from Ready To Go to In Progress on the Machine-Learning-Team board.
Tue, Jan 20, 2:56 PM · Patch-For-Review, Machine-Learning-Team

Thu, Jan 15

gkyziridis created P87559 Promptathon - LLM gemma3:4b model results.
Thu, Jan 15, 3:52 PM

Mon, Jan 12

gkyziridis created P87372 Error in isvc during rr-multilingual deployment.
Mon, Jan 12, 3:36 PM
gkyziridis added a comment to T406179: Q2 FY2025-26 Goal: Host Wikidata Revert Risk model on LiftWing.

Weekly Update:

  • The Wikimedia Enterprise team conducted load tests to simulate their traffic and shared results in T409388#11483570
  • We are working on optimizing the revertrisk-wikidata inference service to achieve the Enterprise team's latency target in T414060
Mon, Jan 12, 3:28 PM · OKR-Work, Goal, Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise, Wikidata, Lift-Wing, Machine-Learning-Team

Jan 9 2026

gkyziridis added a comment to T411786: ORES is not working on testwiki.
curl -s -X \
POST "https://inference.svc.eqiad.wmnet:30443/v1/models/revertrisk-language-agnostic:predict" \
-d '{"rev_id": 2, "lang": "test"}' \
-H "Host: revertrisk-language-agnostic.revertrisk.wikimedia.org"
Jan 9 2026, 3:38 PM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES

Jan 6 2026

gkyziridis added a comment to T411786: ORES is not working on testwiki.

Things we need to keep in mind:

  • Testwiki is not a canonical/normal wiki so it is excluded from canonical_wikis list
  • Testwiki is not a supported wiki for the revertrisk model, so predictions will be completely inaccurate.
  • We treat testwiki as enwiki on the fly in order for the revert-risk model server to accept such API hits posting {"lang"="test"}
Jan 6 2026, 11:20 AM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES
gkyziridis edited P86768 Testing locally revertrisk base model treating testwiki' as 'enwiki' on the fly..
Jan 6 2026, 10:58 AM · Machine-Learning-Team, ml-model-requests
gkyziridis created P86768 Testing locally revertrisk base model treating testwiki' as 'enwiki' on the fly..
Jan 6 2026, 10:57 AM · Machine-Learning-Team, ml-model-requests
gkyziridis moved T407155: [SPIKE] Define process for validating Tone Check model eval data for languages staff members do not speak from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 6 2026, 9:40 AM · Machine-Learning-Team, EditCheck, VisualEditor

Dec 18 2025

gkyziridis moved T411786: ORES is not working on testwiki from 2025-2026 Q2 Done to In Progress on the Machine-Learning-Team board.
Dec 18 2025, 10:21 AM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES
gkyziridis moved T411786: ORES is not working on testwiki from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Dec 18 2025, 10:21 AM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES
gkyziridis moved T410663: Upgrade AMD GPU + torch version of ML Labs machines from Ready To Go to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Dec 18 2025, 10:21 AM · Essential-Work, Machine-Learning-Team
gkyziridis closed T410663: Upgrade AMD GPU + torch version of ML Labs machines as Resolved.
Dec 18 2025, 10:20 AM · Essential-Work, Machine-Learning-Team
gkyziridis claimed T411786: ORES is not working on testwiki.
Dec 18 2025, 8:55 AM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES

Dec 17 2025

gkyziridis added a comment to T411786: ORES is not working on testwiki.

The testwiki is not a canonical Wikipedia, it is a testing environment where articles can be written in any language, and it wasn’t part of the RR model’s training data, so we excluded from the list of canonical Wikipedias. So the, RR model doesn’t support testwiki.
We can easily parse this in the following two requests to enwiki and testwiki, respectively:

Dec 17 2025, 2:54 PM · Automoderator, Moderator-Tools-Team, Machine-Learning-Team, ORES

Dec 15 2025

gkyziridis created P86609 RR Threshold Analysis Query .
Dec 15 2025, 12:37 PM · Machine-Learning-Team

Dec 11 2025

gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:45 AM · Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:45 AM · Machine-Learning-Team
gkyziridis added a comment to T410663: Upgrade AMD GPU + torch version of ML Labs machines.

Nevertheless, this combination of versions seems to fix the issue using the GPU image, so your curiosity is in a super good shape towards the correct direction :P.

Haha, always happy to be accidentally helpful :) Once it's deployed on ml-lab1002, happy to test but definitely looking promising!

Dec 11 2025, 10:44 AM · Essential-Work, Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:37 AM · Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:36 AM · Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:33 AM · Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:30 AM · Machine-Learning-Team
gkyziridis updated the task description for T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:30 AM · Machine-Learning-Team
gkyziridis created T412357: Install AMD GPU + torch version of ML Labs machines.
Dec 11 2025, 10:29 AM · Machine-Learning-Team

Dec 10 2025

gkyziridis added a comment to T410663: Upgrade AMD GPU + torch version of ML Labs machines.

Thanks @gkyziridis for digging into this! Out of curiosity, why not jump to the current stable versions (2.9.1 for torch and 6.4 for AMD)? I see you commented that line out in the initial file that at least had torch at 2.9.1.

Dec 10 2025, 11:37 AM · Essential-Work, Machine-Learning-Team
gkyziridis created P86494 Dockerfile torch2.9.1+rocm6.4.
Dec 10 2025, 11:24 AM · Machine-Learning-Team
gkyziridis updated the title for P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU from Train 'answerdotai/ModernBERT-base' using Trainer on GPU to Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 10 2025, 11:14 AM · Machine-Learning-Team
gkyziridis created P86493 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 10 2025, 11:14 AM · Machine-Learning-Team

Dec 9 2025

gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 3:29 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 3:29 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 3:28 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 2:48 PM · Machine-Learning-Team
gkyziridis added a comment to T410663: Upgrade AMD GPU + torch version of ML Labs machines.

I built the image using: docker build --network=host -t torch_rocm3 .

Dec 9 2025, 2:38 PM · Essential-Work, Machine-Learning-Team
gkyziridis updated the title for P86467 Dockerfile torch2.6.0+rocm6.1 CPU from Dockerfile to Dockerfile torch2.6.0+rocm6.1 CPU.
Dec 9 2025, 2:30 PM · Machine-Learning-Team, ml-model-requests
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 2:29 PM · Machine-Learning-Team
gkyziridis created P86478 Dockerfile torch2.6.0+rocm6.1 GPU.
Dec 9 2025, 2:20 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 2:18 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 1:29 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 1:25 PM · Machine-Learning-Team
gkyziridis edited P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 1:24 PM · Machine-Learning-Team
gkyziridis created P86473 Train 'jhu-clsp/mmBERT-base' using Trainer on GPU.
Dec 9 2025, 1:22 PM · Machine-Learning-Team
gkyziridis edited P86472 Train 'answerdotai/ModernBERT-base' using Trainer on CPU.
Dec 9 2025, 1:14 PM · Machine-Learning-Team
gkyziridis updated the title for P86472 Train 'answerdotai/ModernBERT-base' using Trainer on CPU from Train model using Trainer to Train 'answerdotai/ModernBERT-base' using Trainer on CPU.
Dec 9 2025, 1:13 PM · Machine-Learning-Team
gkyziridis edited P86472 Train 'answerdotai/ModernBERT-base' using Trainer on CPU.
Dec 9 2025, 1:12 PM · Machine-Learning-Team
gkyziridis created P86472 Train 'answerdotai/ModernBERT-base' using Trainer on CPU.
Dec 9 2025, 1:11 PM · Machine-Learning-Team
gkyziridis edited P86468 Docker run.
Dec 9 2025, 10:59 AM · Machine-Learning-Team
gkyziridis edited P86468 Docker run.
Dec 9 2025, 10:55 AM · Machine-Learning-Team
gkyziridis created P86468 Docker run.
Dec 9 2025, 10:47 AM · Machine-Learning-Team
gkyziridis created P86467 Dockerfile torch2.6.0+rocm6.1 CPU.
Dec 9 2025, 10:28 AM · Machine-Learning-Team, ml-model-requests
gkyziridis claimed T410663: Upgrade AMD GPU + torch version of ML Labs machines.
Dec 9 2025, 10:24 AM · Essential-Work, Machine-Learning-Team

Dec 3 2025

gkyziridis added a comment to T409438: Enable revertrisk filters in thwiki.

Model configuration and threshold configuration have been deployed.
The next step is to backport adding thwiki to the ORES dblist: https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/1207923.
Let's discuss in engineering weekly when we would like to go ahead and do that along with the backfill script.

Dec 3 2025, 2:11 PM · Moderator-Tools-Team, Thai-Sites, OKR-Work, Machine-Learning-Team, MediaWiki-extensions-ORES, PersonalDashboard, MediaWiki-Recent-changes

Nov 27 2025

gkyziridis added a comment to T408607: AI/ML Infrastructure Request: Assistance in Rolling out Revert Risk to wikis that don't have damaging/goodfaith models.

When we start the actual deployment:
Due to the fact that we have a huge number of wikis which are needed to be deployed, I suggest to to do it in batches. Right now, in the patch above only the thresholds are set for each wiki, that means that if this patch is merged and deployed nothing will be changed. In the next iterations, when we start to deploying the wikis we need to enable ORES model and enable the UI as well. Only then the thresholds which are configured in the patch will be functional. So, I suggest to enable ORES model in batches e.g. for 4-5 wikis per batch. This will take some time to finish all batches, but we can easily handle issues that could occur during the backport deployments

Nov 27 2025, 10:32 AM · Patch-For-Review, MediaWiki-Recent-changes, PersonalDashboard, Moderator-Tools-Team, Machine-Learning-Team
gkyziridis added a comment to T409438: Enable revertrisk filters in thwiki.
Update

I configure all the rr thresholds for all the wikis and enabled the model for all of them in this patch: https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/1212086 .
I excluded thwiki from the above patch since you are using it for the MVP.
I also avoided to run the composer manage-dblist add {wiki_name} ores for all the wikis, which means that whenever we deploy all these wikis we need to run the composer for all of them.

Nov 27 2025, 10:31 AM · Moderator-Tools-Team, Thai-Sites, OKR-Work, Machine-Learning-Team, MediaWiki-extensions-ORES, PersonalDashboard, MediaWiki-Recent-changes

Nov 26 2025

gkyziridis added a comment to T409438: Enable revertrisk filters in thwiki.

I think that there is one more step which needs to be done which is to run: composer manage-dblist add {wiki_name} ores. I do not see thwiki to be added under "dblists/ores.dblist" file in this patch -> https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/1207932

Nov 26 2025, 9:29 AM · Moderator-Tools-Team, Thai-Sites, OKR-Work, Machine-Learning-Team, MediaWiki-extensions-ORES, PersonalDashboard, MediaWiki-Recent-changes

Nov 25 2025

gkyziridis updated the title for P85517 [Locust Tests] RevertRisk - Wikidata using staging configuration (Autoscaling is off) from [Load Tests] Revert-Risk WikiData to [Locust Tests] RevertRisk - Wikidata using staging configuration (Autoscaling is off).
Nov 25 2025, 2:12 PM · Machine-Learning-Team, ml-model-requests
gkyziridis added a comment to T409388: Test liftwing wikidata revert risk API for scale and latency.

The revertrisk-wikidata inference service production endpoint uses similar scaling configs that other revertrisk inference-services use: https://github.com/wikimedia/operations-deployment-charts/blob/8412fc655d3b1e10b38cf0c954d910b820e93a05/helmfile.d/ml-services/revertrisk/values.yaml#L145-L150

IMO the prod endpoint should scale well unless results from the WME folks say otherwise.

Nov 25 2025, 2:11 PM · Wikimedia-Enterprise-Kanban-On-Call, Machine-Learning-Team
gkyziridis created P85616 [Locust Tests] RevertRisk - Wikidata using prod configuration.
Nov 25 2025, 2:06 PM · ml-model-requests, Machine-Learning-Team

Nov 24 2025

gkyziridis added a comment to T400423: Q1 FY2025-26 Goal: Enable volunteer evaluation of Tone Check model in additional languages.

Progress update on the hypothesis for the week, including if something has shipped:

Nov 24 2025, 2:48 PM · OKR-Work, Goal, Machine-Learning-Team
gkyziridis added a comment to T409388: Test liftwing wikidata revert risk API for scale and latency.

Hey @kevinbazira thank very much for running the loading tests for Revert-Risk wikidata.
I think we should change a little bit the configuration in order to simulate a more realistic scenario close to reality.
We also need to run heavier tests spawning more users in order to check our API's capacity and capability to handle maximum RPS.
I ran three different locust tests with heavier configuration, you can see the results in the following phab paste:

1# 500 users | 5 per second
2$ MODEL_LOCUST_DIR="revertrisk_wikidata" make run-locust-test
3[2025-11-24 13:19:16,836] stat1010/INFO/locust.main: Run time limit set to 120 seconds
4[2025-11-24 13:19:16,837] stat1010/INFO/locust.main: Starting Locust 2.31.5
5[2025-11-24 13:19:16,837] stat1010/INFO/locust.runners: Ramping to 500 users at a rate of 5.00 per second
6[2025-11-24 13:20:55,994] stat1010/INFO/locust.runners: All users spawned: {"RevertriskWikidata": 500} (500 total users)
7[2025-11-24 13:21:16,348] stat1010/INFO/locust.main: --run-time limit reached, shutting down
8Load test results are within the threshold
9[2025-11-24 13:21:16,556] stat1010/INFO/locust.main: Shutting down (exit code 1)
10Type Name # reqs # fails | Avg Min Max Med | req/s failures/s
11--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
12POST /v1/models/revertrisk-wikidata:predict 1202 33(2.75%) | 18076 472 46826 12000 | 10.05 0.28
13--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
14 Aggregated 1202 33(2.75%) | 18076 472 46826 12000 | 10.05 0.28
15
16Response time percentiles (approximated)
17Type Name 50% 66% 75% 80% 90% 95% 98% 99% 99.9% 99.99% 100% # reqs
18--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
19POST /v1/models/revertrisk-wikidata:predict 12000 26000 32000 35000 41000 42000 44000 45000 47000 47000 47000 1202
20--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
21 Aggregated 12000 26000 32000 35000 41000 42000 44000 45000 47000 47000 47000 1202
22
23Error report
24# occurrences Error
25------------------|---------------------------------------------------------------------------------------------------------------------------------------------
2633 POST /v1/models/revertrisk-wikidata:predict: BadStatusCode('https://inference-staging.svc.codfw.wmnet:30443/v1/models/revertrisk-wikidata:predict', code=502)
27------------------|---------------------------------------------------------------------------------------------------------------------------------------------
28
29+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
30
31# 500 users | 2 per second
32$ MODEL_LOCUST_DIR="revertrisk_wikidata" make run-locust-test
33[2025-11-24 13:13:03,964] stat1010/INFO/locust.main: Run time limit set to 120 seconds
34[2025-11-24 13:13:03,964] stat1010/INFO/locust.main: Starting Locust 2.31.5
35[2025-11-24 13:13:03,965] stat1010/INFO/locust.runners: Ramping to 500 users at a rate of 2.00 per second
36[2025-11-24 13:15:03,496] stat1010/INFO/locust.main: --run-time limit reached, shutting down
37Load test results are within the threshold
38[2025-11-24 13:15:03,651] stat1010/INFO/locust.main: Shutting down (exit code 1)
39Type Name # reqs # fails | Avg Min Max Med | req/s failures/s
40--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
41POST /v1/models/revertrisk-wikidata:predict 879 9(1.02%) | 10939 474 25179 11000 | 7.35 0.08
42--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
43 Aggregated 879 9(1.02%) | 10939 474 25179 11000 | 7.35 0.08
44
45Response time percentiles (approximated)
46Type Name 50% 66% 75% 80% 90% 95% 98% 99% 99.9% 99.99% 100% # reqs
47--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
48POST /v1/models/revertrisk-wikidata:predict 11000 14000 16000 17000 20000 21000 23000 24000 25000 25000 25000 879
49--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
50 Aggregated 11000 14000 16000 17000 20000 21000 23000 24000 25000 25000 25000 879
51
52Error report
53# occurrences Error
54------------------|---------------------------------------------------------------------------------------------------------------------------------------------
559 POST /v1/models/revertrisk-wikidata:predict: BadStatusCode('https://inference-staging.svc.codfw.wmnet:30443/v1/models/revertrisk-wikidata:predict', code=502)
56------------------|---------------------------------------------------------------------------------------------------------------------------------------------
57
58+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
59
60# 100 users | 5 per second
61$ MODEL_LOCUST_DIR="revertrisk_wikidata" make run-locust-test
62[2025-11-24 13:26:48,568] stat1010/INFO/locust.main: Run time limit set to 120 seconds
63[2025-11-24 13:26:48,568] stat1010/INFO/locust.main: Starting Locust 2.31.5
64[2025-11-24 13:26:48,569] stat1010/INFO/locust.runners: Ramping to 100 users at a rate of 5.00 per second
65[2025-11-24 13:27:07,640] stat1010/INFO/locust.runners: All users spawned: {"RevertriskWikidata": 100} (100 total users)
66[2025-11-24 13:28:48,102] stat1010/INFO/locust.main: --run-time limit reached, shutting down
67Load test results are within the threshold
68[2025-11-24 13:28:48,215] stat1010/INFO/locust.main: Shutting down (exit code 1)
69Type Name # reqs # fails | Avg Min Max Med | req/s failures/s
70--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
71POST /v1/models/revertrisk-wikidata:predict 1742 4(0.23%) | 3314 81 6776 3400 | 14.58 0.03
72--------|----------------------------------------------------------------------------|-------|-------------|-------|-------|-------|-------|--------|-----------
73 Aggregated 1742 4(0.23%) | 3314 81 6776 3400 | 14.58 0.03
74
75Response time percentiles (approximated)
76Type Name 50% 66% 75% 80% 90% 95% 98% 99% 99.9% 99.99% 100% # reqs
77--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
78POST /v1/models/revertrisk-wikidata:predict 3400 3800 4000 4200 4600 4900 5400 5700 6500 6800 6800 1742
79--------|--------------------------------------------------------------------------------|--------|------|------|------|------|------|------|------|------|------|------|------
80 Aggregated 3400 3800 4000 4200 4600 4900 5400 5700 6500 6800 6800 1742
81
82Error report
83# occurrences Error
84------------------|---------------------------------------------------------------------------------------------------------------------------------------------
854 POST /v1/models/revertrisk-wikidata:predict: BadStatusCode('https://inference-staging.svc.codfw.wmnet:30443/v1/models/revertrisk-wikidata:predict', code=502)
86------------------|---------------------------------------------------------------------------------------------------------------------------------------------

Nov 24 2025, 2:04 PM · Wikimedia-Enterprise-Kanban-On-Call, Machine-Learning-Team
gkyziridis edited P85517 [Locust Tests] RevertRisk - Wikidata using staging configuration (Autoscaling is off).
Nov 24 2025, 1:37 PM · Machine-Learning-Team, ml-model-requests
gkyziridis created P85517 [Locust Tests] RevertRisk - Wikidata using staging configuration (Autoscaling is off).
Nov 24 2025, 1:30 PM · Machine-Learning-Team, ml-model-requests

Nov 20 2025

gkyziridis added a comment to T407155: [SPIKE] Define process for validating Tone Check model eval data for languages staff members do not speak.

For the dewiki we had spotted an issue which is described here: T407155#11311194 regarding many english samples used for training the model in dewiki. In order to overcome this, I used translation only where the english samples exists inside the dewiki dataset.

Nov 20 2025, 3:02 PM · Machine-Learning-Team, EditCheck, VisualEditor

Nov 17 2025

gkyziridis added a comment to T407155: [SPIKE] Define process for validating Tone Check model eval data for languages staff members do not speak.

You can try tweaking the filters in the notebook, such as loosening the diff size conditions, expanding the revert time periods, or asking the community for more signals if possible.

Nov 17 2025, 4:38 PM · Machine-Learning-Team, EditCheck, VisualEditor

Nov 14 2025

gkyziridis added a project to T409931: Fix logging on Revertrisk: Essential-Work.
Nov 14 2025, 10:30 AM · Essential-Work, Machine-Learning-Team

Nov 12 2025

gkyziridis created T409931: Fix logging on Revertrisk.
Nov 12 2025, 1:54 PM · Essential-Work, Machine-Learning-Team
gkyziridis added a comment to T409657: Revertrisk multilingual predictor returning 500s.
Update

The issue I am facing for reproducing the error is that we are logging the incoming request if it is successful (status code 200), but we do not log it if is not.
We need somehow to log it immediately after we receive it in order to reproduce it.
I will open a ticket for upgrading the logging on the model server side: https://phabricator.wikimedia.org/T409931

Nov 12 2025, 1:04 PM · Essential-Work, Machine-Learning-Team

Nov 10 2025

gkyziridis added a comment to T409657: Revertrisk multilingual predictor returning 500s.

@gkyziridis I am not 100% sure if the rev-id in the task's description is the problematic one, I thought it was when checking the logs but you may need to review /home/elukey/T409657 on deploy2002 to get other testing samples :(

Nov 10 2025, 2:49 PM · Essential-Work, Machine-Learning-Team
gkyziridis added a comment to T409657: Revertrisk multilingual predictor returning 500s.
# Request
$ curl -i -X \
POST localhost:8080/v1/models/revertrisk-multilingual:predict \
-d '{"lang": "ru", "rev_id": 149673768}'
Nov 10 2025, 2:29 PM · Essential-Work, Machine-Learning-Team
gkyziridis moved T408607: AI/ML Infrastructure Request: Assistance in Rolling out Revert Risk to wikis that don't have damaging/goodfaith models from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Nov 10 2025, 1:08 PM · Patch-For-Review, MediaWiki-Recent-changes, PersonalDashboard, Moderator-Tools-Team, Machine-Learning-Team
gkyziridis added a project to T409657: Revertrisk multilingual predictor returning 500s: Essential-Work.
Nov 10 2025, 1:08 PM · Essential-Work, Machine-Learning-Team
gkyziridis changed the visibility for T409657: Revertrisk multilingual predictor returning 500s.
Nov 10 2025, 10:21 AM · Essential-Work, Machine-Learning-Team

Nov 6 2025

gkyziridis added a comment to T408607: AI/ML Infrastructure Request: Assistance in Rolling out Revert Risk to wikis that don't have damaging/goodfaith models.

Optimal Thresholds Table

Nov 6 2025, 12:26 PM · Patch-For-Review, MediaWiki-Recent-changes, PersonalDashboard, Moderator-Tools-Team, Machine-Learning-Team
gkyziridis edited P84751 Revert Risk Threshold Analysis Results.
Nov 6 2025, 10:26 AM

Nov 5 2025

gkyziridis edited P84751 Revert Risk Threshold Analysis Results.
Nov 5 2025, 3:06 PM
gkyziridis edited P84751 Revert Risk Threshold Analysis Results.
Nov 5 2025, 1:43 PM
gkyziridis edited P84751 Revert Risk Threshold Analysis Results.
Nov 5 2025, 12:56 PM
gkyziridis edited P84751 Revert Risk Threshold Analysis Results.
Nov 5 2025, 11:39 AM