Page MenuHomePhabricator

achou (AikoChou)
Machine Learning Engineer

Today

  • No visible events.

Tomorrow

  • No visible events.

Sunday

  • No visible events.

User Details

User Since
Feb 15 2022, 2:51 PM (217 w, 2 d)
Availability
Available
IRC Nick
aiko
LDAP User
Unknown
MediaWiki User
AChou-WMF [ Global Accounts ]

Recent Activity

Wed, Apr 15

achou added a comment to T416384: Reduce logstash logs from machine learning infra.

I dug deeper into the logs from kserve-container and identified a few opportunities to reduce log volume.

Wed, Apr 15, 7:39 AM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)

Fri, Apr 10

achou moved T422812: Upgrade edit-check to kserve==0.17 from Backlog to In Progress on the Machine-Learning-Team (Q4 FY2025-26) board.
Fri, Apr 10, 3:17 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou moved T416384: Reduce logstash logs from machine learning infra from Backlog to In Progress on the Machine-Learning-Team (Q4 FY2025-26) board.
Fri, Apr 10, 3:17 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou claimed T416384: Reduce logstash logs from machine learning infra.
Fri, Apr 10, 3:17 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou added a comment to T416384: Reduce logstash logs from machine learning infra.
  1. The logs are not emitted using json or ECS so in case of errors, like Python stacktraces, we get one log for each line. It is a waste on the logstash side, but also not really great for human readers that need to investigate an outage the day afterwards. If the logs are not on the pods because of rotation, getting a complete stacktrace from logstash is really really tedious.
  1. We have a mixture of kserve traces, unicorn access logs, latency timings etc.. Do we need all of them?
Fri, Apr 10, 3:13 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou updated subscribers of T419455: Generate OpenAPI descriptions for Lift Wing APIs.
Fri, Apr 10, 10:05 AM · Lift-Wing, Machine-Learning-Team

Thu, Apr 9

achou added a comment to T422812: Upgrade edit-check to kserve==0.17.

Completed local validation for edit-check after upgrading to kserve==0.17.

Thu, Apr 9, 2:25 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou added a comment to T422812: Upgrade edit-check to kserve==0.17.

During local validation after the kserve==0.17 upgrade, Edit-Check hit a startup regression.

Thu, Apr 9, 2:17 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou added a subtask for T422591: Update kserve Python package to 0.17 across all inference services: T422812: Upgrade edit-check to kserve==0.17.
Thu, Apr 9, 12:00 PM · Machine-Learning-Team (Q4 FY2025-26), Lift-Wing
achou added a parent task for T422812: Upgrade edit-check to kserve==0.17: T422591: Update kserve Python package to 0.17 across all inference services.
Thu, Apr 9, 12:00 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou created T422812: Upgrade edit-check to kserve==0.17.
Thu, Apr 9, 12:00 PM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou added a comment to T422797: Upgrade revise-tone-task-generator to kserve==0.17.

Completed local upgrade and validation work for revise-tone-task-generator as part of the KServe 0.17 migration.

Thu, Apr 9, 11:47 AM · Machine-Learning-Team (Q4 FY2025-26)
achou moved T422797: Upgrade revise-tone-task-generator to kserve==0.17 from Backlog to In Progress on the Machine-Learning-Team (Q4 FY2025-26) board.
Thu, Apr 9, 9:32 AM · Machine-Learning-Team (Q4 FY2025-26)
achou added a subtask for T422591: Update kserve Python package to 0.17 across all inference services: T422797: Upgrade revise-tone-task-generator to kserve==0.17.
Thu, Apr 9, 9:31 AM · Machine-Learning-Team (Q4 FY2025-26), Lift-Wing
achou added a parent task for T422797: Upgrade revise-tone-task-generator to kserve==0.17: T422591: Update kserve Python package to 0.17 across all inference services.
Thu, Apr 9, 9:31 AM · Machine-Learning-Team (Q4 FY2025-26)
achou created T422797: Upgrade revise-tone-task-generator to kserve==0.17.
Thu, Apr 9, 9:31 AM · Machine-Learning-Team (Q4 FY2025-26)

Wed, Apr 8

achou closed T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types, a subtask of T409863: Q2 FY2025-26 Goal: Generate a list of edit suggestions using machine learning, as Resolved.
Wed, Apr 8, 12:13 PM · OKR-Work, Goal, Machine-Learning-Team
achou closed T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types as Resolved.

This task is complete. We'll open new tasks for the follow-on hypotheses.

Wed, Apr 8, 12:13 PM · Machine-Learning-Team (Q4 FY2025-26)

Wed, Apr 1

achou added a comment to T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types.

Possible Extensions — Generation

Wed, Apr 1, 10:37 AM · Machine-Learning-Team (Q4 FY2025-26)

Mon, Mar 30

achou added a comment to T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types.

Edit Suggestions Experiment — Progress Update

Mon, Mar 30, 10:57 AM · Machine-Learning-Team (Q4 FY2025-26)

Tue, Mar 24

achou added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

@gkyziridis quick follow-up: what's the current status of this task? I recall we verified it works on staging. Is there anything left to do before we move it to production?

Tue, Mar 24, 3:08 PM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering
achou added a comment to T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types.

Key observations from model outputs (prompt, results)

Tue, Mar 24, 12:25 PM · Machine-Learning-Team (Q4 FY2025-26)

Wed, Mar 18

achou added a comment to T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types.

Experiment Plan

  1. Local Experiments
    • Use a smaller model.
    • Run on a curated set of articles (sampled across each pa_class and main_topic).
    • Manually review outputs to identify patterns where the model produces incorrect or low-quality suggestions
  2. Lab Experiments
    • Use a larger model for broader evaluation.
    • Run on the full dataset.
    • Review outputs:
      • Targeted review: Re-run the same selected articles from local experiments to evaluate whether issues are mitigated with a larger model
      • Random sampling: Identify any new incorrect patterns not observed during local experiments
  3. We want to answer these questions:
    • What is the risk of generating this incorrect suggestion?
    • How would incorrect suggestions impact the user experience?
  4. Handling Incorrect Suggestions (for future iterations)
    • Model-side mitigations:
      • Instruct the model to:
        • Skip uncertain cases
        • Perform self-verification / double-checking (to reduce hallucinations)
    • Content-side mitigations:
      • Improve content formatting where issues stem from poor or inconsistent article structure
  5. Content Scope:
    • Focus on pure prose content: Exclude templates, structured markup, links, references, etc.
    • To be explored in future iterations:
      • Mathematical / scientific representations
      • Tables
      • References
      • Section structure and formatting
Wed, Mar 18, 11:23 AM · Machine-Learning-Team (Q4 FY2025-26)

Mar 17 2026

achou moved T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types from Ready To Go to In Progress on the Machine-Learning-Team board.
Mar 17 2026, 12:26 PM · Machine-Learning-Team (Q4 FY2025-26)
achou claimed T418102: Edit Suggestions - Edit suggestion generation with pre-defined edit types.
Mar 17 2026, 12:26 PM · Machine-Learning-Team (Q4 FY2025-26)

Mar 12 2026

achou created P89848 test_event.json.
Mar 12 2026, 2:03 PM

Mar 11 2026

achou moved T418976: Migrate embeddings inference service from HF Transformers+CK FlashAttention to vLLM+AITER from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:38 AM · OKR-Work, Semantic Search, Machine-Learning-Team
achou moved T418493: Integrate Article Topic model with the new caching service from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:36 AM · Machine-Learning-Team (Q4 FY2025-26), Patch-For-Review, OKR-Work
achou moved T419235: Fix revertrisk Pyrra SLO from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Mar 11 2026, 11:31 AM · Machine-Learning-Team
achou closed T419235: Fix revertrisk Pyrra SLO, a subtask of T398948: Q1 FY2025-26 Goal: Operational Excellence - LiftWing Platform Updates & Improvements, as Resolved.
Mar 11 2026, 11:31 AM · Goal, Machine-Learning-Team
achou closed T419235: Fix revertrisk Pyrra SLO as Resolved.
Mar 11 2026, 11:31 AM · Machine-Learning-Team
achou moved T419071: PHP 8.5 CI failure in ORES: "Using null as an array offset is deprecated, use an empty string instead" from Unsorted to Watching on the Machine-Learning-Team board.
Mar 11 2026, 11:23 AM · MW-1.46-notes (1.46.0-wmf.19; 2026-03-10), Machine-Learning-Team, ORES, PHP 8.5 support
achou moved T413081: Review draft research roadmap for AI stratgy for editors from Unsorted to Watching on the Machine-Learning-Team board.
Mar 11 2026, 11:23 AM · Machine-Learning-Team, Research
achou moved T414485: Upgrade ML clusters to kubernetes 1.31 from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:21 AM · ServiceOps new, Machine-Learning-Team, Kubernetes, Prod-Kubernetes
achou added a parent task for T419040: kserve helm status is broken across ml clusters: T418722: Incident: 2026-02-23 ml-serve.
Mar 11 2026, 11:20 AM · Patch-For-Review, Machine-Learning-Team
achou added a subtask for T418722: Incident: 2026-02-23 ml-serve: T419040: kserve helm status is broken across ml clusters.
Mar 11 2026, 11:20 AM · Machine-Learning-Team
achou moved T418202: ORES API query is slow from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Mar 11 2026, 11:19 AM · Machine-Learning-Team, Regression, ORES
achou moved T418173: LiftWing edit-check:predict model is 404ing from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Mar 11 2026, 11:19 AM · OKR-Work, Editing-team (Editing-18Feb-27Feb-2026), MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), Growth-Team, Machine-Learning-Team, Revise-Tone-Structured-Task, EditCheck, VisualEditor
achou moved T418223: ORES/LiftWing infrastructure is not working for filtering Recent Changes edits from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Mar 11 2026, 11:19 AM · Regression, Moderator-Tools-Team, MediaWiki-Recent-changes, Russian-Sites, MediaWiki-extensions-ORES, Machine-Learning-Team
achou moved T418223: ORES/LiftWing infrastructure is not working for filtering Recent Changes edits from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:18 AM · Regression, Moderator-Tools-Team, MediaWiki-Recent-changes, Russian-Sites, MediaWiki-extensions-ORES, Machine-Learning-Team
achou moved T418722: Incident: 2026-02-23 ml-serve from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:18 AM · Machine-Learning-Team
achou added a parent task for T418173: LiftWing edit-check:predict model is 404ing: T418722: Incident: 2026-02-23 ml-serve.
Mar 11 2026, 11:17 AM · OKR-Work, Editing-team (Editing-18Feb-27Feb-2026), MW-1.46-notes (1.46.0-wmf.16; 2026-02-17), Growth-Team, Machine-Learning-Team, Revise-Tone-Structured-Task, EditCheck, VisualEditor
achou added a subtask for T418722: Incident: 2026-02-23 ml-serve: T418173: LiftWing edit-check:predict model is 404ing.
Mar 11 2026, 11:17 AM · Machine-Learning-Team
achou added a subtask for T418722: Incident: 2026-02-23 ml-serve: T418202: ORES API query is slow.
Mar 11 2026, 11:16 AM · Machine-Learning-Team
achou added a parent task for T418202: ORES API query is slow: T418722: Incident: 2026-02-23 ml-serve.
Mar 11 2026, 11:16 AM · Machine-Learning-Team, Regression, ORES
achou added a subtask for T418722: Incident: 2026-02-23 ml-serve: T418223: ORES/LiftWing infrastructure is not working for filtering Recent Changes edits.
Mar 11 2026, 11:16 AM · Machine-Learning-Team
achou added a parent task for T418223: ORES/LiftWing infrastructure is not working for filtering Recent Changes edits: T418722: Incident: 2026-02-23 ml-serve.
Mar 11 2026, 11:16 AM · Regression, Moderator-Tools-Team, MediaWiki-Recent-changes, Russian-Sites, MediaWiki-extensions-ORES, Machine-Learning-Team
achou moved T419527: Increase batch size in edit-check service from Unsorted to In Progress on the Machine-Learning-Team board.
Mar 11 2026, 11:10 AM · Editing-team (Tracking), OKR-Work (WE1 FY2025-26), ml-model-requests, Lift-Wing, Machine-Learning-Team

Mar 2 2026

achou added a comment to T417794: Emit article quality predictions as a stream and expose in EventStreams API..

We likely need a new event schema for this use case. The schema that Lift Wing has been using assumes classification outputs and no more. The article quality model, however, produces a continuous score between 0 and 1. We can also return a derived label and additional computed features. Examples:

Mar 2 2026, 1:13 PM · Event-Platform, Machine-Learning-Team, Data-Engineering

Feb 20 2026

achou updated subscribers of T416384: Reduce logstash logs from machine learning infra.

I can see that most logs have "kubernetes.pod_name" values like:
"controller-..."
"webhook-..."
"kserve-controller-manager-..."
"istio-ingressgateway-..."
These are system logs generated by kserve, which seem to be the majority.
@DPogorzelski-WMF Is there a way to reduce the system logs from kserve?
@elukey sorry to ping you, but maybe you have some insights here

pod-name-1.png (1×1 px, 307 KB)

pod-name-2.png (1×1 px, 337 KB)

pod-name-3.png (1×1 px, 334 KB)

Feb 20 2026, 11:17 AM · Patch-For-Review, Machine-Learning-Team (Q4 FY2025-26)
achou added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

How does this sound folks?

@gkyziridis Sounds good to me! :)

EVENTGATE_STREAM=mediawiki.page_revert_risk_multilingual_prediction_change.v1, this will separate the stream right ?

In addition to this, we'll need to create a mediawiki-config change like https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/1133603 (this is for adding the RRLA stream).
And please test the whole workflow in staging changeprop + staging Lift Wing first before moving to production.

Feb 20 2026, 10:18 AM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering

Feb 3 2026

achou added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

Answering @gkyziridis's questions:

… what kind of optimization do you have in mind?

I meant optimizing for latency and throughput. Since the model server will need to handle every new edit once we produce rr-multilingual predictions to an event stream. The source is page_change event stream (every Wikipedia edit triggers a predict request), so the model server needs to be fast enough to keep up with the incoming edit rate.

Feb 3 2026, 10:49 AM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering
achou updated subscribers of T412357: Install AMD GPU + torch version of ML Labs machines.

@Isaac: Kevin is currently working on the vLLM image for Lift Wing in T415627: Update WMF Debian vLLM image to support latest upstream software stack. He's been testing different versions and building the image on our ML-build machine, which will then be pushed to the WMF registry.

Feb 3 2026, 1:10 AM · Machine-Learning-Team

Feb 2 2026

achou added a comment to T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.

Should these be emitted into the same stream, or should we make a new stream for this?

I was thinking to use the same stream. When I proposed the name in T326179#10711809, my idea was to put all the predictions from revert-risk models (rr-language-agnostic, rr-multilingual, rr-wikidata) in one stream.

Feb 2 2026, 8:39 AM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering

Jan 30 2026

achou closed T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task, a subtask of T396162: [EPIC] Revise Tone: Structured Task (WE1.1.2, FY25-26), as Resolved.
Jan 30 2026, 4:36 PM · GrowthExperiments-NewcomerTasks, OKR-Work (WE1 FY2025-26), Patch-For-Review, Revise-Tone-Structured-Task, Epic, EditCheck, Growth-Structured-Tasks, Growth-Team
achou closed T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task as Resolved.

The Revise Tone experiment launched on Monday, the 26th of January!

Jan 30 2026, 4:36 PM · OKR-Work, Goal, Machine-Learning-Team
achou moved T398974: Share code between Research & ML teams from In Progress to Watching on the Machine-Learning-Team board.
Jan 30 2026, 3:14 PM · Research (FY2025-26-Research-October-December), Machine-Learning-Team, Research-engineering
achou moved T412701: AbuseFilter: revertrisk_level should not trigger for non-mainspace edits from Unsorted to Watching on the Machine-Learning-Team board.
Jan 30 2026, 3:11 PM · Machine-Learning-Team, Product Safety and Integrity, AbuseFilter, ORES
achou moved T401021: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 30 2026, 3:09 PM · User-Eevans, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Data-Persistence-Design-Review, Revise-Tone-Structured-Task, OKR-Work, Machine-Learning-Team, Growth-Team, Data-Persistence
achou closed T401021: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task, a subtask of T392283: Q1 FY2025-26 Goal: Apply the Tone Check model to published articles, to learn whether we can build a pool of high-quality structured tasks for new editors, as Resolved.
Jan 30 2026, 3:09 PM · OKR-Work, Goal, Machine-Learning-Team
achou closed T401021: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task as Resolved.

Resolved this task. Really appreciate all the input and collaboration from everyone. :)

Jan 30 2026, 3:09 PM · User-Eevans, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Data-Persistence-Design-Review, Revise-Tone-Structured-Task, OKR-Work, Machine-Learning-Team, Growth-Team, Data-Persistence
achou closed T401021: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task, a subtask of T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task, as Resolved.
Jan 30 2026, 3:08 PM · OKR-Work, Goal, Machine-Learning-Team
achou moved T414385: Export all current (wiki_id, page_id) data from ml_cache.page_paragraph_tone_scores (Cassandra) from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 30 2026, 2:57 PM · User-Eevans, Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou moved T411897: Revise Tone: Exclude certain sections from Tone Recommendations from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 30 2026, 2:56 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, Growth-Team
achou moved T411892: Revise Tone: Exclude direct quotes from Tone Recommendations from Unsorted to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 30 2026, 2:56 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, EditCheck, Growth-Team
achou moved T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing from In Progress to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Jan 30 2026, 2:56 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou closed T411897: Revise Tone: Exclude certain sections from Tone Recommendations, a subtask of T396162: [EPIC] Revise Tone: Structured Task (WE1.1.2, FY25-26), as Resolved.
Jan 30 2026, 2:55 PM · GrowthExperiments-NewcomerTasks, OKR-Work (WE1 FY2025-26), Patch-For-Review, Revise-Tone-Structured-Task, Epic, EditCheck, Growth-Structured-Tasks, Growth-Team
achou closed T411897: Revise Tone: Exclude certain sections from Tone Recommendations as Resolved.

This task has been resolved by T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing

Jan 30 2026, 2:55 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, Growth-Team
achou closed T411892: Revise Tone: Exclude direct quotes from Tone Recommendations, a subtask of T396162: [EPIC] Revise Tone: Structured Task (WE1.1.2, FY25-26), as Resolved.
Jan 30 2026, 2:55 PM · GrowthExperiments-NewcomerTasks, OKR-Work (WE1 FY2025-26), Patch-For-Review, Revise-Tone-Structured-Task, Epic, EditCheck, Growth-Structured-Tasks, Growth-Team
achou closed T411892: Revise Tone: Exclude direct quotes from Tone Recommendations as Resolved.

This task has been resolved by T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing

Jan 30 2026, 2:55 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, EditCheck, Growth-Team
achou added a comment to T401021: Data Persistence Design Review: Improve Tone Suggested Edits newcomer task.

@Ottomata Yes, we proceeded with Option D (more info in T409469). Btw, we moved from wikitext to HTML for the Revise Tone task generator in Lift Wing - now we fetch HTML content from the REST API instead of wikitext from the MediaWiki API (T412210).

Jan 30 2026, 2:48 PM · User-Eevans, Data-Engineering (Q3 FY25/26 January 1st - March 31th), Data-Persistence-Design-Review, Revise-Tone-Structured-Task, OKR-Work, Machine-Learning-Team, Growth-Team, Data-Persistence
achou closed T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing, a subtask of T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task, as Resolved.
Jan 30 2026, 2:36 PM · OKR-Work, Goal, Machine-Learning-Team
achou closed T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing as Resolved.
Jan 30 2026, 2:36 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team

Jan 29 2026

achou created T415892: Add Multilingual RevertRisk predictions to mediawiki.page_revert_risk_prediction_change.
Jan 29 2026, 1:40 PM · Machine-Learning-Team (Q4 FY2025-26), Data-Engineering-Radar, Event-Platform, Data-Engineering

Jan 19 2026

achou added a comment to T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task.

Weekly Report

Jan 19 2026, 12:00 PM · OKR-Work, Goal, Machine-Learning-Team

Jan 12 2026

achou created P87410 Revise Tone: Cassandra connection (eqiad).
Jan 12 2026, 7:29 PM
achou created P87318 Revise Tone: asyncio.TimeoutError and TypeError from mwparserfromhtml.
Jan 12 2026, 10:39 AM
achou added a comment to T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task.

Weekly Report

Jan 12 2026, 10:08 AM · OKR-Work, Goal, Machine-Learning-Team

Jan 8 2026

achou added a parent task for T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing: T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task.
Jan 8 2026, 4:39 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou added a subtask for T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task: T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.
Jan 8 2026, 4:39 PM · OKR-Work, Goal, Machine-Learning-Team

Jan 7 2026

achou added a comment to T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.

@Isaac Yes, that would be very helpful! I've +1 the MR. :)

Jan 7 2026, 4:55 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team

Jan 6 2026

achou added a comment to T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.

let me know if that fixes things? I checked one of your examples locally and that seemed to do the trick but will be good to have a second pair of eyes and more examples.

I tested it and it fixes the issue. Thanks for the quick fix :)

Jan 6 2026, 1:54 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team

Dec 22 2025

achou moved T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing from Unsorted to In Progress on the Machine-Learning-Team board.
Dec 22 2025, 4:55 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou claimed T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.
Dec 22 2025, 4:54 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou updated subscribers of T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.

For the issues we want to address with the HTML parser,

Dec 22 2025, 4:49 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team
achou added a comment to T411897: Revise Tone: Exclude certain sections from Tone Recommendations.

By using the HTML parser's plaintext functionality and specifying elements to exclude, we should be able to filter out reference lists, external links, tables, infoboxes, data tables, and image captions. When I parsed the direct quote examples from the spreadsheet, the results show only text that appears in the main article prose.

Dec 22 2025, 3:50 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, Growth-Team
achou added a comment to T411892: Revise Tone: Exclude direct quotes from Tone Recommendations.

I parsed all the examples labeled "Tone issue in direct quote" from the spreadsheet Revise Tone: Articles to feed the model using a HTML parser. Overall, the results look very good.

Dec 22 2025, 3:38 PM · Machine-Learning-Team, Revise-Tone-Structured-Task, OKR-Work, EditCheck, Growth-Team

Dec 10 2025

achou created T412210: Use HTML instead of wikitext for Revise Tone Task Generator in LiftWing.
Dec 10 2025, 12:29 PM · Growth-Team, Revise-Tone-Structured-Task, Machine-Learning-Team

Dec 8 2025

achou closed T408538: Create a Revise Tone Task Generator in LiftWing as Resolved.
Dec 8 2025, 10:58 AM · Patch-For-Review, Machine-Learning-Team
achou closed T408538: Create a Revise Tone Task Generator in LiftWing, a subtask of T408341: Q1 FY2025-26 Goal: Task generation engine for Revise Tone task, as Resolved.
Dec 8 2025, 10:58 AM · OKR-Work, Goal, Machine-Learning-Team
achou moved T392283: Q1 FY2025-26 Goal: Apply the Tone Check model to published articles, to learn whether we can build a pool of high-quality structured tasks for new editors from Current Quarter Goals to 2025-2026 Q2 Done on the Machine-Learning-Team board.
Dec 8 2025, 10:58 AM · OKR-Work, Goal, Machine-Learning-Team
achou moved T408690: Move inference-services repo from Gerrit to GitLab from Unsorted to Ready To Go on the Machine-Learning-Team board.
Dec 8 2025, 10:56 AM · Machine-Learning-Team
achou moved T411485: Enable revert risk filters for first batch of wikis: < 1000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:56 AM · Moderator-Tools-Team, Patch-For-Review, OKR-Work (WE1 FY2025-26), MediaWiki-Recent-changes, MediaWiki-extensions-ORES, Machine-Learning-Team
achou moved T411487: Enable revert risk filters for the second batch of wikis: > 1000 AND <= 2000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:56 AM · Machine-Learning-Team, OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Moderator-Tools-Team
achou moved T411489: Enable revert risk filters for the third batch of wikis: > 2000 AND <= 5000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:56 AM · OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Machine-Learning-Team, Moderator-Tools-Team
achou moved T411490: Enable revert risk filters for the fourth batch of wikis: > 5000 AND <= 10000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:55 AM · Machine-Learning-Team, OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Moderator-Tools-Team
achou moved T411492: Enable revert risk filters for the fifth batch of wikis: > 10000 AND <= 30000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:55 AM · Machine-Learning-Team, OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Moderator-Tools-Team
achou moved T411493: Enable revert risk filters for the sixth batch of wikis: > 30000 AND <= 70000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:55 AM · Machine-Learning-Team, OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Moderator-Tools-Team
achou moved T411494: Enable revert risk filters for the seventh batch of wikis: > 70000 AND <= 150000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:55 AM · Machine-Learning-Team, OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Moderator-Tools-Team
achou moved T411495: Enable revert risk filters for the eighth batch of wikis: > 150000 monthly edits from Unsorted to Watching on the Machine-Learning-Team board.
Dec 8 2025, 10:55 AM · OKR-Work, MediaWiki-extensions-ORES, MediaWiki-Recent-changes, Machine-Learning-Team, Moderator-Tools-Team