This is a parent task to capture the Q1 work by Applied Sciences (Research) related to Knowledge Integrity. It will capture prioritization decisions and major weekly updates related to tasks in this bucket from July - September 2025. More fine-grained updates and coordination will occur in the subtasks as appropriate. It follows the Q4 task (T391717).
Description
Description
Details
Details
- Due Date
- Sep 30 2025, 4:00 AM
Related Objects
Related Objects
- Mentioned Here
- T351225: Productionized Edit Types
T398482: develop a content diff index design plan
T393634: Research support for NPOV workstreams
T391717: [Q4 FY 24-25 Applied Science] Knowledge Integrity Research
T391940: FY2024-25 Q4 Goal: Productionize tone check model
T398071: Proof of Concept: RecSys for Patrollers
Event Timeline
Comment Actions
Weekly Update
- Tone Check edit check
- Discussing how to incorporate user feedback
- Designing the structured task
- Improving the Model card
Comment Actions
Weekly Update
- Moderator recommendation: No updates
- Tone Check edit check: Supporting ML and Product on designing structured task around tone check.
- Edit diffs: @fkaelin to update.
Comment Actions
Weekly update for edit diffs:
- Notebook for exploring query patterns.
- Switched to using edit types instead of tokenizing the added/removed words using the mwtokenizer. The structured output of edit-types is much preferable to imposing structure on the aggregated raw tokens. Running edit-types at scale is computationally challenging due the memory hungriness of the parsing library (mwparserfromhell). The approach to have a daily dag that appends to an iceberg table seems to work, likely because the suspected memory leaks don't have "enough time" to explode the job, as each batch only computes one day of data. This will be helpful for T351225.
Comment Actions
Weekly Update
- Tone Check edit check: Supporting team with the slide preparation for the staff meeting presentation.
- Moderator recommendation: Working on the experiment design.
- Split editors by user group and finding recommendation for specific group.
- This drastically decrease the number of users (and data sparsity)
Comment Actions
Weekly report
- Edit diffs:
- a prototype for querying editor history is deployed on cloud vps, with a design plan document.
- Tone check:
- Represented research at Tone Check product presentation at All Staff meeting
- Met with (part of) the Tone Check team to discuss how to incorporate user feedback from the future Tone Check structured task.
- We need to validate newcomers expertise before consider their feedback to retrain the model.
Comment Actions
Weekly report
- Moderator recommendation: Work in selecting the task to recommend and categorize them according to their predictability.
Comment Actions
Weekly Report
- Edit diffs: collected feedback about the prototype from product management.