Page MenuHomePhabricator

XiaoXiao-WMF
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Nov 27 2023, 7:13 PM (59 w, 7 h)
Availability
Available
LDAP User
Unknown
MediaWiki User
XiaoXiao-WMF [ Global Accounts ]

Recent Activity

Thu, Jan 9

XiaoXiao-WMF triaged T383361: Enable collection of JS features: fonts and canvas as High priority.
Thu, Jan 9, 7:10 PM · Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF added a comment to T383361: Enable collection of JS features: fonts and canvas.

@kostajh please triage and adjust the ticket with deadlines etc...

Thu, Jan 9, 7:10 PM · Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF created T383361: Enable collection of JS features: fonts and canvas.
Thu, Jan 9, 7:07 PM · Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Wed, Jan 8

XiaoXiao-WMF added a project to T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier : Research-engineering.
Wed, Jan 8, 7:47 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Tue, Jan 7

XiaoXiao-WMF assigned T383061: Algorithm creation to MunizaA.
Tue, Jan 7, 2:29 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team

Mon, Jan 6

XiaoXiao-WMF changed the status of T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier from Open to In Progress.
Mon, Jan 6, 9:28 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF changed the status of T383061: Algorithm creation, a subtask of T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier , from Open to In Progress.
Mon, Jan 6, 9:26 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF changed the status of T383061: Algorithm creation from Open to In Progress.
Mon, Jan 6, 9:26 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team
XiaoXiao-WMF changed the status of T383060: Dataset creation, a subtask of T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier , from Open to In Progress.
Mon, Jan 6, 5:04 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF changed the status of T383060: Dataset creation from Open to In Progress.
Mon, Jan 6, 5:04 PM · CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF updated the task description for T383060: Dataset creation.
Mon, Jan 6, 5:04 PM · CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF assigned T383060: Dataset creation to fkaelin.
Mon, Jan 6, 4:24 PM · CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF removed a project from T383060: Dataset creation: Research (FY2024-25-Research-January-March).
Mon, Jan 6, 4:17 PM · CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF edited projects for T383061: Algorithm creation, added: FY2024-25 WE4.2; removed Research (FY2024-25-Research-January-March).
Mon, Jan 6, 4:16 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team
XiaoXiao-WMF removed a project from T383061: Algorithm creation: FY2024-25 WE4.2.
Mon, Jan 6, 4:15 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team
XiaoXiao-WMF updated the task description for T383061: Algorithm creation.
Mon, Jan 6, 3:08 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team
XiaoXiao-WMF created T383061: Algorithm creation.
Mon, Jan 6, 3:06 PM · FY2024-25 WE4.2, CheckUser, Trust and Safety Product Team
XiaoXiao-WMF created T383060: Dataset creation.
Mon, Jan 6, 3:01 PM · CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Dec 13 2024

XiaoXiao-WMF added a comment to T377498: Phase 2: Article categorization metrics, fine-tuning metrics, optimization tooling.

Stretch goals may not be completed by end of Q2 - will continue in Q3.

Dec 13 2024, 7:01 PM · Research-engineering, Research
XiaoXiao-WMF assigned T377498: Phase 2: Article categorization metrics, fine-tuning metrics, optimization tooling to MunizaA.
Dec 13 2024, 7:01 PM · Research-engineering, Research
XiaoXiao-WMF assigned T382070: Deploy pipeline under DSE namespace to fkaelin.
Dec 13 2024, 6:58 PM · Research-engineering, Research
XiaoXiao-WMF renamed T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier from WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier locality-sensitive hash to WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier .
Dec 13 2024, 6:57 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF triaged T382070: Deploy pipeline under DSE namespace as High priority.
Dec 13 2024, 6:56 PM · Research-engineering, Research
XiaoXiao-WMF edited projects for T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier , added: Research (FY2024-25-Research-January-March); removed Research.
Dec 13 2024, 6:55 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2
XiaoXiao-WMF edited projects for T382072: Offline pipelines, added: Research, Research-engineering; removed Research (FY2024-25-Research-January-March).
Dec 13 2024, 4:05 PM · Research-engineering, Research

Dec 12 2024

XiaoXiao-WMF created T382072: Offline pipelines.
Dec 12 2024, 2:36 PM · Research-engineering, Research
XiaoXiao-WMF created T382070: Deploy pipeline under DSE namespace.
Dec 12 2024, 2:25 PM · Research-engineering, Research
XiaoXiao-WMF edited projects for T382068: Relforge embedding experimentation, added: Research (FY2024-25-Research-January-March); removed Research.
Dec 12 2024, 2:14 PM · Research (FY2024-25-Research-January-March), Research-engineering
XiaoXiao-WMF created T382068: Relforge embedding experimentation.
Dec 12 2024, 2:14 PM · Research (FY2024-25-Research-January-March), Research-engineering

Dec 5 2024

XiaoXiao-WMF added a comment to T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier .

Update ext.checkUser.clientHints to obtain list of fonts and generate a canvas fingerprint

Have we exhausted all avenues of passive fingerprinting? Canvas and font fingerprinting feel like a massive overreach in terms of violating a user's privacy in a way that a user cannot explicitly opt out of. (Outside of ceasing to edit Wikipedia)

Dec 5 2024, 6:17 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Dec 3 2024

XiaoXiao-WMF removed a project from T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier : ml-model-requests.
Dec 3 2024, 1:50 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Nov 28 2024

XiaoXiao-WMF moved T376204: TempAccount updates to research pipelines from Backlog to FY2024-25-Research-January-March on the Research board.
Nov 28 2024, 8:04 PM · Research (FY2024-25-Research-January-March), Research-engineering
XiaoXiao-WMF moved T380752: Migrate Relforge to Opensearch from Backlog to Watching on the Research board.
Nov 28 2024, 7:59 PM · Data-Platform-SRE (2025.01.11 - 2025.01.31), Patch-For-Review, Research, Discovery-Search (Current work)
XiaoXiao-WMF moved T360794: Implement stream of HTML content on mw.page_change event from Backlog to Watching on the Research board.
Nov 28 2024, 7:57 PM · Research, Data-Engineering, Event-Platform
XiaoXiao-WMF added a project to T360794: Implement stream of HTML content on mw.page_change event: Research.
Nov 28 2024, 7:56 PM · Research, Data-Engineering, Event-Platform

Nov 27 2024

XiaoXiao-WMF created T381031: WE4.2.10 Add more browser signals to client hints pipeline to generate unique device identifier .
Nov 27 2024, 7:18 PM · OKR-Work, Research-engineering, Research (FY2024-25-Research-January-March), CheckUser, Trust and Safety Product Team, FY2024-25 WE4.2

Nov 26 2024

XiaoXiao-WMF added a parent task for T360794: Implement stream of HTML content on mw.page_change event: T380874: Incremental HTML wiki content dataset to support "Who are moderators" SDS 1.2.3.
Nov 26 2024, 5:48 PM · Research, Data-Engineering, Event-Platform
XiaoXiao-WMF added a subtask for T380874: Incremental HTML wiki content dataset to support "Who are moderators" SDS 1.2.3: T360794: Implement stream of HTML content on mw.page_change event.
Nov 26 2024, 5:48 PM · Data-Engineering, Research
XiaoXiao-WMF updated the task description for T380874: Incremental HTML wiki content dataset to support "Who are moderators" SDS 1.2.3.
Nov 26 2024, 5:48 PM · Data-Engineering, Research

Nov 20 2024

XiaoXiao-WMF changed the status of T377266: DSE kubernetes namespace for llm-inference from Open to In Progress.
Nov 20 2024, 3:10 PM · Data-Platform-SRE (2024.11.30 - 2024.12.20), Research-engineering, Data-Platform, Research
XiaoXiao-WMF removed a project from T372707: research code hand-over and resolve requests/comments from research engineers: Research.
Nov 20 2024, 3:09 PM · Research-Freezer, Epic, Wikidata data quality and trust, Wikidata, address-knowledge-gaps, Knowledge-Integrity

Nov 19 2024

XiaoXiao-WMF moved T379288: Plan for access control with opensearch from Backlog to Watching on the Research board.
Nov 19 2024, 3:31 PM · Discovery-Search
XiaoXiao-WMF added a comment to T379543: Update the research team's DAGs to use miniforge instead of miniconda.

@BTullis Can you please comment on the urgency/timeline if you have any for updating the DAG?

Nov 19 2024, 3:31 PM · Research-engineering, Research
XiaoXiao-WMF added a comment to T379161: create "ml-model-requests" tag.

Thank you !

Nov 19 2024, 3:29 PM · Project-Admins
XiaoXiao-WMF added a project to T379543: Update the research team's DAGs to use miniforge instead of miniconda: Research-engineering.
Nov 19 2024, 3:29 PM · Research-engineering, Research
XiaoXiao-WMF edited projects for T377267: Consolidate article based data pipelines, added: Research-Freezer; removed Research.
Nov 19 2024, 3:26 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T377265: Simplify dependencies between research code repositories for ML, added: Research-Freezer; removed Research.
Nov 19 2024, 3:25 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a parent task for T377265: Simplify dependencies between research code repositories for ML: T368615: technical debt and process enhancement.
Nov 19 2024, 3:21 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a subtask for T368615: technical debt and process enhancement: T377265: Simplify dependencies between research code repositories for ML.
Nov 19 2024, 3:21 PM · Research-engineering, Research
XiaoXiao-WMF added a comment to T367446: Consolidate duplicated configuration/constants.

Will not work on it this quarter. Will move the task back when we get back to it.

Nov 19 2024, 3:13 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T367446: Consolidate duplicated configuration/constants, added: Research-Freezer; removed Research.
Nov 19 2024, 3:12 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T351677: [Research Engineering Request] Produce image datasets, added: Research-Freezer; removed Research.
Nov 19 2024, 3:11 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a comment to T351677: [Research Engineering Request] Produce image datasets.

As per comment above, will move to freezer and reprioritize.

Nov 19 2024, 3:11 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a comment to T342916: Add new "Readability" gap to Knowledge Gaps pipeline.

Will need reprioritize, moving to freezer.

Nov 19 2024, 3:10 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T342916: Add new "Readability" gap to Knowledge Gaps pipeline, added: Research-Freezer; removed Research.
Nov 19 2024, 3:10 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF removed a project from T346473: Training dataset validation: Research.
Nov 19 2024, 3:09 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a project to T346473: Training dataset validation: Research-Freezer.
Nov 19 2024, 3:09 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF moved T378761: HTML diff dataset for SDS 1.2.3 from Backlog to In Progress on the Research board.
Nov 19 2024, 3:08 PM · Research-engineering, Research
XiaoXiao-WMF moved T377266: DSE kubernetes namespace for llm-inference from Backlog to In Progress on the Research board.
Nov 19 2024, 3:07 PM · Data-Platform-SRE (2024.11.30 - 2024.12.20), Research-engineering, Data-Platform, Research
XiaoXiao-WMF moved T366528: Deployment of model updates from Backlog to Watching on the Research board.
Nov 19 2024, 3:05 PM · Research-engineering, Machine-Learning-Team, Research

Nov 18 2024

XiaoXiao-WMF added a comment to T379161: create "ml-model-requests" tag.

Following the template:

Nov 18 2024, 4:48 PM · Project-Admins

Nov 6 2024

XiaoXiao-WMF created T379161: create "ml-model-requests" tag.
Nov 6 2024, 2:35 PM · Project-Admins

Nov 5 2024

XiaoXiao-WMF added a comment to T378761: HTML diff dataset for SDS 1.2.3.

@diego What's the decision of creating one-off html dumps? From the description, there are two options. If we have decided to go with option 1, then we should start right now. If that's the case, please put this task to in-progress.

Nov 5 2024, 5:46 PM · Research-engineering, Research
XiaoXiao-WMF moved T376206: Quicksurvey audience selection with TempAccounts from Backlog to Watching on the Research board.
Nov 5 2024, 5:29 PM · Temporary accounts, QuickSurveys, Research

Oct 31 2024

XiaoXiao-WMF moved T376674: [SPIKE] Investigate efficacy for the Reference Needed API from Backlog to Watching on the Research board.
Oct 31 2024, 5:15 PM · Research, EditCheck, Editing-team
XiaoXiao-WMF added a project to T376674: [SPIKE] Investigate efficacy for the Reference Needed API: Research.
Oct 31 2024, 5:13 PM · Research, EditCheck, Editing-team

Oct 29 2024

XiaoXiao-WMF moved T340854: Verify if the Python recommendation API can support the use cases of the nodejs one from Backlog to Watching on the Research board.
Oct 29 2024, 5:29 PM · Research, ContentTranslation, Machine-Learning-Team, Epic
XiaoXiao-WMF added a comment to T376206: Quicksurvey audience selection with TempAccounts.

@TAndic Any update on this?

Oct 29 2024, 3:59 PM · Temporary accounts, QuickSurveys, Research
XiaoXiao-WMF moved T370147: [Epic] Migration of the Elasticsearch 7.10 search cluster to replacement backend search engine from Backlog to Watching on the Research board.
Oct 29 2024, 3:55 PM · Research, Discovery-Search (Current work), Epic
XiaoXiao-WMF added a project to T370147: [Epic] Migration of the Elasticsearch 7.10 search cluster to replacement backend search engine: Research.
Oct 29 2024, 3:55 PM · Research, Discovery-Search (Current work), Epic
XiaoXiao-WMF updated subscribers of T376204: TempAccount updates to research pipelines.

@KinneretG I think you are aware of this change, can you please comment on the community impact? Thanks!

Oct 29 2024, 3:48 PM · Research (FY2024-25-Research-January-March), Research-engineering

Oct 25 2024

XiaoXiao-WMF moved T376116: Implement support for temporary accounts in revertrisk models from Backlog to Watching on the Research board.
Oct 25 2024, 8:26 PM · Machine-Learning-Team, Research, Temporary accounts

Oct 17 2024

isarantopoulos awarded T371902: Request to host the Reference Need Model on LiftWing a Yellow Medal token.
Oct 17 2024, 7:44 AM · Lift-Wing, Machine-Learning-Team

Oct 16 2024

XiaoXiao-WMF updated subscribers of T376851: Turing Test for Patrolling.
Oct 16 2024, 4:54 PM · research-ideas
XiaoXiao-WMF added a project to T377266: DSE kubernetes namespace for llm-inference: Research-engineering.
Oct 16 2024, 2:24 PM · Data-Platform-SRE (2024.11.30 - 2024.12.20), Research-engineering, Data-Platform, Research

Oct 15 2024

XiaoXiao-WMF closed T368614: Essential work - model quantization as Resolved.

Mark as resolved for Q1 deliverables.

Oct 15 2024, 3:01 PM · Research, Essential-Work, Research-engineering

Oct 11 2024

XiaoXiao-WMF added a comment to T368389: WE4.3.1 - IP traffic.
  1. We have provided two approaches: 1) hard-coded logic based, 2) model. The two approaches arrive very similar performance metrics, which suggested that the latter (model) can be seen as merely a confirmation of the former (logic based), i.e. if the ML model does not out-smart the logics, it may imply that the problem in hand may not need a ML based approach. Should we observe other behaviors in the future, we could come back to revisit.
Oct 11 2024, 12:41 PM · Knowledge-Integrity, OKR-Work, Research (FY2024-25-Research-July-September)

Oct 10 2024

XiaoXiao-WMF created T376851: Turing Test for Patrolling.
Oct 10 2024, 12:02 AM · research-ideas

Oct 9 2024

XiaoXiao-WMF updated the task description for T376674: [SPIKE] Investigate efficacy for the Reference Needed API.
Oct 9 2024, 1:14 PM · Research, EditCheck, Editing-team

Oct 8 2024

XiaoXiao-WMF added a subtask for T293465: Edit Types Research: T351225: Productionized Edit Types.
Oct 8 2024, 11:50 PM · Research, Epic
XiaoXiao-WMF added a parent task for T351225: Productionized Edit Types: T293465: Edit Types Research.
Oct 8 2024, 11:50 PM · Data-Engineering-Icebox, Research, Data-Engineering, Event-Platform, Research-engineering
XiaoXiao-WMF updated the task description for T376674: [SPIKE] Investigate efficacy for the Reference Needed API.
Oct 8 2024, 12:44 PM · Research, EditCheck, Editing-team

Oct 5 2024

XiaoXiao-WMF created T376548: Learning to defer a decision-making to moderators when (non)ML-based decisions are deemed unreliable.
Oct 5 2024, 9:57 PM · research-ideas
XiaoXiao-WMF added a comment to T354455: [long] Train model for auto-generating SQL queries.

LLMs have shown promising results (raw SQL of course) in this front, probably worth a try - could start with llama3-70b (some claims this is as good as chatgpt4)

Oct 5 2024, 7:51 PM · research-ideas

Oct 4 2024

XiaoXiao-WMF added a comment to T371658: Codify team processes for research code to Production code.

no update, P&T offsite

Oct 4 2024, 11:38 PM · Research-management, Research
XiaoXiao-WMF added a comment to T375291: Research Infrastructure component accountability.

no update, ready to discuss again as a group

Oct 4 2024, 11:36 PM · Research-management, Research
XiaoXiao-WMF edited projects for T333701: Implement "visibility metric" as percentage of orphan articles in a category., added: Research-Freezer; removed Research.
Oct 4 2024, 11:31 PM · Research-Freezer
XiaoXiao-WMF closed T368388: WE4.2.1 - Unique Device as Resolved.

The research work for Q1 has concluded.

Oct 4 2024, 1:52 PM · OKR-Work, Knowledge-Integrity, FY2024-25 WE4.2.1 Unique device identification model, Research (FY2024-25-Research-July-September), Research-engineering
XiaoXiao-WMF closed T368389: WE4.3.1 - IP traffic as Resolved.

@CDanis we will close this task. Please reach out to me if you need further assistance.

Oct 4 2024, 1:51 PM · Knowledge-Integrity, OKR-Work, Research (FY2024-25-Research-July-September)

Sep 27 2024

XiaoXiao-WMF added a comment to T371658: Codify team processes for research code to Production code.

no update, will pick up next week

Sep 27 2024, 9:02 PM · Research-management, Research
XiaoXiao-WMF added a comment to T375291: Research Infrastructure component accountability.

update:

  • second round review, mostly on database as a service definition and discussion
Sep 27 2024, 9:02 PM · Research-management, Research
XiaoXiao-WMF added a comment to T341519: Team-Interface: Discoverability of the documentation for ongoing or completed research project.

@leila sorry need clarification - what do you mean by "other categories of 'technical documentation'"? Are you asking for more categories which is at the same level of the existing headers, or subcategories within "technical documentation"?

Sep 27 2024, 5:32 PM · Essential-Work, Design-Research, Research-management, Research

Sep 25 2024

XiaoXiao-WMF created T375691: Essential work - Monitoring and Stats for election-related articles and edits.
Sep 25 2024, 8:14 PM · Research, Movement-Insights

Sep 24 2024

XiaoXiao-WMF edited projects for T369371: [Research Engineering Request] Deploy the new Wikidata Revert Risk Model, added: Research-Freezer; removed Research.
Sep 24 2024, 5:05 PM · Research-Freezer, Wikidata.org, Wikidata, Research-engineering

Sep 23 2024

XiaoXiao-WMF added a project to T336766: Release data for interaction between gaps: Research-engineering.
Sep 23 2024, 5:46 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T344644: Investigate articles_deleted as content gap metric, added: Research-Freezer; removed Research.
Sep 23 2024, 5:37 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T344851: Add cumulative metrics to the knowledge gaps pipeline, added: Research-Freezer; removed Research.
Sep 23 2024, 5:36 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T343061: Denylist for language agnostic revert risk model, added: Research-Freezer; removed Research.
Sep 23 2024, 5:35 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF added a comment to T343061: Denylist for language agnostic revert risk model.

moving to freezer

Sep 23 2024, 5:35 PM · Research-Freezer, Research-engineering
XiaoXiao-WMF edited projects for T351225: Productionized Edit Types, added: Research-Freezer; removed Research.
Sep 23 2024, 5:30 PM · Data-Engineering-Icebox, Research, Data-Engineering, Event-Platform, Research-engineering