Page MenuHomePhabricator

diego (Diego S-T)
Senior Research Scientist

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Aug 8 2017, 10:56 AM (293 w, 5 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Diego (WMF) [ Global Accounts ]

Recent Activity

Wed, Mar 22

diego updated subscribers of T332021: Wikidata Articlequality ORES/ML model needs updating after MUL.
Wed, Mar 22, 4:02 PM · artificial-intelligence, Machine-Learning-Team, Item Quality Scoring Improvement, revscoring, wmde-wikidata-tech, Wikidata, ORES

Sat, Mar 11

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.
  • New features had slightly improved the accuracy (now is 75%), I'm still working on improving the model.
Sat, Mar 11, 2:15 AM · Wikidata data quality and trust, Wikidata, Research (FY2022-23-Research-January-March)
diego added a comment to T314386: Create a multilingual model to predict reverts on Wikipedia.
  • @Trokhymovych had finished the version of this model, covering 47 languages. @MunizaA reviewed and adapted the code, and now we are coordinating with @achou to update the model on Lift Wing.
Sat, Mar 11, 2:13 AM · Research (FY2022-23-Research-January-March)
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • We are discussing the schema for this (and other) ML-generated events (T331401)
Sat, Mar 11, 2:10 AM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing

Wed, Mar 8

diego added a comment to T329071: Integration of Revert Risk Scores to Recent Changes as a filter.

The problem that I see with 1) is that we are already filtering (and rightfully so) a lot of events, meanwhile researchers may want the whole stream scored.

@elukey, do we know which is the data that is being filtered out?

Wed, Mar 8, 4:36 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team
diego updated subscribers of T329071: Integration of Revert Risk Scores to Recent Changes as a filter.
Wed, Mar 8, 4:35 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team

Tue, Mar 7

diego added a comment to T331401: Design event schema for ML scores/recommendations on current page state.

And for the revert-risk model:

score:
    model_name: revertrisk
    model_version: 1.0.1
    prediction:
    - true
    probability:
     true: 0.9
     false: 0.1

This works for me.

Tue, Mar 7, 6:09 PM · Event-Platform Value Stream (Sprint 10), Data-Engineering, Machine-Learning-Team, Research

Mon, Mar 6

diego added a comment to T314386: Create a multilingual model to predict reverts on Wikipedia.
  • We are working in increasing the number of languages covered by this model. Currently, the model hosted on Lift Wing has 6. The next version will cover 47. (Keep in mind that the language agnostic model T314385 cover all wikis)
Mon, Mar 6, 12:26 AM · Research (FY2022-23-Research-January-March)

Sun, Mar 5

diego added a parent task for T329071: Integration of Revert Risk Scores to Recent Changes as a filter: T314385: Create a language agnostic model to predict reverts on Wikipedia.
Sun, Mar 5, 11:58 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team
diego added a subtask for T314385: Create a language agnostic model to predict reverts on Wikipedia: T329071: Integration of Revert Risk Scores to Recent Changes as a filter.
Sun, Mar 5, 11:57 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • We are discussing how to integrate this model on the Recent Changes page on MediaWiki (T329071)
Sun, Mar 5, 11:57 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego edited projects for T314385: Create a language agnostic model to predict reverts on Wikipedia, added: Research (FY2022-23-Research-January-March); removed Research (FY2022-23-Research-October-December).
Sun, Mar 5, 11:55 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.
  • Currently I'm working on featuring engineering. The current model has around 72% accuracy on balanced data.
Sun, Mar 5, 11:52 PM · Wikidata data quality and trust, Wikidata, Research (FY2022-23-Research-January-March)

Fri, Mar 3

diego updated subscribers of T326179: Proposal: Create a stream end point for Revision Risk Model.
Fri, Mar 3, 11:20 AM · Event-Platform Value Stream, Data-Engineering, Machine-Learning-Team, Research

Thu, Mar 2

diego added a comment to T326179: Proposal: Create a stream end point for Revision Risk Model.

We should all sync up and work on some big standardized modeling design decisions and ideas. It would be great if we could share intensions and strategies for the future so we can prioritize work between ML and Event Platform especially.

I would add research on this ;)

Thu, Mar 2, 4:26 PM · Event-Platform Value Stream, Data-Engineering, Machine-Learning-Team, Research

Mon, Feb 27

diego updated subscribers of T329071: Integration of Revert Risk Scores to Recent Changes as a filter.
Mon, Feb 27, 2:37 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team

Feb 24 2023

diego added a comment to T330148: Support the Revert-Review API/tool on Toolforge.

@achou the main requestor will be the aforementioned API, for evaluating the the model. I don't expect high traffic. Let's say a couple of thousands per week.

Feb 24 2023, 10:00 AM · Machine-Learning-Team, Lift-Wing

Feb 18 2023

diego added a comment to T305888: Reference Quality in English Wikipedia.
  • The paper was officially accepted in WWW'23. We made some final updates to the text.
Feb 18 2023, 5:24 AM · Research (FY2022-23-Research-January-March)
diego added a comment to T314384: Develop a ML-based service to predict reverts on Wikipedia(s).
  • Discussing the integration of Revert Risk on MediaWiki: T329071
Feb 18 2023, 5:24 AM · Research, Epic
diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.
  • Still working on the data evaluation. Currently I'm studying the use of tags and user groups and their relation with reverts.
Feb 18 2023, 5:22 AM · Wikidata data quality and trust, Wikidata, Research (FY2022-23-Research-January-March)

Feb 16 2023

diego added a comment to T329071: Integration of Revert Risk Scores to Recent Changes as a filter.

Apparently this is currently done by: https://www.mediawiki.org/wiki/Extension:ORES

Feb 16 2023, 1:40 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team

Feb 7 2023

diego updated subscribers of T329071: Integration of Revert Risk Scores to Recent Changes as a filter.
Feb 7 2023, 5:36 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team
diego created T329071: Integration of Revert Risk Scores to Recent Changes as a filter.
Feb 7 2023, 4:07 PM · Data-Engineering-Planning, Event-Platform Value Stream, Machine-Learning-Team, Edit-Review-Improvements-Integrated-Filters, Research, Growth-Team

Feb 3 2023

diego added a comment to T328813: Develop a ML-based service to detect vandalism on Wikidata.
  • We are working on manually evaluating reverts to identify the right data to train the model.
Feb 3 2023, 10:03 PM · Wikidata data quality and trust, Wikidata, Research (FY2022-23-Research-January-March)
diego created T328813: Develop a ML-based service to detect vandalism on Wikidata.
Feb 3 2023, 10:01 PM · Wikidata data quality and trust, Wikidata, Research (FY2022-23-Research-January-March)
diego added a comment to T305888: Reference Quality in English Wikipedia.
  • We have submitted the Camera Ready version of this paper.
  • We have started working on evaluating references in other languages.
Feb 3 2023, 9:59 PM · Research (FY2022-23-Research-January-March)
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • We are coordinating with the ML-team to create a public stream with this model's score.
Feb 3 2023, 9:58 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing

Jan 27 2023

diego added a comment to T314386: Create a multilingual model to predict reverts on Wikipedia.
  • We are working on paper to share results of this model.
Jan 27 2023, 6:24 PM · Research (FY2022-23-Research-January-March)
diego edited projects for T314386: Create a multilingual model to predict reverts on Wikipedia, added: Research (FY2022-23-Research-January-March); removed Research (FY2022-23-Research-October-December).
Jan 27 2023, 6:23 PM · Research (FY2022-23-Research-January-March)
diego closed T327763: Generate lists of unillustrated sections in enwiki that have image suggestions from other wikis as Resolved.
Jan 27 2023, 6:23 PM · Section-Level-Image-Suggestions
diego closed T327763: Generate lists of unillustrated sections in enwiki that have image suggestions from other wikis, a subtask of T319419: Section Level Image Suggestions based on cross-lingual Section Alignment, as Resolved.
Jan 27 2023, 6:22 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions
diego edited projects for T305888: Reference Quality in English Wikipedia, added: Research (FY2022-23-Research-January-March); removed Research (FY2022-23-Research-October-December).
Jan 27 2023, 6:21 PM · Research (FY2022-23-Research-January-March)
diego added a comment to T305888: Reference Quality in English Wikipedia.
  • Our paper was conditionally accepted at TheWebConf'23 (a.k.a WWW'23)
Jan 27 2023, 6:21 PM · Research (FY2022-23-Research-January-March)

Jan 25 2023

diego added a comment to T327763: Generate lists of unillustrated sections in enwiki that have image suggestions from other wikis.

@Miriam please find the data in csv format here and the code used to generate it here.

Jan 25 2023, 9:22 PM · Section-Level-Image-Suggestions
diego added a comment to T327763: Generate lists of unillustrated sections in enwiki that have image suggestions from other wikis.

@Miriam what should we do with sections that has multiple images? do you want

<page id>,<page title>,<section title>,<img_title>,<n_recommendations>

?

Jan 25 2023, 4:00 PM · Section-Level-Image-Suggestions

Jan 19 2023

diego added a comment to T317700: Enable product analytics to use revision risk to assess edit quality in feature analyses.

The model is already available, check here how to use it: T314385#8496547

Jan 19 2023, 3:41 PM · EditCheck, Research, Product-Analytics, VisualEditor

Jan 13 2023

diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • We have presented the results of this project at the WMF's "Monthly Tech All Meeting"
Jan 13 2023, 9:25 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T305888: Reference Quality in English Wikipedia.

*The final decision should be out this week.

Jan 13 2023, 9:24 PM · Research (FY2022-23-Research-January-March)

Jan 9 2023

diego updated subscribers of T317768: Proposal: deprecate the mediawiki.revision-score stream in favour of more streams like mediawiki-revision-score-<model>.

Hi all, we have a use case here T326179. These models are already hosted on LiftWing. The suggested end-point could be mediawiki-revision-score-revert-risk-la

Jan 9 2023, 6:33 PM · Data-Engineering-Planning, Research, Machine-Learning-Team
diego updated subscribers of T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

For the records here a snippet (by @achou) to try the models from the WMF's cluster

Jan 9 2023, 4:31 PM · Research, Epic

Jan 3 2023

diego renamed T326179: Proposal: Create a stream end point for Revision Risk Model from Create an stream end point for Revision Risk Model to Proposal: Create a stream end point for Revision Risk Model.
Jan 3 2023, 10:35 PM · Event-Platform Value Stream, Data-Engineering, Machine-Learning-Team, Research
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.

For the records here a snippet (by @achou) to try this model:

Jan 3 2023, 9:37 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego created T326179: Proposal: Create a stream end point for Revision Risk Model.
Jan 3 2023, 9:35 PM · Event-Platform Value Stream, Data-Engineering, Machine-Learning-Team, Research
diego updated subscribers of T314386: Create a multilingual model to predict reverts on Wikipedia.

For the records here a snippet (by @achou) to try this model:

Jan 3 2023, 9:25 PM · Research (FY2022-23-Research-January-March)
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
Jan 3 2023, 9:24 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing

Dec 23 2022

diego added a comment to T319419: Section Level Image Suggestions based on cross-lingual Section Alignment.

Update

Dec 23 2022, 4:58 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • Multilingual and language-agnostic models has been deployed to production. Check the details in the related tasks.
  • We are now onboarding @Sheilakaruku to work on developing an user-interface to work with these models (T318634)
Dec 23 2022, 3:44 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T305888: Reference Quality in English Wikipedia.
  • We have received the reviews from the WWW, and submitted the rebuttal. Now, we need to wait for the final decision.
Dec 23 2022, 3:41 PM · Research (FY2022-23-Research-January-March)

Dec 22 2022

diego added a comment to T321224: Wikidata Item Quality Model.

I'm trying to implement a link-prediction task on Wikidata, to be used as proxy for claims coverage. I'm building on top of Goyal & Ferrara's work. The existing libraries might require some tweaks to work on the full Wikidata Graph, but before addressing the scalability issues I want to test this approach on a small sample to see the suitability of this approach.

Dec 22 2022, 9:14 PM · Research (FY2022-23-Research-January-March), Linked-Open-Data-Network-Program, Wikidata

Dec 13 2022

diego added a comment to T321947: Data collection for the Knowledge Integrity Risk Composite Index.

Regarding article quality, you can find the scores for all revisions in all languages from 2020-01-01 until 2022-09-31 here: /user/dsaez/paramita_article_quality/scores_all_v3_from_2020-01-01.parquet (HDFS)

Dec 13 2022, 7:56 AM · Research (FY2022-23-Research-October-December)

Dec 12 2022

diego added a comment to T318348: [SPIKE] Section-level topic relevance score.

@mfossati do you have an update on the status of the evaluation for proposal 2? thanks!

I sent the data check request as per our formal cross-team process, solicited Research during the last meeting, and personally pinged @diego and @MunizaA . I haven't seen or received any update so far, I'll follow up again at the next meeting.

Dec 12 2022, 10:23 PM · Structured-Data-Backlog (Current Work), Research-Backlog, Section-Topics
diego added a comment to T318348: [SPIKE] Section-level topic relevance score.

As mentioned before in our meetings, the main problem we have is the confusing usage of "Blue Links" as synonym of "topics". In NLP topics are either categories or clusters of documents. The second important problem we have is the lack of a evaluation task or guidelines. If we are using links as tags, and we want to evaluate the importance/relevance of such tags, we need a task, because relevance depends on the context.

Dec 12 2022, 10:20 PM · Structured-Data-Backlog (Current Work), Research-Backlog, Section-Topics

Nov 21 2022

diego added a comment to T321947: Data collection for the Knowledge Integrity Risk Composite Index.

Based on your work at T314384, we would love to incorporate new fields like:

  • vandalism_count
  • vandalism_ratio
  • vandalism_reverts_ratio
  • seconds_to_revert_vandalism_avg

Your feedback would be highly appreciated, so thanks in advance for your interest and happy to brainstorm together on this :)

Just to clarify we have a "revert probability", we can't claim this is "vandalism". Different from previous model we have just one single score.
Maybe you might be interested on collecting abuse filter information. I have some code to do that, and from there you might be able to compute something like "abuse filters hits".

Nov 21 2022, 3:31 PM · Research (FY2022-23-Research-October-December)

Nov 4 2022

diego added a comment to T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.

Please remember to record your contributions on the Outreachy website! The deadline is (today) Nov 4th!

Nov 4 2022, 12:19 AM · Outreachy (Round 25)

Nov 1 2022

diego updated the task description for T314384: Develop a ML-based service to predict reverts on Wikipedia(s).
Nov 1 2022, 4:40 PM · Research, Epic
diego added a comment to T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

It has been decided to focus on knowledge integrity risks from two categories of our taxonomy:

Nov 1 2022, 4:39 PM · Research, Epic
diego added a comment to T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.

Hi @diego, should we mention a single contribution on the Outreachy website or mention our contribution in multiple parts?

Nov 1 2022, 12:02 PM · Outreachy (Round 25)

Oct 31 2022

diego added a comment to T321594: Deploy revert-risk-model to production.

Yeah! Thanks @achou ! Please, can you write here an example of how to hit the endpoint ?

Oct 31 2022, 12:34 PM · Machine-Learning-Team, Lift-Wing
diego added a comment to T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.

Hello everybody,

Oct 31 2022, 12:04 PM · Outreachy (Round 25)

Oct 24 2022

diego added a comment to T321224: Wikidata Item Quality Model.

I understand your concerns, but I'll start considering the "Instance Of" as the main "category" for the item. We could later try to cluster instances based on statements similarities, but I would keep that for later.

Oct 24 2022, 6:29 PM · Research (FY2022-23-Research-January-March), Linked-Open-Data-Network-Program, Wikidata
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • The code is being refactored by @MunizaA and reviewed by @achou. They are trying to find the optimal architecture in order to make the code easier to maintain and update.
  • We have found a poor performance from the model for anonymous edits. I'm working on updating the model to improve this.
Oct 24 2022, 6:09 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T319419: Section Level Image Suggestions based on cross-lingual Section Alignment.
  • The code is fully functional, and can be found here.
  • @MunizaA is experimenting with AirFlow to be able to run this script periodically.
Oct 24 2022, 6:04 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions

Oct 19 2022

diego added a comment to T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.

@diego I experienced some issues when comparing the revisions for all the 5000 recentchanges, their individual wikitext contents are quite large and running them through mwedittypes.SimpleEditTypes() return an empty output.

Hi @Caseyy0000. Please check the documentation for medittypes here. It could be that in some specific cases the library fails, but that should be very exceptional.

Oct 19 2022, 2:18 PM · Outreachy (Round 25)

Oct 17 2022

diego added a comment to T314863: [SPIKE] Section topics article-level relevance score.

Looks good to me!

Oct 17 2022, 3:53 PM · Data Pipelines, Structured-Data-Backlog (Current Work), Research-Backlog, Section-Topics

Oct 12 2022

diego added a comment to T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.

Hi everyone, I am Andy, outreachy applicant, nice to meet you all,
Please i have a question, any recomendation on any video tutorial one can watch to getting started with mediawiki/mwapi api ?
I will be grateful.

Oct 12 2022, 9:43 AM · Outreachy (Round 25)

Oct 7 2022

diego updated the task description for T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Oct 7 2022, 12:44 PM · Outreachy (Round 25)

Oct 6 2022

diego changed the visibility for T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Oct 6 2022, 3:08 PM · Outreachy (Round 25)
diego updated the task description for T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Oct 6 2022, 3:07 PM · Outreachy (Round 25)
diego updated the task description for T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Oct 6 2022, 12:58 PM · Outreachy (Round 25)

Oct 5 2022

diego updated Other Assignee for T319419: Section Level Image Suggestions based on cross-lingual Section Alignment, added: MunizaA.
Oct 5 2022, 1:57 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions
diego added a subtask for T311814: [EPIC] Section-level image suggestions data pipeline: T319419: Section Level Image Suggestions based on cross-lingual Section Alignment.
Oct 5 2022, 1:57 PM · Structured-Data-Backlog (Current Work), Data Pipelines, Section-Level-Image-Suggestions, Research-Backlog, Epic
diego added a parent task for T319419: Section Level Image Suggestions based on cross-lingual Section Alignment: T311814: [EPIC] Section-level image suggestions data pipeline.
Oct 5 2022, 1:57 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions
diego created T319419: Section Level Image Suggestions based on cross-lingual Section Alignment.
Oct 5 2022, 1:55 PM · Research (FY2022-23-Research-October-December), Section-Level-Image-Suggestions

Oct 3 2022

diego closed T319233: Pageviews API: Problems accessing data from python (requests) as Resolved.
Oct 3 2022, 6:36 PM · Data-Engineering, Pageviews-API
diego added a comment to T319233: Pageviews API: Problems accessing data from python (requests).

Sorry, I've read the documentation here: https://meta.wikimedia.org/wiki/User-Agent_policy and everything is clear. I'm going to close this ticket.

Oct 3 2022, 6:36 PM · Data-Engineering, Pageviews-API
diego added a comment to T319233: Pageviews API: Problems accessing data from python (requests).

I see that the error says:

Scripted requests from your IP have been blocked.

However, the error persists from different IPs.

Oct 3 2022, 6:34 PM · Data-Engineering, Pageviews-API
diego created T319233: Pageviews API: Problems accessing data from python (requests).
Oct 3 2022, 6:32 PM · Data-Engineering, Pageviews-API

Sep 27 2022

diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • @MunizaA had written the model to be hosted in Liftwing and shared with @achou.
  • @achou is testing the model locally before uploading to Liftwing
  • @diego is working on creating a new model adding new features.
Sep 27 2022, 3:20 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing

Sep 26 2022

diego updated subscribers of T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Sep 26 2022, 11:15 PM · Outreachy (Round 25)
diego created T318634: Develop a web app for patrolling based on the new ML-based service to predict reverts.
Sep 26 2022, 11:14 PM · Outreachy (Round 25)

Sep 10 2022

diego renamed T305888: Reference Quality in English Wikipedia from Reference Quality in English Wikipedia / Internship to Reference Quality in English Wikipedia.
Sep 10 2022, 3:49 PM · Research (FY2022-23-Research-January-March)
diego added a comment to T314385: Create a language agnostic model to predict reverts on Wikipedia.
  • We have developed a language agnostic ML model to predict reverts.
  • The model has an accuracy of 80% on a balanced dataset, compared to the 66% given by ORES.
  • Research code is available here.
  • @MunizaA is working in implementing the code as a service, and then @AikoChou will deploy to LiftWing. T
Sep 10 2022, 3:48 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego added a comment to T305888: Reference Quality in English Wikipedia.
  • We are currently working on writing the results
Sep 10 2022, 3:39 PM · Research (FY2022-23-Research-January-March)
diego moved T288333: Understanding the spread of disinformation on Wikipedia from In Progress to Staged on the Research board.
Sep 10 2022, 3:38 PM · Research
diego moved T305888: Reference Quality in English Wikipedia from In Progress to FY2022-23-Research-July-September on the Research board.
Sep 10 2022, 3:38 PM · Research (FY2022-23-Research-January-March)

Aug 31 2022

diego moved T314384: Develop a ML-based service to predict reverts on Wikipedia(s) from FY2022-23-Research-July-September to In Progress on the Research board.
Aug 31 2022, 3:08 PM · Research, Epic
diego moved T314384: Develop a ML-based service to predict reverts on Wikipedia(s) from Staged to FY2022-23-Research-July-September on the Research board.
Aug 31 2022, 3:05 PM · Research, Epic
diego moved T314386: Create a multilingual model to predict reverts on Wikipedia from Staged to FY2022-23-Research-July-September on the Research board.
Aug 31 2022, 3:05 PM · Research (FY2022-23-Research-January-March)
diego moved T314385: Create a language agnostic model to predict reverts on Wikipedia from Staged to FY2022-23-Research-July-September on the Research board.
Aug 31 2022, 3:05 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego moved T314385: Create a language agnostic model to predict reverts on Wikipedia from FY2022-23-Research-July-September to Staged on the Research board.
Aug 31 2022, 3:05 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing
diego moved T314385: Create a language agnostic model to predict reverts on Wikipedia from Staged to FY2022-23-Research-July-September on the Research board.
Aug 31 2022, 3:04 PM · Research (FY2022-23-Research-January-March), Machine-Learning-Team, Lift-Wing

Aug 26 2022

diego triaged T314386: Create a multilingual model to predict reverts on Wikipedia as High priority.
Aug 26 2022, 5:02 PM · Research (FY2022-23-Research-January-March)

Aug 17 2022

diego added a comment to T315262: Requesting access to Analytic Cluster for Trokhymovych.

Thanks @Ottomata , the contract finish at December 15th.

Aug 17 2022, 2:41 PM · SRE, SRE-Access-Requests
diego added a comment to T315262: Requesting access to Analytic Cluster for Trokhymovych.

Hi @Trokhymovych thanks for sharing the key, I assume you need shell/ssh access is that correct? Could you advise of exactly what systems you'll need to ssh into?

@cmooney we need to give access to @Trokhymovych to the stat machines and Spark Cluster.

Aug 17 2022, 11:18 AM · SRE, SRE-Access-Requests

Aug 15 2022

diego created T315262: Requesting access to Analytic Cluster for Trokhymovych.
Aug 15 2022, 9:15 PM · SRE, SRE-Access-Requests

Aug 10 2022

Dzahn awarded T292955: Requesting access to Analytic Cluster for Muniza a Orange Medal token.
Aug 10 2022, 3:44 PM · SRE, SRE-Access-Requests

Aug 9 2022

diego updated subscribers of T292955: Requesting access to Analytic Cluster for Muniza.

Hi @BCornwall, just to say this is a high priority for us. We are already lost 4 days of work with @MunizaA been locked-out from the servers.

Aug 9 2022, 7:35 AM · SRE, SRE-Access-Requests

Aug 5 2022

diego added a comment to T292955: Requesting access to Analytic Cluster for Muniza.

@Muehlenhoff I just re-confirmed via call with @MunizaA that ssh-key is correct.

Aug 5 2022, 11:03 AM · SRE, SRE-Access-Requests
diego added a comment to T292955: Requesting access to Analytic Cluster for Muniza.

@Dzahn, there is a -ctr email: maslam-ctr@wikimedia.org , would that solve the problem?

Aug 5 2022, 10:27 AM · SRE, SRE-Access-Requests

Aug 4 2022

diego added a comment to T292955: Requesting access to Analytic Cluster for Muniza.

Thanks @RhinosF1 !

Aug 4 2022, 1:49 PM · SRE, SRE-Access-Requests