diego (Diego S-T)
Senior Research Scientist

Projects

Calendar

User Details

User Since: Aug 8 2017, 10:56 AM (349 w, 14 h)
Availability: Available
LDAP User: Unknown
MediaWiki User: Diego (WMF) [ Global Accounts ]

Recent Activity
View All

Mar 1 2024

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Mar 1 2024, 8:01 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

We have improve the model accuracy, currently I'm working on making the model faster, to be able to work in real time.

Mar 1 2024, 7:48 PM · Research (FY2023-24-Research-April-June)

diego moved T357036: References Model: Multilingual Reference Need from Backlog to FY2023-24-Research-January-March on the Research board.

Mar 1 2024, 7:47 PM · Research (FY2023-24-Research-April-June)

Feb 29 2024

diego added a comment to T348298: Add revertrisk-language-agnostic to RecentChanges filters.

@kostajh , to the best of my knowledge @KStoller-WMF is leading this project. We had a meeting on January and I gave my input there. I think other teams that have done community testing process can talk more about this. Technically we could go for targeting certain precision, what would involve different thresholds per wiki. Using the Knowledge Observatory data it should be easy to compute these numbers, however, maintenance could be hard, so my understanding was the decision was to go for a single threshold for all wikis.

Feb 29 2024, 1:43 PM · Patch-For-Review, MW-1.42-notes (1.42.0-wmf.16; 2024-01-30), Wikipedia-Android-App-Backlog, Growth-Team, MediaWiki-extensions-ORES, Machine-Learning-Team

Feb 28 2024

diego renamed T357036: References Model: Multilingual Reference Need from reference model build to Rerefences Model: Multilingual Reference Need .

Feb 28 2024, 4:00 PM · Research (FY2023-24-Research-April-June)

Feb 17 2024

diego updated subscribers of T356102: Allow calling revertrisk language agnostic and revert risk multilingual APIs in a pre-save context.

my two cents:

Feb 17 2024, 4:28 PM · Research, Machine-Learning-Team

Feb 9 2024

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

In order to improve the interaction between structured and text data , I'm experimenting with a full pytorch approach.

Feb 9 2024, 10:55 PM · Research (FY2023-24-Research-April-June)

Feb 5 2024

diego created T356708: wmfdata.__version__ doesn't exist in wmfdata-python.

Feb 5 2024, 9:27 PM · Data-Engineering, Wmfdata-Python

Jan 24 2024

diego updated subscribers of T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model.

Jan 24 2024, 4:54 PM · Knowledge-Integrity, Research

Jan 22 2024

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

We are using this tasks as umbrella for reporting improvements and our coordination with products teams regarding the Revert Risk models.
Given that the model showed to be good enough for the Automoderator project, and also would be integrated on the MediaWiki Recent Changes feed (T352217), I think we can resolve this task and report future updates related to revert risk to the EPIC task; T314384

Jan 22 2024, 4:47 PM · Research (FY2023-24-Research-October-December)

diego moved T341820: Evaluate and improve the Revert Risk model for Wikidata. from FY2023-24-Research-October-December to FY2023-24-Research-January-March on the Research board.

Jan 22 2024, 4:35 PM · Research (FY2023-24-Research-April-June)

Jan 20 2024

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

We are collecting preparing a new dataset (using diffs) to train the model.
We are experimenting with language models, such as mBert and LaBSE to evaluate structured (claims) edits.

Jan 20 2024, 1:56 AM · Research (FY2023-24-Research-April-June)

Jan 18 2024

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

@MunizaA , until we don't have enough training data we should treat temporary accounts as anonymous users. In practice this means to overwrite temporary users features.
So, basically

Jan 18 2024, 10:18 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Jan 17 2024

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

Ideally, by the time we are deploying to pilot wikis, the model will understand that revisions made by temp accounts should be scored differently than if those revisions came from full accounts. I am not sure how much you'll be able to do, though, without a lot of real world data of temp account edits?

Jan 17 2024, 8:19 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Dec 22 2023

diego updated subscribers of T341820: Evaluate and improve the Revert Risk model for Wikidata..

We have obtained 590 labels from 540 different revisions. Data is available here.
This is the confusion matrix:

92	28
56	364

Given the following scores:

	Revert Risk	ORES
Precision:	0.93	0.91
F1:	0.90	0.91

Dec 22 2023, 8:44 PM · Research (FY2023-24-Research-April-June)

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Dec 22 2023, 8:00 PM · Research (FY2023-24-Research-April-June)

diego updated subscribers of T352839: RevertRisk model readiness for temporary accounts.

Ok! I understand.
Currently, Revert Risk uses several user's features. I think the "revision count" could be used as a replacement of the "anonymous" field. However, probably the most straight forward solution would be to replace the "anonymous" column for a "temporary" column.

Dec 22 2023, 6:54 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

Dec 18 2023

diego added a comment to T352839: RevertRisk model readiness for temporary accounts.

Hi @kostajh , I'm not sure if I'm understanding the question. Are you proposing to add the "user status" (temporary/full) as feature on Revert Risk?

Dec 18 2023, 3:00 PM · Research, Moderator-Tools-Team, Temporary accounts, Trust and Safety Product Team, Machine-Learning-Team

diego added a comment to T353164: Citations/References ML Proposal - Research & Enterprise.

Few comments:

Dec 18 2023, 2:39 PM · Wikimedia Enterprise - Content Integrity, Research

diego updated the task description for T353164: Citations/References ML Proposal - Research & Enterprise.

Dec 18 2023, 2:31 PM · Wikimedia Enterprise - Content Integrity, Research

Dec 8 2023

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

The Privacy Engineering team has reviewed the model finding no privacy-related concerns with the model.
The patch for adding revert risk on the recent changes' feed has been merged. This enables the option of integrate Revert Risk on MediaWiki. Now, I'm working in finding the adequate thresholds for RR scores (T351897) to add the corresponding mediawiki tags.

Dec 8 2023, 3:45 PM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Currently we have around 200 labels. WMDE is helping to increase this number.
We are preparing a new dataset for training the de RR Wikidata model.

Dec 8 2023, 3:36 PM · Research (FY2023-24-Research-April-June)

Dec 1 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

No updates this week.

Dec 1 2023, 4:34 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

The Moderation Tools team is running tests and community discussions to implement the Automoderator project, we are in coordination with them to learn about potential areas of improvement for RR.
There has been some community initiatives to evaluate the quality of the RR models. In T336934 a group of rowiki editors had manually labeled a set of risky revisions. We have analyzed these results, showing reasonable good performance.
The ML-team is working on integrating RRLA to recent changes feed T348298. We are working on defining the best thresholds for this integration T351897 .
We have been working with Wikimedia Enterprise to clarify some doubts about the RRLA model T346095

Dec 1 2023, 4:21 PM · Research (FY2023-24-Research-October-December)

Nov 24 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

I have presented the Revert Risk model Wikidata and the Annotool at the WikiProject LD4 Wikidata gathering.
We have started collecting new annotations on the second Wikidata labeling. The campaign is available here. @Lydia_Pintscher is helping us to find more annotators (thanks!).

Nov 24 2023, 6:26 PM · Research (FY2023-24-Research-April-June)

Nov 23 2023

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

In T336934#9355952, @Strainu wrote:

In T336934#9350484, @diego wrote:

Great. Having some manual labels is always valuable.
I have done a quick check and I've seen there are few cases were the RR scores are not higher than 0.93. For example, this one:

That is very weird. I checked the code and a few of the pages identified in a newer run and did not see any mismatch. Is it possible for the score to change? I know in ores it was possible in certain conditions.

I can't think a case where this is possible but I'll have a look.

Anyhow, I've done some cleaning, and merged the datasets, and then I've computed some scores:

These scores seem to be based on the prediction, not the score returned by the algorithm, so they seem a bit useless in the context of a reverter - the community will almost certainly not accept a 53% success rate. Can you advise on why you chose these and not the score-based results, which seem better?

I've done both, you can find them on the jupyter notebook. But in summary the precision is very similar (almost identical) to ORES rowing-damagging

Nov 23 2023, 9:06 PM · Automoderator, Epic, Moderator-Tools-Team

diego added a subtask for T314384: Develop a ML-based service to predict reverts on Wikipedia(s): T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:36 PM · Machine-Learning-Team, Research, Epic

diego added a parent task for T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension): T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

Nov 23 2023, 3:36 PM · Research

diego set Due Date to Dec 13 2023, 11:00 PM on T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:34 PM · Research

diego created T351897: Set the thresholds Revert Risk models to be used on the Recent Changes Feed (via ORES Extension).

Nov 23 2023, 3:33 PM · Research

Nov 21 2023

diego renamed T343064: Expand types of edits for Wikidata revert risk model from Expand edit types for Wikidata revert risk model to Expand types of edits for Wikidata revert risk model.

Nov 21 2023, 6:58 PM · Research

diego updated subscribers of T346095: Investigate the tangible differences between the results from goodfaith vs revertrisk.

Older revisions.

Concerns around our understanding of the limitations of the model for older edits given the training window. If there is a user that is looking at taking a full snapshot of either our current corpus of Wikipedia, or a past version, both include revisions from a broader window of time than the training window specifically and may show “latent” bad revisions that either perform differently with the LiftWing model or are uncaught.

I am curious what you may recommend to evaluate older content that could be vandalized without us knowing due to a lack of revisions/content attention by editors.

I'm not completely sure if I'm understanding your question. What I can say is that any model would have certain time drift, that includes RR and ORES. I think the model's precision would decay if we use it for very old data, but probably it tends to a certain limit (I would assume that the same is true for ORES, and that model is probably already working close to it's boundaries). The Language Agnostic model shouldn't be difficult to run on a large old dataset, I understand that @fkaelin
and @Pablo had been working on running the model on large data, so maybe if you have an specific question to be answered, the four of us could try to design an experiment to answer that question.

Performance on different types of pages. You already addressed this in part, but what I mean by different page types isn’t necessarily subject-related (though cross-language data is helpful as well) but instead based on the metadata of the page.

How does the model typically perform on revisions in pages with low/high pageviews, low/high amounts of content, more/less edits, etc. This is less critical for our use-case, but we are imagining cases where a user may want to create their own filtering system based on their tolerance for risk and may want an approach that divides article approaches based on metadata.

Let us know if there are potential low-risk exercises we can collaborate on to subsect the data.

I don't have such statistics, maybe the Knowledge Integrity Observatory have some data to answer this (@Pablo ?)

What to know that we do not know.

This is what I was trying to pull on with the question on use. If ORES had fallen out of style among some users and/or grown in use with others- why? If we can understand points of friction with use (Usability? Performance? Different approach needed?) it will help us integrate learnings as we design similar features (credibility signals,
etc.).

I don't think there is clear pattern here. I think the adoption/attrition of these tools is opportunistic, in the sense that ppl used them according to their needs. With no other options, ppl would use what there is available. And even with more tools available, developers would use what fits better on their workflows, or even what the have seen working in the past. Unless we have dramatic difference between models' accuracy, I don't think that differences in models' quality is something that is easy to asses for developers.
Probably the attrition is related to the (lack of) success of the tools created using ML models, and not directly to the model itself (although that model quality probably has an impact on the tool success).

Nov 21 2023, 6:30 PM · Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

Great. Having some manual labels is always valuable.
I have done a quick check and I've seen there are few cases were the RR scores are not higher than 0.93. For example, this one:

Nov 21 2023, 6:27 PM · Automoderator, Epic, Moderator-Tools-Team

Nov 14 2023

diego added a comment to T336934: Enable communities to configure automated reversion of bad edits.

Hi @Strainu , here Diego from the WMF Research team.

Nov 14 2023, 12:05 AM · Automoderator, Epic, Moderator-Tools-Team

Nov 13 2023

diego added a comment to T349410: Document and curate the learnings from executing the existing roadmap.

We would also include @cwylo !

Nov 13 2023, 11:40 PM · Research (FY2023-24-Research-April-June)

Nov 6 2023

diego updated subscribers of T346095: Investigate the tangible differences between the results from goodfaith vs revertrisk.

Hi, here Diego from Research. I'll take some of the questions you raised:

Nov 6 2023, 11:15 PM · Wikimedia Enterprise - Content Integrity, Wikimedia Enterprise

diego added a comment to T348298: Add revertrisk-language-agnostic to RecentChanges filters.

Nov 6 2023, 2:20 PM · Patch-For-Review, MW-1.42-notes (1.42.0-wmf.16; 2024-01-30), Wikipedia-Android-App-Backlog, Growth-Team, MediaWiki-extensions-ORES, Machine-Learning-Team

Oct 31 2023

diego added a comment to T350061: [Annotool] Errors loading edits.

@MunizaA got it. Would this mean to create a another project?

Oct 31 2023, 2:46 PM · Research

Oct 30 2023

diego created T350061: [Annotool] Errors loading edits.

Oct 30 2023, 4:49 PM · Research

Oct 27 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly updates

Oct 27 2023, 8:28 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model.

Weekly Updates

Oct 27 2023, 8:26 PM · Knowledge-Integrity, Research

Oct 26 2023

diego added a comment to T349739: [Annotool] Include additional information on private project exports.

In T349739#9283232, @Aklapper wrote:

@diego: Please add at least one project tag to tasks, so other people can also find this task when searching via projects. Please set appropriate project tags when possible. Thanks!

Oct 26 2023, 2:58 PM · Research

diego added a comment to T349295: Determine technical approach for Automoderator edit revert component.

@jsn.sherman this might be of your interest: T338792

Oct 26 2023, 2:40 PM · MediaWiki-Platform-Team (Radar), Moderator-Tools-Team (Kanban), Spike, Automoderator

Oct 25 2023

diego added a subtask for T314384: Develop a ML-based service to predict reverts on Wikipedia(s): T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model.

Oct 25 2023, 7:56 PM · Machine-Learning-Team, Research, Epic

diego added a parent task for T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model: T314384: Develop a ML-based service to predict reverts on Wikipedia(s).

Oct 25 2023, 7:56 PM · Knowledge-Integrity, Research

diego created T349755: Training pipeline for Revert Risk Language Agnostic (RRLA) model.

Oct 25 2023, 7:55 PM · Knowledge-Integrity, Research

diego added a subtask for T344016: Improvements to Annotool: T349739: [Annotool] Include additional information on private project exports.

Oct 25 2023, 4:39 PM · Research

diego added a parent task for T349739: [Annotool] Include additional information on private project exports: T344016: Improvements to Annotool.

Oct 25 2023, 4:39 PM · Research

diego created T349739: [Annotool] Include additional information on private project exports.

Oct 25 2023, 4:38 PM · Research

diego added a project to T341820: Evaluate and improve the Revert Risk model for Wikidata.: Epic.

Oct 25 2023, 4:35 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T348666: Add randomization to the revision order showed in Annotool.

This feature is working correctly. From my perspective this task can be marked solved.

Oct 25 2023, 4:34 PM · Research

diego updated the task description for T344016: Improvements to Annotool.

Oct 25 2023, 4:33 PM · Research

Oct 24 2023

diego created T349649: Update links for Research Team meetings to enable "Join by phone" from outside of USA.

Oct 24 2023, 4:20 PM · Research, Research-management

diego added a comment to T219903: Keep research.wikimedia.org landing page updated.

looks good. Thanks.

Oct 24 2023, 1:53 PM · Patch-For-Review, periodic-update, Research

Oct 23 2023

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Oct 23 2023, 8:57 PM · Research (FY2023-24-Research-April-June)

Oct 17 2023

diego added a comment to T219903: Keep research.wikimedia.org landing page updated.

Hi @DDeSouza, please add the following papers:

Oct 17 2023, 2:35 PM · Patch-For-Review, periodic-update, Research

Oct 11 2023

diego triaged T348666: Add randomization to the revision order showed in Annotool as High priority.

Oct 11 2023, 3:24 PM · Research

diego added a subtask for T344016: Improvements to Annotool: T348666: Add randomization to the revision order showed in Annotool.

Oct 11 2023, 3:21 PM · Research

diego added a parent task for T348666: Add randomization to the revision order showed in Annotool: T344016: Improvements to Annotool.

Oct 11 2023, 3:21 PM · Research

diego created T348666: Add randomization to the revision order showed in Annotool.

Oct 11 2023, 3:21 PM · Research

Oct 10 2023

diego added a comment to T340427: Check home/HDFS leftovers of paramd.

Hi! The standard archival process works good. Thanks!

Oct 10 2023, 12:15 AM · Data-Platform-SRE

Oct 6 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Oct 6 2023, 8:07 PM · Research (FY2023-24-Research-April-June)

diego updated subscribers of T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Oct 6 2023, 8:05 PM · Research (FY2023-24-Research-October-December)

Oct 5 2023

diego added a comment to T347136: Review Revert Risk reports from WME.

In T347136#9225891, @prabhat wrote:

In the last 50 hours, we haven't seen any "Unsupported lang" issue.
Thanks for fixing this.

Oct 5 2023, 3:44 PM · Machine-Learning-Team

diego updated the task description for T348264: Review WWW'24 Sumbission .

Oct 5 2023, 3:26 PM · Research-outreach

diego created T348264: Review WWW'24 Sumbission .

Oct 5 2023, 3:25 PM · Research-outreach

Sep 27 2023

diego added a comment to T347330: Expand language support for Revert Risk Model.

Let's the updated csv for now. Later Iets to coordinate with @fkaelin to periodically update these values, both RRLA and Article quality models.

Sep 27 2023, 2:17 AM · Machine-Learning-Team, Research

Sep 15 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Sep 15 2023, 10:23 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Sep 15 2023, 10:18 PM · Research (FY2023-24-Research-October-December)

Sep 12 2023

diego added a comment to T344016: Improvements to Annotool.

@MunizaA , could we please add an action to finish a project? By finish I mean to keep the project data, but stop showing in the front-end.

Sep 12 2023, 6:05 PM · Research

diego updated the task description for T344016: Improvements to Annotool.

Sep 12 2023, 6:04 PM · Research

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Sure, the data is public (we just remove the labeler username). As I mentioned in the previous comment the amount of data is pretty low, but you can find it here.

Sep 12 2023, 5:55 PM · Research (FY2023-24-Research-April-June)

Sep 8 2023

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Sep 8 2023, 8:01 PM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Sep 8 2023, 8:00 PM · Research (FY2023-24-Research-April-June)

Sep 1 2023

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weeky Updates

Sep 1 2023, 7:19 PM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Sep 1 2023, 7:16 PM · Research (FY2023-24-Research-April-June)

Aug 25 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Aug 25 2023, 11:46 AM · Research (FY2023-24-Research-April-June)

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Aug 25 2023, 11:43 AM · Research (FY2023-24-Research-October-December)

Aug 22 2023

diego updated the task description for T344016: Improvements to Annotool.

Aug 22 2023, 4:53 PM · Research

diego added a comment to T344537: Fast Vandalism Detection .

Revert Risk Language Agnostic should do the work, the inference time goes below 200ms.

Aug 22 2023, 4:35 PM · Machine-Learning-Team

Aug 21 2023

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Hi @emwille! Sure I'll be happy to present, I'll send you an email to coordinate. Thanks!

Aug 21 2023, 4:31 PM · Research (FY2023-24-Research-April-June)

diego updated subscribers of T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Aug 21 2023, 11:14 AM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Aug 21 2023, 11:10 AM · Research (FY2023-24-Research-April-June)

diego updated the task description for T341820: Evaluate and improve the Revert Risk model for Wikidata..

Aug 21 2023, 11:08 AM · Research (FY2023-24-Research-April-June)

Aug 15 2023

diego updated the task description for T344016: Improvements to Annotool.

Aug 15 2023, 5:58 PM · Research

diego added a comment to T344016: Improvements to Annotool.

Thanks for the input. In order to make the annotations as much useful as possible for training the algorithm, it would be good to have labels that are not too specific, and can generalize as much as possible. Do you think that something like: "long-term vandalism" or "hijacked item" could be a good name for the phenomena that you are describing ?

Aug 15 2023, 5:56 PM · Research

diego closed T290008: Autocomplete is very slow (unusable) in Newpyter as Resolved.

Aug 15 2023, 5:40 PM · Data-Platform-SRE, Data-Engineering-Jupyter, Data-Engineering

diego added a comment to T290008: Autocomplete is very slow (unusable) in Newpyter .

@BTullis , yes, this has been solved in the current environment, thanks!

Aug 15 2023, 5:40 PM · Data-Platform-SRE, Data-Engineering-Jupyter, Data-Engineering

Aug 11 2023

diego added a comment to T341819: Explore alternatives for Revert Risk model improvements for Wikipedia.

Weekly Updates

Aug 11 2023, 4:21 PM · Research (FY2023-24-Research-October-December)

diego added a comment to T341820: Evaluate and improve the Revert Risk model for Wikidata..

Weekly Updates

Aug 11 2023, 4:17 PM · Research (FY2023-24-Research-April-June)

diego added a comment to T340427: Check home/HDFS leftovers of paramd.

Would you like me to move this in bulk to a new directory within your home, such as: /home/dsaez/paramd-archive

This sounds good and enough!

Aug 11 2023, 4:10 PM · Data-Platform-SRE

diego updated the task description for T344016: Improvements to Annotool.

Aug 11 2023, 5:59 AM · Research

diego added a comment to T344016: Improvements to Annotool.

In T344016#9085650, @Huntster wrote:

In the description you list "Remove list of languages". Instead of removing the drop-down box, can it not instead be made functional, i.e. only display tasks in the chosen language? For example, I'm only proficient in English, so it is likely better for me to focus on tasks in English, rather than additionally displaying Spanish, German, etc. Being able to choose which language these training tasks are from would be useful.

Aug 11 2023, 5:59 AM · Research