Page MenuHomePhabricator

Miriam (Miriam Redi)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Sep 25 2017, 10:36 AM (121 w, 5 h)
Availability
Available
LDAP User
Miriam
MediaWiki User
Miriam (WMF) [ Global Accounts ]

Recent Activity

Fri, Jan 17

Miriam updated subscribers of T242844: Release data from a public health related research conducted by WMF and formal collaborators.

We will met with @JFishback_WMF and let you know of next steps

Fri, Jan 17, 10:49 AM · Security, Privacy, Research, Analytics

Thu, Jan 16

Miriam updated the task description for T242229: Test the feasibility of a classifier trained on Commons categories.
Thu, Jan 16, 1:36 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam updated subscribers of T242969: A list of meaningful Commons Categories whose images can be used to train image classifiers.
Thu, Jan 16, 1:36 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam updated subscribers of T242229: Test the feasibility of a classifier trained on Commons categories.
Thu, Jan 16, 1:36 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam updated subscribers of T242971: A report on accuracy and performance of the classification models .
Thu, Jan 16, 1:36 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam created T242971: A report on accuracy and performance of the classification models .
Thu, Jan 16, 1:35 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam created T242970: A set of prototypes of image classifiers trained on images from Commons Categories.
Thu, Jan 16, 1:34 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam created T242969: A list of meaningful Commons Categories whose images can be used to train image classifiers.
Thu, Jan 16, 1:32 PM · Research (FY2019-20-Research-January-March), artificial-intelligence

Wed, Jan 15

Miriam created T242844: Release data from a public health related research conducted by WMF and formal collaborators.
Wed, Jan 15, 12:16 PM · Security, Privacy, Research, Analytics

Mon, Jan 13

Miriam added a comment to T242600: Submit Citation Usage Camera Ready Paper to the Web Conference 2020.

Weekly update: paper was accepted with good reviews !! there is some work to do to respond to reviewers' comments, which we will do in the next month.

Mon, Jan 13, 4:48 PM · Research (FY2019-20-Research-January-March)
Miriam added a comment to T242598: Organize Wiki Workshop 2020.

Weekly update: sent invitation emails to a few speakers and upcoming submission deadline reminder.

Mon, Jan 13, 4:46 PM · Research (FY2019-20-Research-January-March)
Miriam added a comment to T242229: Test the feasibility of a classifier trained on Commons categories.

Weekly update:

  • downloaded the list of coco-stuff classes which include highly generic categories of people, animals, and things which exist in the visual world: https://github.com/nightrome/cocostuff
  • downloaded the list of categories in Commons, with the counts of the number of images per categories.
  • to create the initial seed of categories we want to consider for object categorization in Commons, I computed fasttext vectors on both COCO categories and Commons Categories, and I am checking what are the commons categories that we can use to represent COCO categories.
Mon, Jan 13, 4:46 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam added a comment to T242601: Supervise Outreachy Internship on Releasing data dumps for Citation Needed Classifiers.

Weekly update: progress as expected. I had to double check the output of the citation needed models as there were some inconsistencies, which were solved by looking at some of the data. There is need for a text pre-processing pipeline which is close to the one we used for training, otherwise the model might give some unexpected output.

Mon, Jan 13, 4:41 PM · Research (FY2019-20-Research-January-March)
Miriam added a comment to T242603: Investigate Knowledge Gaps in Multimedia.

Weekly Update: none. Team work is on hold until ICWSM deadline.

Mon, Jan 13, 4:39 PM · Research (FY2019-20-Research-January-March)
Miriam added a comment to T242635: Start formal collaboration around the project "Understanding Readers' Image Usage in Wikipedia".

Weekly update: project proposal is ready. Investigating possibilities with Rutgers' university. We might decide to merge collaboration with Rutgers' within the Visual Knowledge Gaps project (T242603). In that case, I will reach out to further contacts starting last week of Jan .

Mon, Jan 13, 4:37 PM · Research (FY2019-20-Research-January-March)
Miriam created T242635: Start formal collaboration around the project "Understanding Readers' Image Usage in Wikipedia".
Mon, Jan 13, 4:32 PM · Research (FY2019-20-Research-January-March)
Miriam created T242603: Investigate Knowledge Gaps in Multimedia.
Mon, Jan 13, 1:04 PM · Research (FY2019-20-Research-January-March)
Miriam moved T228442: Design and implement an API for "citation needed" tag recommendation from FY2019-20-Research-January-March to In Progress on the Research board.
Mon, Jan 13, 12:56 PM · Research
Miriam created T242601: Supervise Outreachy Internship on Releasing data dumps for Citation Needed Classifiers.
Mon, Jan 13, 12:56 PM · Research (FY2019-20-Research-January-March)
Miriam moved T242600: Submit Citation Usage Camera Ready Paper to the Web Conference 2020 from Staged to FY2019-20-Research-January-March on the Research board.
Mon, Jan 13, 12:51 PM · Research (FY2019-20-Research-January-March)
Miriam updated the task description for T212228: Reader citation usage (quantitative).
Mon, Jan 13, 12:51 PM · Research
Miriam created T242600: Submit Citation Usage Camera Ready Paper to the Web Conference 2020.
Mon, Jan 13, 12:51 PM · Research (FY2019-20-Research-January-March)
Miriam updated the task description for T212228: Reader citation usage (quantitative).
Mon, Jan 13, 12:49 PM · Research
Miriam moved T228442: Design and implement an API for "citation needed" tag recommendation from In Progress to FY2019-20-Research-January-March on the Research board.
Mon, Jan 13, 12:48 PM · Research
Miriam moved T242598: Organize Wiki Workshop 2020 from Staged to FY2019-20-Research-January-March on the Research board.
Mon, Jan 13, 12:47 PM · Research (FY2019-20-Research-January-March)
Miriam moved T242229: Test the feasibility of a classifier trained on Commons categories from In Progress to FY2019-20-Research-January-March on the Research board.
Mon, Jan 13, 12:47 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam created T242598: Organize Wiki Workshop 2020.
Mon, Jan 13, 12:47 PM · Research (FY2019-20-Research-January-March)

Wed, Jan 8

Miriam moved T228441: Design a pipeline for image classification from Staged to In Progress on the Research board.
Wed, Jan 8, 4:28 PM · Research, artificial-intelligence
Miriam moved T242229: Test the feasibility of a classifier trained on Commons categories from Staged to In Progress on the Research board.
Wed, Jan 8, 4:28 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam updated the task description for T155538: General image classifier for commons.
Wed, Jan 8, 4:23 PM · Wikimedia-Hackathon-2017, Research-Backlog, Wiki-Loves-Monuments (2017), artificial-intelligence, Research ideas, Scoring-platform-team
Miriam updated the task description for T228441: Design a pipeline for image classification.
Wed, Jan 8, 4:22 PM · Research, artificial-intelligence
Miriam created T242229: Test the feasibility of a classifier trained on Commons categories.
Wed, Jan 8, 4:22 PM · Research (FY2019-20-Research-January-March), artificial-intelligence
Miriam updated the task description for T228441: Design a pipeline for image classification.
Wed, Jan 8, 4:19 PM · Research, artificial-intelligence
Miriam updated the task description for T155538: General image classifier for commons.
Wed, Jan 8, 4:17 PM · Wikimedia-Hackathon-2017, Research-Backlog, Wiki-Loves-Monuments (2017), artificial-intelligence, Research ideas, Scoring-platform-team

Thu, Jan 2

Miriam closed T221934: Visualize Wiki Commons Images, a subtask of T215413: Image Classification Working Group, as Resolved.
Thu, Jan 2, 3:26 PM · Reading-Admin, SDC General, Wikidata, Multimedia, Discovery-Search, Analytics, Research
Miriam closed T221934: Visualize Wiki Commons Images as Resolved.

Visualization available at http://wikiview.net

Thu, Jan 2, 3:26 PM · Research
Miriam updated the task description for T221934: Visualize Wiki Commons Images.
Thu, Jan 2, 3:25 PM · Research
Miriam moved T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis] from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam moved T236157: [ImgRec UCU] Evaluation of the basic model from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam moved T236156: [ImgRec UCU] Implementation of the basic model for a project from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam moved T236159: [ImgRec UCU] Evaluation of improvements from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam moved T236158: [ImgRec UCU] Planning and implementation of improvements from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam moved T236161: [ImgRec] Thesis wrap up and submission from Staged to In Progress on the Research board.
Thu, Jan 2, 3:24 PM · Research
Miriam closed T221761: Test GPUs with an end-to-end training task (Photo vs Graphics image classifier), a subtask of T148843: Remove computational bottlenecks in stats machine via adding a GPU that can be used to train ML models, as Resolved.
Thu, Jan 2, 3:23 PM · Patch-For-Review, User-Elukey, Operations, Analytics, Research-management
Miriam closed T221761: Test GPUs with an end-to-end training task (Photo vs Graphics image classifier) as Resolved.
Thu, Jan 2, 3:23 PM · Analytics, Research-management
Miriam updated the task description for T228441: Design a pipeline for image classification.
Thu, Jan 2, 3:22 PM · Research, artificial-intelligence
Miriam moved T228442: Design and implement an API for "citation needed" tag recommendation from Staged to In Progress on the Research board.
Thu, Jan 2, 3:22 PM · Research
Miriam closed T228444: Prepare presentation for CSS Summer School as Resolved.

Done, presentation "Machine Vision in Social Computing" available at: https://docs.google.com/presentation/d/1O1Ni40tovcFIh_Btj3qpWFh5Pz8j5Bh_U0u5rqfTbW0/edit?usp=sharing

Thu, Jan 2, 3:21 PM · Research
Miriam reassigned T233707: A system for releasing data dumps from a classifier detecting unsourced sentences in Wikipedia from Miriam to AikoChou.
Thu, Jan 2, 3:17 PM · User-ArielGlenn, Research, Outreachy (Round 19)
Miriam moved T233707: A system for releasing data dumps from a classifier detecting unsourced sentences in Wikipedia from Staged to In Progress on the Research board.
Thu, Jan 2, 3:16 PM · User-ArielGlenn, Research, Outreachy (Round 19)
Miriam moved T199188: [1.2] Research study to understand how readers use citations from Staged to In Progress on the Research board.
Thu, Jan 2, 3:16 PM · Knowledge-Integrity, Research, Epic
Miriam updated subscribers of T228442: Design and implement an API for "citation needed" tag recommendation.

The output of the first stage of this research, as well as the opportunities arisen for collaborations and internships, led to the decision of having an intermediate step for this task.
In collaboration with @AikoChou, an Outreachy intern, we will have a system that can release periodic data dumps exposing sentences in Wikipedia needing citations. This task is tracked at: T233707.

Thu, Jan 2, 3:15 PM · Research
Miriam updated the task description for T228442: Design and implement an API for "citation needed" tag recommendation.
Thu, Jan 2, 3:08 PM · Research
Miriam updated the task description for T228442: Design and implement an API for "citation needed" tag recommendation.
Thu, Jan 2, 3:07 PM · Research
Miriam added a comment to T212228: Reader citation usage (quantitative).

I'll leave this task open until we hear back from the Web Conference

Thu, Jan 2, 3:05 PM · Research
Miriam updated the task description for T212228: Reader citation usage (quantitative).
Thu, Jan 2, 3:04 PM · Research
Miriam closed T227790: Why We Leave Wikipedia Research, a subtask of T212228: Reader citation usage (quantitative), as Declined.
Thu, Jan 2, 3:03 PM · Research
Miriam closed T227790: Why We Leave Wikipedia Research as Declined.
Thu, Jan 2, 3:03 PM · Research
Miriam changed the status of T227790: Why We Leave Wikipedia Research, a subtask of T212228: Reader citation usage (quantitative), from Open to Stalled.
Thu, Jan 2, 3:03 PM · Research
Miriam changed the status of T227790: Why We Leave Wikipedia Research from Open to Stalled.

When started this sub-task as part of our efforts on citation usage studies, we reealised that this is a huge effort that will be spun off to a completely new project

Thu, Jan 2, 3:03 PM · Research

Dec 11 2019

Miriam added a comment to T219903: Keep research.wikipedia.org landing page updated.

@leila @Isaac does this look good? https://commons.wikimedia.org/wiki/File:Wiki_Workshop_2020_banner.png

Dec 11 2019, 11:47 AM · Research

Dec 2 2019

Miriam added a comment to T221761: Test GPUs with an end-to-end training task (Photo vs Graphics image classifier).

Update after changing learning rate and modifying few parameters in the training:

  • we reach 91% accuracy on the validation set (2% improvement compared to previous model) and a significant loss reduction.
  • Manual validation on an external dataset shows improvements on overfitting (80% of new photos and 100% of new graphics are correctly recognized)
  • Problems persist for near-abstract photos and for extremely low-res photos
  • All this with an training time of 18-minutes only for a dataset of 150k images!
Dec 2 2019, 10:36 AM · Analytics, Research-management

Nov 29 2019

Miriam added a comment to T237605: Create kerberos principals for users.

Hi! Can I have access too please? My username is mirrys

elukey@krb1001:~$ sudo manage_principals.py create mirrys --email_address=mredi@wikimedia.org
Principal successfully created.
Successfully sent email to mredi@wikimedia.org

Done! :)

Nov 29 2019, 9:42 AM · Patch-For-Review, Analytics-Kanban, Analytics

Nov 28 2019

Miriam updated subscribers of T237605: Create kerberos principals for users.

Hi! Can I have access too please? My username is mirrys

Nov 28 2019, 6:03 PM · Patch-For-Review, Analytics-Kanban, Analytics

Nov 7 2019

Miriam updated the task description for T221761: Test GPUs with an end-to-end training task (Photo vs Graphics image classifier).
Nov 7 2019, 3:12 PM · Analytics, Research-management
Miriam updated subscribers of T221761: Test GPUs with an end-to-end training task (Photo vs Graphics image classifier).

Finally I managed to do some progress. I built a simple model for testing purposes.

Nov 7 2019, 3:11 PM · Analytics, Research-management

Oct 22 2019

Miriam updated the task description for T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 12:55 PM · Research
Miriam added a comment to T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].

Proposal submitted on September 30th: https://github.com/OlehOnyshchak/WikiImageRecommendation/blob/master/Papers/Project_Proposal/paper.pdf

Oct 22 2019, 11:50 AM · Research
Miriam updated the task description for T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 11:49 AM · Research
Miriam created T236142: Evaluate existing multimodal learning methods for Wikipedia image recommendation [Master thesis].
Oct 22 2019, 11:40 AM · Research

Oct 21 2019

Miriam closed T236066: Submit Wiki Workshop 2020 proposal to the Web Conference as Resolved.
Oct 21 2019, 4:17 PM · Research
Miriam closed T236067: Submit paper to the Web Conference 2020, a subtask of T212228: Reader citation usage (quantitative), as Resolved.
Oct 21 2019, 1:57 PM · Research
Miriam closed T236067: Submit paper to the Web Conference 2020 as Resolved.
Oct 21 2019, 1:57 PM · Research
Miriam updated the task description for T212228: Reader citation usage (quantitative).
Oct 21 2019, 1:57 PM · Research
Miriam created T236067: Submit paper to the Web Conference 2020.
Oct 21 2019, 1:57 PM · Research
Miriam updated the task description for T212228: Reader citation usage (quantitative).
Oct 21 2019, 1:52 PM · Research
Miriam created T236066: Submit Wiki Workshop 2020 proposal to the Web Conference.
Oct 21 2019, 1:39 PM · Research

Oct 15 2019

Miriam closed T230417: Image Matching API as Resolved.
Oct 15 2019, 12:36 PM · Research, Wikimania-Hackathon-2019
Miriam added a comment to T234519: Your first task: classify sample statements using Citation Needed Models.

Hello, @Miriam! My name is Monique and I don't know python, it will be difficult for me? Thank you in advance!

Oct 15 2019, 12:35 PM · Outreachy (Round 19)

Oct 7 2019

Miriam added a comment to T234606: Your second task: classify statements within an article.

Hi @Samwalton9 ,
Which section title takes precedence? I noticed there's different levels and was wondering about which one to use.

Oct 7 2019, 8:45 AM · Outreachy (Round 19)

Oct 3 2019

Miriam added a comment to T234519: Your first task: classify sample statements using Citation Needed Models.

@Achillesheel02 thanks for the feedback and yes, please send me to my email.
Thanks!

Oct 3 2019, 5:03 PM · Outreachy (Round 19)
Miriam added a comment to T234519: Your first task: classify sample statements using Citation Needed Models.

Hi @Miriam ..can we use python 3.6 for the tasks?

Oct 3 2019, 12:16 PM · Outreachy (Round 19)
Miriam added a comment to T234519: Your first task: classify sample statements using Citation Needed Models.

Also, am a little confused if this is part of the projects for interns,i saw Outreachy (round 9) and jumped right in , kindly guide me through

Oct 3 2019, 11:36 AM · Outreachy (Round 19)
Miriam updated the task description for T234519: Your first task: classify sample statements using Citation Needed Models.
Oct 3 2019, 11:34 AM · Outreachy (Round 19)
Miriam added a comment to T234519: Your first task: classify sample statements using Citation Needed Models.

Thank you for creating this task, @Miriam.
I was able to install the required libraries in my development environment.
I tried looking for sample_data.txt file in the project's GitHub repo, but I'm afraid I can't seem to find it there. The only data file that I can see is test_input_data_sample.txt file.
I'll appreciate a hint on locating this sample_data.txt file.

Oct 3 2019, 11:34 AM · Outreachy (Round 19)
Miriam added a comment to T233709: Onboarding Task: getting familiar with the machine learning models for Citation Need.

Hi @Lucideuclid - many thanks for your interest in this project :)

Oct 3 2019, 10:49 AM · Outreachy (Round 19)
Miriam created T234519: Your first task: classify sample statements using Citation Needed Models.
Oct 3 2019, 10:47 AM · Outreachy (Round 19)

Sep 27 2019

Miriam updated subscribers of T233893: drop CitatitionUsage data on mysql .

Hi @Nuria and @elukey - please feel free to drop this dataset from mysql. @tizianopiccardi -the main user for this data - also confirmed.

Sep 27 2019, 12:48 PM · Analytics-Kanban, Analytics, Analytics-EventLogging

Sep 24 2019

Miriam created T233709: Onboarding Task: getting familiar with the machine learning models for Citation Need.
Sep 24 2019, 11:33 AM · Outreachy (Round 19)

Sep 18 2019

Miriam added a comment to T199736: Help accessing SWAP for research collaborators.

Thanks @elukey! @MGerlach you should now be able to access the notebooks. Thanks!

Sep 18 2019, 1:37 PM · Research
Miriam updated subscribers of T199736: Help accessing SWAP for research collaborators.

Hi @elukey -our new team member, @MGerlach, is having the same issue (he is not able to access SWAP)- see T232707#5502540
Could you please add him to the wmf group as well?

Sep 18 2019, 10:34 AM · Research

Aug 21 2019

Miriam added a comment to T228440: Computer Vision Consultation from Research.

Examples of issues of Google Vision API, related to the fact that their models are probably trained on unevenly distributed data: https://docs.google.com/presentation/d/1qfD3q9Ij79_luAKXNdVvZoPFsUvFQKcvIGmIvWC19z0/edit?usp=sharing

Aug 21 2019, 3:39 PM · Structured-Data-Backlog (Current Work), Research, SDC-Statements (Machine-vision-depicts), Structured-Data-Team-Current-Work
Miriam updated the task description for T228440: Computer Vision Consultation from Research.
Aug 21 2019, 3:35 PM · Structured-Data-Backlog (Current Work), Research, SDC-Statements (Machine-vision-depicts), Structured-Data-Team-Current-Work

Aug 14 2019

Miriam updated subscribers of T225964: [SDC] Build a depicts tag suggestion tool that is powered by machine vision platforms.
Aug 14 2019, 1:49 PM · Structured-Data-Backlog (Current Work), SDC-Statements (Machine-vision-depicts), Structured-Data-Team-Current-Work, Structured Data Engineering

Aug 13 2019

Miriam updated the task description for T229267: Plan for Research team's acitivities during Wikimania 2019.
Aug 13 2019, 2:30 PM · Research-management, Research
Miriam moved T230417: Image Matching API from Backlog to Project Idea on the Wikimania-Hackathon-2019 board.
Aug 13 2019, 2:25 PM · Research, Wikimania-Hackathon-2019
Miriam created T230417: Image Matching API.
Aug 13 2019, 2:24 PM · Research, Wikimania-Hackathon-2019

Aug 7 2019

Miriam added a comment to T229267: Plan for Research team's acitivities during Wikimania 2019.

So Rachel said we are welcome to join the open mic session at the beginning of the hackathon. Each speaker has 1-1.5 minutes to pitch projects " that newcomers can participate in and/or needs significant collaboration from other attendees".
She suggested that, once we have the mini-projects ready, we share them on phab : https://phabricator.wikimedia.org/project/view/3922 and also on the Wikimania Hackathon telegram channel.

Aug 7 2019, 5:37 PM · Research-management, Research

Aug 6 2019

Miriam added a comment to T229267: Plan for Research team's acitivities during Wikimania 2019.

@leila mini-projects for hackathon can be added to the phab workboard for the Hackathon: https://phabricator.wikimedia.org/project/view/3922/. Checking now with Rachel about the pitches.

Aug 6 2019, 12:08 PM · Research-management, Research

Aug 5 2019

Miriam updated the task description for T229267: Plan for Research team's acitivities during Wikimania 2019.
Aug 5 2019, 3:36 PM · Research-management, Research