Page MenuHomePhabricator
Feed Advanced Search

Thu, Sep 23

Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

Thu, Sep 23, 4:10 PM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

  • Three meetings have been held (two with T&S staff and one with a steward) to identify why few spambots had cross-wiki edits: most activity of spambots might be stored as hits in the logs of AbuseFilter or Spamblacklist.
  • These conversations also led to formalizing the problem of characterizing spambots as the characterizing whether a URL/domain is related to spambot-driven activities.
  • The problem of missing data on deleted revisions has been reviewed with research engineering and more examples will be analyzed to better identify how these contents are handled in our databases.
Thu, Sep 23, 4:08 PM · Research (FY2021-22-Research-July-Sept)

Tue, Sep 21

Pablo closed T274250: Review for the ICWSM Workshop on Data for Most Vulnerable as Resolved.
Tue, Sep 21, 4:06 PM · Research

Fri, Sep 17

Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

Fri, Sep 17, 3:30 PM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

Fri, Sep 17, 3:24 PM · Research (FY2021-22-Research-July-Sept)

Mon, Sep 13

Pablo updated subscribers of T288339: Formalize the problem space and create a dataset of spambots activities.
Mon, Sep 13, 3:55 PM · Research (FY2021-22-Research-July-Sept)

Fri, Sep 10

Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

Fri, Sep 10, 2:47 PM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

Fri, Sep 10, 2:47 PM · Research (FY2021-22-Research-July-Sept)

Wed, Sep 8

Pablo updated subscribers of T290547: Research Showcase November 2021.
Wed, Sep 8, 9:20 AM · Research (FY2021-22-Research-Oct-Dec)
Pablo created T290547: Research Showcase November 2021.
Wed, Sep 8, 7:01 AM · Research (FY2021-22-Research-Oct-Dec)

Fri, Sep 3

Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

Fri, Sep 3, 4:12 PM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

Fri, Sep 3, 4:12 PM · Research (FY2021-22-Research-July-Sept)
Pablo closed T287232: Co-organize a session for Wikimania 2021 (collective track) as Resolved.
Fri, Sep 3, 3:04 PM · Research (FY2021-22-Research-July-Sept)

Aug 20 2021

Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

  • The initial list of spambots has been extended with stats around the edit count per existing wiki.
  • A preliminary data analysis has been conducted revealing that only a minor fraction of globally locked editors as spambots had generated cross-wiki activity.
Aug 20 2021, 7:01 PM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

  • Stats on the number of globally locked editors over time have been added to the dashboard.
  • The 1st Moderator Tools monthly meeting was held and the current dashboard was showcased.
Aug 20 2021, 7:00 PM · Research (FY2021-22-Research-July-Sept)

Aug 13 2021

Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

  • A dataset has been created with monthly stats per wiki (number of edits, number of active editors, ratio of edits by special editors/bots/anonymous users, ratio of minor edits, ratio of edits that are reverts, ratio of edits that the namespace of the page is categorized as content, average number of seconds elapsed since the previous revision made on the current page)
  • A dashboard has been built to expose the above metrics
Aug 13 2021, 2:22 PM · Research (FY2021-22-Research-July-Sept)

Aug 6 2021

Pablo added a comment to T288340: Define expectations with the stakeholder(s).

Weekly updates:

  • Brainstorm of data signals with the Moderator Tools team
Aug 6 2021, 11:01 AM · Research (FY2021-22-Research-July-Sept)
Pablo added a comment to T288339: Formalize the problem space and create a dataset of spambots activities.

Weekly updates:

Aug 6 2021, 11:01 AM · Research (FY2021-22-Research-July-Sept)
Pablo removed a project from T288339: Formalize the problem space and create a dataset of spambots activities: Epic.
Aug 6 2021, 11:00 AM · Research (FY2021-22-Research-July-Sept)
Pablo removed a project from T288340: Define expectations with the stakeholder(s): Epic.
Aug 6 2021, 11:00 AM · Research (FY2021-22-Research-July-Sept)
Pablo edited projects for T288339: Formalize the problem space and create a dataset of spambots activities, added: Research (FY2021-22-Research-July-Sept); removed Research.
Aug 6 2021, 10:59 AM · Research (FY2021-22-Research-July-Sept)
Pablo created T288340: Define expectations with the stakeholder(s).
Aug 6 2021, 9:58 AM · Research (FY2021-22-Research-July-Sept)
Pablo created T288339: Formalize the problem space and create a dataset of spambots activities.
Aug 6 2021, 9:58 AM · Research (FY2021-22-Research-July-Sept)
Pablo created T288338: [EPIC] Spambot Detection System to Support Stewards.
Aug 6 2021, 9:37 AM · Epic, Research
Pablo created T288337: [EPIC] Wikipedia Knowledge Integrity Risk Observatory.
Aug 6 2021, 9:27 AM · Epic, Research
Pablo added a comment to T287232: Co-organize a session for Wikimania 2021 (collective track).

Information about the session is already on the program https://wikimania.wikimedia.org/wiki/2021:Submissions/Indicators_for_the_Wikimedia_Projects

Aug 6 2021, 7:41 AM · Research (FY2021-22-Research-July-Sept)

Jul 27 2021

Pablo assigned T287232: Co-organize a session for Wikimania 2021 (collective track) to marcmiquel.
Jul 27 2021, 9:16 AM · Research (FY2021-22-Research-July-Sept)

Jul 23 2021

Pablo updated the task description for T287232: Co-organize a session for Wikimania 2021 (collective track).
Jul 23 2021, 9:02 AM · Research (FY2021-22-Research-July-Sept)
Pablo created T287232: Co-organize a session for Wikimania 2021 (collective track).
Jul 23 2021, 8:48 AM · Research (FY2021-22-Research-July-Sept)

May 12 2021

Pablo renamed T282657: Adding data from centralauth to the lake and the mediawiki_history dataset from Add global locks to mediawiki_history to Adding data from centralauth to the lake and the mediawiki_history dataset.
May 12 2021, 11:06 AM · Research, Analytics
Pablo added a comment to T282657: Adding data from centralauth to the lake and the mediawiki_history dataset.

Thanks @JAllemandou! I confirm that these data is stored in the centralauth DB, so I am happy to update the title of this task accordingly :)

May 12 2021, 11:05 AM · Research, Analytics
Pablo created T282657: Adding data from centralauth to the lake and the mediawiki_history dataset.
May 12 2021, 8:42 AM · Research, Analytics

Feb 18 2021

Pablo added a comment to T274631: Requesting access to Analytic Cluster for Research Scientist (Paragon).

It worked! Thanks @MoritzMuehlenhoff @Ottomata :)

Feb 18 2021, 1:36 PM · SRE, SRE-Access-Requests

Feb 17 2021

Pablo added a comment to T274631: Requesting access to Analytic Cluster for Research Scientist (Paragon).

I have managed to set up the config and ssh in but I am not able to connect to JupyterLab (https://wikitech.wikimedia.org/wiki/Analytics/Systems/Jupyter#JupyterLab) using my WikiTech credentials ("Invalid username or password").

Feb 17 2021, 6:00 PM · SRE, SRE-Access-Requests

Feb 12 2021

Pablo renamed T274631: Requesting access to Analytic Cluster for Research Scientist (Paragon) from Requesting access to Analytic Cluster for Research Intern (Paragon) to Requesting access to Analytic Cluster for Research Scientist (Paragon).
Feb 12 2021, 1:45 PM · SRE, SRE-Access-Requests
Pablo updated the task description for T274631: Requesting access to Analytic Cluster for Research Scientist (Paragon).
Feb 12 2021, 1:11 PM · SRE, SRE-Access-Requests
Pablo created T274631: Requesting access to Analytic Cluster for Research Scientist (Paragon).
Feb 12 2021, 11:44 AM · SRE, SRE-Access-Requests

Feb 9 2021

Pablo renamed T274250: Review for the ICWSM Workshop on Data for Most Vulnerable from Review for the ICWSM Workshop on Data for Most Vulnerable (2nd Ed) to Review for the ICWSM Workshop on Data for Most Vulnerable.
Feb 9 2021, 1:32 PM · Research
Pablo created T274250: Review for the ICWSM Workshop on Data for Most Vulnerable.
Feb 9 2021, 1:31 PM · Research

Feb 2 2021

Pablo updated Pablo.
Feb 2 2021, 1:35 PM