Page MenuHomePhabricator

Number and proportion of bot edits per projects
Open, Needs TriagePublic

Description

As a Wikidata Analytics PM, @Manuel would like to be able to

  • identify bot maintained projects (e.g. Egyptian-Arabic WP and astronomical objects, Cebuano WP) to be able to exclude them from the sitelinks datasets (see: T288611) .

Criteria:

  • e.g. bot edits per project last 1 month
  • Table per wiki_db, per is_bot
  • count the number of revisions
  • in the previous month.

Finally: we will use this information to filter out bot-edited projects from the Sitelinks datatset.

Event Timeline

@Manuel

The following dataset, all data from July 2021:

should be enough to help us resolve our dilemma in relation to T288611.

Columns:

  • wiki - project
  • botEdits - number of bot edits
  • humanEdits - number of human edits
  • botP - percentage of bot edits
  • totalEdits - total number of edits.

We need a project exclusion criterion for the Sitelinks dataset(s): what % of bot edits do we accept not to call a project "bot maintained"?

@GoranSMilovanovic: Per emails from Sep18 and Oct20 and https://www.mediawiki.org/wiki/Bug_management/Assignee_cleanup , I am resetting the assignee of this task because there has not been progress lately (please correct me if I am wrong!). Resetting the assignee avoids the impression that somebody is already working on this task. It also allows others to potentially work towards fixing this task. Please claim this task again when you plan to work on it (via Add Action...Assign / Claim in the dropdown menu) - it would be welcome. Thanks for your understanding!