Page MenuHomePhabricator

Cleaning via bot: Vast bot-lead clean up of files storage system (Lili + Commons)
Open, HighPublic

Description

Bot checking and correcting when needed the following is welcome. Subtasks are:

  • T297676: 1. Commons+LL: Correct filename > ISO code > Rename zho audios into cmn (200 items).
  • T298414: 2. Commons+LL: Correct filename > ISO code > Renaming audios missing their ISO code (2000+ Cantonese items, likely others)
  • T297635: 3. Commons+LL: Correct filename > Rename files with better field separator since - is present in 5% of our account names (all, ~800,000)
  • T298415: 4. Commons+LL: Correct filename > word spellings > De-capitalized common names when possible (likely in 10s of thousands)
  • T298413: 5. Commons: Correct category > Recategorize files on a per speaker basis (all, ~800,000)
Important

Bot will need bot rights on both Wikimedia LinguaLibre + Commons.
Edits should occurs simultaneously on both LinguaLibre + Commons.
All rightful edits should occurs at once. For example, see T297676 .

Suggestions

Use WikiapiJS and WikiapiJS-Eggs, which are easy to code with dual Lili wikibase + Commons wiki-pages edit. So far:

  • Commons: Given category name, list members by filename.
  • Commons: Edit wiki page : replace x by y
  • Commons: Move wiki page : rename x into y
  • LinguaLibre: Edit wikibase property value: given target x (filename), property y, set its to value z.
  • LinguaLibre: Edit wikibase property value: given target x (filename), property label, set its to value z.

Event Timeline

Yug updated the task description. (Show Details)
Yug updated the task description. (Show Details)
Yug updated the task description. (Show Details)
Yug updated the task description. (Show Details)
Yug updated the task description. (Show Details)
Yug updated the task description. (Show Details)
Yug triaged this task as High priority.Jul 6 2022, 11:22 AM
Yug renamed this task from Vast bot-lead clean up of files storage system (Lili + Commons) to Cleaning via bot: Vast bot-lead clean up of files storage system (Lili + Commons).Jul 7 2022, 8:22 PM