Page MenuHomePhabricator

Labeling some bots (active in Git/Gerrit) as bots
Closed, ResolvedPublic

Description

To improve the results of some queries, we need to label bots as such. This is done in the Sorting Hat database, which stores identities, by setting the field profiles.is_bot to 1. We should at least label all bots that are known to us to be bots.

Event Timeline

jgbarah claimed this task.
jgbarah raised the priority of this task from to High.
jgbarah updated the task description. (Show Details)
jgbarah added subscribers: Acs, Dicortazar, Aklapper, Qgil.

After looking at the available information, the current list of identities labeled as "bot" is:

  • l10n-bot, l10n-bot@translatewiki.net
  • jenkins-bot, NULL
  • "Wikimedia Jenkins Bot", jenkins-slave@gallium.wikimedia.org
  • jenkins-bot, jenkins-bot@wikimedia.org
  • wmf-jenkins-bot, NULL
  • mw-jenkinsbot, NULL
  • jenkins-bot,jenkins-bot@gerrit. wikimedia.org

Second field is the email address declared by the bot, not always available. Closing the task for now. Please, feel free to reopen if you notice new bots, or some error in this list.

Warning: right now maybe not all queries consider this field (is_bot) as they should, but most already do, and in a short time all should do.

Aklapper renamed this task from Labeling some bots as bots to Labeling some bots (active in Git/Gerrit) as bots.Aug 27 2015, 8:23 PM
Aklapper set Security to None.
Aklapper moved this task from Backlog to Doing on the ECT-August-2015 board.
Aklapper moved this task from Backlog to Doing on the wikimedia.biterg.io board.

Wondering if the list in korma of bots can be automatically updated by pulling data from that very link that Lego provided.
Even if it cannot, providing some basic info where that list of bots is stored by korma and how (often) it is updated is welcome so we can put this on https://www.mediawiki.org/wiki/Community_metrics

The list of bots is currently maintained in the Sorting Hat (identities) database, in table profiles. If the field "is_bot" is 1, the identity is considered a bot. Otherwise, that field is 0.

Unfortunately, except for changing the database, there is no other way (for now) of tagging an identity as a bot.

Added Contributors | Bots subsection at the Community Metrics wiki with this info.

From the list in Group Non-Interactive Users, I tagged gerritpatchuploader@gmail.com, yuvipanda+suchabot@gmail.com, and wikidata-services@wikimedia.de as bots. I couldn't find jdlrobson+frankie@gmail.com in our database. Maybe it didn't act?

I couldn't find @Jdlrobson+frankie@gmail.com in our database. Maybe it didn't act?

Probably, might have been just an experiment.

I couldn't find @Jdlrobson+frankie@gmail.com in our database. Maybe it didn't act?

Probably, might have been just an experiment.

OK, In that case, I suggest that we close this task as done.