Page MenuHomePhabricator

Avoid indexing of local "copies" of the central user page
Open, NormalPublic

Description

I noticed today that user pages from Meta-Wiki are also indexed from seemingly random wikis, possibly all. e.g. https://sw.wikibooks.org/wiki/Mtumiaji:Krinkle is a result in Google on the second page for my username (not higher than the version from Meta-Wiki, but still).

We should probably mark these as Noindex and/or ensure the Canonical link header is set to the central version?

Event Timeline

Krinkle created this task.Oct 1 2017, 2:24 AM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 1 2017, 2:24 AM
Krinkle renamed this task from Avoid indexing local "copies" of the central user page to Avoid indexing of local "copies" of the central user page.Oct 1 2017, 2:24 AM

There is a similar task, which proposes that __NOINDEX__ on the gobal user page should also be applied on local copies: T90475.

Setting the canonical link should be easy enough. Do we also need to mark these as no index in addition to that?

Change 381882 had a related patch set uploaded (by Legoktm; owner: Legoktm):
[mediawiki/extensions/GlobalUserPage@master] Set canonical URL of remote pages to central page

https://gerrit.wikimedia.org/r/381882

Krinkle triaged this task as Normal priority.Oct 3 2017, 6:10 PM

Change 381882 merged by jenkins-bot:
[mediawiki/extensions/GlobalUserPage@master] Set canonical URL of remote pages to central page

https://gerrit.wikimedia.org/r/381882

@Krinkle: My global userpage on Wikimedia Commons still appears in Google results, even with the text "… you see on this page was copied from". Is this an intentional exception by the Commons community?

Hmmm....

Is this an intentional exception by the Commons community?

Not that I'm aware of...

Same for the dewiki page; the search terms "Benutzer ToBeFree de wikipedia" and "User ToBeFree Wikimedia Commons" are very specific, but to my knowledge, a properly working "NOINDEX" should prevent, or even remove, these results.

ToBeFree reopened this task as Open.Sun, Jul 28, 8:05 PM

reopening, but perhaps it's the wrong phabricator task. T90475 seems to be relevant too. Judging by the title of this task here, however, it isn't yet fixed.