Avoid indexing of local "copies" of the central user page
Closed, ResolvedPublic

Description

I noticed today that user pages from Meta-Wiki are also indexed from seemingly random wikis, possibly all. e.g. https://sw.wikibooks.org/wiki/Mtumiaji:Krinkle is a result in Google on the second page for my username (not higher than the version from Meta-Wiki, but still).

We should probably mark these as Noindex and/or ensure the Canonical link header is set to the central version?

Krinkle created this task.Oct 1 2017, 2:24 AM
Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 1 2017, 2:24 AM
Krinkle renamed this task from Avoid indexing local "copies" of the central user page to Avoid indexing of local "copies" of the central user page.Oct 1 2017, 2:24 AM

There is a similar task, which proposes that __NOINDEX__ on the gobal user page should also be applied on local copies: T90475.

Setting the canonical link should be easy enough. Do we also need to mark these as no index in addition to that?

Change 381882 had a related patch set uploaded (by Legoktm; owner: Legoktm):
[mediawiki/extensions/GlobalUserPage@master] Set canonical URL of remote pages to central page

https://gerrit.wikimedia.org/r/381882

Krinkle triaged this task as Normal priority.Oct 3 2017, 6:10 PM

Change 381882 merged by jenkins-bot:
[mediawiki/extensions/GlobalUserPage@master] Set canonical URL of remote pages to central page

https://gerrit.wikimedia.org/r/381882