Page MenuHomePhabricator

Google search traffic sending users to the wrong Chinese language versions of Wikipedia
Open, Needs TriagePublicBUG REPORT

Description

Hi all - feel free to add others to this ticket as I'm not sure exactly who the correct owner should be.

Our contacts at Google reached out to say one of their engineering teams is having a technical issue where Google Search results are displaying or linking to the wrong versions of Chinese Wikipedia, and they're asking if there's someone they can connect with to troubleshoot.

Specifically, the technical issues cited by the Google team are:
When Chinese Wikipedia makes on-the-fly C2C conversion (plus some word replacement), the Google system can't tell which Chinese variant it is receiving unless href="/wiki/....." is replaced with href="/zh-{region}/...." It seems this isn't working because of the automatic referrer-dependent redirection.

Google has asked if it would be possible for Wikipedia to stop redirecting zh.wikipedia.org/zh-{region,script}/<subject> coming from Google search (or Google crawler) to zh.wikipedia.org/wiki/<subject> because as long as this redirects Google cannot send its users to the right version of Chinese Wikipedia.

I believe the links below are to the relevant supporting documentation on our side. If someone could let me know who to connect with internally about this I'd appreciate it - thanks!

https://meta.wikimedia.org/wiki/Automatic_conversion_between_simplified_and_traditional_Chinese
https://www.mediawiki.org/wiki/Writing_systems#LanguageConverter

Event Timeline

It seems this isn't working because of the automatic referrer-dependent redirection

Yes. The function is provided by a gadget on zhwiki: https://zh.wikipedia.org/wiki/MediaWiki:Gadget-variant-link-fix.js
I think the two links you provided are unrelated. T108443: Google doesn't honor canonical URLs of zh.wiki is more relevant.

Per T108443#6279559, T294716 is likely to help, but the patch connected needs to be amended, and the owner of it seems not active.