Page MenuHomePhabricator

Gapfinder recommendation fails with articles having special characters in title
Open, Needs TriagePublic

Description

Go to http://recommend.wmflabs.org/ -
choose a source language as Russian or any language with special characters - ie- when there is URL encoding used
choose English as the target language
Click on a recommended page with special characters in title - it fails to load on the next page and the interface cannot proceed further
Click on a recommended page which only has regular a-z,A-Z,0-9 characters and the page loads fine

Instead of going via the recommend page, go to the same source article - for instance the German article
https://de.wikipedia.org/wiki/Daheim_sterben_die_Leut%E2%80%99
Click on the English interwiki link if preferences are set up and the translate tools work fine from there.

The above recommend page fails presumably due to the URL encoding and decoding involved

My system language is English and I use a Firefox browser (54.0.1).

Attached example German page fails if character has umlaut but works otherwise

Event Timeline

Shyamal created this task.Jul 24 2017, 1:48 PM
Shyamal updated the task description. (Show Details)Jul 24 2017, 1:52 PM
Shyamal updated the task description. (Show Details)
Shyamal updated the task description. (Show Details)Jul 24 2017, 4:46 PM
Shyamal renamed this task from Gapfinder fails to open recommended articles with non-Latin titles to Gapfinder recommendation fails with articles having special characters in title.Jul 25 2017, 3:02 AM
Shyamal updated the task description. (Show Details)
Shyamal updated the task description. (Show Details)Jul 25 2017, 3:06 AM

Hi @Shyamal, thanks for reporting this. It looks like the title is getting double-encoded somewhere along the way. GapFinder is in the process of being migrated to a new codebase, and it doesn't look like this problem exists there: https://recommend-alpha.wmflabs.org/translation?source=ru&target=en

We'll keep this in mind if we start back-porting changes.

Shyamal added a comment.EditedJul 26 2017, 11:30 AM

Thanks, the pages load on the new code site but it appears that clicking on the "add translation" has no effect, it puts in the original text as it is. Not sure if I need to do some additional settings (but I certainly have not checked the "Don't use machine translation".

Thanks, the pages load on the new code site but it appears that clicking on the "add translation" has no effect, it puts in the original text as it is. Not sure if I need to do some additional settings (but I certainly have not checked the "Don't use machine translation".

It sounds like this issue is after clicking "Translate" in GapFinder. That button takes you into the Content Translation tool, which is developed independently from GapFinder.