As a WWT user, I want characters not handled by WhoColor to receive support in WWT (if possible), so that I can access information on more content.
Background: Some words are not highlighted and have no revision popup in WWT. This is because they contain characters the WhoColor API does not support. This is a known issue in the WhoColor code. I raised here. From this comment in the WhoColor code:
# token is not found. because most probably it contains some characters that has different length # in lower and upper case such as 'İstanbul'
So far, I have only found this affecting the character İ (LATIN CAPITAL LETTER I WITH DOT ABOVE). However, this appears frequently in Turkish Wikipedia. From a random sample of 1000 articles on trwiki I found this character in about 700 of them. (Some of the appearances are in templates, so have no user-facing affect.) This includes the article https://tr.wikipedia.org/wiki/%C4%B0stanbul (which is unfortunate).
Acceptance Criteria:
- Investigate if there is any way we can support or fix this issue
- Implement a fix, if possible
Steps to reproduce problem:
- Go to https://tr.wikipedia.org/wiki/%C4%B0stanbul
- Turn on WWT
- Hover over and/or attempt to click on the first word "İstanbul"
Expected behavior: "İstanbul" is highlighted and a revision popup appears when clicked
Observed behavior: Nothing happens
Visual Example (inability to click on 'Istanbul'):