What is the problem?
In the HTML that the WhoColor API returns, and that WhoWroteThat inserts into the page, any wikitext in [[ ]] (e.g. links, files) gets treated as one token. We therefore treat this entire token as being added by one contributor as part of one revision (the revision that added the first [[). But, this might not actually be the case. For example, if a later contributor modifies the caption of an image this won't be reflected in WWT.
For example, the image in the top right here has been modified by at least two contributors. But, clicking on that image will show information for only the first contributor (once T231959 is fixed).
Normally, each word in an article gets treated as a separate token, so we know who wrote each word and when.
Visual Examples:
Possible solutions
- Do nothing. Perhaps this is not a big problem.
- Fix WhoColor API. This might be tricky, as tokenising each word inside [[ ]] will often produce invalid wikitext.
- Don't show revision details about things like images, internal links, etc. This would be a shame, as we do have this information.
- Find some other way of extracting the revision details of individual words inside files, links, etc. The WhoColor API does give us that information already.
- Something else. I might suggest a spike of some sort to investigate possibilities.
Steps to reproduce problem
See the example above.
Will happen whenever two or more contributors (or the same contributor in separate revisions) add/modify a link, file, etc.