Currently, if a paragraph length exceeds 10000 bytes, you get no word-level diff, as illustrated by this example. The limit is for performance reason, however we can totally investigate this deeper.
- What are the worst-case examples of diffable text?
- What's the impact of this limit on non-Latin wikis? What's the average paragraph length in bytes on Chinese Wikipedia compared to English, for example?
- What affects performance more - paragraph length or number of words?
- On modern hardware (as opposed to 15 years ago when DairikiDiff was created and 10 years ago for wikidiff2), can we afford to increase the limit?