Looks great. Are there any more words (or word variants) that would would like to add to the list before we encode it in our modeling library?
As an example, for English, we have many variants of curse words in our tests. E.g. "shit", "sh1t", "shiiit", etc.
More is generally better. This isn't the last chance to extend the list though it may be the last chance to extend the list directly on the wiki. Future extensions will need to happen in code, but that isn't very difficult. See English Wikipedia's test set for the words we try to match there: https://github.com/wiki-ai/revscoring/blob/master/revscoring/languages/tests/test_english.py