Currently the result of
ccnorm("ABCDEFGHIJKLMNOPQRSTUVWXYZ_abcdefghijklmnopqrstuwxyz")
is
ABCDEFGH1JK1MN0PQR5TUVWXYZ_ABCDEFGH1JK1MN0PQR5TUWXYZ
This makes the creation of filters on [[Special:AbuseFilter]] not intuitive, since if we want to catch all variations of a word like "testing" and try to use something like
words :="TESTING|VANDALIZING"; ccnorm(added_lines) rlike words) & !(ccnorm(removed_lines) rlike words)
it won't work. Instead of this natural approach, the text would need to be changed to
words :="TE5T1NG|VANDA11Z1NG";
You can confirm the problem on [[Special:AbuseFilter/tools]], by using the following:
words :="TESTING|VANDALIZING"; ccnorm("I'm testing here. I'm vandalizing the article!") rlike words
The regex above will not match, but it will match in the following:
words := "TE5T1NG|VANDA11Z1NG"; ccnorm("I'm testing here. I'm vandalizing the article!") rlike words
Could this be fixed?