Equivset should normalize some diacriticals
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Xqt
	May 1 2017, 8:46 AM

Description

Abuse Filter ccnorm should normalize some diacriticals like:

Ċ -> C
Ď -> D
È -> E
Ê -> E
ǝ -> E
Ĥ -> H
Ñ -> N
Ň -> N
ᴬ -> A
ᴰ -> D
ᴱ -> E
ᴴ -> H
ᴸ -> L
ᴹ -> M
ᴿ -> R
ᵀ -> T
ᶜ -> C

Details

	Subject	Repo	Branch	Lines +/-
	Add characters from the "Phonetic Extensions" Unicode Block (1D00-1DBF)	mediawiki/libs/Equivset	master	+393 -39
	Expand set for lower/upper case characters which are alone in the set	mediawiki/libs/Equivset	master	+549 -27

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Resolved		Umherirrender	T27619 Add more characters to ccnorm
		Resolved		Umherirrender	T164180 Equivset should normalize some diacriticals

Event Timeline

Xqt created this task.May 1 2017, 8:46 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptMay 1 2017, 8:46 AM

Xqt added a parent task: T27619: Add more characters to ccnorm.May 8 2017, 4:44 PM

He7d3r renamed this task from Abuse Filter ccnorm should should normalize some diacriticals to Abuse Filter ccnorm should normalize some diacriticals.Sep 10 2017, 5:10 PM

He7d3r updated the task description. (Show Details)

matej_suchanek added a project: Equivset.Nov 24 2017, 5:55 PM

Daimona updated the task description. (Show Details)Apr 13 2018, 7:27 AM

Huji renamed this task from Abuse Filter ccnorm should normalize some diacriticals to Equivset should normalize some diacriticals.Apr 13 2018, 12:52 PM

Huji removed a project: AbuseFilter.

ǝ -> E

There is already another mapping for that character

Change 818287 had a related patch set uploaded (by Umherirrender; author: Umherirrender):

[mediawiki/libs/Equivset@master] Expand set for lower/upper case characters which are alone in the set

https://gerrit.wikimedia.org/r/818287

gerritbot added a project: Patch-For-Review.Jul 29 2022, 3:06 AM

Change 818287 merged by jenkins-bot:

[mediawiki/libs/Equivset@master] Expand set for lower/upper case characters which are alone in the set

https://gerrit.wikimedia.org/r/818287

Maintenance_bot removed a project: Patch-For-Review.Mar 31 2023, 7:10 AM

Ċ -> C
Ď -> D
È -> E
Ê -> E
ǝ -> E
Ĥ -> H
Ñ -> N
Ň -> N

are now part of Equivset, it needs a new release to get them working in AbuseFilter on wmf wikis

Some letters still to be done

Change 904823 had a related patch set uploaded (by Umherirrender; author: Umherirrender):

[mediawiki/libs/Equivset@master] Add characters from the "Phonetic Extensions" Unicode Block (1D00-1DBF)

https://gerrit.wikimedia.org/r/904823

gerritbot added a project: Patch-For-Review.Mar 31 2023, 3:50 PM

Change 904823 merged by jenkins-bot:

[mediawiki/libs/Equivset@master] Add characters from the "Phonetic Extensions" Unicode Block (1D00-1DBF)

https://gerrit.wikimedia.org/r/904823

All letters mention in this task are now part of Equivset, it needs a new release to get them working in AbuseFilter on wmf wikis

Maintenance_bot removed a project: Patch-For-Review.Apr 11 2023, 4:10 PM

Equivset should normalize some diacriticalsClosed, ResolvedPublicActions

Description

Details

Related ObjectsSearch...

Event Timeline

Equivset should normalize some diacriticals
Closed, ResolvedPublic
Actions

Related Objects
Search...