Page MenuHomePhabricator

Port CopyPatrol to bn.wiki
Open, Stalled, Needs TriagePublic

Description

Enable CopyPatrol tool for bn.wikipedia.org (https://tools.wmflabs.org/copypatrol/bn). The tool has been recently translated into Bengali Language.

Event Timeline

Shahadat created this task.Sun, Jan 5, 5:11 PM
Restricted Application added a project: Community-Tech. · View Herald TranscriptSun, Jan 5, 5:11 PM
Restricted Application added a subscriber: Aklapper. · View Herald Transcript

(https://meta.wikimedia.org/wiki/CopyPatrol says to see T141379 for discussion of support for other languages. That task has been resolved years ago and links to T145431. The on-wiki documentation might welcome updates. Especially how to request making it available on another wiki.)

Shahadat updated the task description. (Show Details)Mon, Jan 6, 7:11 AM
Ammarpad changed the task status from Open to Stalled.Mon, Jan 6, 1:07 PM
Ammarpad added subscribers: Niharika, Ammarpad.

Per T145431#2768400, you need to complete the remaining 2730 labels so as to reach the minimum number of labels needed before enabling the tool. You should advertise it for your community to help in labeling the edits at https://labels.wmflabs.org/ui/bnwiki/. There's also a documentation at https://meta.wikimedia.org/wiki/Wiki_labels

MusikAnimal changed the task status from Stalled to Open.Mon, Jan 13, 11:33 PM
MusikAnimal added a subscriber: MusikAnimal.

Per T145431#2768400, you need to complete the remaining 2730 labels so as to reach the minimum number of labels needed before enabling the tool. You should advertise it for your community to help in labeling the edits at https://labels.wmflabs.org/ui/bnwiki/. There's also a documentation at https://meta.wikimedia.org/wiki/Wiki_labels

I'm guessing that's to do with the ORES scores? CopyPatrol does surface these but it does not rely on them. I believe there's nothing technically holding us back from adding new wikis.

The on-wiki documentation might welcome updates. Especially how to request making it available on another wiki.

Right you are. I'll get something added.


@Shahadat To answer this request -- CopyPatrol is a bit resource-intensive, so the only real requirement is that your wiki will regularly make use of CopyPatrol. Ideally we'd see some community support for it. Could you point us to a discussion?

On that note, it seems French Wikipedia has lost interest :( https://tools.wmflabs.org/copypatrol/fr/leaderboard

MusikAnimal changed the task status from Open to Stalled.Wed, Jan 15, 4:28 AM

@Shahadat Bad news... the copyright detection service we use does not support Bengali :(

From https://www.ithenticate.com/products/faqs:

What languages does iThenticate support?
The iThenticate interface currently supports the following languages: English, Korean, Japanese, German, Spanish Latin American, Brazilian Portuguese, Dutch, Italian, French, Simplified and Traditional Chinese, and Arabic.
Which international languages does iThenticate have content for in its database?
iThenticate searches for content matches in the following 30 languages: Chinese (simplified and traditional), Japanese, Thai, Korean, Catalan, Croatian, Czech, Danish, Dutch, Finnish, French, German, Hungarian, Italian, Norwegian (Bokmal, Nynorsk), Polish, Portuguese, Romanian, Serbian, Slovak, Slovenian, Spanish, Swedish, Arabic, Greek, Hebrew, Farsi, Russian, and Turkish. Please note that iThenticate will match text between text of the same language.

I am very sorry! That should have been the first thing I checked. What we can do is contact iThenticate and ask that Bengali be added, but I can't make any promises this will amount to anything. Thanks for taking the time to file this task and seek community support, nonetheless!

While we can't offer automation through CopyPatrol, I did want to make you aware of https://tools.wmflabs.org/copyvios, if you weren't already. With this you can give it any article and it will search for identical content on the web. You could add a link to this say at https://bn.wikipedia.org/wiki/MediaWiki:Pageinfo-footer so that it is more easily accessible. I this is of some help.

I'm going to mark this task as stalled, in hopes iThenticate does add support for Bengali in the future.