Page MenuHomePhabricator
Feed Advanced Search

Feb 7 2024

seth added a comment to T341626: BlockedExternalDomains.json / Special:BlockedExternalDomains: Save username and addition time for each entry.

yes, please use iso time format and no local formats.
iso has a lot of advantages, e.g., it's alpha-sortable, it's international, it's easily readable by spam-fighters from other wikis, it begins with the most important part (the year).

Feb 7 2024, 3:09 PM · MW-1.42-notes (1.42.0-wmf.13; 2024-01-09 ), SpamBlacklist, AbuseFilter

Nov 2 2023

seth awarded T350445: not possible to edit in the german language wikipedia a Like token.
Nov 2 2023, 11:20 PM · User-notice-archive, DBA

Oct 26 2023

seth added a comment to T216657: encoded URLs at webarchive might cirvumvent the spamblacklist.

It looks like the behavior has already changed, but not for the better. Now
https://web.archive.org/web/20140914080957/https://support.google.com/adsense/answer/176201?hl=de is not blocked anymore?

Oct 26 2023, 6:10 PM · Internet-Archive, SpamBlacklist

Nov 21 2022

seth created T323522: Toolforge Kubernetes perl profile: module PHP::Serialization needed.
Nov 21 2022, 5:59 PM · User-bd808, Toolforge (Software install/update)

Oct 9 2022

seth created T320347: if action='move', then content (new_wikitext, old_wikitext) is not available.
Oct 9 2022, 1:22 PM · AbuseFilter

Oct 7 2022

seth closed T319613: Migrate camelbot from Toolforge GridEngine to Toolforge Kubernetes as Resolved.

thanks for the precise and helpful answers!

Oct 7 2022, 8:44 PM · Grid-Engine-to-K8s-Migration
seth added a comment to T319613: Migrate camelbot from Toolforge GridEngine to Toolforge Kubernetes.

hi!
thanks for your comment. but i don't see where it answers any of my questions.

Oct 7 2022, 6:47 AM · Grid-Engine-to-K8s-Migration

Oct 6 2022

seth added a comment to T319613: Migrate camelbot from Toolforge GridEngine to Toolforge Kubernetes.

formerly i used two cronjobs

*/5 * * * * jsub -once -j y -quiet -v LC_ALL=en_US.UTF-8 -release buster -mem 4g /data/project/camelbot/bot/camelbot.pl --rc-monitoring=db --username=CamelBot
*/30 * * * * jsub -once -j y -quiet -v LC_ALL=en_US.UTF-8 /data/project/camelbot/bot/camelbot_copy_for_cat_dead.pl --cat-dead --username=CamelBot
Oct 6 2022, 11:36 PM · Grid-Engine-to-K8s-Migration
seth closed T320111: Migrate url-converter from Toolforge GridEngine to Toolforge Kubernetes as Resolved.

resolved via

webservice stop
webservice --backend=kubernetes perl5.32 start
Oct 6 2022, 10:56 PM · Grid-Engine-to-K8s-Migration
seth closed T320032: Migrate searchsbl from Toolforge GridEngine to Toolforge Kubernetes as Resolved.
Oct 6 2022, 10:55 PM · Grid-Engine-to-K8s-Migration
seth added a comment to T320032: Migrate searchsbl from Toolforge GridEngine to Toolforge Kubernetes.

i did

webservice stop
webservice --backend=kubernetes perl5.32 start

and it seems to work. so maybe this could be closed now?

Oct 6 2022, 10:52 PM · Grid-Engine-to-K8s-Migration

Jun 15 2022

seth added a comment to T305308: Buster grid problem: some perl modules are missing.

ah, great! thanks! it works! 👍

Jun 15 2022, 8:47 PM · Toolforge (Software install/update), cloud-services-team (Kanban)

Jun 6 2022

seth added a comment to T305308: Buster grid problem: some perl modules are missing.

i guess, this gets some kind of urgent, as buster is the new default now.
my script at toolforge is only running if i use the -release=stretch param.

Jun 6 2022, 8:29 AM · Toolforge (Software install/update), cloud-services-team (Kanban)

Apr 9 2022

seth awarded T6198: Suggestion to implement page-level links into uploaded PDF files [[media:foo.pdf|page=n]] a Like token.
Apr 9 2022, 10:17 PM · Multimedia, MediaWiki-File-management, Commons, All-and-every-Wikisource, MediaWiki-Parser

Apr 2 2022

seth updated the task description for T305308: Buster grid problem: some perl modules are missing.
Apr 2 2022, 5:03 PM · Toolforge (Software install/update), cloud-services-team (Kanban)
seth created T305308: Buster grid problem: some perl modules are missing.
Apr 2 2022, 4:54 PM · Toolforge (Software install/update), cloud-services-team (Kanban)

Feb 6 2022

seth added a comment to T27524: blacklist may become too big.

i'm not sure.
as we deleted a lot of superfluous entries in the last years, the sizes of the SBLs are now the same (meta: 290kB) or even smaller (w:en: 140kB) than 2010.
as computers got faster during that time, it might be a not-so-big issue right now. however, i don't know what the situation will look like in the next years.
at least i'd say that this issue is of low priority.

Feb 6 2022, 10:49 PM · SpamBlacklist

Jun 28 2020

seth awarded T242089: Consider keeping user entered URL and removing tracking parameters a Like token.
Jun 28 2020, 9:36 AM · Security, Privacy Engineering, Citoid

Jun 24 2020

seth added a comment to T254649: Rename SpamBlacklist.

'spam' should not be replaced by 'url', because only linked urls are harmed by the lists. 'website' would not be correct, because the list entries do not necessarily block whole websites, but can also block single webpages. so i think 'link' or even 'external link' should be part of the name.

Jun 24 2020, 10:03 PM · SpamBlacklist

Jan 23 2020

seth added a comment to T243131: visual editor seems to automatically replace long (good) urls with short urls.
Jan 23 2020, 7:24 PM · Citoid, VisualEditor

Jan 18 2020

seth created T243131: visual editor seems to automatically replace long (good) urls with short urls.
Jan 18 2020, 8:03 PM · Citoid, VisualEditor

Dec 30 2019

seth added a comment to T241605: https://tools.wmflabs.org/url-converter/ does not run anymore.

that's strange. now it works for me, too.
i tried it several times, about 30 minutes ago. and i always got strange different error messages.
however, now it works. so i guess this issue can be closed.
thanks!

Dec 30 2019, 10:57 PM · cloud-services-team
seth created T241605: https://tools.wmflabs.org/url-converter/ does not run anymore.
Dec 30 2019, 10:43 PM · cloud-services-team

Jun 2 2019

Restricted Application added a project to T224838: ECHO/includes/DiscussionParser.php: minor review of regexp usage: Growth-Team.
Jun 2 2019, 7:34 PM · Growth-Team-Filtering, Patch-Needs-Improvement, Growth-Team, Notifications

Apr 13 2019

seth created T220874: Special:AbuseLog throws when viewing details or examining (BadMethodCallException).
Apr 13 2019, 9:01 AM · AbuseFilter

Feb 20 2019

seth created T216657: encoded URLs at webarchive might cirvumvent the spamblacklist.
Feb 20 2019, 9:04 PM · Internet-Archive, SpamBlacklist

Feb 17 2019

seth added a comment to T181024: AbuseFilter should not cast arrays into strings.

Oops, ok, I understand. Then I totally agree with both of you. Sorry for the interruption and thanks for the explanation. I should have read the php manual.

Feb 17 2019, 10:25 PM · Patch-Needs-Improvement, AbuseFilter
seth added a comment to T181024: AbuseFilter should not cast arrays into strings.

Of course, it is legal in some programming languages, but then the array would be converted into something reasonable, e.g. into the length of the array (as in perl).
Do you know any common languages, where array arr in
arr > 3
would be converted to a string by default? And if so, will then the '3' also be converted to a string? And the '>' will do a lexicographical comparison than?
Im my opinion this is far to much magic and the abuse filter language should not be contra-intuitive for programmers.

Feb 17 2019, 8:52 PM · Patch-Needs-Improvement, AbuseFilter
seth added a comment to T181024: AbuseFilter should not cast arrays into strings.

Right now I had a problem with
added_links > 10
Because of the implicit type casting, the syntax is correct, but this code does not do what one could expect, i.e.,
length(added_links) > 10.

Feb 17 2019, 5:40 PM · Patch-Needs-Improvement, AbuseFilter

Feb 9 2019

seth added a comment to T214343: Create a Perl Docker image for use on the Toolforge Kubernetes cluster.

I've used CPAN locally at the old toolserver (ran by DaB.). Since there is labs, I just asked for some packages, and the admins installed them. For me this is an easy way.

Feb 9 2019, 9:04 AM · User-bd808, Kubernetes, cloud-services-team (Kanban), Toolforge

Feb 5 2019

seth added a comment to T214343: Create a Perl Docker image for use on the Toolforge Kubernetes cluster.

i'd like to test somehow, which perl-modules are missing.

Feb 5 2019, 11:18 PM · User-bd808, Kubernetes, cloud-services-team (Kanban), Toolforge

Nov 14 2018

seth awarded T14896: Spam Blacklist shouldn't be fooled by similar-looking Unicode characters a Like token.
Nov 14 2018, 9:38 PM · Patch-Needs-Improvement, Trust-and-Safety, Stewards-and-global-tools, SpamBlacklist

May 21 2018

seth added a comment to T34159: urls should be decoded before regexp matching.

A similar (or actually the same) problem occurs with translate.google.[^\/]{2,5}/translate.
Right now, google-translate-urls can be used to circumvent the SBL, and we don't know a good way to cope with that problem via SBL or edit filter.
It would be better, if the SBL finds the blocked URLs inside the google-translate url.

May 21 2018, 7:29 PM · Patch-Needs-Improvement, SpamBlacklist

Feb 22 2018

seth added a comment to T129171: increase $wgAbuseFilterConditionLimit for dewiki.

Hi!
Sorry, but I can't confirm, because during the last months I used the edit filter very(!) rarely. (So I can't confirm the opposite, too.)

Feb 22 2018, 6:41 PM · Performance-Team (Radar), Wikimedia-Site-requests, AbuseFilter

Jan 28 2018

seth added a comment to T184483: Expose spamblacklist log type on wiki replica servers.

That sounds great! Do we/I have to trigger anyone? Or do I just have to wait now?

Jan 28 2018, 7:23 PM · cloud-services-team (Kanban), Data-Services, MediaWiki-Logevents, SpamBlacklist

Jan 9 2018

seth added a comment to T184483: Expose spamblacklist log type on wiki replica servers.

I'm afraid, bdb808's answer does not help here, because the first part is just a repetition of what has been said already. And the API will not help, as Umherirrender already said.

Jan 9 2018, 8:07 PM · cloud-services-team (Kanban), Data-Services, MediaWiki-Logevents, SpamBlacklist

Jan 8 2018

seth created T184483: Expose spamblacklist log type on wiki replica servers.
Jan 8 2018, 10:31 PM · cloud-services-team (Kanban), Data-Services, MediaWiki-Logevents, SpamBlacklist

Jan 15 2017

seth added a comment to T119463: Automatically convert spaces after section markers (§) into non-breaking spaces.

Change 332037 had a related patch set uploaded (by Harjotsingh): [...]

https://gerrit.wikimedia.org/r/332037

Jan 15 2017, 8:03 AM · Patch-Needs-Improvement, Parsoid, MediaWiki-Parser

Dec 24 2016

seth added a comment to T15619: Add non-breaking spaces in additional places automatically.

I think such a content processing should be done on edit time not on parser time.

Dec 24 2016, 8:48 AM · Patch-Needs-Improvement, Parsoid, MediaWiki-Parser

Sep 20 2016

seth added a comment to T6459: Create a special page to handle additions, removals, changes and logging of spam blacklist entries.

as said already (above in this thread), you could use

https://tools.wmflabs.org/searchsbl
Sep 20 2016, 8:22 AM · Stewards-and-global-tools, SpamBlacklist

Mar 20 2016

seth added a comment to T15619: Add non-breaking spaces in additional places automatically.

I used the Parser.php changes mentioned at https://de.wikipedia.org/wiki/Wikipedia:Typografie/Automatische_Leerzeichen#Regexps and it seems to work. I had to replace the order of a few entries (I've already done that at the wiki page).

Mar 20 2016, 11:30 PM · Patch-Needs-Improvement, Parsoid, MediaWiki-Parser
seth created T130506: Automatically convert referer links copied from Google search results into actual/proper URLs.
Mar 20 2016, 2:27 PM · MediaWiki-General

Mar 8 2016

seth added a comment to T129171: increase $wgAbuseFilterConditionLimit for dewiki.

If there is a problem with condition limit, you should see whether there is room for improvement in the filters before changing this.

Mar 8 2016, 10:22 PM · Performance-Team (Radar), Wikimedia-Site-requests, AbuseFilter

Mar 7 2016

seth created T129171: increase $wgAbuseFilterConditionLimit for dewiki.
Mar 7 2016, 10:32 PM · Performance-Team (Radar), Wikimedia-Site-requests, AbuseFilter

Mar 4 2016

seth added a comment to T128907: AbuseFilter needs a history (not only per rule).

great, thanks! :-)

Mar 4 2016, 10:31 PM · AbuseFilter
seth added a comment to T128907: AbuseFilter needs a history (not only per rule).

However, it would be great, to set the number of printed changes per page...

Mar 4 2016, 10:23 PM · AbuseFilter
seth created T128907: AbuseFilter needs a history (not only per rule).
Mar 4 2016, 9:03 PM · AbuseFilter

Jan 19 2016

seth created T124117: IP::isInRange() returns true on nonsense input.
Jan 19 2016, 11:33 PM · MW-1.32-notes (WMF-deploy-2018-06-12 (1.32.0-wmf.8)), MW-1.27-release (WMF-deploy-2016-04-12_(1.27.0-wmf.21)), MW-1.27-release-notes, AbuseFilter, MediaWiki-General
seth added a comment to T120850: Investigation: Migrate dead external links to archives.

The method I use with CamelBot is quite simple:

Jan 19 2016, 8:41 PM · Community-Tech, Community-Wishlist-Survey-2015, Internet-Archive

Nov 6 2015

seth added a comment to T23895: Change the name of the AbuseFilter extension.

The extension was meant to stop abusing behaviour, but it's also used to stop e.g. edits, that would have been destructive by accident. And of course, sometimes there are false positive hits, that cause a harmless edit by a harmless user to be logged forever. There are many users who dislike any public entry of theirselves in an "abuse log".

Nov 6 2015, 10:24 AM · AbuseFilter

May 17 2015

seth added a comment to T99281: [Migrated] ru.elderscrolls.wikia.

I'm not the 'seth', who started this topic. :-)

May 17 2015, 7:14 AM · AutoWikiBrowser

Feb 17 2015

seth created T89699: Already used URLs on a page should not trigger spam blacklist.
Feb 17 2015, 8:54 AM · Contributors-Team, SpamBlacklist
seth added a comment to T66541: regex expressions starting with caret (^) not functioning as per instructions say.

Actually the instructions at mw:Extension:SpamBlacklist#Blacklist syntax <s>are</s>were wrong, because all blacklisted domains will be php-joined like https?://[a-z0-9.-]*(sbl0_|sbl_1|...|sbl_n).
So if one want's to block an exact domain, the code
(?<=//|\.)t\.co\b
can be used, i.e., "a 't\.co\b' which is preceded either by two slashes or a dot".
We do that already at w:de, w:en and at meta for years now. I agree, it's better to fix the instructions rather than changing the code. I've done that now. -> won't fix?

Feb 17 2015, 8:36 AM · Stewards-and-global-tools, SpamBlacklist

Dec 4 2014

seth added a comment to T15619: Add non-breaking spaces in additional places automatically.

we made a few regexps for the German part of the problem:
see https://de.wikipedia.org/wiki/Wikipedia:Typografie/Automatische_Leerzeichen#Regexps

Dec 4 2014, 9:49 PM · Patch-Needs-Improvement, Parsoid, MediaWiki-Parser