Page MenuHomePhabricator

Green_Cardamom (GreenC)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Jun 4 2016, 1:17 PM (364 w, 2 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
GreenC [ Global Accounts ]

Recent Activity

Tue, May 2

Green_Cardamom created T335771: "Found on" section is missing pages.
Tue, May 2, 12:46 PM · InternetArchiveBot

Apr 9 2023

Green_Cardamom created T334345: url-status is not updated from live to dead .
Apr 9 2023, 4:27 AM · InternetArchiveBot

Dec 17 2022

Green_Cardamom created T325432: webarchive conversion to cite web .
Dec 17 2022, 1:07 PM · InternetArchiveBot (v3.0)

Oct 27 2022

Green_Cardamom updated the task description for T317471: Spaces in URL.
Oct 27 2022, 1:55 PM · InternetArchiveBot

Oct 19 2022

Green_Cardamom added a comment to T321146: IABot: new domain for webrecorder.io (conifer.rhizome.org).

Until resolved, you can add {{cbignore}} to keep the bot off the citation. That's what I did for the few I added. BTw ghostarchive.org uses the same Webrecorder technology on the backend. It won't work in every case as a substitute for Conifer, but many it will.

Oct 19 2022, 2:29 AM · InternetArchiveBot

Sep 21 2022

Green_Cardamom created T318201: Ghostarchive.org availability API.
Sep 21 2022, 4:12 AM · InternetArchiveBot

Sep 19 2022

Green_Cardamom created T318119: GhostArchive.org and Instagram short URLs.
Sep 19 2022, 8:35 PM · InternetArchiveBot

Sep 11 2022

Green_Cardamom created T317487: Priority placement for bot: unknown.
Sep 11 2022, 2:35 AM · InternetArchiveBot

Sep 10 2022

Green_Cardamom created T317475: nlwiki: changing to unfit.
Sep 10 2022, 3:03 PM · InternetArchiveBot
Green_Cardamom created T317474: mlwiki: problem with cite video.
Sep 10 2022, 2:40 PM · InternetArchiveBot
Green_Cardamom created T317473: mlwiki template:Dlw.
Sep 10 2022, 2:35 PM · InternetArchiveBot
Green_Cardamom updated the task description for T317471: Spaces in URL.
Sep 10 2022, 2:19 PM · InternetArchiveBot
Green_Cardamom updated the task description for T317471: Spaces in URL.
Sep 10 2022, 2:10 PM · InternetArchiveBot
Green_Cardamom updated the task description for T317471: Spaces in URL.
Sep 10 2022, 2:07 PM · InternetArchiveBot
Green_Cardamom created T317471: Spaces in URL.
Sep 10 2022, 1:58 PM · InternetArchiveBot

Aug 27 2022

Green_Cardamom created T316438: Feature: replace archive URL on-wiki.
Aug 27 2022, 4:32 PM · InternetArchiveBot (v3.0)

Aug 19 2022

Green_Cardamom created T315628: IABot converts archive.today to archive.is.
Aug 19 2022, 12:23 AM · InternetArchiveBot

Aug 11 2022

Green_Cardamom added a comment to T290211: EventStreams sending same data over and over (page links change).

My workaround is compare the date of the diff (via MW API) with the date in the JSON and if they are too far apart assume the JSON is buggy data, ignore and log it. There is a massive log, now.

Aug 11 2022, 7:40 PM · Data-Engineering-Planning, Platform Engineering, Analytics, Event-Platform Value Stream

Aug 3 2022

Green_Cardamom created T314513: url-status = deviated.
Aug 3 2022, 6:06 PM · InternetArchiveBot

May 17 2022

Green_Cardamom closed T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change as Resolved.
May 17 2022, 4:28 PM · Tools
Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

Thanks @bd808 - tool deletion request: https://phabricator.wikimedia.org/T308587

May 17 2022, 4:28 PM · Tools
Green_Cardamom created T308587: Archive/delete tool translation-server.
May 17 2022, 4:25 PM · Projects-Cleanup, Tools, Toolforge (Tools to be deleted)

May 16 2022

Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

Hi, I would like to delete the tool entirely. If someone wants to have this service, it would best to start over with a new account and install.

May 16 2022, 5:43 PM · Tools

Mar 28 2022

Green_Cardamom added a comment to T277698: InternetArchiveBot adds links to archives that have been excluded from the Wayback Machine.

This is a hard problem as the status can flip back and forth. As noted by Mark Graham above (Director Wayback Machine) archives that show excluded is like a curtain, the archive still exists in the Wayback Machine and could flip back to active in the future based on policy decision. The reason they are being added into wiki anyway is because IABot has a separate cache database and when it first detected that URL it was active. As a friend recently noted, one of the hardest things in computing is keeping accurate caches. The design of IABot is to use caching and not querying the WaybackMachine for every URL it encounters, which has pros and cons.

Mar 28 2022, 11:42 PM · Internet-Archive, InternetArchiveBot

Mar 16 2022

Green_Cardamom created T303907: EventStream (page-links-change) is not accurate.
Mar 16 2022, 1:35 AM · Internet-Archive, EventStreams

Mar 14 2022

Green_Cardamom added a comment to T291588: IABot overwriting Ghostarchive .

Testcases page: https://en.wikipedia.org/wiki/User:GreenC/testcases/ghostarchive

Mar 14 2022, 9:37 PM · InternetArchiveBot
Green_Cardamom created T303692: Transfering files to Toolforge.
Mar 14 2022, 4:51 AM · Toolforge

Feb 2 2022

Green_Cardamom added a comment to T274050: Users trying to analyze pages are being told they are blocked when they are not.

Block 12946111 https://en.wikipedia.org/wiki/Special:BlockList?wpTarget=%2312946111&blockType=&limit=50&wpFormIdentifier=blocklist is for User:Lallint .. confirming they have used the IABot tool before, last active 22:45 30 January 2022

Feb 2 2022, 5:53 AM · InternetArchiveBot

Jan 18 2022

Green_Cardamom created T299438: archive.today #select feature.
Jan 18 2022, 6:53 PM · InternetArchiveBot

Jan 16 2022

Green_Cardamom created T299296: arquivo.pt additional syntax .
Jan 16 2022, 3:27 AM · InternetArchiveBot

Jan 12 2022

Green_Cardamom added a comment to T293324: Bot no longer expands "archive.today" URLs from short format to long format.

Bot is converting short-form archive.today to Wayback links:
https://en.wikipedia.org/w/index.php?title=Elizabeth_Holmes&type=revision&diff=1065246449&oldid=1065173325

Jan 12 2022, 3:52 PM · InternetArchiveBot

Dec 21 2021

Green_Cardamom updated the task description for T298004: Pandora has changed format.
Dec 21 2021, 6:32 PM · InternetArchiveBot

Dec 20 2021

Green_Cardamom updated the task description for T298004: Pandora has changed format.
Dec 20 2021, 3:58 AM · InternetArchiveBot

Dec 19 2021

Green_Cardamom created T298004: Pandora has changed format.
Dec 19 2021, 5:02 AM · InternetArchiveBot

Nov 9 2021

Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

This is awesome. Confirming deployed AntiCompositeNumber's shadows.py to produce the list and the GreenC bot Job 10 ("shadows.awk") is back running, having just tagged 25 pages .

Nov 9 2021, 2:32 AM · cloud-services-team, Tools

Sep 7 2021

Green_Cardamom closed T284412: Global domain whitelist overrides URL blacklist as Invalid.
Sep 7 2021, 6:42 PM · InternetArchiveBot
Green_Cardamom added a comment to T284412: Global domain whitelist overrides URL blacklist.

I don't remember what prompted creation of this ticket. I'll make a new ticket if seen again, with an example.

Sep 7 2021, 6:42 PM · InternetArchiveBot

Sep 2 2021

Green_Cardamom added a comment to T290211: EventStreams sending same data over and over (page links change).

No idea. Feel free to adjust for the right audience I wasn't sure.

Sep 2 2021, 1:27 AM · Data-Engineering-Planning, Platform Engineering, Analytics, Event-Platform Value Stream
Green_Cardamom added a comment to T290211: EventStreams sending same data over and over (page links change).
{"$schema":"/mediawiki/page/links-change/1.0.0","meta":{"uri":"https://arz.wikipedia.org/wiki/%D8%B1%D9%88%D8%AF%D9%8A%D9%88%D9%85","request_id":"bc403e26b8b72080c369aa66","id":"26739413-d570-4363-af37-af690a94f501","dt":"2021-09-01T23:30:50Z","domain":"arz.wikipedia.org","stream":"mediawiki.page-links-change","topic":"codfw.mediawiki.page-links-change","partition":0,"offset":203083041},"database":"arzwiki","page_id":1389768,"page_title":"روديوم","page_namespace":0,"page_is_redirect":false,"rev_id":5641431,"performer":{"user_text":"InternetArchiveBot","user_groups":["bot","*","user","autoconfirmed"],"user_is_bot":true,"user_id":142851,"user_registration_dt":"2020-12-18T16:05:11Z","user_edit_count":20253},"added_links":[{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:%25D9%2585%25D9%2582%25D8%25A7%25D9%2584%25D8%25A7%25D8%25AA_%25D9%2581%25D9%258A%25D9%2587%25D8%25A7_%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581%25D8%25A7%25D8%25AA_BNF","external":false},{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:%25D9%2585%25D9%2582%25D8%25A7%25D9%2584%25D8%25A7%25D8%25AA_%25D9%2581%25D9%258A%25D9%2587%25D8%25A7_%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581%25D8%25A7%25D8%25AA_GND","external":false},{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:%25D9%2585%25D9%2582%25D8%25A7%25D9%2584%25D8%25A7%25D8%25AA_%25D9%2581%25D9%258A%25D9%2587%25D8%25A7_%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581%25D8%25A7%25D8%25AA_LCCN","external":false},{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:%25D9%2585%25D9%2582%25D8%25A7%25D9%2584%25D8%25A7%25D8%25AA_%25D9%2581%25D9%258A%25D9%2587%25D8%25A7_%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581%25D8%25A7%25D8%25AA_LNB","external":false},{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:%25D9%2585%25D9%2582%25D8%25A7%25D9%2584%25D8%25A7%25D8%25AA_%25D9%2581%25D9%258A%25D9%2587%25D8%25A7_%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581%25D8%25A7%25D8%25AA_NDL","external":false},{"link":"/wiki/%25D8%25AA%25D8%25B5%25D9%2586%25D9%258A%25D9%2581:CS1_maint:_uses_authors_parameter","external":false},{"link":"/wiki/International_Standard_Book_Number","external":false},{"link":"/wiki/National_Library_of_Latvia","external":false},{"link":"/wiki/Oxford_University_Press","external":false},{"link":"/wiki/%25D9%2585%25D9%2583%25D8%25AA%25D8%25A8%25D8%25A9_%25D8%25A7%25D9%2584%25D9%258A%25D8%25A7%25D8%25A8%25D8%25A7%25D9%2586_%25D8%25A7%25D9%2584%25D9%2588%25D8%25B7%25D9%2586%25D9%258A%25D9%2587","external":false},{"link":"/wiki/%25D9%2585%25D9%2583%25D8%25AA%25D8%25A8%25D8%25A9_%25D9%2581%25D8%25B1%25D9%2586%25D8%25B3%25D8%25A7_%25D8%25A7%25D9%2584%25D9%2588%25D8%25B7%25D9%2586%25D9%258A%25D9%2587","external":false},{"link":"/wiki/%25D9%2585%25D9%2584%25D9%2581_%25D8%25A7%25D8%25B3%25D8%25AA%25D9%2586%25D8%25A7%25D8%25AF%25D9%2589_%25D9%2585%25D8%25AA%25D9%2583%25D8%25A7%25D9%2585%25D9%2584","external":false},{"link":"/wiki/%25D9%2586%25D9%2585%25D8%25B1%25D8%25A9_%25D8%25AA%25D8%25AD%25D9%2583%25D9%2585_%25D9%2585%25D9%2583%25D8%25AA%25D8%25A8%25D8%25A9_%25D8%25A7%25D9%2584%25D9%2583%25D9%2588%25D9%2586%25D8%25AC%25D8%25B1%25D8%25B3","external":false},{"link":"/wiki/Hamish_Hamilton_Ltd","external":false},{"link":"/wiki/%25D9%2585%25D8%25B9%25D8%25B1%25D9%2581_%25D8%25A7%25D9%2584%25D8%25BA%25D8%25B1%25D8%25B6_%25D8%25A7%25D9%2584%25D8%25B1%25D9%2582%25D9%2585%25D9%2589","external":false},{"link":"/wiki/%25D9%2585%25D8%25B3%25D8%25A7%25D8%25B9%25D8%25AF%25D8%25A9:CS1_errors","external":false},{"link":"https://www.wikidata.org/wiki/Q1087","external":true},{"link":"https://commons.wikimedia.org/wiki/Category:Rhodium","external":true},{"link":"https://www.quora.com/topic/Rhodium-1","external":true},{"link":"https://www.google.com/search%3Fkgmid%3D/m/025scm0","external":true},{"link":"https://catalogue.bnf.fr/ark:/12148/cb12218903f","external":true},{"link":"https://academic.microsoft.com/v2/detail/521398313","external":true},{"link":"https://academic.microsoft.com/v2/detail/2910290644","external":true},{"link":"https://id.loc.gov/authorities/sh85113755","external":true},{"link":"https://kopkatalogs.lv/F/%3Ffunc%3Ddirect%26local_base%3Dlnc10%26doc_number%3D000307942","external":true},{"link":"https://d-nb.info/gnd/4178038-3","external":true},{"link":"https://archive.org/details/naturesbuildingb0000emsl","external":true},{"link":"https://archive.org/details/elementsvisualex0000gray","external":true},{"link":"https://archive.org/details/periodictableits0000scer","external":true},{"link":"//doi.org/10.1351%252Fgoldbook","external":true},{"link":"//doi.org/10.1351%252Fgoldbook","external":true},{"link":"https://data.bnf.fr/ark:/12148/cb12218903f","external":true},{"link":"https://id.loc.gov/authorities/subjects/sh85113755","external":true},{"link":"https://kopkatalogs.lv/F%3Ffunc%3Ddirect%26local_base%3Dlnc10%26doc_number%3D000307942%26P_CON_LNG%3DENG","external":true},{"link":"https://id.ndl.go.jp/auth/ndlna/00569786","external":true}]}
Sep 2 2021, 1:16 AM · Data-Engineering-Planning, Platform Engineering, Analytics, Event-Platform Value Stream
Green_Cardamom created T290211: EventStreams sending same data over and over (page links change).
Sep 2 2021, 1:15 AM · Data-Engineering-Planning, Platform Engineering, Analytics, Event-Platform Value Stream

Aug 13 2021

Green_Cardamom added a comment to T279207: http://thomas.loc.gov/ is now dead.

Domain now blacklisted - could also lift global live state to "none" and let IABot do dead link detection and fix normally but blacklisting will resolve faster.

Aug 13 2021, 9:27 PM · InternetArchiveBot
Green_Cardamom closed T286338: All URLs on domain www.hindustantimes.com set to Alive as Resolved.
Aug 13 2021, 9:18 PM · InternetArchiveBot

Aug 7 2021

Green_Cardamom added a comment to T286338: All URLs on domain www.hindustantimes.com set to Alive.

I will process this domain for dead links, set them blacklisted (locks it in so global live state won't matter), then remove the global whitelist . Takes a while.

Aug 7 2021, 3:41 PM · InternetArchiveBot
Green_Cardamom closed T287837: URL validation of archive.today links should also check the redirect URL as Resolved.
Aug 7 2021, 3:34 PM · InternetArchiveBot
Green_Cardamom added a comment to T287837: URL validation of archive.today links should also check the redirect URL.

Fixed. https://ca.wikipedia.org/w/index.php?title=Cas_Peatge&type=revision&diff=27979994&oldid=27979974

Aug 7 2021, 3:34 PM · InternetArchiveBot

Aug 1 2021

Green_Cardamom created T287837: URL validation of archive.today links should also check the redirect URL.
Aug 1 2021, 5:32 PM · InternetArchiveBot

Jun 24 2021

Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

it wouldn't need to be realtime up to date, so if the data is a bit stale, like updated weekly or monthly

Jun 24 2021, 11:55 PM · cloud-services-team, Tools

Jun 6 2021

Green_Cardamom created T284412: Global domain whitelist overrides URL blacklist.
Jun 6 2021, 9:37 PM · InternetArchiveBot

May 19 2021

Green_Cardamom updated the task description for T283211: InternetArchiveBot adding archive URLs on wiki when the URLs are permalive in the db.
May 19 2021, 11:47 PM · Chinese-Sites, InternetArchiveBot
Green_Cardamom created T283211: InternetArchiveBot adding archive URLs on wiki when the URLs are permalive in the db.
May 19 2021, 11:46 PM · Chinese-Sites, InternetArchiveBot

May 9 2021

Green_Cardamom added a comment to T282322: VisualEditor leaving nowiki on enwiki.

@Amire80 that is a fascinating taxonomy you created. Glad you found this thread :) I'm having trouble understanding some of them as there are no examples. Intended space has an example, for example, so I understand where nowiki appears and might find those eg regex. []]{2}<nowiki/>[[]{2} - in fact there are 8 intended space on enwiki:

If you ever decide to expand the description or a new column to include (more) examples where possible it would be very helpful towards developing tools and reports. It should be possible to detect many of them universally (like intended space) potentially making for a global bot/tool/report across all wikis.

May 9 2021, 5:27 PM · Parsoid, Parsoid-Nowiki, VisualEditor

May 8 2021

Green_Cardamom created T282322: VisualEditor leaving nowiki on enwiki.
May 8 2021, 7:38 PM · Parsoid, Parsoid-Nowiki, VisualEditor

Apr 20 2021

Green_Cardamom updated the task description for T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 12:45 PM · Spanish-Sites, InternetArchiveBot
Green_Cardamom updated the task description for T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 1:00 AM · Spanish-Sites, InternetArchiveBot
Green_Cardamom updated the task description for T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 12:48 AM · Spanish-Sites, InternetArchiveBot
Green_Cardamom updated the task description for T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 12:24 AM · Spanish-Sites, InternetArchiveBot
Green_Cardamom updated the task description for T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 12:17 AM · Spanish-Sites, InternetArchiveBot
Green_Cardamom created T280607: IABot deleting URLs on eswiki.
Apr 20 2021, 12:09 AM · Spanish-Sites, InternetArchiveBot

Apr 6 2021

Green_Cardamom merged task T279437: EventStreams producing non-existent/ghost events into T216504: page-links-change stream is assigning template propagation events to the wrong edits.
Apr 6 2021, 3:29 PM · Internet-Archive, EventStreams
Green_Cardamom merged T279437: EventStreams producing non-existent/ghost events into T216504: page-links-change stream is assigning template propagation events to the wrong edits.
Apr 6 2021, 3:29 PM · Data-Engineering, Event-Platform Value Stream, Patch-For-Review, Platform Team Workboards (Clinic Duty Team), The-Wikipedia-Library, Internet-Archive
Green_Cardamom added a comment to T216504: page-links-change stream is assigning template propagation events to the wrong edits.

Hi - I am also having trouble. In one day, it falsely reported over 2,000 edits on ukwiki as made by InternetArchiveBot - not a small number of false positives.

Apr 6 2021, 3:15 PM · Data-Engineering, Event-Platform Value Stream, Patch-For-Review, Platform Team Workboards (Clinic Duty Team), The-Wikipedia-Library, Internet-Archive
Green_Cardamom added a comment to T279437: EventStreams producing non-existent/ghost events.

@Samwalton9 that's it. Thank you.

Apr 6 2021, 3:06 PM · Internet-Archive, EventStreams
Green_Cardamom added a comment to T279437: EventStreams producing non-existent/ghost events.

I see it was transcluded in from this edit to a template used on the page: https://ca.wikiquote.org/w/index.php?title=Plantilla%3ARingler&type=revision&diff=130519&oldid=98762

Apr 6 2021, 2:54 PM · Internet-Archive, EventStreams
Green_Cardamom created T279437: EventStreams producing non-existent/ghost events.
Apr 6 2021, 2:31 PM · Internet-Archive, EventStreams

Mar 20 2021

Green_Cardamom added a comment to T264843: InternetArchiveBot is creating new articles.

I believe it is an old bug long since fixed. If you see anything like it again it's not fixed, but the bot has edited cswiki a lot since then which is a good sign.

Mar 20 2021, 12:37 AM · InternetArchiveBot

Mar 18 2021

Green_Cardamom created T277765: IABot goes into infinite loop when encountering a template inside a square link.
Mar 18 2021, 3:07 PM · Chinese-Sites, InternetArchiveBot

Feb 24 2021

Green_Cardamom added a comment to T274050: Users trying to analyze pages are being told they are blocked when they are not.

On zhwiki this turned out to be an IP filter.

Feb 24 2021, 10:35 PM · InternetArchiveBot

Feb 8 2021

Green_Cardamom added a comment to T274050: Users trying to analyze pages are being told they are blocked when they are not.

This is now happening to me on zhwiki as of today.

Feb 8 2021, 10:19 PM · InternetArchiveBot

Feb 1 2021

Green_Cardamom added a comment to T272410: Fix missing static resource "autocomplete_light" in ToolsAdmin causing broken webpage.

Reporting same problem. Unable to add maintainers. This is urgent, unable to get a team of developers going on a project. Thanks.

Feb 1 2021, 8:17 PM · Patch-For-Review, cloud-services-team (Kanban), Striker

Dec 22 2020

Green_Cardamom edited P13620 extlnks.awk.
Dec 22 2020, 3:57 PM

Dec 21 2020

Green_Cardamom edited P13620 extlnks.awk.
Dec 21 2020, 8:26 PM
Green_Cardamom created P13620 extlnks.awk.
Dec 21 2020, 8:08 PM

Dec 7 2020

Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

I'm not really a JS person but suspect this might be solved by simply installing the software from scratch following the directions at https://wikitech.wikimedia.org/wiki/Help:Toolforge/Web/Node.js and making sure everything is in the correct directories. Ideally by someone familiar with JS and/or Node. Perhaps delete everything and start over with a fresh install. What I noticed is the current install it not in the right directories per linked directions.

Dec 7 2020, 2:58 AM · Tools

Nov 20 2020

Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

Reposting bd808's comment: "Holding open an SQL select for hours (or even minutes) while you page through it for secondary lookups (the in-memory join as it were) is prone to a lot of interruption vectors." An SQL query that takes hours to complete, and is run daily, might run into aborts and lost data?

Nov 20 2020, 5:53 PM · cloud-services-team, Tools

Nov 18 2020

Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

@Bstorm what happens when a new File: is uploaded to Commons that overlaps with a pre-existing File: on enwiki for example one uploaded years ago?

Nov 18 2020, 7:49 PM · cloud-services-team, Tools
Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

OK thinking it through.. for example monitor recentchanges on the Commons side (the larger) and compare against the full corpus on the Enwiki side. This is great because it reduces the size of Commons to a few entries. Thus if the file pre-exists on Enwiki and is then added to Commons it works. But, if the file pre-exists on Commons and is then added to Enwiki, rececentchanges on Commons would not see it. So there would have to be recentchanges for both enwiki and Commons in both directions. But at the same, you would also need the full corpus in both directions, leading back to the original problem of a very large list for Commons. Unless the idea is to build that list from scratch Day 1 save to disk and each day add to it from recentchanges. Or maybe I am missing something.

Nov 18 2020, 7:31 PM · cloud-services-team, Tools
Green_Cardamom added a comment to T267992: Provide mechanism to detect name clashed media between Commons and a Local project, without needing to join tables across wiki-db's.

Shadows bot currently runs every 24hrs, so the AntiCompositeNumber SQL method takes under 6 hours is workable, in that sense. Slower than the 90 seconds join method. It seems like a lot, 6 hours a day, on a SQL query paging through, and more error prone given the time exposure and networking. I can confirm using an API method would take around 70 hours, the pywikibot method would probably be similar if API based. Estimating data size of 65 million page title is 1-3 gigabytes. It's a lot to request, retrieve and process daily even in the same datacenter.

Nov 18 2020, 4:17 PM · cloud-services-team, Tools

Oct 31 2020

Green_Cardamom created T266932: TARB: Holocaust Memorial Museum [enwiki].
Oct 31 2020, 3:04 PM · InternetArchiveBot, Internet-Archive
Green_Cardamom created T266931: TARB: missed books with ISBN match.
Oct 31 2020, 2:56 PM · InternetArchiveBot
Green_Cardamom created T266930: TARB: missing url field.
Oct 31 2020, 2:54 PM · InternetArchiveBot
Green_Cardamom created T266929: TARB: Roman numerals .
Oct 31 2020, 2:52 PM · InternetArchiveBot
Green_Cardamom created T266928: TARB: url conflict with title-link generate red error.
Oct 31 2020, 2:49 PM · InternetArchiveBot

Sep 3 2020

Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

Since learned it should be installed per: https://wikitech.wikimedia.org/wiki/Help:Toolforge/Web/Node.js
I have not had a moments time to try it, but suspect this convention would allow the proxy to see it.

Sep 3 2020, 5:41 PM · Tools

Sep 1 2020

Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

After some various things, it now reports it is running

> translation-server@2.0.3 start /mnt/nfs/labstore-secondary-tools-project/translation-server/www
> node src/server.js
Sep 1 2020, 4:51 AM · Tools
Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

If anyone would like shell access to look around let me know.

Sep 1 2020, 3:54 AM · Tools
Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

The cronjob is throwing this error:

Sep 1 2020, 3:29 AM · Tools

Aug 28 2020

Green_Cardamom closed T241910: Extra forward-slash added as Resolved.
Aug 28 2020, 4:13 PM · InternetArchiveBot
Green_Cardamom added a comment to T241910: Extra forward-slash added .

This might just be bad data in the db seems ok after correcting the db

Aug 28 2020, 4:13 PM · InternetArchiveBot
Green_Cardamom updated the task description for T241910: Extra forward-slash added .
Aug 28 2020, 4:07 PM · InternetArchiveBot
Green_Cardamom closed T247008: Incorrectly adds ISBN into quote field of ref as Resolved.
Aug 28 2020, 3:56 PM · InternetArchiveBot
Green_Cardamom added a comment to T247008: Incorrectly adds ISBN into quote field of ref.

It not longer does this (move the &dq text into the |quote field)

Aug 28 2020, 3:56 PM · InternetArchiveBot
Green_Cardamom added a comment to T261300: https://translation-server.toolforge.org/ is a HTTP 502 error since DNS name change.

I've initiated an abandoned tool procedure so we can add additional maintainers and hopefully fix the tool. smith609 is reportedly busy IRL right now.

Aug 28 2020, 3:19 PM · Tools

Jul 22 2020

Green_Cardamom closed T258498: Login not working - Oauth error - IA Upload tool on toolforge as Invalid.
Jul 22 2020, 3:52 PM · Community-Tech, IA Upload
Green_Cardamom added a comment to T258498: Login not working - Oauth error - IA Upload tool on toolforge.

@ifried ok thanks my mistake had the wrong URL. Closing tix

Jul 22 2020, 3:52 PM · Community-Tech, IA Upload

Jul 21 2020

Green_Cardamom updated the task description for T258498: Login not working - Oauth error - IA Upload tool on toolforge.
Jul 21 2020, 2:49 PM · Community-Tech, IA Upload
Green_Cardamom created T258498: Login not working - Oauth error - IA Upload tool on toolforge.
Jul 21 2020, 2:48 PM · Community-Tech, IA Upload

Jun 13 2020

Green_Cardamom closed T255338: Adds unrelated url to book review as Resolved.
Jun 13 2020, 1:12 PM · InternetArchiveBot
Green_Cardamom added a comment to T255338: Adds unrelated url to book review.

The bot is patched. Closing ticket, if you still want to discuss reopen.

Jun 13 2020, 1:12 PM · InternetArchiveBot
Green_Cardamom added a comment to T255338: Adds unrelated url to book review.

Checked the logs and found one other case at A. J. B. Johnston

Jun 13 2020, 1:09 PM · InternetArchiveBot
Green_Cardamom added a comment to T255338: Adds unrelated url to book review.

It appears "title=none" is an undocumented bypass, otherwise gives a red error. It will skip on title=none.

Jun 13 2020, 1:01 PM · InternetArchiveBot