Green_Cardamom (GreenC)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Jun 4 2016, 1:17 PM (72 w, 2 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Green Cardamom

Recent Activity

Sun, Oct 22

Green_Cardamom closed T178751: Provide an API so external tools can run SQL queries via Quarry as Declined.
Sun, Oct 22, 8:54 PM · Quarry
Green_Cardamom added a comment to T178751: Provide an API so external tools can run SQL queries via Quarry.

Ok I'll close this it doesn't sound like it will work but thanks for the explanations and good to have explored the idea anyway.

Sun, Oct 22, 8:54 PM · Quarry
Green_Cardamom added a comment to T178751: Provide an API so external tools can run SQL queries via Quarry.

As background I've written a unix command-line tool called Wikiget for accessing certain Wikimedia API functions to generate lists of article titles

Sun, Oct 22, 3:35 AM · Quarry
Green_Cardamom created T178751: Provide an API so external tools can run SQL queries via Quarry.
Sun, Oct 22, 2:39 AM · Quarry

Thu, Oct 12

Green_Cardamom updated the task description for T178106: Templates that support archiveurl/archivedat arguments.
Thu, Oct 12, 7:31 PM · InternetArchiveBot
Green_Cardamom added a comment to T178106: Templates that support archiveurl/archivedat arguments.

Complete list incorporating original and new templates:

Thu, Oct 12, 7:28 PM · InternetArchiveBot
Green_Cardamom added a comment to T178106: Templates that support archiveurl/archivedat arguments.

New templates discovered

Thu, Oct 12, 7:27 PM · InternetArchiveBot
Restricted Application assigned T178106: Templates that support archiveurl/archivedat arguments to Cyberpower678.
Thu, Oct 12, 7:26 PM · InternetArchiveBot

Wed, Oct 11

Green_Cardamom added a comment to T177907: {{webarchive |format=...}}.

It has a transculusion count of 250. Boutique templates are ard to find,

Wed, Oct 11, 11:58 PM · InternetArchiveBot (v1.5)
Restricted Application assigned T177907: {{webarchive |format=...}} to Cyberpower678.
Wed, Oct 11, 4:39 AM · InternetArchiveBot (v1.5)

Sat, Oct 7

Green_Cardamom added a comment to T177185: {{cite constitution}}.

I don't understand {{cite constitution}} because it doesn't appear to support a URL argument so beginning to think it is a mistake

Sat, Oct 7, 7:48 PM · InternetArchiveBot (v1.5)

Wed, Oct 4

Green_Cardamom added a comment to T177270: API:Search maxes out at 10000.

we won't be extending this limitation

Wed, Oct 4, 4:21 PM · Discovery-Search, CirrusSearch, Discovery, MediaWiki-API
Green_Cardamom added a comment to T177270: API:Search maxes out at 10000.

@Betacommand - yes wikiget supports linksearch. I was using a link in the example search but it could be anything.
@debt - from what I understand the limit is an Elasticsearch configuration unrelated to the API. There aren't really any API workarounds, except a local install of Elasticsearch and fresh copies of the index (or the dump).

Wed, Oct 4, 3:04 PM · Discovery-Search, CirrusSearch, Discovery, MediaWiki-API

Tue, Oct 3

Green_Cardamom added a comment to T177270: API:Search maxes out at 10000.

Agreed that checking totalhits against total results is a good idea anyway. Understood about the limit .. what it is. I'll add something to the API doc page so future editors are aware. Didn't know about cirrusdumps that's probably more than I want to do but will keep it in mind for the future.

Tue, Oct 3, 2:41 PM · Discovery-Search, CirrusSearch, Discovery, MediaWiki-API
Green_Cardamom added a comment to T177270: API:Search maxes out at 10000.

@dcausse thanks for the info, did not know this.

Tue, Oct 3, 2:14 PM · Discovery-Search, CirrusSearch, Discovery, MediaWiki-API
Green_Cardamom created T177270: API:Search maxes out at 10000.
Tue, Oct 3, 4:37 AM · Discovery-Search, CirrusSearch, Discovery, MediaWiki-API

Mon, Oct 2

Restricted Application assigned T177185: {{cite constitution}} to Cyberpower678.
Mon, Oct 2, 2:08 AM · InternetArchiveBot (v1.5)

Mon, Sep 25

Green_Cardamom added a comment to T176562: Autogenerate false positive reports when the bot detects it's been reverted.

Ill leave some as I think of them.

Mon, Sep 25, 11:52 PM · InternetArchiveBot (v1.6)

Sun, Sep 24

Green_Cardamom added a comment to T176579: Investigate supporting MMS protocol on the checkIfDead class.

https://stackoverflow.com/questions/4778195/check-if-mms-stream-exists-or-not-using-php

Sun, Sep 24, 3:43 PM · InternetArchiveBot (v1.6)

Sep 22 2017

Restricted Application assigned T176455: Odd edits Book of Common Prayer to Cyberpower678.
Sep 22 2017, 1:38 AM · InternetArchiveBot

Sep 5 2017

Green_Cardamom added a comment to T174945: Spacing in expanded templates.

Great! Hope it wasn't too difficult. Occurred to me the easiest method would be 1) determine if there is padded spacing involved and if so 2) change all to the length of the longest argument + 1 space.

Sep 5 2017, 4:02 PM · InternetArchiveBot (v1.5)

Sep 4 2017

Green_Cardamom closed T174832: aolnews.com not being rescued as Resolved.
Sep 4 2017, 2:43 PM · InternetArchiveBot
Green_Cardamom added a comment to T174832: aolnews.com not being rescued.

Ok.. I set the domain to dead and reran the bot on the 440 articles

Sep 4 2017, 2:28 PM · InternetArchiveBot
Restricted Application assigned T174945: Spacing in expanded templates to Cyberpower678.
Sep 4 2017, 2:24 PM · InternetArchiveBot (v1.5)

Sep 2 2017

Green_Cardamom added a comment to T174832: aolnews.com not being rescued.

Sorry missed that :) Here's another for Evander Holyfield

Sep 2 2017, 4:47 PM · InternetArchiveBot

Sep 1 2017

Restricted Application assigned T174832: aolnews.com not being rescued to Cyberpower678.
Sep 1 2017, 9:47 PM · InternetArchiveBot

Aug 29 2017

Restricted Application assigned T174436: False detection of wayback link to Cyberpower678.
Aug 29 2017, 1:09 PM · InternetArchiveBot

Aug 23 2017

Green_Cardamom added a comment to T173843: webcite url's truncated.

I forgot glad it's fixed .. yeah sure enough IMP fixed it in the database on July 2

Aug 23 2017, 1:52 AM · InternetArchiveBot
Restricted Application assigned T173843: webcite url's truncated to Cyberpower678.
Aug 23 2017, 1:46 AM · InternetArchiveBot

Aug 22 2017

Green_Cardamom closed T173830: Embeded cite web in cite web as Resolved.
Aug 22 2017, 7:42 PM · InternetArchiveBot
Green_Cardamom added a comment to T173830: Embeded cite web in cite web.

This is a rare case closing

Aug 22 2017, 7:42 PM · InternetArchiveBot
Green_Cardamom added a comment to T173830: Embeded cite web in cite web.

I removed the two {{date}} templates

Aug 22 2017, 7:39 PM · InternetArchiveBot
Green_Cardamom added a comment to T173847: Double {{webarchive}}.

It's always like this pattern, with the first one at the end of the ref, and the second added one at the end of the cite. That's why I thought it might be due to a detection problem.

Aug 22 2017, 7:33 PM · InternetArchiveBot (v1.5)
Restricted Application assigned T173847: Double {{webarchive}} to Cyberpower678.
Aug 22 2017, 4:49 PM · InternetArchiveBot (v1.5)
Green_Cardamom created T173843: webcite url's truncated.
Aug 22 2017, 4:25 PM · InternetArchiveBot
Restricted Application assigned T173830: Embeded cite web in cite web to Cyberpower678.
Aug 22 2017, 2:08 PM · InternetArchiveBot

Aug 20 2017

Green_Cardamom added a comment to T172313: Detect robots.txt exclusion on archive.org.

GreenC bot detects robots.txt and will try to find a different archive if available. If none available it will keep the robots.txt snapshot, because Wayback management said they plan to remove that policy block sometime in the near future, hopefully.

Aug 20 2017, 7:06 PM · Internet-Archive, InternetArchiveBot
Green_Cardamom added a comment to T172055: InternetArchiveBot adds dead archive (diff included).

GreenC bot does some light soft 404 checking of Wayback links, but mostly that is only done for archive.is .. if it did it for Wayback I'm afraid the false positive rate would outweigh the benefit since Wayback has a low rate of soft404 anyway. With archive.is soft404 rate is > 50% so false positive are less of a concern, the benefits outweigh the loss.

Aug 20 2017, 7:03 PM · Internet-Archive, InternetArchiveBot
Green_Cardamom added a comment to T173036: IABot couldn't find archive for archived URL.

The Wayback API says no snapshot is available:

Aug 20 2017, 6:56 PM · Internet-Archive, InternetArchiveBot

Aug 16 2017

Green_Cardamom added a comment to T172737: Compare HTTP responses for NO RESPONSE FROM SERVER with external API.

Good timing as I just completed IMP yesterday (verified 4 million URLs in 2 months) and now back to running WaybackMedic on enwiki for the moment.

Aug 16 2017, 4:48 PM · InternetArchiveBot (v1.5)

Jul 25 2017

Green_Cardamom added a comment to T171023: {{webarchive}} and __FORMAT__.

Removed from enwiki but there are in other languages.

Jul 25 2017, 2:24 PM · InternetArchiveBot (v1.4)

Jul 19 2017

Restricted Application assigned T171057: Non-archive archive.org URLs in database to Cyberpower678.
Jul 19 2017, 2:18 PM · InternetArchiveBot, Internet-Archive
Restricted Application assigned T171023: {{webarchive}} and __FORMAT__ to Cyberpower678.
Jul 19 2017, 2:26 AM · InternetArchiveBot (v1.4)

Jul 13 2017

Green_Cardamom added a comment to T170413: Unable to update URL even though API reports success.

True because WebCite drops anything beyond ?url .. and theoretically it should work if it's "+" or "%20" in the query. But what if it is a site that is not flexible these ways.

Jul 13 2017, 8:43 PM · InternetArchiveBot

Jul 12 2017

Restricted Application assigned T170413: Unable to update URL even though API reports success to Cyberpower678.
Jul 12 2017, 12:17 PM · InternetArchiveBot

Jul 10 2017

Green_Cardamom added a comment to T170142: IABot API - truncates at %20 with modifyurl.

Some of them are not taking.

Jul 10 2017, 4:13 PM · InternetArchiveBot, Internet-Archive
Green_Cardamom added a comment to T170142: IABot API - truncates at %20 with modifyurl.

Ugh that's a small oversight. Fortunately it won't be difficult to go back and rerun all the ones that had a % in the URL. For some reason I was intentionally decoding the URL before encoding, I don't remember why, but probably works to do a single encoding with no pre-decode.

Jul 10 2017, 3:46 PM · InternetArchiveBot, Internet-Archive
Restricted Application assigned T170142: IABot API - truncates at %20 with modifyurl to Cyberpower678.
Jul 10 2017, 1:10 PM · InternetArchiveBot, Internet-Archive

Jun 26 2017

Green_Cardamom added a comment to T168794: Unable to manage URLs containing +.

Phew excellent.

Jun 26 2017, 11:30 PM · InternetArchiveBot (v1.4), Internet-Archive
Green_Cardamom added a comment to T168794: Unable to manage URLs containing +.

Yeah I've come across a lot like it.. would a db script change IDs? That is what IMP uses to track what it processed.

Jun 26 2017, 11:26 PM · InternetArchiveBot (v1.4), Internet-Archive

Jun 25 2017

Restricted Application assigned T168794: Unable to manage URLs containing + to Cyberpower678.
Jun 25 2017, 3:41 PM · InternetArchiveBot (v1.4), Internet-Archive

Jun 20 2017

Green_Cardamom updated the task description for T168351: v1.1 beta3 unable to update URLs.
Jun 20 2017, 2:42 AM · InternetArchiveBot (v1.4), Internet-Archive
Restricted Application assigned T168351: v1.1 beta3 unable to update URLs to Cyberpower678.
Jun 20 2017, 2:40 AM · InternetArchiveBot (v1.4), Internet-Archive

Jun 18 2017

Restricted Application assigned T168202: https:/// in database to Cyberpower678.
Jun 18 2017, 1:22 PM · InternetArchiveBot (v1.4)

Jun 15 2017

Green_Cardamom added a comment to T167605: Unable to override archive validation check in Tool/API.

Is v1.1beta3 live? How do I check the running version?

Jun 15 2017, 3:32 PM · InternetArchiveBot (v1.4), Internet-Archive
Green_Cardamom added a comment to T167512: IABot API wishlist.

Great thank you #1. For #2 the thinking is it needs to be able to track a link has been verified, most of the time it makes no change to the database because it verified OK. Being able to track this is important so it can go back and re-process the database to catch any new archive URLs that were added without re-processing the same URLs it already verified which is very time consuming. Unless they passed an expiration date and need to be re-verified thus the need for a date of verification. Maybe can track it locally not sure, what do you think. I thought if it was tracked in the IABot database anyone could then run IMP.

Jun 15 2017, 3:25 PM · InternetArchiveBot (v1.4)

Jun 12 2017

Green_Cardamom added a comment to T167512: IABot API wishlist.

Oh no! Well the more I think about #2 it might make more sense to track it local but still thinking about it. #4 is just an idea with no immediate application/need but could open possibilities. #1 a reverse lookup needed for debugging (needed it today for example). #3 will be key to saving links.

Jun 12 2017, 12:54 AM · InternetArchiveBot (v1.4)

Jun 11 2017

Green_Cardamom updated the task description for T167605: Unable to override archive validation check in Tool/API.
Jun 11 2017, 11:42 PM · InternetArchiveBot (v1.4), Internet-Archive
Green_Cardamom updated the task description for T167605: Unable to override archive validation check in Tool/API.
Jun 11 2017, 8:50 PM · InternetArchiveBot (v1.4), Internet-Archive
Green_Cardamom renamed T167605: Unable to override archive validation check in Tool/API from Unable to override archive validatio check in Tool/API to Unable to override archive validation check in Tool/API.
Jun 11 2017, 5:41 PM · InternetArchiveBot (v1.4), Internet-Archive
Green_Cardamom updated the task description for T167605: Unable to override archive validation check in Tool/API.
Jun 11 2017, 5:40 PM · InternetArchiveBot (v1.4), Internet-Archive
Restricted Application assigned T167605: Unable to override archive validation check in Tool/API to Cyberpower678.
Jun 11 2017, 5:39 PM · InternetArchiveBot (v1.4), Internet-Archive

Jun 9 2017

Restricted Application assigned T167512: IABot API wishlist to Cyberpower678.
Jun 9 2017, 2:08 PM · InternetArchiveBot (v1.4)
Green_Cardamom closed T167411: Protocol-relative URLs and SSL as Invalid.
Jun 9 2017, 2:07 AM · InternetArchiveBot
Green_Cardamom added a comment to T167411: Protocol-relative URLs and SSL.

Started a BOTREQ

Jun 9 2017, 2:07 AM · InternetArchiveBot

Jun 8 2017

Green_Cardamom added a comment to T167411: Protocol-relative URLs and SSL.

I started a discussion at Village Pump technical maybe it will answer some questions and/or lead to a bot that does conversions

Jun 8 2017, 3:15 PM · InternetArchiveBot
Green_Cardamom added a comment to T167411: Protocol-relative URLs and SSL.

Huh you're right it's forcing https. That's crazy. It breaks the whole point of PR and breaks many URLs.

Jun 8 2017, 3:00 PM · InternetArchiveBot
Green_Cardamom updated the task description for T167411: Protocol-relative URLs and SSL.
Jun 8 2017, 2:19 PM · InternetArchiveBot
Restricted Application assigned T167411: Protocol-relative URLs and SSL to Cyberpower678.
Jun 8 2017, 1:37 PM · InternetArchiveBot

Jun 5 2017

Green_Cardamom added a comment to T166928: Blank archiveurl/archivedate/url arguments in CS1|2 templates.

You're right I assumed that was VE,

Jun 5 2017, 1:49 PM · VisualEditor

Jun 3 2017

Green_Cardamom placed T166928: Blank archiveurl/archivedate/url arguments in CS1|2 templates up for grabs.
Jun 3 2017, 2:29 AM · VisualEditor
Green_Cardamom updated the task description for T166928: Blank archiveurl/archivedate/url arguments in CS1|2 templates.
Jun 3 2017, 2:25 AM · VisualEditor
Restricted Application assigned T166928: Blank archiveurl/archivedate/url arguments in CS1|2 templates to Cyberpower678.
Jun 3 2017, 2:22 AM · VisualEditor

Jun 1 2017

Green_Cardamom added a comment to T166791: Double {{webarchive}}.

In the first diff, it added the same {{webarchive}} twice, even though the source URLs are different.

Jun 1 2017, 4:25 PM · InternetArchiveBot (v1.4)
Restricted Application assigned T166792: Archive.is date format to Cyberpower678.
Jun 1 2017, 2:49 PM · InternetArchiveBot (v1.4)
Restricted Application assigned T166791: Double {{webarchive}} to Cyberpower678.
Jun 1 2017, 2:31 PM · InternetArchiveBot (v1.4)

May 26 2017

Green_Cardamom added a comment to T165504: Double wayback URL.

It's definitely a lot fewer - between the time of the fix and May 22 this is the only occurrence.

May 26 2017, 3:13 PM · InternetArchiveBot (v1.3)
Green_Cardamom reopened T165504: Double wayback URL as "Open".
May 26 2017, 2:29 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T165504: Double wayback URL.

It appears in 1.3.2 on May 21

May 26 2017, 2:29 PM · InternetArchiveBot (v1.3)

May 23 2017

Green_Cardamom added a comment to T165827: Truncated webcite URL.

I processed about 8000 webcite links randomly .. rechecking the API with the wikitext .. and found they mismatch in about 5% of cases, most of the time due to being truncated at the &. This is not an easy problem because the WebCite API is slow, and there are so many WebCite URLs in the database and wikitext. But I think this should be fixed somehow.

May 23 2017, 1:46 PM · InternetArchiveBot

May 22 2017

Green_Cardamom added a comment to T166008: Missing {{dead link}} tags.

It fixed 6 out of 8

May 22 2017, 2:02 PM · InternetArchiveBot (v1.3)

May 21 2017

Green_Cardamom closed T162722: Converting + to %20 as Resolved.
May 21 2017, 7:43 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T162722: Converting + to %20 .

Number 3 is a variant of 2 because when bare links are converted to wayback they get sanitized and if the wayback link stops working in the future one needs to extract the original URL from the wayback URL to search other services and since it was previously sanitized it may not be found at the new service. This is mostly true for archive.is since they crawled Wikipedia saving URLs as they were found at the time of the crawl.

May 21 2017, 7:43 PM · InternetArchiveBot (v1.3)
Green_Cardamom reopened T162722: Converting + to %20 as "Open".
May 21 2017, 3:27 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T162722: Converting + to %20 .

Sample ongoing bot wars over the %20/+ in query string.. not a complete list

May 21 2017, 3:27 PM · InternetArchiveBot (v1.3)
Restricted Application assigned T166008: Missing {{dead link}} tags to Cyberpower678.
May 21 2017, 2:31 PM · InternetArchiveBot (v1.3)

May 20 2017

Green_Cardamom closed T165912: GreenC bot edit war: duplicate {{webarchive}} as Invalid.
May 20 2017, 5:53 PM · InternetArchiveBot
Green_Cardamom added a comment to T165912: GreenC bot edit war: duplicate {{webarchive}}.

Actually forget this.. found other edit warring cases and will open a new ticket later.

May 20 2017, 5:53 PM · InternetArchiveBot
Restricted Application assigned T165912: GreenC bot edit war: duplicate {{webarchive}} to Cyberpower678.
May 20 2017, 5:23 PM · InternetArchiveBot
Green_Cardamom closed T165827: Truncated webcite URL as Resolved.
May 20 2017, 1:56 PM · InternetArchiveBot
Green_Cardamom added a comment to T165827: Truncated webcite URL.

Ok .. found it manually by random check. What I'll do is verify them all in the next batch and see how many if any show up and determine by those numbers.

May 20 2017, 1:55 PM · InternetArchiveBot
Green_Cardamom added a comment to T165827: Truncated webcite URL.

Ok got it re: how to test for bug in code vs database.

May 20 2017, 1:48 PM · InternetArchiveBot
Green_Cardamom reopened T165827: Truncated webcite URL as "Open".
May 20 2017, 1:37 PM · InternetArchiveBot
Green_Cardamom added a comment to T165827: Truncated webcite URL.

Is there a way to tell when the URL was added to the database? I assumed it was just added by IABot but from what you're saying it was already in the database. This would help before reporting problems to determine if the link already existed in the database.

May 20 2017, 1:36 PM · InternetArchiveBot
Green_Cardamom closed T165826: bot:unknown in conversion from webarchive to citeweb as Invalid.
May 20 2017, 1:20 PM · InternetArchiveBot
Green_Cardamom added a comment to T165826: bot:unknown in conversion from webarchive to citeweb.

I see ok.

May 20 2017, 1:20 PM · InternetArchiveBot
Green_Cardamom added a comment to T165827: Truncated webcite URL.

I checked the WebCite API results and it looks correct,

May 20 2017, 2:38 AM · InternetArchiveBot
Restricted Application assigned T165827: Truncated webcite URL to Cyberpower678.
May 20 2017, 2:36 AM · InternetArchiveBot
Restricted Application assigned T165826: bot:unknown in conversion from webarchive to citeweb to Cyberpower678.
May 20 2017, 2:01 AM · InternetArchiveBot

May 16 2017

Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

Fix done on svwiki.

May 16 2017, 10:53 PM · InternetArchiveBot (v1.3), Internet-Archive