Green_Cardamom (GreenC)
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Jun 4 2016, 1:17 PM (47 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Green Cardamom

Recent Activity

Today

Green_Cardamom added a comment to T164119: Recognize freezepage.com.

I couldn't figure anything out. Does it need the date? So long as it doesn't delete, skip processing those but leave in place. Another option is WaybackMedic can add the date to the URL with a &date=20161001 .. because the query portion with ?url= was contrived by me (using the example of WebCite), it makes no difference to the URL working.

Sun, Apr 30, 4:31 PM · InternetArchiveBot
Green_Cardamom added a comment to T164119: Recognize freezepage.com.

web scrape page for first occurrence of string "as of 16-Oct-2016"

Sun, Apr 30, 4:21 PM · InternetArchiveBot
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

The ones on enwiki are actively being fixed by WaybackMedic. That includes the non-archive.org links mentioned.

Sun, Apr 30, 3:44 PM · InternetArchiveBot (v1.3), Internet-Archive

Yesterday

Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

@Cyberpower678 .. sometimes there is no wayback snapshot available and the {{wayback}} is bogus. To clean these in the database suggest deleting the record entirely. IABot will create a new record when it encounters the URL again.

Sat, Apr 29, 5:06 PM · InternetArchiveBot (v1.3), Internet-Archive
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

Thanks Josve05A .. that's a useful regex method I didn't know about.

Sat, Apr 29, 2:11 PM · InternetArchiveBot (v1.3), Internet-Archive
Green_Cardamom reopened T162722: Converting + to %20 as "Open".

The database has URLs with "+" instead of "%20" due to the above bug.

Sat, Apr 29, 2:04 PM · InternetArchiveBot (v1.3)

Fri, Apr 28

Restricted Application assigned T164129: Extra https:// in wayback URL to Cyberpower678.
Fri, Apr 28, 11:35 PM · InternetArchiveBot
Restricted Application assigned T164119: Recognize freezepage.com to Cyberpower678.
Fri, Apr 28, 9:42 PM · InternetArchiveBot
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

Submitted a BRFA on svwiki see how long it takes

Fri, Apr 28, 8:49 PM · InternetArchiveBot (v1.3), Internet-Archive
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

Looked at this again and they are all limited to {{wayback}} which makes it easier. A simple bot can get the URL from the preceding URL in the []. Just need bot or AWB access.

Fri, Apr 28, 7:38 PM · InternetArchiveBot (v1.3), Internet-Archive
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

You may want to pause the Swedish bot.

Fri, Apr 28, 7:18 PM · InternetArchiveBot (v1.3), Internet-Archive
Green_Cardamom added a comment to T164048: Revert 225 bot edits adding https:///.

Josve05a added me to this ticket. I looked at the problem and don't have a good solution because the original data is gone and my bot doesn't do revision handling. Probably the best solution is to to revert those pages and rerun the bot. There is a script for mass revert but has to be done soon before many new edits are added. Possible check with User:xaosflux on enwiki if he can run the script on svwiki .. he would need a list of target articles.

Fri, Apr 28, 7:10 PM · InternetArchiveBot (v1.3), Internet-Archive

Wed, Apr 26

Green_Cardamom added a comment to T163750: sul-swap-prod.stanford.edu.

I thought maybe the dead link checker was a separate process running ahead of IABot, and IABot follows behind by about 30 days. But it sounds like IABot feeds links to the DLC and when IABot process it a second time, whatever fixes were added to the database by the DLC are corrected. Is that kind of how it works?

Wed, Apr 26, 4:35 PM · InternetArchiveBot

Mon, Apr 24

Green_Cardamom added a comment to T163750: sul-swap-prod.stanford.edu.

Another question with the diff: It rescued 17 links, but IABot ran 9 days prior when it rescued 7 sources. Wouldn't it rescue all sources (17 + 7) at the same time, or is there a difference in how IABot runs in auto vs manual mode?

Mon, Apr 24, 10:38 PM · InternetArchiveBot
Restricted Application assigned T163750: sul-swap-prod.stanford.edu to Cyberpower678.
Mon, Apr 24, 10:33 PM · InternetArchiveBot

Wed, Apr 19

Restricted Application assigned T163335: Deleting URLs that contain http%3A to Cyberpower678.
Wed, Apr 19, 3:03 PM · InternetArchiveBot (v1.3)

Tue, Apr 18

Green_Cardamom added a comment to T163052: Not recognizing archive.wikiwix.com.

Agree it's not a good archive service due to lack of archivedate, but IABot shouldn't tag them all dead .. can I suggest IABot ignore wikiwix links if possible?

Tue, Apr 18, 4:29 PM · InternetArchiveBot
Green_Cardamom reopened T161940: Webcite date conversion incorrect as "Open".
Tue, Apr 18, 2:16 AM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T161940: Webcite date conversion incorrect.

The problem is actually large, involving thousands of articles, too big via the MI. I believer there was another bot a few years ago that was using local instead of GMT time and so it populated thousands of articles with wrong archivedate data. This bad data then got imported into the IAB database.

Tue, Apr 18, 2:16 AM · InternetArchiveBot (v1.3)

Sat, Apr 15

Restricted Application assigned T163052: Not recognizing archive.wikiwix.com to Cyberpower678.
Sat, Apr 15, 5:58 PM · InternetArchiveBot

Fri, Apr 14

Restricted Application assigned T162999: No recognizing wayback.archive-it.org to Cyberpower678.
Fri, Apr 14, 2:07 PM · InternetArchiveBot

Thu, Apr 13

Green_Cardamom added a comment to T162722: Converting + to %20 .

Also the web tool won't allow changing %20 to + because it automatically converts + to %20 meaning it's impossible to update the database

Thu, Apr 13, 9:51 PM · InternetArchiveBot (v1.3)
Restricted Application assigned T162899: Not recognizing classic-web to Cyberpower678.
Thu, Apr 13, 3:36 PM · InternetArchiveBot, Internet-Archive

Wed, Apr 12

Green_Cardamom added a comment to T162722: Converting + to %20 .

Medic now has a check and fix for these cases as it comes across them but hopefully he problem can be identified at the source.

Wed, Apr 12, 4:30 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T162722: Converting + to %20 .

Here's another example demonstrates the problem more clearly

Wed, Apr 12, 2:55 PM · InternetArchiveBot (v1.3)

Tue, Apr 11

Restricted Application assigned T162722: Converting + to %20 to Cyberpower678.
Tue, Apr 11, 5:51 PM · InternetArchiveBot (v1.3)

Mon, Apr 10

Green_Cardamom added a comment to T161940: Webcite date conversion incorrect.

Date is off by years:

Mon, Apr 10, 5:14 PM · InternetArchiveBot (v1.3)

Sat, Apr 1

Green_Cardamom added a comment to T161940: Webcite date conversion incorrect.

Bot war :)

Sat, Apr 1, 2:04 AM · InternetArchiveBot (v1.3)
Green_Cardamom edited the description of T161940: Webcite date conversion incorrect.
Sat, Apr 1, 1:54 AM · InternetArchiveBot (v1.3)
Restricted Application assigned T161940: Webcite date conversion incorrect to Cyberpower678.
Sat, Apr 1, 1:53 AM · InternetArchiveBot (v1.3)

Mar 31 2017

Green_Cardamom added a comment to T161430: Mangled refs being more mangled .

Great. A citation that requires nowiki tags is doing something fundamentally wrong.

Mar 31 2017, 1:21 AM · InternetArchiveBot (v1.3)
Green_Cardamom reopened T161432: Deleted a citation as "Open".
Mar 31 2017, 1:17 AM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T161432: Deleted a citation.

Not sure what happened looks Like I copied the wrong link.

Mar 31 2017, 1:15 AM · InternetArchiveBot (v1.3)
Green_Cardamom closed T161432: Deleted a citation as "Resolved".
Mar 31 2017, 1:14 AM · InternetArchiveBot (v1.3)
Green_Cardamom reopened T161432: Deleted a citation as "Open".
Mar 31 2017, 1:13 AM · InternetArchiveBot (v1.3)

Mar 26 2017

Restricted Application assigned T161432: Deleted a citation to Cyberpower678.
Mar 26 2017, 1:36 AM · InternetArchiveBot (v1.3)
Restricted Application assigned T161431: Deleted opening markup of a wikicomment to Cyberpower678.
Mar 26 2017, 1:04 AM · InternetArchiveBot (v1.3)
Restricted Application assigned T161430: Mangled refs being more mangled to Cyberpower678.
Mar 26 2017, 1:00 AM · InternetArchiveBot (v1.3)
Restricted Application assigned T161429: webarchive refs being deleted to Cyberpower678.
Mar 26 2017, 12:54 AM · InternetArchiveBot

Mar 23 2017

Green_Cardamom added a comment to T161162: Pandora link broken.

I figured it has to do with being on the opposite side of the planet.

Mar 23 2017, 12:30 AM · InternetArchiveBot
Green_Cardamom added a comment to T161162: Pandora link broken.

It is indeed the worst.

Mar 23 2017, 12:29 AM · InternetArchiveBot
Green_Cardamom added a comment to T161162: Pandora link broken.

Removing the "S" breaks it.

Mar 23 2017, 12:27 AM · InternetArchiveBot
Green_Cardamom reopened T161162: Pandora link broken as "Open".
Mar 23 2017, 12:26 AM · InternetArchiveBot
Green_Cardamom added a comment to T161162: Pandora link broken.

That could be it looked like different sub-divisions of the Australian government.

Mar 23 2017, 12:19 AM · InternetArchiveBot

Mar 22 2017

Green_Cardamom added a comment to T161162: Pandora link broken.

The Australia links can't be sanitized. That could be true for others also. We know Wayback can be sanitized, but service each has its own requirements.

Mar 22 2017, 10:19 PM · InternetArchiveBot
Green_Cardamom edited the description of T161162: Pandora link broken.
Mar 22 2017, 9:58 PM · InternetArchiveBot
Restricted Application assigned T161162: Pandora link broken to Cyberpower678.
Mar 22 2017, 9:54 PM · InternetArchiveBot
Green_Cardamom added a comment to T160174: archivedate format .

Still happening March 22 v1.3beta2

Mar 22 2017, 5:32 PM · InternetArchiveBot (v1.3)
Green_Cardamom reopened T160174: archivedate format as "Open".
Mar 22 2017, 5:32 PM · InternetArchiveBot (v1.3)

Mar 18 2017

Restricted Application assigned T160826: Feature request: garbage checker to Cyberpower678.
Mar 18 2017, 2:02 PM · InternetArchiveBot

Mar 16 2017

Restricted Application assigned T160641: Refs deleted to Cyberpower678.
Mar 16 2017, 3:25 PM · InternetArchiveBot (v1.3)

Mar 15 2017

Green_Cardamom added a comment to T160564: Saving links that are not dead.

The problem is common. There's something wrong. Users can't be expected to deal manually with such a high percentage of false positives. Look through this page:

Mar 15 2017, 9:34 PM · InternetArchiveBot
Green_Cardamom reopened T160564: Saving links that are not dead as "Open".
Mar 15 2017, 9:28 PM · InternetArchiveBot
Restricted Application assigned T160564: Saving links that are not dead to Cyberpower678.
Mar 15 2017, 7:18 PM · InternetArchiveBot

Mar 10 2017

Green_Cardamom assigned T160175: Stanford University archive to Cyberpower678.
Mar 10 2017, 8:52 PM · InternetArchiveBot (v1.3)
Green_Cardamom assigned T159975: Icelandic Archives to Cyberpower678.
Mar 10 2017, 8:52 PM · InternetArchiveBot (v1.3)
Green_Cardamom assigned T159833: webarchive.nationalarchives.gov.uk to Cyberpower678.
Mar 10 2017, 8:52 PM · InternetArchiveBot (v1.3)
Green_Cardamom assigned T159832: Wrong archive-date to Cyberpower678.
Mar 10 2017, 8:52 PM · InternetArchiveBot (v1.3)
Green_Cardamom assigned T160174: archivedate format to Cyberpower678.
Mar 10 2017, 8:49 PM · InternetArchiveBot (v1.3)
Green_Cardamom created T160175: Stanford University archive.
Mar 10 2017, 3:03 PM · InternetArchiveBot (v1.3)
Green_Cardamom created T160174: archivedate format .
Mar 10 2017, 2:56 PM · InternetArchiveBot (v1.3)

Mar 8 2017

Green_Cardamom created T159975: Icelandic Archives.
Mar 8 2017, 7:18 PM · InternetArchiveBot (v1.3)

Mar 7 2017

Green_Cardamom edited the description of T159833: webarchive.nationalarchives.gov.uk.
Mar 7 2017, 6:37 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T159833: webarchive.nationalarchives.gov.uk.
Mar 7 2017, 4:00 PM · InternetArchiveBot (v1.3)
Green_Cardamom created T159833: webarchive.nationalarchives.gov.uk.
Mar 7 2017, 2:22 PM · InternetArchiveBot (v1.3)
Green_Cardamom created T159832: Wrong archive-date.
Mar 7 2017, 2:10 PM · InternetArchiveBot (v1.3)

Feb 23 2017

Restricted Application assigned T158887: Backslash in URL not ended correctly to Cyberpower678.
Feb 23 2017, 6:43 PM · InternetArchiveBot (v1.3)

Feb 14 2017

Green_Cardamom added a comment to T158065: Archive.is incorrect encoding.

Another where XML wasn't decoded

Feb 14 2017, 5:35 PM · InternetArchiveBot
Restricted Application assigned T158065: Archive.is incorrect encoding to Cyberpower678.
Feb 14 2017, 3:52 PM · InternetArchiveBot

Feb 9 2017

Restricted Application assigned T157696: Archive.is and urlencoding to Cyberpower678.
Feb 9 2017, 4:33 PM · InternetArchiveBot

Jan 31 2017

Restricted Application assigned T156809: Wikicomment in cite causes wrong parsing to Cyberpower678.
Jan 31 2017, 3:30 PM · InternetArchiveBot (v1.3)

Jan 28 2017

Green_Cardamom added a comment to T154597: UI archive services.

I've actually never seen those in the wild they were listed in the Memento list (linked above) so I included them. The ones I often see besides LOC are

Jan 28 2017, 5:49 AM · InternetArchiveBot

Jan 27 2017

Green_Cardamom added a comment to T154597: UI archive services.

In the FAQ #4

Jan 27 2017, 3:26 AM · InternetArchiveBot
Green_Cardamom added a comment to T154597: UI archive services.

LOC is part of the Memento network of archive sites.

Jan 27 2017, 3:14 AM · InternetArchiveBot

Jan 23 2017

Green_Cardamom added a comment to T155947: Exponential rescues.

WaybackMedic has a function to fix it in the wikitext.

Jan 23 2017, 8:24 PM · InternetArchiveBot (v1.3)

Jan 22 2017

Green_Cardamom created T155947: Exponential rescues.
Jan 22 2017, 5:27 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T155061: BOT incorrectly places {{Wayback}} template.

This is because {{cite sports-reference}} is not a recognized template.. I added it to User:InternetArchiveBot/Dead-links.js

Jan 22 2017, 3:13 AM · InternetArchiveBot (v1.3)

Jan 19 2017

Green_Cardamom added a comment to T150472: Location of {{dead link}} .

The links will be invisible except in wikitext which defeats the purpose. It seems like there should be three categories of templates: mutable, immutable and ignore. With immutable, the template is recognized and the URL processed, but any changes are done outside using {{webarchive}} and {{dead link}} - the template itself remains untouched. This would solve a lot of problems because the url= field probably should remain as the original URL for reasons of interaction with Wikidata and display. The {{official}} is a good example, the URL is meant to be the original URL, which is then exported into Wikidata as the original URL, from there exported elsewhere as the original. When it's transformed into a wayback link it's confusing the downstream to what the original URL is. There are many other templates like this that assume the url= field to be the original URL and have no built-in mechanism for dead links. Using {{webarchive}} / {{deadlink}} outside the template would solve those problems.

Jan 19 2017, 2:00 PM · Community-Tech, InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T150472: Location of {{dead link}} .

Is that also for non-CS templates without support archiveurl etc..?

Jan 19 2017, 3:53 AM · Community-Tech, InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T150472: Location of {{dead link}} .

Note: the embedded dead link is caused by rare templates such as {{cite court}} which are not listed in the bot configuration.. this may change with 1.3 new procedure.

Jan 19 2017, 1:20 AM · Community-Tech, InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T150472: Location of {{dead link}} .

For the embedded dead links:

Jan 19 2017, 12:55 AM · Community-Tech, InternetArchiveBot (v1.3)

Jan 9 2017

Green_Cardamom edited the description of T154887: Template names containing edge case characters.
Jan 9 2017, 4:02 AM · InternetArchiveBot (v1.3)
Green_Cardamom renamed T154887: Template names containing edge case characters from "Linefeed in template name causes template to be be unrecognized and get mangled" to "Template names containing edge case characters".
Jan 9 2017, 3:26 AM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T154543: Compact cites are changed to expanded cites.

One possible solution... if the first key=value pair ends with a CR then likely it's an expanded template. If not, likely other CRs are stray and it could be collapsed. The rationale is because editors who care about expanded templates will make sure the first key|value has a CR after it .. but a template where the first key|value doesn't have a CR then editors are less likely to be concerned and safer to collapse since that is the more common layout.

Jan 9 2017, 3:16 AM · InternetArchiveBot (v1.3)
Restricted Application assigned T154887: Template names containing edge case characters to Cyberpower678.
Jan 9 2017, 3:00 AM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T154884: Recognizing exotic cite templates.

WM is now logging the template names that cause these errors so I can discover and add them to the configuration page.

Jan 9 2017, 2:40 AM · InternetArchiveBot
Green_Cardamom added a comment to T154884: Recognizing exotic cite templates.

There's a fix in WaybackMedic function fixembedway() and it will go back over past articles since August to current.

Jan 9 2017, 2:09 AM · InternetArchiveBot
Restricted Application assigned T154884: Recognizing exotic cite templates to Cyberpower678.
Jan 9 2017, 2:04 AM · InternetArchiveBot

Jan 6 2017

Green_Cardamom added a comment to T154541: Stray dead link template.

In regards to the stray dead link problem: I've written a module to WaybackMedic which checks for and removes dead link templates when there is an existing archive URL. It completed a run backwards against most of the edits by IABot since August, with some more to go. It will be part of the WM checks going forward. Module called straydt.awk

Jan 6 2017, 7:43 PM · InternetArchiveBot (v1.3)
Restricted Application assigned T154734: Unable to modify URL via UI to Cyberpower678.
Jan 6 2017, 2:14 AM · InternetArchiveBot (v1.3), Internet-Archive

Jan 5 2017

Green_Cardamom added a comment to T151182: InternetArchiveBot should not use the deprecated {{Wayback}} template..

Excellent thank you. I already wrote code to convert old templates to new, there are some tricky things with weird data inputs. Already converted over 100000 - Just need IABot to only add webarchive and not wayback or webcite or memento otherwise the conversion process is endless.

Jan 5 2017, 6:39 PM · InternetArchiveBot (v1.3)

Jan 4 2017

Restricted Application assigned T154597: UI archive services to Cyberpower678.
Jan 4 2017, 5:49 PM · InternetArchiveBot
Green_Cardamom added a comment to T154541: Stray dead link template.

It's not recognizing liveweb.archive.org (an alias to archive.org/web - who knew!)

Jan 4 2017, 5:26 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T154541: Stray dead link template.

Also has trouble with rare archive services

Jan 4 2017, 5:20 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T154541: Stray dead link template.

Another cause is when the archive.org date is a star (index page):

Jan 4 2017, 5:15 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T154541: Stray dead link template.

I believe this is caused by Webcite redirects. In the first example for Ellon Castle the API XML returns:

Jan 4 2017, 3:40 PM · InternetArchiveBot (v1.3)

Jan 3 2017

Restricted Application assigned T154543: Compact cites are changed to expanded cites to Cyberpower678.
Jan 3 2017, 11:54 PM · InternetArchiveBot (v1.3)
Restricted Application assigned T154541: Stray dead link template to Cyberpower678.
Jan 3 2017, 11:50 PM · InternetArchiveBot (v1.3)

Jan 2 2017

Green_Cardamom added a comment to T154119: Template inserted into br tag.

Checked 100000 recent edits by IABot and none have this bug, it looks like a 1-time .. fixbrbug() will keep checking as part of WM.

Jan 2 2017, 5:07 PM · InternetArchiveBot (v1.3)
Green_Cardamom added a comment to T150729: Handle "cite__web" (template name with multiple spaces between the words) properly.

Added a function to waybackMedic fixembway() - deletes wayback and webcite templates embedded in cite templates.

Jan 2 2017, 4:58 AM · InternetArchiveBot (v1.3)