Page MenuHomePhabricator

Audiodude (Travis Briggs)
User

Today

  • No visible events.

Tomorrow

  • No visible events.

Wednesday

  • No visible events.

User Details

User Since
Mar 30 2017, 2:19 AM (463 w, 4 d)
Availability
Available
LDAP User
Audiodude
MediaWiki User
Audiodude [ Global Accounts ]

Recent Activity

Yesterday

Audiodude changed the status of T417477: User-agent policy failures should produce better diagnostics from Invalid to Resolved.

Glad you were able to figure it out, at least somewhat. I think your intuition is correct. There a lot of layers/middleware in your stack and one of them is probably dripping the response body and just re-wrapping as a generic 403.

Sun, Feb 15, 8:14 PM · SRE, MediaWiki-Platform-Team, MediaWiki-extensions-OAuth

Sat, Feb 14

Audiodude added a comment to T417477: User-agent policy failures should produce better diagnostics.

I think security best practices for 403s are to return as little information as possible. You don't really want to advise bad actors how to get around the 403.

Sat, Feb 14, 5:00 AM · SRE, MediaWiki-Platform-Team, MediaWiki-extensions-OAuth

Fri, Feb 13

Audiodude added a comment to T415237: etherpad table size is 233GB / plan to delete all etherpads in April 2026.

My notice (https://meta.wikimedia.org/wiki/Talk:Etherpad#Etherpad_wipe/deletion/clean_slate_proposal) has been posted on Village Pump misc:
https://en.wikipedia.org/wiki/Wikipedia:Village_pump_(miscellaneous)#etherpad.wikimedia.org_cleanup:_April_30,_Save_your_pads!

Fri, Feb 13, 7:14 PM · Patch-For-Review, User-notice, collaboration-services, Wikimedia-Etherpad, Data-Persistence

Tue, Feb 10

Audiodude added a comment to T415237: etherpad table size is 233GB / plan to delete all etherpads in April 2026.

My pleasure. How about we take another 24 hours for technical/ops folks on this thread to comment on that meta page and then I'll disseminate a final copy on the Village Pump?

Tue, Feb 10, 6:35 PM · Patch-For-Review, User-notice, collaboration-services, Wikimedia-Etherpad, Data-Persistence
Audiodude added a comment to T415237: etherpad table size is 233GB / plan to delete all etherpads in April 2026.

Yes I was thinking Villiage Pump. As a technical editor myself, I don't know where the non-technical editors "hang out" other than potentially Village Pump.

Tue, Feb 10, 6:22 PM · Patch-For-Review, User-notice, collaboration-services, Wikimedia-Etherpad, Data-Persistence
Audiodude added a comment to T415237: etherpad table size is 233GB / plan to delete all etherpads in April 2026.

And one of the reasons is the expectation of "etherpad is forever" has been built up is that we haven't ever purged it.

Tue, Feb 10, 6:04 PM · Patch-For-Review, User-notice, collaboration-services, Wikimedia-Etherpad, Data-Persistence
Audiodude added a comment to T415237: etherpad table size is 233GB / plan to delete all etherpads in April 2026.

I acknowledge the unwieldy size of the database and support a truncation. However, I think there needs to be a longer timeline (maybe more like 90 days instead of less than 30?) in order to properly communicate to affected users and give them time to retrieve their data.

Tue, Feb 10, 5:19 PM · Patch-For-Review, User-notice, collaboration-services, Wikimedia-Etherpad, Data-Persistence

Tue, Jan 20

Audiodude added a comment to T414897: Incorporate overflow RSP table rows in the conversion.

I created redirect Wikipedia:Reliable sources/Perennial sources/9 ⟶ */X, so no need to special code that part.

Cool, I just need to update 9 -> 10 in the code now.

Tue, Jan 20, 5:12 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414897: Incorporate overflow RSP table rows in the conversion.

I created redirect Wikipedia:Reliable sources/Perennial sources/9 ⟶ */X, so no need to special code that part.

Tue, Jan 20, 5:10 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414897: Incorporate overflow RSP table rows in the conversion.

Sorry the code parses pages from 1-8, where the pages actually are (I forgot how many pages there were, sorry, I wrote it last year!). The 9 only appears in the code as a boundary (off by one error anyone?).

Tue, Jan 20, 4:29 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414897: Incorporate overflow RSP table rows in the conversion.

Oh sorry, I think there was a bit of a misunderstanding. I read "Perennial sources/X" and interpreted X as a variable, like my N from above.

Tue, Jan 20, 4:25 AM · English-Wikipedia-RSP-restructuring

Mon, Jan 19

Audiodude added a comment to T414897: Incorporate overflow RSP table rows in the conversion.

Currently the bot fetches tables from Wikipedia:Reliable_sources/Perennial_sources/N with N from 1-9. Is that sufficient? It doesn't use the "main table" at all.

Mon, Jan 19, 6:39 PM · English-Wikipedia-RSP-restructuring

Jan 17 2026

Audiodude closed T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources) as Resolved.

As mentioned in this thread, the bot was spuriously adding http:// in front of "domain" fields extracted from the table. As long as we don't do that, this issue is fixed.

Jan 17 2026, 5:01 PM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414770: Bot: Track which pages fail due to spam blacklist [ed: denylist].

I wrote the bot, and it's mostly a prototype for now as we discuss and organize a solution to the larger issue here: https://en.wikipedia.org/wiki/Wikipedia_talk:Reliable_sources/Perennial_sources

Jan 17 2026, 4:56 PM · English-Wikipedia-RSP-restructuring

Jan 16 2026

Audiodude added a comment to T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources).

Ah okay, thanks for the clarification.

Jan 16 2026, 4:13 PM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414770: Bot: Track which pages fail due to spam blacklist [ed: denylist].

Sorry, I haven't used phabricator before to organize tasks for a specific project inside of the Wikimedia movement. Maybe the assumption is that any task I file is a "general" task, or that this is meant to apply beyond the specific tag and context in which it is filed?

Jan 16 2026, 6:44 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources).

Interesting suggestion. I'm not sure. The bot templates are currently written as wikitext, so my intuition is that wouldn't work. I also don't know how to publish in "plain text" but have things like links, styling and templates?

Jan 16 2026, 6:40 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414741: RSP cutover.

I think two separate "co-tasks" that are jointly the parent of all tasks is preferable, yes. What I was getting at is that we don't need to check if the "Do the whole thing" task is complete to know that the thing is done. If all the tasks with this tag are marked done, or "not needed", we know the thing is done. Going further, as long as every task in this project has the project tag, it's not necessary to define a strict or rigid task hierarchy/tree of parent/subtask (and could even get cumbersome if people think they can't work on a task because the subtasks aren't all done). I don't mean to be overly crtical, especially if you find the scheme useful, and I'm not trying to start a yak shaving contest, so of course be bold.

Jan 16 2026, 6:26 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414753: Design and create RSP Index page.

+1 for separate tasks, because it is possible that one person is doing the designing and another the creating.

Jan 16 2026, 6:20 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414741: RSP cutover.

In my experience, a "root" supertask like this is not useful. It is by definition the parent of all tasks (master of none?) and therefore has no utility. It will only be completed when the project is done, at which point closing it is the equivalent of turning off the light switch after you've finished moving out.

Jan 16 2026, 5:41 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414764: Design RSP source/subpage categorization.

My advice from the perspective of someone who has done many technical/data "migrations" is that it's best to make the first migrated version as similar to the original in terms of scope, functionality and schema as possible.

Jan 16 2026, 5:36 AM · English-Wikipedia-RSP-restructuring
Audiodude created T414770: Bot: Track which pages fail due to spam blacklist [ed: denylist].
Jan 16 2026, 5:31 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources).

So just to clarify, what's happening is that the bot is trying to create a "source page" (subpage) for Dotdash Meredith which is the current owner of the site "about.com". The wiki software sees "about.com" and assumes I'm trying to add a link to that site which is blacklisted (their term, should be denylisted). Technically (in both senses of the word), I am. But it is clearly a case where an exception is warranted. The error message clearly indicates what to do (asking for an exception on the "spam whitelist [their term, should be allowlist] talk page".

Jan 16 2026, 5:25 AM · English-Wikipedia-RSP-restructuring
Audiodude added a comment to T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources).

Migrated from https://github.com/audiodude/wiki-reliable-sources-parser/issues/1

Jan 16 2026, 5:18 AM · English-Wikipedia-RSP-restructuring
Audiodude created T414769: Can't create pages with blacklisted/spam links (even though those ARE the sources).
Jan 16 2026, 5:18 AM · English-Wikipedia-RSP-restructuring

Jan 12 2026

Audiodude added a comment to T413709: Create project tag for English WP:RSP Restructuring.

Thank you so much!

Jan 12 2026, 12:48 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Project-Admins

Jan 6 2026

Audiodude added a comment to T413709: Create project tag for English WP:RSP Restructuring.

Great idea! Yes I'll migrate the lone issue once we have a phab tag (or I can do it now and open a task, but I don't want it to be confusing and mis-triaged if we don't have the tag). Then I'll disable issues on the project.

Jan 6 2026, 2:39 PM · Essential-Work, Release-Engineering-Team (Doing 😎), Project-Admins

Sep 2 2025

Audiodude added a comment to T403410: Mwoffliner: cloud vps object storage size quota over 90%.

Honestly I feel like we can just ignore this task, or delete these objects, or both.

Sep 2 2025, 6:40 AM · VPS-Projects
Audiodude added a comment to T403410: Mwoffliner: cloud vps object storage size quota over 90%.

Make sure you have mwoffliner selected as your project:

Sep 2 2025, 6:40 AM · VPS-Projects

Sep 1 2025

Audiodude added a comment to T403410: Mwoffliner: cloud vps object storage size quota over 90%.

You're right, it's for database backups originating from Horizon. You're also right about the date it was enabled. Of course, since we have our own database backups, these are useless. The actual mystery to me is why we suddenly got this message. Either way, it's not something we are using so it can be deleted or just ignored.

Sep 1 2025, 4:18 PM · VPS-Projects

Jul 23 2025

Audiodude added a comment to T400018: Rate limiting/status code 429 for mwclient?.

Thanks again @Scott_French for the extremely helpful analysis! I plan to submit a PR to mwclient to update the docs for that method to indicate which parameters are ignored when pool is set.

Jul 23 2025, 4:32 AM · Traffic, SRE, affects-Kiwix-and-openZIM

Jul 21 2025

Audiodude added a comment to T400018: Rate limiting/status code 429 for mwclient?.

Oh nevermind, we do use a connection pool in order to re-use the login cookies! So it seems according to your analysis (which I just confirmed), because we are setting the pool, we are ending up with a null UA? And that is causing us to get rate limited, presumably because of new policy changes?

Jul 21 2025, 11:20 PM · Traffic, SRE, affects-Kiwix-and-openZIM
Audiodude added a comment to T400018: Rate limiting/status code 429 for mwclient?.

Thanks for that. I think you might have incorrect assumptions about the wp1 code, though. We do not attempt to set any custom "WP 1.0 Bot" user agent, and we are not using a connection pool. Since 2018, we have relied on the mwclient/0.* UA, which has worked.

Jul 21 2025, 11:18 PM · Traffic, SRE, affects-Kiwix-and-openZIM
Audiodude added a comment to T400018: Rate limiting/status code 429 for mwclient?.

Thanks for taking the time and looking into what mwclient version we use in production. Upgrading to 0.11.0 was the first thing I did when attempting to run the test code, but I immediately ran into the issue described above.

Jul 21 2025, 9:24 PM · Traffic, SRE, affects-Kiwix-and-openZIM
Audiodude added a comment to T400018: Rate limiting/status code 429 for mwclient?.

This is relatively high priority because the bot is currently offline pending resolution.

Jul 21 2025, 8:05 PM · Traffic, SRE, affects-Kiwix-and-openZIM
Audiodude created T400018: Rate limiting/status code 429 for mwclient?.
Jul 21 2025, 3:49 AM · Traffic, SRE, affects-Kiwix-and-openZIM

May 23 2025

Audiodude added a comment to T394429: enwiki_p query returned empty results on May 14 from ~UTC 0:00 - 05:00.

Thank you for carefully considering the issue. There haven't been any recent code changes in the tool so I'm stumped. I'll keep debugging.

May 23 2025, 3:19 PM · affects-Kiwix-and-openZIM, cloud-services-team, Data-Services

May 15 2025

Audiodude added a comment to T394429: enwiki_p query returned empty results on May 14 from ~UTC 0:00 - 05:00.

It has the talk page articles in it, which is what we are looking for (WP 1.0 articles are categorized on their talk pages, not their article pages).

May 15 2025, 6:29 PM · affects-Kiwix-and-openZIM, cloud-services-team, Data-Services
Audiodude created T394429: enwiki_p query returned empty results on May 14 from ~UTC 0:00 - 05:00.
May 15 2025, 3:34 PM · affects-Kiwix-and-openZIM, cloud-services-team, Data-Services

Feb 5 2025

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

As a stakeholder, my main concern is resolving the issue. That said, my (unsolicited) technical advice is that this is a great temporary workaround. Not only do we mitigate the immediate bug, but the logging especially is probably necessary anyway to help properly diagnose the bug in the future.

Feb 5 2025, 4:11 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Jan 16 2025

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

They only eventually work on retry. Immediate retries, even with a delay, do not work, as the issue seems to persist for multiple days before eventually resolving itself (which as @Benoit74 points out, seems to point to a caching issue).

Jan 16 2025, 7:33 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

The latest examples still "work" (ie they're broken!)

Jan 16 2025, 3:02 AM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Jan 8 2025

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Here's one that exhibits the initial behavior mentioned in this ticket (blank page from API): https://ca.wikipedia.org/api/rest_v1/page/mobile-html/Maylandia

Jan 8 2025, 5:09 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Jan 7 2025

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.
{
type: "https://mediawiki.org/wiki/HyperSwitch/errors/not_found#page_revisions",
title: "Not found.",
method: "get",
detail: "Page was deleted",
uri: "/es.wiktionary.org/v1/page/mobile-html/magnetita"
}
Jan 7 2025, 9:38 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

These claim that the page was deleted from the API, but they are clearly still there:

Jan 7 2025, 9:37 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Unfortunately, I don't have any special way of finding these other than running full mwoffliner scrapes of suspect wikis.

Jan 7 2025, 4:46 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Dec 11 2024

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Thanks for looking into this! Let me know if you need me to keep providing examples. For now, here's another one: https://ca.wikipedia.org/api/rest_v1/page/mobile-html/Dan_Georgiadis

Dec 11 2024, 4:46 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude renamed T379017: API returning completely empty contents from mobile-html endpoint for some articles from Article in Egyptian Arabic wiki (arz) returning completely empty contents from mobile-html endpoint to API returning completely empty contents from mobile-html endpoint for some articles.
Dec 11 2024, 1:56 AM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Here's another one on ca:

Dec 11 2024, 1:56 AM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Dec 10 2024

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Couldn't find anything on arz (though mwoffliner failed for a different reason), but just came across this in cawiki:

Dec 10 2024, 12:45 AM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Dec 6 2024

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

I'll re-run the scrape of arz and es wiktionary to see if I can find more reproducible cases.

Dec 6 2024, 4:58 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Nov 14 2024

Audiodude added a project to T379017: API returning completely empty contents from mobile-html endpoint for some articles: API Platform.
Nov 14 2024, 6:56 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Yes, that page is still broken. The original article that was reported is now working.

Nov 14 2024, 4:50 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude created T379937: Where can I find the code for resizing images in the mobile-sections API?.
Nov 14 2024, 4:48 PM · Content-Transform-Team, affects-Kiwix-and-openZIM, Page Content Service, Mobile-Content-Service

Nov 5 2024

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Happening with this article from Spanish Wiktionary too: https://es.wiktionary.org/api/rest_v1/page/mobile-html/awalk

Nov 5 2024, 7:19 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Nov 4 2024

Audiodude added a comment to T379017: API returning completely empty contents from mobile-html endpoint for some articles.

Also, we (Kiwix) have observed this issue in the past on arz, but it was with a different article that is no longer broken (see https://github.com/openzim/mwoffliner/issues/2003)

Nov 4 2024, 6:27 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude updated the task description for T379017: API returning completely empty contents from mobile-html endpoint for some articles.
Nov 4 2024, 6:16 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service
Audiodude created T379017: API returning completely empty contents from mobile-html endpoint for some articles.
Nov 4 2024, 6:14 PM · Essential-Work, Content-Transform-Team (Work In Progress), affects-Kiwix-and-openZIM, MediaWiki-Engineering, MediaWiki-REST-API, Page Content Service

Oct 13 2024

Audiodude added a comment to T375977: Temporary (1-2 weeks) quota increase for disaster recovery exercise.

@fnegri Now we're getting a message that we don't have enough RAM quota. The original server was g4.cores2.ram4.disk20, so we'd like to replicate that if possible. Thanks!

Oct 13 2024, 7:33 PM · Cloud-VPS (Quota-requests)

Oct 1 2024

Audiodude added a comment to T375977: Temporary (1-2 weeks) quota increase for disaster recovery exercise.

Thanks so much!

Oct 1 2024, 6:31 PM · Cloud-VPS (Quota-requests)

Sep 29 2024

Audiodude created T375977: Temporary (1-2 weeks) quota increase for disaster recovery exercise.
Sep 29 2024, 2:14 PM · Cloud-VPS (Quota-requests)

Jul 23 2024

Audiodude closed T370650: Allow Wikimedia Maps usage on mwoffliner as Resolved.

Thanks for the attention and explanation. We have implemented using the base URL of the wiki being scraped as the Referer header, and https://github.com/openzim/mwoffliner/issues/2067 been closed.

Jul 23 2024, 11:16 PM · serviceops-radar, affects-Kiwix-and-openZIM, Maps

Jul 22 2024

Audiodude added a comment to T370650: Allow Wikimedia Maps usage on mwoffliner.

mwoffliner issue: https://github.com/openzim/mwoffliner/issues/2061

Jul 22 2024, 3:07 PM · serviceops-radar, affects-Kiwix-and-openZIM, Maps
Audiodude created T370650: Allow Wikimedia Maps usage on mwoffliner.
Jul 22 2024, 3:07 PM · serviceops-radar, affects-Kiwix-and-openZIM, Maps

Jul 17 2024

Audiodude added a comment to T369723: Update all trove VMs to a modern guest image.

It would be best for mwoffliner (which runs the WP 1.0 Bot) if the maintenance wasn't between 0:00 - 4:00 UTC, because that's when the bot is running

Jul 17 2024, 4:05 AM · Goal, Data-Services, Cloud-VPS, cloud-services-team, User-Marostegui

Jun 24 2024

Audiodude added a comment to T368265: Disk volumes of cloud instances are completely mixed-up.

Just to be completely clear, I don't really feel comfortable editing /fstab on mwcurator. If one of you could do it that would be great! Thanks!

Jun 24 2024, 5:49 PM · Cloud-VPS, Cloud-Services-Origin-User, affects-Kiwix-and-openZIM
Audiodude added a comment to T368265: Disk volumes of cloud instances are completely mixed-up.

I can try, but I'm not sure I know what I'm doing. Where do I get the UUIDs from?

Jun 24 2024, 4:29 PM · Cloud-VPS, Cloud-Services-Origin-User, affects-Kiwix-and-openZIM

Jun 9 2024

Audiodude created T367010: WIkidata Query Service example queries are missing.
Jun 9 2024, 9:29 PM · Wikidata Query UI, Wikidata

May 25 2024

Audiodude closed T365321: Pageviews complete dumps have lots of rows with article name = '-' as Resolved.

Awesome, thank you so much for the explanation! It might be worth adding that to the README here: https://dumps.wikimedia.org/other/pageview_complete/readme.html

May 25 2024, 5:29 PM · Test Kitchen, Data-Engineering, Analytics, Data-Engineering-Wikistats

May 18 2024

Audiodude created T365321: Pageviews complete dumps have lots of rows with article name = '-'.
May 18 2024, 3:21 PM · Test Kitchen, Data-Engineering, Analytics, Data-Engineering-Wikistats

May 3 2024

Audiodude added a comment to T363901: Project WP1.0/mwoffliner requests Trove instance with 75 GB.

Yes it seems to have worked! Thanks so much.

May 3 2024, 3:45 PM · affects-Kiwix-and-openZIM, User-aborrero, cloud-services-team, Cloud-VPS (Quota-requests)

May 1 2024

Audiodude created T363901: Project WP1.0/mwoffliner requests Trove instance with 75 GB.
May 1 2024, 9:05 AM · affects-Kiwix-and-openZIM, User-aborrero, cloud-services-team, Cloud-VPS (Quota-requests)

Mar 8 2024

Audiodude added a comment to T359564: Advice needed: creating a row for every article across every language Wikipedia in ToolsDB.

Awesome, thanks for the feedback. I'll look into Trove.

Mar 8 2024, 5:05 PM · cloud-services-team, Data-Services

Mar 7 2024

Audiodude created T359564: Advice needed: creating a row for every article across every language Wikipedia in ToolsDB.

The Cloud-Services project tag is not intended to have any tasks. Please check the list on https://phabricator.wikimedia.org/project/profile/832/ and replace it with a more specific project tag to this task. Thanks!

Mar 7 2024, 4:40 PM · cloud-services-team, Data-Services

Oct 9 2023

Audiodude added a comment to T301469: Create minikube deploy for quarry.

I'm completely new to Kubernetes but have been reading through https://wikitech.wikimedia.org/wiki/Kubernetes/Kubernetes_Workshop. Does WM Cloud provide k8s clusters, or is it expected that we would provision our own cluster on individual cloud VPS instances?

Oct 9 2023, 10:45 PM · Patch-For-Review, Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

Thank you for all the information, it is very helpful! We can stick to asynchronous communication if that's what works best, no problem. I guess we can keep using this ticket for Q&A?

Oct 9 2023, 9:47 PM · Quarry
Audiodude updated subscribers of T348184: Add maintainers to quarry.

Looking at that wiki page I linked, it seems at least somewhat out of date. I'd like to work on upgrading Python to at least 3.11, since 3.7 is EOL since June of 2023. Of course this might require upgrading dependencies as well. I see that @Framawiki has some quarry-dev-* instances with a puppet "skip" note of:

Oct 9 2023, 5:25 PM · Quarry

Oct 7 2023

Audiodude closed T348364: Allow Quarry to use arbitrary hostnames for the replica DB as Resolved.
Oct 7 2023, 10:09 PM · Quarry
Audiodude added a comment to T348364: Allow Quarry to use arbitrary hostnames for the replica DB.

https://github.com/toolforge/quarry/pull/25

Oct 7 2023, 6:27 PM · Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

@SD0001 I found: https://wikitech.wikimedia.org/wiki/Portal:Data_Services/Admin/Quarry, but I don't think I fully understand it

Oct 7 2023, 6:01 PM · Quarry
Audiodude added a comment to T348364: Allow Quarry to use arbitrary hostnames for the replica DB.

I thought I was going crazy because REPLICA_HOST does in fact exist in default_config.yaml, but it turns out it isn't used anywhere in the repo so it must be a vestige of an old way of calculating the hostname.

Oct 7 2023, 2:17 AM · Quarry
Audiodude updated the task description for T348364: Allow Quarry to use arbitrary hostnames for the replica DB.
Oct 7 2023, 1:16 AM · Quarry
Audiodude updated the task description for T348364: Allow Quarry to use arbitrary hostnames for the replica DB.
Oct 7 2023, 12:52 AM · Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

Forked discussion to T348364

Oct 7 2023, 12:45 AM · Quarry
Audiodude created T348364: Allow Quarry to use arbitrary hostnames for the replica DB.
Oct 7 2023, 12:45 AM · Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

Shouldn't REPLICA_DOMAIN be set to analytics.db.svc.wikimedia.cloud for this to work? I haven't tried it myself yet. Then you would get enwiki.analytics.db.svc.wikimedia.cloud which would be correct right?

Oct 7 2023, 12:36 AM · Quarry

Oct 6 2023

Audiodude added a comment to T348184: Add maintainers to quarry.

FWIW I set up the dev environment without any issue and was able to run queries against mywiki.

Oct 6 2023, 7:56 PM · Quarry

Oct 5 2023

Audiodude added a comment to T348184: Add maintainers to quarry.

Confirmed: I got the github invite. I can also access the instances with my wikitech account, thanks!

Oct 5 2023, 5:30 PM · Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

I'm audiodude on github. Thanks!

Oct 5 2023, 3:59 PM · Quarry
Audiodude added a comment to T348184: Add maintainers to quarry.

I assume we need some kind of access to the Github repo too? https://github.com/toolforge/quarry

Oct 5 2023, 4:34 AM · Quarry
Audiodude added a comment to T151106: Quarry exports integers as floats to wikitable.

Another puzzling part is that MariaDB doesn't appear to be returning results as floats. I exposed the mywiki MariaDB in docker and ran this:

Oct 5 2023, 4:33 AM · Quarry
Audiodude added a comment to T151106: Quarry exports integers as floats to wikitable.

Documenting my investigation (no solution found).

Oct 5 2023, 4:29 AM · Quarry

Oct 3 2023

Audiodude added a comment to T169452: Replace Quarry with an installation of Superset.

So is it correct that we're looking for a new maintainer, but only in the capacity of migrating all usage of Quarry to Superset? That is, no new features are planned for or expected of Quarry and we expect to turn it down once Superset has feature and use case parity?

Oct 3 2023, 11:23 PM · cloud-services-team (FY2024/2025-Q1-Q2), superset.wmcloud.org, Quarry

Jul 19 2023

Audiodude added a comment to T342144: singleuser container not building.

My pleasure. Did I save mwdiffs and mwpersistence? That's the goal. If so should one of us update the announcement to [Cloud-announce]?

Jul 19 2023, 5:32 PM · PAWS

Jul 18 2023

Audiodude added a comment to T342144: singleuser container not building.

I successfully built the singleuser image with jupyterlab=3.6.3 and this PR: https://github.com/toolforge/paws/pull/309

Jul 18 2023, 10:03 PM · PAWS
Audiodude added a comment to T342144: singleuser container not building.

Maybe not quite that. Looks like mwpersistence requires deltas -> yamlconf -> PyYAML 5.4.1:

Jul 18 2023, 9:52 PM · PAWS
Audiodude added a comment to T342144: singleuser container not building.

My guess is that the implicated libraries (mwdiffs and mwpersistence) are written in Python 2 and can't upgrade to the latest version of PyYAML, but that's just a guess.

Jul 18 2023, 9:45 PM · PAWS
Audiodude added a comment to T342144: singleuser container not building.

I think removing those libraries simply causes pip to resolve the dependency to a later version of PyYAML which doesn't have the issue, as seen here: https://github.com/flyteorg/flytekit/pull/1752/files

Jul 18 2023, 9:44 PM · PAWS
Audiodude added a comment to T342144: singleuser container not building.

I tried bumping jupyterlab to 3.6.3 as seen in this commit: https://github.com/toolforge/paws/pull/308/commits/425a53a6449198beca9e3466c32f3604bdfbe31e

Jul 18 2023, 9:35 PM · PAWS

Dec 17 2022

Audiodude added a comment to T325392: Account recovery help needed for Developer account tmoney.

Thank you!

Dec 17 2022, 3:25 AM · User-bd808, wikitech.wikimedia.org, Trust-and-Safety, cloud-services-team (Kanban)