Gilles (Gilles Dubuc)
Senior Performance Engineer, WMF

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 8:24 AM (213 w, 6 d)
Availability
Available
IRC Nick
gilles
LDAP User
Gilles
MediaWiki User
Unknown

Recent Activity

Today

Gilles moved T208999: [Bug] lastVisualChange osciallating by ~1s and fullyLoaded oscillating by ~.5s on Obama enwiki prod desktop ~Nov 4 2018 from Inbox to Doing on the Performance-Team board.
Mon, Nov 12, 9:04 PM · WebPageTest, Performance, Readers-Web-Backlog (Tracking), Performance-Team
Gilles assigned T208999: [Bug] lastVisualChange osciallating by ~1s and fullyLoaded oscillating by ~.5s on Obama enwiki prod desktop ~Nov 4 2018 to Peter.
Mon, Nov 12, 9:04 PM · WebPageTest, Performance, Readers-Web-Backlog (Tracking), Performance-Team
Gilles added a comment to T208020: Consider using Chrome origin trials to try new APIs on real users early.

I've clarified one last point, which is how they consider that a feature is used (counting towards the quota). It depends on the feature, but for JS APIs, mere existence checks won't count as usage. Eg.

Mon, Nov 12, 9:53 AM · Performance-Team

Fri, Nov 9

Gilles triaged T204174: FileOperation error "SwiftFileBackend::addMissingMetadata: {path} was not stored with SHA-1 metadata." as Low priority.
Fri, Nov 9, 7:56 AM · Patch-For-Review, Performance-Team, Thumbor, MediaWiki-File-management, Wikimedia-production-error
Gilles reassigned T204174: FileOperation error "SwiftFileBackend::addMissingMetadata: {path} was not stored with SHA-1 metadata." from Gilles to aaron.
Fri, Nov 9, 7:56 AM · Patch-For-Review, Performance-Team, Thumbor, MediaWiki-File-management, Wikimedia-production-error

Thu, Nov 8

Gilles added a comment to T204174: FileOperation error "SwiftFileBackend::addMissingMetadata: {path} was not stored with SHA-1 metadata.".

Header name + data is 55 bytes longs. We have 1.2 billion thumbnails in Swift. That's 66 GiB of data. Which represents 0.02% of our Swift storage space. Not earth-shattering savings, I'll grant you that, but I think we should get rid of data we don't need. It does affect speed a little as well when fetching thumbnails from Swift.

Thu, Nov 8, 3:35 PM · Patch-For-Review, Performance-Team, Thumbor, MediaWiki-File-management, Wikimedia-production-error
Gilles closed T209047: Oversampling can cause double measurement on a given pageload as Invalid.

This seems intentional and I can finally understand why. You need to keep recording the non-oversampled hits, otherwise that lowers your record rate of non-oversampled pageviews.

Thu, Nov 8, 12:52 PM · Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles moved T208020: Consider using Chrome origin trials to try new APIs on real users early from Next-up to Doing on the Performance-Team board.
Thu, Nov 8, 12:49 PM · Performance-Team
Gilles claimed T208020: Consider using Chrome origin trials to try new APIs on real users early.
Thu, Nov 8, 12:48 PM · Performance-Team
Gilles triaged T208020: Consider using Chrome origin trials to try new APIs on real users early as Normal priority.
Thu, Nov 8, 12:48 PM · Performance-Team
Gilles added a comment to T208020: Consider using Chrome origin trials to try new APIs on real users early.

Looking at how it works, it seems safe for privacy. We would expose a fixed token generated for the trial we want to perform, which automatically expires after 6 weeks. The token can be served as a response header or a meta tag. This means that we would be serving the same token to all visitors.

Thu, Nov 8, 12:42 PM · Performance-Team
Gilles updated the task description for T209047: Oversampling can cause double measurement on a given pageload.
Thu, Nov 8, 12:02 PM · Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles triaged T209047: Oversampling can cause double measurement on a given pageload as Normal priority.
Thu, Nov 8, 12:02 PM · Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles created T209047: Oversampling can cause double measurement on a given pageload.
Thu, Nov 8, 11:55 AM · Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles closed T131894: Collect Backend-Timing in Prometheus as Resolved.

The basic functionality is there. If we want to iterate on that, it should be the subject of a new task.

Thu, Nov 8, 10:32 AM · Patch-For-Review, Wikimedia-Incident, Operations, Traffic, Performance-Team
Gilles moved T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming from Backlog to Doing on the Performance-Team board.
Thu, Nov 8, 10:30 AM · Patch-For-Review, Performance-Team
Gilles triaged T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming as Normal priority.
Thu, Nov 8, 10:23 AM · Patch-For-Review, Performance-Team
Gilles closed T207723: Chrome 70: RUM changes as Resolved.
Thu, Nov 8, 9:50 AM · Performance-Team
Gilles added a comment to T207723: Chrome 70: RUM changes.

Looking at Grafana, it appears that Chrome 69 is prone to sending extremely high values for metrics like firstPaint:

Thu, Nov 8, 9:50 AM · Performance-Team

Wed, Nov 7

Gilles closed T207411: Using mw.eventLog.inSample for subsampling gets in the way of randomness as Resolved.
Wed, Nov 7, 5:24 AM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Patch-For-Review, Performance-Team, MediaWiki-extensions-NavigationTiming

Tue, Nov 6

Gilles added a comment to T207315: Investigate 200-300ms increase in responseStart.p75.

1.31.0-wmf.6 was deployed on 2017-11-01 to group 1 and on 2017-11-02 to group 2

Tue, Nov 6, 11:02 AM · Operations, Traffic, Performance-Team
Gilles added a comment to T207315: Investigate 200-300ms increase in responseStart.p75.

This 2017-11-01 SAL entry seems noteworthy:

Tue, Nov 6, 10:49 AM · Operations, Traffic, Performance-Team
Gilles added a comment to T207315: Investigate 200-300ms increase in responseStart.p75.

I've found something very interesting. If you plot both navtiming and navtiming2, navtiming2 for mobile is in direct continuation of the old trend, without any regression (it's improving over time, even!):

Tue, Nov 6, 10:37 AM · Operations, Traffic, Performance-Team

Mon, Nov 5

Jdlrobson awarded Blog Post: Mobile web performance: the importance of the device a Like token.
Mon, Nov 5, 11:16 PM · Performance-Team
Gilles created T208782: Graphite error causing breakage of Graphite-backed Grafana dashboards.
Mon, Nov 5, 8:28 PM · Patch-For-Review, monitoring, User-fgiunchedi, Graphite
Gilles added a comment to T208752: webrequest data loss 2018-11-05 on upload partition .

Exactly, the way the webp support works if that it restarts the Varnish transaction after rewriting the request URL. This is a Varnish feature that hasn't been used much in production before. Sorry for the mess it created in the kafka pipeline, I had not anticipated that it could cause something like this. We can easily turn the feature off if you guys have some fixing to do, and turn it back on later.

Mon, Nov 5, 8:04 PM · Analytics-Kanban, Patch-For-Review, Operations, Traffic, Analytics
TheDJ awarded T27611: Support optimized WebP thumbnails as alternative to JPEG, PNG a Barnstar token.
Mon, Nov 5, 1:54 PM · Patch-For-Review, Performance-Team, Commons, Multimedia, MediaWiki-File-management

Sun, Nov 4

rafidaslam awarded Blog Post: Machine learning: how to undersample the wrong way a Like token.
Sun, Nov 4, 1:38 PM · Performance-Team
Gilles added a comment to T207615: Varnish won't purge thumbnails of specific file.

This is really bizarre. Second time it happens, and the previous affected file didn't seem to have special characters besides a dash (there could be more than one bug involved, though): https://upload.wikimedia.org/wikipedia/commons/thumb/d/d8/PL_Jean_de_La_Fontaine_-_Bajki.djvu/page657-1024px-PL_Jean_de_La_Fontaine_-_Bajki.djvu.jpg

Sun, Nov 4, 10:11 AM · Traffic, Operations

Mon, Oct 29

Gilles moved T207723: Chrome 70: RUM changes from Inbox to Doing on the Performance-Team board.
Mon, Oct 29, 9:29 PM · Performance-Team
Gilles moved T207860: Collect client network errors, deprecation, intervention and crash reports from Inbox to Backlog on the Performance-Team board.
Mon, Oct 29, 9:29 PM · Performance-Team
Gilles moved T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming from Inbox to Backlog on the Performance-Team board.
Mon, Oct 29, 9:27 PM · Patch-For-Review, Performance-Team
Gilles claimed T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming.
Mon, Oct 29, 9:27 PM · Patch-For-Review, Performance-Team
Gilles claimed T208015: Preload top article image on desktop.
Mon, Oct 29, 9:25 PM · Performance-Team
Gilles moved T208020: Consider using Chrome origin trials to try new APIs on real users early from Inbox to Next-up on the Performance-Team board.
Mon, Oct 29, 9:23 PM · Performance-Team
Gilles closed T208088: Add ability to randomize display order of answer options in QuickSurveys as Resolved.
Mon, Oct 29, 8:46 AM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys, Performance-Team

Sun, Oct 28

Gilles committed rEQS50aa1bbe3e9c: Add ability to shuffle answers display order (authored by Gilles).
Add ability to shuffle answers display order
Sun, Oct 28, 9:03 PM
Gilles committed rEQS32818cee5c1d: Add ability to shuffle answers display order (authored by Gilles).
Add ability to shuffle answers display order
Sun, Oct 28, 1:46 PM
Gilles committed rEQS2c2b307bfbad: Add ability to shuffle answers display order (authored by Gilles).
Add ability to shuffle answers display order
Sun, Oct 28, 1:44 PM

Fri, Oct 26

Gilles committed rEQS244d035017c9: Add ability to shuffle answers display order (authored by Gilles).
Add ability to shuffle answers display order
Fri, Oct 26, 8:47 PM
Gilles committed rEQS654161467a9e: Add ability to shuffle answers display order (authored by Gilles).
Add ability to shuffle answers display order
Fri, Oct 26, 8:47 PM
Gilles raised the priority of T208088: Add ability to randomize display order of answer options in QuickSurveys from Normal to High.
Fri, Oct 26, 7:48 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys, Performance-Team
Gilles triaged T208088: Add ability to randomize display order of answer options in QuickSurveys as Normal priority.
Fri, Oct 26, 6:57 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys, Performance-Team
Gilles created T208088: Add ability to randomize display order of answer options in QuickSurveys.
Fri, Oct 26, 6:57 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys, Performance-Team
Gilles closed T204921: Rename EventLogging column surveyInstanceToken to pageviewToken in QuickSurveysResponses for consistency as Resolved.
SELECT COUNT(*) FROM event.quicksurveyinitiation WHERE year = 2018 and month = 10 AND day = 26 AND event.pageviewToken IS NOT NULL;
Fri, Oct 26, 10:08 AM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Performance-Team, MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys
Gilles created T208020: Consider using Chrome origin trials to try new APIs on real users early.
Fri, Oct 26, 9:08 AM · Performance-Team
Gilles renamed T208015: Preload top article image on desktop from Preload top article image to Preload top article image on desktop.
Fri, Oct 26, 7:44 AM · Performance-Team
Gilles created T208015: Preload top article image on desktop.
Fri, Oct 26, 7:32 AM · Performance-Team
Gilles added a comment to T182314: Analyze results of enwiki and dewiki page previews a/b test.

Some browsers are targeting 60fps, which means around 16ms between paint frames to execute stuff. I don't know if somehow those 60fps targets could be somehow synced to round times, which would explain a 8/16ms cycle, and the spikes would be clients that are up to date with NTP and experiencing smooth execution, others that have a clock skew, that are currently experiencing jank or that are coming from browsers that would have a different behaviour.

Fri, Oct 26, 6:49 AM · Product-Analytics, Readers-Web-Backlog (Tracking), Reading-analysis, Page-Previews

Wed, Oct 24

Gilles renamed T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming from User Server-Timing to collect Varnish hit/miss information in NavigationTiming to Use Server-Timing to collect Varnish hit/miss information in NavigationTiming.
Wed, Oct 24, 4:28 PM · Patch-For-Review, Performance-Team
Gilles created T207862: Use Server-Timing to collect Varnish hit/miss information in NavigationTiming.
Wed, Oct 24, 4:28 PM · Patch-For-Review, Performance-Team
Gilles created T207860: Collect client network errors, deprecation, intervention and crash reports.
Wed, Oct 24, 4:21 PM · Performance-Team

Tue, Oct 23

Gilles added a comment to T207723: Chrome 70: RUM changes.

Now that we have bucketed RUM data in Turnilo, it makes comparing between Chrome 69 and Chrome 70 easy. The visualisations are limited, but opening 2 tabs I can easily switch back and forth to see the difference.

Tue, Oct 23, 8:30 AM · Performance-Team
Gilles triaged T207723: Chrome 70: RUM changes as Normal priority.
Tue, Oct 23, 8:12 AM · Performance-Team
Gilles created T207723: Chrome 70: RUM changes.
Tue, Oct 23, 8:12 AM · Performance-Team
Gilles added a comment to T166414: Explore NavigationTiming by faceted properties - EventLogging refine.

Looks great! Already I'm finding interesting facts about Chrome 69 vs Chrome 70

Tue, Oct 23, 8:11 AM · Analytics-Kanban, Performance-Team (Radar), Analytics, Patch-For-Review
Gilles added a comment to T198946: Add Schema property 'sameAs' pointing to Wikidata entries.

In fact there are already other Wikimedia logos in there: https://github.com/wikimedia/operations-mediawiki-config/tree/master/static/images

Tue, Oct 23, 7:31 AM · Performance-Team (Radar), MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Wikidata-Campsite, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q2), Wikidata, MediaWiki-extensions-WikibaseClient, SEO
Gilles added a comment to T198946: Add Schema property 'sameAs' pointing to Wikidata entries.

https://www.wikidata.org/extensions/Wikibase/client/assets/wikimedia.png This is a very unusual location for a static image. Was this vetted by Traffic ? This image being consumed by bots/crawlers means a long-term commitment to that URL working. I would have expected it to be housed in /static/ where all the logos are, including wikidata's own. I.e. something like https://www.wikidata.org/static/images/project-logos/wikimedia.png or https://www.wikidata.org/static/images/wikimedia.png

Tue, Oct 23, 7:27 AM · Performance-Team (Radar), MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Wikidata-Campsite, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q2), Wikidata, MediaWiki-extensions-WikibaseClient, SEO
Gilles added a comment to T198946: Add Schema property 'sameAs' pointing to Wikidata entries.

It seems like this extra content would likely repeat strings present elsewhere in the HTML, which means that this additional content should compress well. Could you look at how much extra weight it adds to the page when gzipped on a couple of articles (big and small)? This should help put the cost into perspective.

Tue, Oct 23, 7:19 AM · Performance-Team (Radar), MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Wikidata-Campsite, Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q2), Wikidata, MediaWiki-extensions-WikibaseClient, SEO

Mon, Oct 22

Gilles moved T204921: Rename EventLogging column surveyInstanceToken to pageviewToken in QuickSurveysResponses for consistency from Inbox to Doing on the Performance-Team board.
Mon, Oct 22, 8:40 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Performance-Team, MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys
Gilles moved T207411: Using mw.eventLog.inSample for subsampling gets in the way of randomness from Inbox to Doing on the Performance-Team board.
Mon, Oct 22, 8:39 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Patch-For-Review, Performance-Team, MediaWiki-extensions-NavigationTiming
Gilles renamed T207615: Varnish won't purge thumbnails of specific file from Outdated Copyright violation thumbnails for djvu file on Commons to Varnish won't purge thumbnails of specific file.
Mon, Oct 22, 1:39 PM · Traffic, Operations
Gilles added a comment to T207615: Varnish won't purge thumbnails of specific file.

This seems to be an issue with Varnish purging. Purging that file with debugging turned on, I can clearly see MediaWiki issuing the order to purge those files, including the problematic thumbnails that remain old no matter what: https://logstash.wikimedia.org/app/kibana#/doc/logstash-*/logstash-2018.10.22/mediawiki?id=AWab6v-X00on8STvlYvw&_g=h@44136fa

Mon, Oct 22, 1:38 PM · Traffic, Operations
Gilles closed T206190: Outdated thumbnail for djvu file on Commons cannot be purged and does not update, a subtask of T43371: Thumbnail/imagescaler (tracking), as Resolved.
Mon, Oct 22, 11:25 AM · media-storage, Commons, Tracking
Gilles closed T206190: Outdated thumbnail for djvu file on Commons cannot be purged and does not update as Resolved.

No, I think it's impossible to reproduce the exact conditions that happened during upload and caused this. It seems like the various expiry mechanisms ultimately rectified that thumbnail, thanks for the update.

Mon, Oct 22, 11:25 AM · Thumbor, media-storage

Fri, Oct 19

Gilles triaged T204702: Move Thumbor Plugins code review from Differential to Gerrit as Normal priority.
Fri, Oct 19, 11:22 AM · Repository-Admins, Thumbor
Gilles added a comment to T204702: Move Thumbor Plugins code review from Differential to Gerrit.

@Aklapper who should I assign the second step to?

Fri, Oct 19, 11:22 AM · Repository-Admins, Thumbor
Gilles updated the task description for T204702: Move Thumbor Plugins code review from Differential to Gerrit.
Fri, Oct 19, 11:21 AM · Repository-Admins, Thumbor
Gilles placed T204702: Move Thumbor Plugins code review from Differential to Gerrit up for grabs.
Fri, Oct 19, 11:21 AM · Repository-Admins, Thumbor
Gilles closed T205580: Microbenchmark device power and record results in NavigationTiming as Resolved.

I've verified that the scores are being collected correctly and the values make sense when compared to device type on Android.

Fri, Oct 19, 11:15 AM · MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles committed rEQSe29f28ac8747: Fix pageviewToken casing issues (authored by Gilles).
Fix pageviewToken casing issues
Fri, Oct 19, 10:41 AM
Gilles added a comment to T204921: Rename EventLogging column surveyInstanceToken to pageviewToken in QuickSurveysResponses for consistency.

Recent entries have a null surveyInstanceToken and no pageviewToken field in Hive. Triggering the survey manually and looking at the beacon call, the schema version is correct, however there isn't any surveyInstanceToken not pageviewToken parameter passed.

Fri, Oct 19, 10:11 AM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Performance-Team, MW-1.32-notes (WMF-deploy-2018-10-16 (1.32.0-wmf.26)), Readers-Web-Backlog (Tracking), Patch-For-Review, QuickSurveys

Thu, Oct 18

Gilles triaged T207411: Using mw.eventLog.inSample for subsampling gets in the way of randomness as Normal priority.
Thu, Oct 18, 7:56 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Patch-For-Review, Performance-Team, MediaWiki-extensions-NavigationTiming
Gilles renamed T207411: Using mw.eventLog.inSample for subsampling gets in the way of randomness from Reusing mw.eventLog.inSample gets in the way of randomness to Using mw.eventLog.inSample for subsampling gets in the way of randomness.
Thu, Oct 18, 7:52 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Patch-For-Review, Performance-Team, MediaWiki-extensions-NavigationTiming
Gilles created T207411: Using mw.eventLog.inSample for subsampling gets in the way of randomness.
Thu, Oct 18, 7:52 PM · MW-1.33-notes (1.33.0-wmf.1; 2018-10-23), Patch-For-Review, Performance-Team, MediaWiki-extensions-NavigationTiming
D3r1ck01 awarded Blog Post: Machine learning: how to undersample the wrong way a Like token.
Thu, Oct 18, 6:23 PM · Performance-Team

Tue, Oct 16

Gilles added a comment to T207165: eventlogging_db_sanitization script failed.
0: jdbc:hive2://an-coord1001.eqiad.wmnet:1000> SELECT COUNT(*) FROM event.resourcetiming WHERE year = 2018;
Tue, Oct 16, 12:55 PM · Analytics-Kanban, Analytics
Gilles added a comment to T207165: eventlogging_db_sanitization script failed.

It already is

Tue, Oct 16, 12:51 PM · Analytics-Kanban, Analytics

Mon, Oct 15

Gilles updated the post content for Blog Post: Machine learning: how to undersample the wrong way.
Mon, Oct 15, 3:18 PM · Performance-Team
Gilles updated the post content for Blog Post: Machine learning: how to undersample the wrong way.
Mon, Oct 15, 3:16 PM · Performance-Team
Gilles updated the post content for Blog Post: Machine learning: how to undersample the wrong way.
Mon, Oct 15, 1:37 PM · Performance-Team
Gilles created Blog Post: Machine learning: how to undersample the wrong way.
Mon, Oct 15, 1:37 PM · Performance-Team
Gilles added a subtask for T170817: Upgrade Thumbor servers to Stretch: T196961: Non-rendering of thumbnail of compressed pdf in Commons.
Mon, Oct 15, 12:58 PM · Patch-For-Review, User-fgiunchedi, Performance-Team (Radar), Operations, Thumbor
Gilles added a parent task for T196961: Non-rendering of thumbnail of compressed pdf in Commons: T170817: Upgrade Thumbor servers to Stretch.
Mon, Oct 15, 12:58 PM · Thumbor, Multimedia, MediaWiki-extensions-PdfHandler, Commons
Gilles added a comment to T196961: Non-rendering of thumbnail of compressed pdf in Commons.

That horse radish culture PDF is timing out when processed with ghostscript. Which means it's taking more than one minute on our production servers. It's an unreasonable amount of time for any thumbnail. The question is what's special about a 393KB PDF that it would take 1+ minute to extract a thumbnail from it.

Mon, Oct 15, 12:57 PM · Thumbor, Multimedia, MediaWiki-extensions-PdfHandler, Commons
Gilles closed T206207: Set up raspberry pi as ZNC bouncer as Resolved.
Mon, Oct 15, 11:51 AM · Performance-Team
Gilles added a comment to T206207: Set up raspberry pi as ZNC bouncer.

Steps taken were:

Mon, Oct 15, 11:31 AM · Performance-Team
Gilles claimed T206207: Set up raspberry pi as ZNC bouncer.
Mon, Oct 15, 10:59 AM · Performance-Team

Oct 11 2018

Gilles added a comment to T203135: ThumbnailRender job fails with 429 errors.

Can this job be configured to not retry at all? I think that would be ideal. At the end of the day it's a warmup script and it's fine for it to fail.

Oct 11 2018, 9:40 AM · Core Platform Team Kanban (Done with CPT), Services (done), MW-1.32-notes (WMF-deploy-2018-10-02 (1.32.0-wmf.24)), Thumbor, WMF-Blog-Social-Team, Multimedia

Oct 8 2018

Gilles added a comment to T131894: Collect Backend-Timing in Prometheus.

https://grafana.wikimedia.org/dashboard/db/apache-backend-timing getting something started there...

Oct 8 2018, 3:42 PM · Patch-For-Review, Wikimedia-Incident, Operations, Traffic, Performance-Team
Gilles edited P7650 firstPaint timing.
Oct 8 2018, 1:27 PM · Performance-Team
Gilles created P7650 firstPaint timing.
Oct 8 2018, 1:24 PM · Performance-Team
Gilles created P7649 firstPaint timing.
Oct 8 2018, 1:24 PM · Performance-Team

Oct 5 2018

Gilles changed the status of T131894: Collect Backend-Timing in Prometheus from Stalled to Open.
Oct 5 2018, 2:16 PM · Patch-For-Review, Wikimedia-Incident, Operations, Traffic, Performance-Team
Gilles closed T205533: Create views to simplify access to renamed columns on NavigationTiming and Quicksurveys schemas as Resolved.
Oct 5 2018, 1:37 PM · Performance-Team

Oct 4 2018

Gilles created T206207: Set up raspberry pi as ZNC bouncer.
Oct 4 2018, 10:29 AM · Performance-Team
Gilles added a comment to T203135: ThumbnailRender job fails with 429 errors.

The volume of errors for this job has gone way down. I've looked at what's left and they're legitimate errors where the files can't render. I think the job might be hitting the 429 for failing render either because the person is uploading the same file multiple times, or they've accessed them from their client before the job reaches that particular file. The client requests make the failure counter go up, and then the job is the one too many attempt to render it.

Oct 4 2018, 10:26 AM · Core Platform Team Kanban (Done with CPT), Services (done), MW-1.32-notes (WMF-deploy-2018-10-02 (1.32.0-wmf.24)), Thumbor, WMF-Blog-Social-Team, Multimedia
Gilles closed T195840: Track when a CentralNotice banner was displayed to the user in NavTiming as Resolved.
Oct 4 2018, 10:15 AM · MW-1.32-notes (WMF-deploy-2018-10-02 (1.32.0-wmf.24)), Patch-For-Review, Wikimedia-Fundraising, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles closed T205117: Collect navigator.deviceMemory in navtiming when available as Resolved.
Oct 4 2018, 10:13 AM · MW-1.32-notes (WMF-deploy-2018-10-02 (1.32.0-wmf.24)), Patch-For-Review, MediaWiki-extensions-NavigationTiming, Performance-Team
Gilles added a comment to T206190: Outdated thumbnail for djvu file on Commons cannot be purged and does not update.

It's the purge that's malfunctioning and not purging the thumbnails for that file at all.

Oct 4 2018, 9:24 AM · Thumbor, media-storage