Page MenuHomePhabricator

Implement an accurate and easy to understand status page for all wikis
Open, MediumPublic

Description

In T199816 we deprecated status.wikimedia.org, and replaced it with a static HTML page that points users to a grafana dashboard. This is a reasonable intermediate step, but is not a particularly user-friendly way to indicate when a particular wiki, group of wikis, or partial functionality is unavailable.

We should implement a status page that gives users information about when some/all functionality is impacted; and displays that information in a user-friendly and easy to understand way. Ease of use and accuracy are very important and should be taken into account by any implementation. (These were the biggest issues with the prior Watchmouse implementation, and resulted in that implementation being removed.)

For reference, the previous site looked like this: archive.org link and screenshot: F31755516

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 16 2018, 2:50 PM
waldyrious added a subscriber: waldyrious.
ArielGlenn triaged this task as Medium priority.Aug 29 2018, 10:30 AM
CDanis added a subscriber: CDanis.Dec 5 2018, 4:53 PM
TJH2018 added a subscriber: TJH2018.EditedFeb 12 2019, 4:08 AM

I think a good example we could build off of would be https://status.discordapp.com/ as it has the basics and explains why an issue happens. We could easily expand something like this to fit the needs of Wikimedia. Discord uses an external provider, however the look of the page itself is quite easy to understand.

Quiddity updated the task description. (Show Details)Apr 13 2020, 8:53 PM
Restricted Application added a project: Internet-Archive. · View Herald TranscriptApr 13 2020, 8:53 PM
Quiddity added a subscriber: Quiddity.
Ladsgroup added a subscriber: Ladsgroup.

I just stumbled upon https://www.githubstatus.com/ (github had an outage) and I quite liked the timeline of "green, yellow, red" (green = the whole day was okay, yellow = degredation below on hour, red = degredation above one hour) per day in the past ninety days.

fgiunchedi moved this task from Inbox to Backlog on the observability board.Mon, Jul 6, 1:52 PM