[EPIC] Enforce "no increase in log errors" during deployments
Closed, ResolvedPublic
Actions

Assigned To

None

Authored By

	greg
	Oct 15 2015, 4:59 PM

Description

Essentially:

If we deploy and the log errors increase, revert immediately.

Related Objects
Search...

Status	Assigned	Task
Resolved	None	T115629 [EPIC] Enforce "no increase in log errors" during deployments
Resolved	None	T115630 [EPIC] Reduce production log errors to zero*
Resolved	• demon	T108749 Implement "WMF Log Errors count" KPI
Declined	None	T115633 Proposal: Force any WARNINGs on Beta Cluster to fail completely
Resolved	• demon	T81030 gdash reports for php/apache errors

Event Timeline

greg created this task.Oct 15 2015, 4:59 PM

greg raised the priority of this task from to Needs Triage.

greg updated the task description. (Show Details)

greg added a project: Release-Engineering-Team.

greg subscribed.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 15 2015, 4:59 PM

greg mentioned this in T115630: [EPIC] Reduce production log errors to zero*.Oct 15 2015, 5:00 PM

greg added a subscriber: MaxSem.Oct 15 2015, 5:02 PM

greg moved this task from INBOX to Backlog (ARCHIVED) on the Release-Engineering-Team board.Dec 3 2015, 9:07 PM

Restricted Application added a subscriber: StudiesWorld. · View Herald TranscriptDec 3 2015, 9:07 PM

greg triaged this task as Medium priority.Dec 3 2015, 9:07 PM

@greg: The problem is the new branch cut on tuesday. Since it incorporates a lot of new code it's difficult to avoid new errors sneaking in.

That's why we need to shoot these errors while they're still on beta.

The errors need to be a lot more visible, honestly. If mediawiki-vagrant, and the beta cluster, would surface the errors in a way that's not easily ignored, then they would be a lot more likely to be fixed before they are holding up a deployment.

I'd like to see something like http://phpdebugbar.com/ enabled by-default on vagrant and beta cluster. Perhaps it could even be offered as a per-user preference on production.

Danny_B added a project: Epic.May 6 2016, 7:47 PM

greg moved this task from Backlog (ARCHIVED) to Epics (ARCHIVED) on the Release-Engineering-Team board.May 31 2016, 3:36 PM

• Phabricator_maintenance edited projects, added Release-Engineering-Team-TODO; removed Release-Engineering-Team.Jun 12 2019, 11:40 PM

• Phabricator_maintenance moved this task from Should be empty (use Release-Engineering-Team) to Epics on the Release-Engineering-Team-TODO board.Jun 12 2019, 11:41 PM

greg added a project: Release-Engineering-Team.Jun 21 2019, 10:35 PM

greg edited projects, added Release-Engineering-Team (Deployment services); removed Release-Engineering-Team.Aug 1 2019, 11:17 PM

thcipriani removed a project: Release-Engineering-Team (Deployment services).Apr 20 2021, 1:10 AM

thcipriani edited projects, added Release-Engineering-Team (thcipriani-workboard-fiddling); removed Release-Engineering-Team-TODO.Apr 20 2021, 3:42 AM

thcipriani moved this task from thcipriani-workboard-fiddling to Seen (ARCHIVE) on the Release-Engineering-Team board.Apr 20 2021, 4:05 AM

thcipriani edited projects, added Release-Engineering-Team; removed Release-Engineering-Team (thcipriani-workboard-fiddling).

thcipriani edited projects, added Release-Engineering-Team (Seen); removed Release-Engineering-Team.Apr 20 2021, 3:23 PM

The task got filed back in 2015 when release engineering had plans to improve the overall quality of deployment. After several years of efforts we collectively improved our logging system (Monolog, ELK), we have logging dashboards we closely track and have a process to triage all those errors (eg Wikimedia-production-error ).

We now enforces Zero* log by blocking the train whenever there are new logs, so I am claiming this goal to be a success.

[EPIC] Enforce "no increase in log errors" during deploymentsClosed, ResolvedPublicActions

Description

Related ObjectsSearch...

Event Timeline

[EPIC] Enforce "no increase in log errors" during deployments
Closed, ResolvedPublic
Actions

Related Objects
Search...