Page MenuHomePhabricator
Feed Advanced Search

Jul 5 2019

Marostegui closed T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts as Resolved.

Just to clarify, we have lowered the priority because the slaves are no longer lagging.
A few minutes ago the master went back to normal INSERT values - normal meaning before the upgrade.
Resolving this - thanks @mmodell!

Jul 5 2019, 12:50 PM · SRE, Phabricator
Marostegui updated subscribers of T197531: Data model for dbconfig.

@Joe @CDanis is this task still valid?

Jul 5 2019, 5:41 AM · MediaWiki-Configuration, SRE, DBA
Marostegui added a comment to T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.

Now the graphs look better. Unfortunately, puppet will set the config back to 10 taskmasters unless we make a commit to rOPUP Wikimedia Puppet

Jul 5 2019, 5:35 AM · SRE, Phabricator
Marostegui assigned T71222: list=logevents slow for users with last log action long time ago to Anomie.
Jul 5 2019, 5:34 AM · mariadb-optimizer-bug, User-Marostegui, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), DBA, Performance Issue, MediaWiki-Action-API
Marostegui placed T71222: list=logevents slow for users with last log action long time ago up for grabs.
Jul 5 2019, 5:31 AM · mariadb-optimizer-bug, User-Marostegui, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), DBA, Performance Issue, MediaWiki-Action-API
Marostegui closed T71222: list=logevents slow for users with last log action long time ago as Resolved.
Jul 5 2019, 5:29 AM · mariadb-optimizer-bug, User-Marostegui, MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), DBA, Performance Issue, MediaWiki-Action-API
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

Note: db2044 needs upgrading

Jul 5 2019, 5:21 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui updated the task description for T217396: Decommission db1061-db1073.
Jul 5 2019, 5:11 AM · SRE, DBA
Marostegui added a comment to T227166: decommission db1069.

This host is ready for DC-Ops to decommission

Jul 5 2019, 5:10 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui reassigned T227166: decommission db1069 from Marostegui to RobH.
Jul 5 2019, 5:10 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T227166: decommission db1069.
Jul 5 2019, 5:06 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T227166: decommission db1069.
Jul 5 2019, 5:02 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware

Jul 4 2019

Marostegui updated the task description for T222978: Compress and defragment tables on labsdb hosts.
Jul 4 2019, 2:39 PM · Data-Services, DBA
Marostegui added a comment to T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.

From what I can see now, the UPDATEs have stopped, but the INSERT rate is still at the same level on the master: https://grafana.wikimedia.org/d/000000273/mysql?panelId=2&fullscreen&orgId=1&var-dc=eqiad%20prometheus%2Fops&var-server=db1072&var-port=9104&from=now-24h&to=now

Jul 4 2019, 2:38 PM · SRE, Phabricator
Marostegui added a comment to T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.

I have restored the defaults after db2065 caught up

Jul 4 2019, 12:43 PM · SRE, Phabricator
Marostegui added a comment to T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.

Mentioned in SAL (#wikimedia-operations) [2019-07-04T10:47:29Z] <marostegui> Ease replication consistency option on db2065 to allow it to catch a bit - T227251

Jul 4 2019, 10:48 AM · SRE, Phabricator
Marostegui added a comment to T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.

@mmodell there has not been any significant change to the amount of INSERTs the master is getting
https://grafana.wikimedia.org/d/000000273/mysql?panelId=2&fullscreen&orgId=1&var-dc=eqiad%20prometheus%2Fops&var-server=db1072&var-port=9104&from=1562198027198&to=1562237067984

Jul 4 2019, 10:45 AM · SRE, Phabricator
Marostegui added a subtask for T186188: Failover DB masters in row D: T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).
Jul 4 2019, 10:40 AM · DBA
Marostegui added a parent task for T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required): T186188: Failover DB masters in row D.
Jul 4 2019, 10:40 AM · DBA
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

The etherpad is ready with the procedure and ready for a review.
The patch is also ready for review: https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/519975/

Jul 4 2019, 9:20 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui triaged T227063: Database primary master failover on s8 (wikidatawiki) as Medium priority.

Thank you!

Jul 4 2019, 9:19 AM · User-notice-archive, User-Johan, MoveComms-Support (Jul-Sep-2019), Wikidata
Marostegui updated the task description for T227166: decommission db1069.
Jul 4 2019, 7:50 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T208323: Predictive failures on disk S.M.A.R.T. status.
Jul 4 2019, 5:02 AM · SRE, DBA
Marostegui created T227251: Phabricator release/2019-07-03/1 from wmf/stable creating lag on codfw hosts.
Jul 4 2019, 5:00 AM · SRE, Phabricator
Marostegui updated the task description for T222978: Compress and defragment tables on labsdb hosts.
Jul 4 2019, 4:51 AM · Data-Services, DBA
Marostegui added a comment to T222978: Compress and defragment tables on labsdb hosts.

labsdb1011 is fully done:

root@labsdb1011:~# df -hT /srv
Filesystem            Type  Size  Used Avail Use% Mounted on
/dev/mapper/tank-data xfs    12T  6.2T  5.5T  53% /srv
Jul 4 2019, 4:51 AM · Data-Services, DBA
Marostegui closed T227107: Degraded RAID on db2049 as Resolved.

All good now - thanks!

root@db2049:~# hpssacli controller all show config
Jul 4 2019, 4:44 AM · DBA, SRE, ops-codfw

Jul 3 2019

Marostegui closed T226826: Drop old oathauth_users columns as Resolved.
Jul 3 2019, 1:06 PM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a comment to T226826: Drop old oathauth_users columns.

All done

Jul 3 2019, 1:06 PM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a comment to T226826: Drop old oathauth_users columns.

centralauth progress

Jul 3 2019, 12:58 PM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui closed T226358: Failover x1 master: db1069 to db1120 3rd July at 06:00 UTC as Resolved.
Jul 3 2019, 12:51 PM · Wikidata, User-notice-archive, Product-Infrastructure-Team-Backlog-Deprecated, WikimediaEditorTasks, Reading List Service, ContentTranslation, MediaWiki-extensions-BounceHandler, StructuredDiscussions, MediaWiki-extensions-UrlShortener, Cognate, Language-Team, Growth-Team, SRE, DBA
Marostegui closed T226358: Failover x1 master: db1069 to db1120 3rd July at 06:00 UTC, a subtask of T217396: Decommission db1061-db1073, as Resolved.
Jul 3 2019, 12:51 PM · SRE, DBA
Marostegui closed T226358: Failover x1 master: db1069 to db1120 3rd July at 06:00 UTC, a subtask of T220170: Address Database hardware infrastructure blockers on datacenter switchover & multi-dc deployment, as Resolved.
Jul 3 2019, 12:51 PM · Goal, DBA
Marostegui updated the task description for T226826: Drop old oathauth_users columns.
Jul 3 2019, 10:07 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a parent task for T226952: Failover m2 master db1065 to db1132: T220170: Address Database hardware infrastructure blockers on datacenter switchover & multi-dc deployment.
Jul 3 2019, 8:47 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added subtasks for T220170: Address Database hardware infrastructure blockers on datacenter switchover & multi-dc deployment: T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required), T226952: Failover m2 master db1065 to db1132.
Jul 3 2019, 8:47 AM · Goal, DBA
Marostegui added a parent task for T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required): T220170: Address Database hardware infrastructure blockers on datacenter switchover & multi-dc deployment.
Jul 3 2019, 8:47 AM · DBA
Marostegui added a comment to T227113: rack/setup/install db21[21-30].codfw.wmnet.

@RobH @Papaul I have merged: https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/520379/
The only changes pending from your side to be able to install these hosts once they arrive would be:

Jul 3 2019, 8:34 AM · Goal, SRE, DBA, ops-codfw
Marostegui added a comment to T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).

All codfw is now running 10.1.39 (which is the version the new master will run) - will keep upgrading eqiad now.

Jul 3 2019, 8:15 AM · DBA
Marostegui moved T226952: Failover m2 master db1065 to db1132 from Pending comment to In progress on the DBA board.
Jul 3 2019, 7:20 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added a project to T227113: rack/setup/install db21[21-30].codfw.wmnet: Goal.
Jul 3 2019, 7:14 AM · Goal, SRE, DBA, ops-codfw
Marostegui updated the task description for T226826: Drop old oathauth_users columns.
Jul 3 2019, 7:11 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui updated the task description for T226826: Drop old oathauth_users columns.
Jul 3 2019, 7:07 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui updated the task description for T227166: decommission db1069.
Jul 3 2019, 6:36 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T227166: decommission db1069.
Jul 3 2019, 6:33 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T217396: Decommission db1061-db1073.
Jul 3 2019, 6:29 AM · SRE, DBA
Marostegui added a parent task for T227166: decommission db1069: T217396: Decommission db1061-db1073.
Jul 3 2019, 6:28 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui added a subtask for T217396: Decommission db1061-db1073: T227166: decommission db1069.
Jul 3 2019, 6:28 AM · SRE, DBA
Marostegui created T227166: decommission db1069.
Jul 3 2019, 6:28 AM · SRE, ops-eqiad, DC-Ops, decommission-hardware
Marostegui updated the task description for T208323: Predictive failures on disk S.M.A.R.T. status.
Jul 3 2019, 6:25 AM · SRE, DBA
Marostegui moved T227113: rack/setup/install db21[21-30].codfw.wmnet from Triage to Blocked external/Not db team on the DBA board.
Jul 3 2019, 6:25 AM · Goal, SRE, DBA, ops-codfw
Marostegui updated the task description for T227113: rack/setup/install db21[21-30].codfw.wmnet.
Jul 3 2019, 6:21 AM · Goal, SRE, DBA, ops-codfw
Marostegui added a comment to T226358: Failover x1 master: db1069 to db1120 3rd July at 06:00 UTC.

@Ladsgroup do you want to run the script?

Jul 3 2019, 6:10 AM · Wikidata, User-notice-archive, Product-Infrastructure-Team-Backlog-Deprecated, WikimediaEditorTasks, Reading List Service, ContentTranslation, MediaWiki-extensions-BounceHandler, StructuredDiscussions, MediaWiki-extensions-UrlShortener, Cognate, Language-Team, Growth-Team, SRE, DBA
Marostegui added a comment to T226358: Failover x1 master: db1069 to db1120 3rd July at 06:00 UTC.

This was done.
Read only start: 06:00:36 UTC
Read only stop: 06:01:56 UTC
Total read only time: 01:20 min

Jul 3 2019, 6:09 AM · Wikidata, User-notice-archive, Product-Infrastructure-Team-Backlog-Deprecated, WikimediaEditorTasks, Reading List Service, ContentTranslation, MediaWiki-extensions-BounceHandler, StructuredDiscussions, MediaWiki-extensions-UrlShortener, Cognate, Language-Team, Growth-Team, SRE, DBA
Marostegui triaged T227107: Degraded RAID on db2049 as Medium priority.
Jul 3 2019, 4:54 AM · DBA, SRE, ops-codfw
Marostegui assigned T227107: Degraded RAID on db2049 to Papaul.

Let's replace the disk please!
Thanks

Jul 3 2019, 4:54 AM · DBA, SRE, ops-codfw

Jul 2 2019

Marostegui updated the task description for T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).
Jul 2 2019, 8:24 AM · DBA
Marostegui added a subtask for T217396: Decommission db1061-db1073: T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).
Jul 2 2019, 8:20 AM · SRE, DBA
Marostegui added a parent task for T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required): T217396: Decommission db1061-db1073.
Jul 2 2019, 8:20 AM · DBA
Marostegui added a parent task for T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required): T221764: Overview of wb_terms redesign.
Jul 2 2019, 8:18 AM · DBA
Marostegui added a subtask for T221764: Overview of wb_terms redesign: T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).
Jul 2 2019, 8:18 AM · User-Addshore, Wikidata, wb_terms - Tool Builders Migration
Marostegui created T227063: Database primary master failover on s8 (wikidatawiki).
Jul 2 2019, 8:17 AM · User-notice-archive, User-Johan, MoveComms-Support (Jul-Sep-2019), Wikidata
Marostegui triaged T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required) as Medium priority.
Jul 2 2019, 8:14 AM · DBA
Marostegui created T227062: Failover s8 (wikidatawiki) db primary master db1071 to db1104 (read-only required).
Jul 2 2019, 8:14 AM · DBA
Marostegui closed T227030: hi.wikisource added to labs replicas? as Declined.
Jul 2 2019, 5:28 AM · DBA, Analytics
Marostegui added a project to T219374: Prepare and check storage layer for hi.wikisource: Analytics.

Adding Analytics as they are interested in knowing when this wiki finally gets created so they can sqoop data from it T227030: hi.wikisource added to labs replicas?

Jul 2 2019, 5:27 AM · Platform Team Workboards (Clinic Duty Team), cloud-services-team, Analytics, Data-Services, DBA
Marostegui added a comment to T227030: hi.wikisource added to labs replicas? .

As @Reedy points out, hi.wiksource isn't created yet, not even its database T219374: Prepare and check storage layer for hi.wikisource.
As the wiki is marked as a public wiki, the process is as follows:

Jul 2 2019, 5:26 AM · DBA, Analytics
Marostegui added a comment to T226050: Wiki Replicas are very slow and timing out.

How long do you expect labsdb1011 to be depooled for. Is this going to be a regular thing?

Jul 2 2019, 5:21 AM · Data-Services

Jul 1 2019

Marostegui added a comment to T226826: Drop old oathauth_users columns.

Cool, I will give it some more 24h - so far nothing on logstash for db1094

Jul 1 2019, 3:28 PM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a comment to T226704: Setup es4 and es5 replica sets for new read-write external store service.

Should we define new HW specs for these hosts?
Currently es1015 specs:
128GB RAM
12x1.819 TB SATA HDD

Jul 1 2019, 3:01 PM · Goal, Epic, DBA
Marostegui added a comment to T169440: Pending global renames in need of sysadmin supervision (tracking).

Well, that's the question. Do we need to keep asking on Phabricator before renaming users with big edit counts locally and/or globally? I don't think any of us wants to break the sites doing a heavy rename (DBA tag or not).

Jul 1 2019, 1:42 PM · MediaWiki-extensions-CentralAuth, GlobalRename, Tracking-Neverending
Marostegui added a comment to T169440: Pending global renames in need of sysadmin supervision (tracking).

Renames probably don't need DBA monitoring anymore - since we replaced our old hardware we haven't seen any replication delay showing up.

Thanks @Marostegui - Could you please discuss this with DBA/Ops/etc. and get back to us with a definitive answer on this issue? (that is: if renames for global accounts with more than X edits locally or globally needs Phabricator ticket). Thank you!

Jul 1 2019, 11:00 AM · MediaWiki-extensions-CentralAuth, GlobalRename, Tracking-Neverending
Marostegui added a comment to T169440: Pending global renames in need of sysadmin supervision (tracking).

Renames probably don't need DBA monitoring anymore - since we replaced our old hardware we haven't seen any replication delay showing up.

Jul 1 2019, 9:58 AM · MediaWiki-extensions-CentralAuth, GlobalRename, Tracking-Neverending
Marostegui removed a project from T224916: Global rename of B dash → A1Cafel: supervision needed: DBA.

This doesn't really need a DBA there is no lag replication lag showing up since we replaced all the old hardware

Jul 1 2019, 9:57 AM · Wikimedia-Site-requests
Marostegui edited projects for T224348: Global rename of Fiona B. → Fiona*: supervision needed, added: SRE; removed DBA.

This doesn't really need a DBA there is no lag replication lag showing up since we replaced all the old hardware

Jul 1 2019, 9:56 AM · SRE, Wikimedia-Site-requests
Marostegui edited projects for T225370: Global rename of Waldir → Waldyrious: supervision needed, added: SRE; removed DBA.

This doesn't really need a DBA there is no lag replication lag showing up since we replaced all the old hardware

Jul 1 2019, 9:56 AM · SRE, Wikimedia-Site-requests
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

Note: db2044 needs upgrading

Jul 1 2019, 9:09 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui moved T85000: Investigate query planning in MariaDB 10 from Triage to Blocked external/Not db team on the DBA board.
Jul 1 2019, 8:53 AM · Platform Team Workboards (Clinic Duty Team), DBA, MediaWiki-General
Marostegui changed the status of T226851: Drop abuse_filter_log.afl_log_id in production from Open to Stalled.

Thanks, I am going to stall this until then.

Jul 1 2019, 8:40 AM · AbuseFilter, DBA
Marostegui updated the task description for T226952: Failover m2 master db1065 to db1132.
Jul 1 2019, 8:35 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

For debmonitor it connects to m2-master.eqiad.wmnet and I'm not sure if Django's connection pooling would be smart enough to reconnect given that the old one will still work, just RO. It might need a:

sudo cumin 'A:debmonitor' 'systemctl restart uwsgi-debmonitor.service'

just after the switch.

Jul 1 2019, 8:35 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

Let's leave it aside for now :-)

Jul 1 2019, 8:20 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui updated the task description for T226952: Failover m2 master db1065 to db1132.
Jul 1 2019, 7:54 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added a comment to T226952: Failover m2 master db1065 to db1132.

Because of the TTL mention, are you planning a failover of proxy at the same time?

Jul 1 2019, 7:53 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui updated subscribers of T226952: Failover m2 master db1065 to db1132.
Jul 1 2019, 7:23 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui added a comment to T226826: Drop old oathauth_users columns.

Also changed on an eqiad host (so we can check if there is something reading from those):

Jul 1 2019, 7:05 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui triaged T226952: Failover m2 master db1065 to db1132 as Medium priority.
Jul 1 2019, 6:16 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui created T226952: Failover m2 master db1065 to db1132.
Jul 1 2019, 6:16 AM · SRE-tools, Znuny, Recommendation-API, SRE, DBA
Marostegui triaged T226950: Move more wikis from s3 to s5 as Medium priority.
Jul 1 2019, 5:35 AM · DBA
Marostegui created T226950: Move more wikis from s3 to s5.
Jul 1 2019, 5:35 AM · DBA
Marostegui updated the task description for T226689: decommission db1068.
Jul 1 2019, 5:25 AM · DC-Ops, ops-eqiad, decommission-hardware, SRE
Marostegui claimed T226826: Drop old oathauth_users columns.
Jul 1 2019, 5:19 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui triaged T226826: Drop old oathauth_users columns as Medium priority.
Jul 1 2019, 5:18 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a comment to T226826: Drop old oathauth_users columns.

I have altered this table on db2054 on centralauth and will leave the columns renamed for a few days to make sure nothing uses it.

root@db2054.codfw.wmnet[centralauth]> alter table oathauth_users change secret TO_DROP_secret varbinary(255) DEFAULT NULL, change scratch_tokens TO_DROP_scratch_tokens varbinary(511) DEFAULT NULL;
Query OK, 0 rows affected (0.09 sec)
Records: 0  Duplicates: 0  Warnings: 0
Jul 1 2019, 5:17 AM · MediaWiki-extensions-OATHAuth, DBA, Schema-change
Marostegui added a comment to T85000: Investigate query planning in MariaDB 10.

Not sure what's the actionable here for the DBAs.
This looks like another case of the optimizer not doing what expected. I have tested the original two queries on the new 10.3 and they also filesort. (and considering this is a 5 years old ticket and nothing has changed from 5.5 to 10.3.... I guess we cannot have much hopes on MariaDB's optimizer doing the right thing).
We can always report it as a bug, but I don't think it will make much difference.
Thoughts?

Jul 1 2019, 5:03 AM · Platform Team Workboards (Clinic Duty Team), DBA, MediaWiki-General
Marostegui triaged T226851: Drop abuse_filter_log.afl_log_id in production as Medium priority.
Jul 1 2019, 4:59 AM · AbuseFilter, DBA
Marostegui moved T226851: Drop abuse_filter_log.afl_log_id in production from Triage to Pending comment on the DBA board.

@Daimona as per https://tools.wmflabs.org/versions/ we are on .11 everywhere, this is safe to proceed then?

Jul 1 2019, 4:58 AM · AbuseFilter, DBA
Marostegui added a comment to T224348: Global rename of Fiona B. → Fiona*: supervision needed.

@Anomie might be able to give us more information of the timeline for the deployment.

Jul 1 2019, 4:43 AM · SRE, Wikimedia-Site-requests
Marostegui updated the task description for T208323: Predictive failures on disk S.M.A.R.T. status.
Jul 1 2019, 4:40 AM · SRE, DBA
Marostegui added a comment to T226050: Wiki Replicas are very slow and timing out.

Yes, the host that was out for maintenance, labsdb1011 was repooled. However, we still need to continue with the maintenance for T222978: Compress and defragment tables on labsdb hosts so I am going to depool labsdb1011 again. I know this is unfortunate, but there is nothing else we can do to reduce disk space, and we have to do that no matter what, or else, the replicas will get fully filled.

Jul 1 2019, 4:39 AM · Data-Services