Page MenuHomePhabricator

Eevans (Eric Evans)
Senior Software Engineer

Projects (13)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Feb 27 2015, 10:47 PM (265 w, 1 d)
Availability
Available
IRC Nick
urandom
LDAP User
Eevans
MediaWiki User
Unknown

Recent Activity

Thu, Mar 26

Eevans updated the task description for T248543: Evaluate Envoy proxy for API gateway (and rate-limiter).
Thu, Mar 26, 6:38 PM · CPT Initiatives (API Gateway)
Eevans updated subscribers of T248543: Evaluate Envoy proxy for API gateway (and rate-limiter).
Thu, Mar 26, 12:56 AM · CPT Initiatives (API Gateway)
Eevans added a subtask for T235270: Wikimedia API Gateway: T248543: Evaluate Envoy proxy for API gateway (and rate-limiter).
Thu, Mar 26, 12:54 AM · Core Platform Team Workboards (Initiatives), CPT Initiatives (API Gateway)
Eevans added a parent task for T248543: Evaluate Envoy proxy for API gateway (and rate-limiter): T235270: Wikimedia API Gateway.
Thu, Mar 26, 12:54 AM · CPT Initiatives (API Gateway)
Eevans triaged T248543: Evaluate Envoy proxy for API gateway (and rate-limiter) as Medium priority.
Thu, Mar 26, 12:53 AM · CPT Initiatives (API Gateway)
Eevans created T248543: Evaluate Envoy proxy for API gateway (and rate-limiter).
Thu, Mar 26, 12:51 AM · CPT Initiatives (API Gateway)

Wed, Mar 25

Eevans added a comment to T248018: Drop Cassandra keyspaces for /page/references.

Ok, these keyspaces have been removed from production, dev, and deployment-prep. Out of an abundance of caution, I will leave this open until Friday, and close after cleaning up the snapshots.

Wed, Mar 25, 9:52 PM · Core Platform Team Workboards (Clinic Duty Team), Page Content Service, Product-Infrastructure-Team-Backlog
Eevans added a comment to T248018: Drop Cassandra keyspaces for /page/references.

LGTM. Beta cluster has one as well

Wed, Mar 25, 9:48 PM · Core Platform Team Workboards (Clinic Duty Team), Page Content Service, Product-Infrastructure-Team-Backlog
Eevans triaged T248018: Drop Cassandra keyspaces for /page/references as Medium priority.
Wed, Mar 25, 9:33 PM · Core Platform Team Workboards (Clinic Duty Team), Page Content Service, Product-Infrastructure-Team-Backlog
Eevans added a comment to T248018: Drop Cassandra keyspaces for /page/references.

This has been deployed, so we can drop those key spaces!

Wed, Mar 25, 9:33 PM · Core Platform Team Workboards (Clinic Duty Team), Page Content Service, Product-Infrastructure-Team-Backlog

Fri, Mar 13

Eevans added a comment to T239856: Fold services recommendations into Standards for services RfC.

Is there anything left to do/review here?

Fri, Mar 13, 8:12 PM · Core Platform Team Workboards (Clinic Duty Team)

Fri, Mar 6

Eevans added a comment to T243544: Cassandra PHP language driver packaging (Debian).

I've overhauled things and moved stuff to a more Debian-compliant layout here: https://github.com/nosmo/cpp-driver/tree/debian/debian
Still not sure if this is up to snuff though, needs some testing.

Fri, Mar 6, 1:44 PM · Core Platform Team Workboards (Initiatives), User-Eevans

Tue, Mar 3

Eevans placed T137419: Investigate aberrant disk read throughput in Cassandra (affects 2.2.x and 3.x) up for grabs.

This is actually something we should be following up on with upstream more aggressively.

@Eevans: Hi, do you plan to do this? Asking as you you have been task assignee for a while now.

Tue, Mar 3, 3:51 PM · Core Platform Team (Icebox), User-Eevans, Services (later), Cassandra
Eevans updated subscribers of T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.

Yeah, it's ready to be closed, but AFAIK, we're supposed to wait for the PM (@CCicalese_WMF) to close it after moving it to Done on the workboard.

Tue, Mar 3, 3:47 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans placed T92471: enable authenticated access to Cassandra JMX up for grabs.
Tue, Mar 3, 3:43 PM · Core Platform Team (Icebox), User-Eevans, Cassandra, Operations, Patch-For-Review

Feb 27 2020

Eevans created T246379: Research rate limiter implementations and rate limiter-capable HTTP reverse proxies.
Feb 27 2020, 8:59 PM · CPT Initiatives (API Gateway), Core Platform Team Workboards (Green)

Feb 25 2020

zeljkofilipin awarded T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` a Party Time token.
Feb 25 2020, 10:49 AM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Feb 24 2020

Cparle awarded T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` a Like token.
Feb 24 2020, 4:30 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)
Eevans added a comment to T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session`.

I believe this is pending https://gerrit.wikimedia.org/r/c/operations/mediawiki-config/+/574034 (T224712).

Feb 24 2020, 4:13 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Feb 21 2020

Eevans reassigned T245875: Parsoid REST endpoint not working on en.wikipedia.beta.wmflabs.org from Eevans to Pchelolo.
Feb 21 2020, 10:02 PM · Core Platform Team Workboards (Clinic Duty Team)
Eevans closed T245875: Parsoid REST endpoint not working on en.wikipedia.beta.wmflabs.org as Resolved.
Feb 21 2020, 10:02 PM · Core Platform Team Workboards (Clinic Duty Team)

Feb 19 2020

Kaartic awarded T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` a Heartbreak token.
Feb 19 2020, 5:51 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)
Eevans moved T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` from Done to Inbox on the Core Platform Team Workboards (Clinic Duty Team) board.
Feb 19 2020, 1:59 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Feb 18 2020

Eevans added a comment to T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session`.

I don't think this is sessionstore, (at least, it's not the timeout issue with Cassandra that we saw before).

Feb 18 2020, 2:37 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Feb 13 2020

WMDE-Fisch awarded T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` a Heartbreak token.
Feb 13 2020, 9:27 AM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)
awight awarded T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` a Heartbreak token.
Feb 13 2020, 9:27 AM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Feb 10 2020

Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Feb 10 2020, 7:41 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans awarded T244508: Request for +2 access to mediawiki-config a Party Time token.
Feb 10 2020, 7:37 PM · Release-Engineering-Team, Operations, SRE-Access-Requests, Gerrit-Privilege-Requests
Eevans added a comment to T244508: Request for +2 access to mediawiki-config.

@Eevans You should have +2 on the mw-config repo now. Probably after logging out and back in.

Feb 10 2020, 7:37 PM · Release-Engineering-Team, Operations, SRE-Access-Requests, Gerrit-Privilege-Requests

Feb 6 2020

Eevans created T244508: Request for +2 access to mediawiki-config.
Feb 6 2020, 5:51 PM · Release-Engineering-Team, Operations, SRE-Access-Requests, Gerrit-Privilege-Requests
Eevans edited P10322 Masterwork From Distant Lands.
Feb 6 2020, 5:41 PM

Feb 5 2020

Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Feb 5 2020, 4:45 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans

Feb 4 2020

Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Feb 4 2020, 7:54 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans moved T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c} from Doing to Done on the Core Platform Team Workboards (Clinic Duty Team) board.

This is complete.

Feb 4 2020, 4:45 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Feb 4 2020, 4:45 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans added a comment to T242461: restrouter.svc.{eqiad,codfw}.wmnet in a failed state.

I believe we have consensus around de-deploying restrouter from k8s, @WDoranWMF can you confirm?

Feb 4 2020, 12:26 AM · Patch-For-Review, serviceops, Core Platform Team Workboards (Clinic Duty Team)
Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Feb 4 2020, 12:12 AM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Feb 4 2020, 12:11 AM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans

Feb 3 2020

Eevans added a comment to T244178: Deploy restbase to restbase202[123].

All these steps should be done after Cassandra is bootstrapped. See T219404 for the ticket where fresh deploy was done the previous time.

Feb 3 2020, 9:59 PM · Patch-For-Review, Core Platform Team Workboards (Clinic Duty Team)
Eevans triaged T244178: Deploy restbase to restbase202[123] as Medium priority.
Feb 3 2020, 9:09 PM · Patch-For-Review, Core Platform Team Workboards (Clinic Duty Team)
Eevans created T244178: Deploy restbase to restbase202[123].
Feb 3 2020, 9:09 PM · Patch-For-Review, Core Platform Team Workboards (Clinic Duty Team)
Eevans committed rDEPLOYCHARTS6751955304bc: Upgrade sessionstore production to Kask v1.0.6 (authored by Eevans).
Upgrade sessionstore production to Kask v1.0.6
Feb 3 2020, 5:34 PM
Eevans committed rDEPLOYCHARTS121eba82cd0c: Upgrade staging to Kask v1.0.6 (authored by Eevans).
Upgrade staging to Kask v1.0.6
Feb 3 2020, 5:34 PM
Eevans added a comment to T243544: Cassandra PHP language driver packaging (Debian).

Interestingly there is already a packaging script in the repo for the driver, we can either reuse or adapt this - there doesn't seem to be much custom stuff in it https://github.com/datastax/cpp-driver/blob/master/packaging/build_deb.sh

Feb 3 2020, 5:30 PM · Core Platform Team Workboards (Initiatives), User-Eevans

Jan 25 2020

Eevans committed rMSKS662f71ebf9bc: Configurable query and connect timeouts (authored by Eevans).
Configurable query and connect timeouts
Jan 25 2020, 2:07 AM

Jan 24 2020

Eevans closed T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` as Resolved.
Jan 24 2020, 10:28 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)
Eevans added a comment to T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session`.

Kask has been updated with higher (default) Cassandra timeouts, and deployment-prep has been updated. I'm going to close this, feel free to re-open if this happens again.

Jan 24 2020, 10:27 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Jan 23 2020

Eevans triaged T243544: Cassandra PHP language driver packaging (Debian) as Medium priority.
Jan 23 2020, 8:10 PM · Core Platform Team Workboards (Initiatives), User-Eevans
Eevans created T243544: Cassandra PHP language driver packaging (Debian).
Jan 23 2020, 8:09 PM · Core Platform Team Workboards (Initiatives), User-Eevans
Eevans added a comment to T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session`.

The default timeouts in the Cassandra Go driver, both Timeout and ConnectTimeout are 600ms. This seems quite low, by comparison the Java and NodeJS drivers both use 12s and 5s respectively. I propose we make these values configurable in Kask (with defaults of 12s and 5s).

Jan 23 2020, 1:01 AM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Jan 22 2020

Eevans lowered the priority of T243123: Login to at least en.wikipedia.beta.wmflabs.org and commons.wikimedia.beta.wmflabs.org sometimes fails with `There seems to be a problem with your login session` from Unbreak Now! to Medium.

It looks like Cassandra queries from Kask have been intermittently timing out. Both Kask and Cassandra are co-located on the same VM, and it is pretty resource constrained, but AFAIK it has been working OK to this point; We can probably begin with a restart and go from there

Jan 22 2020, 11:23 PM · Core Platform Team Workboards (Clinic Duty Team), MediaWiki-extensions-CentralAuth, Beta-Cluster-Infrastructure, MediaWiki-User-login-and-signup, MediaWiki-Core-Testing, User-zeljkofilipin, Quality-and-Test-Engineering-Team (QTE)

Jan 17 2020

Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 11:51 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 10:02 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 9:56 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Jan 17 2020, 9:41 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans updated the task description for T243106: Phased rollout of sessionstore to production fleet.
Jan 17 2020, 9:40 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans created T243106: Phased rollout of sessionstore to production fleet.
Jan 17 2020, 9:36 PM · CPT Initiatives (Session Management Service (CDP2)), serviceops-radar, Patch-For-Review, TPG-Epics (Team Practices Group Coaching Clinic), User-Clarakosi, User-Eevans
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 8:07 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 5:13 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 2:32 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 2:45 AM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 17 2020, 12:30 AM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations

Jan 16 2020

Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 16 2020, 10:21 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans updated the task description for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 16 2020, 9:30 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans added a comment to T234286: Multi-DC Echo Notification Storage.

TTBMK, everything here is done.

Jan 16 2020, 7:21 PM · Growth-Team, Notifications, Core Platform Team, CPT Initiatives (Multi-DC Echo Notification Storage)
Eevans claimed T234296: Completed migration.

Done.

Jan 16 2020, 7:21 PM · Growth-Team, Notifications, Core Platform Team Workboards (User Stories), Story, CPT Initiatives (Multi-DC Echo Notification Storage)
Eevans claimed T234963: Deploy final configuration.

Done.

Jan 16 2020, 7:20 PM · Core Platform Team Workboards (Clinic Duty Team), Notifications, Growth-Team, CPT Initiatives (Multi-DC Echo Notification Storage)
Eevans edited projects for T241784: (Need by: TBD) rack/setup/install restbase1029, restbase1029, restbase1030, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 16 2020, 6:23 PM · Core Platform Team Workboards (Clinic Duty Team), ops-eqiad, Operations
Eevans edited projects for T241790: (No Need By Date Provided) rack/setup/install restbase202[123], added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 16 2020, 6:23 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans triaged T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c} as Medium priority.
Jan 16 2020, 5:56 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans moved T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c} from Inbox to Doing on the Core Platform Team Workboards (Clinic Duty Team) board.
Jan 16 2020, 5:56 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans edited projects for T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 16 2020, 5:55 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations
Eevans created T243000: Bootstrap new Cassandra instances: restbase202[123]-{a,b,c}.
Jan 16 2020, 5:55 PM · Core Platform Team Workboards (Clinic Duty Team), ops-codfw, Operations

Jan 15 2020

Eevans moved T234963: Deploy final configuration from Doing to Waiting for Review on the Core Platform Team Workboards (Clinic Duty Team) board.
Jan 15 2020, 9:14 PM · Core Platform Team Workboards (Clinic Duty Team), Notifications, Growth-Team, CPT Initiatives (Multi-DC Echo Notification Storage)
Eevans moved T234963: Deploy final configuration from Inbox to Doing on the Core Platform Team Workboards (Clinic Duty Team) board.
Jan 15 2020, 9:14 PM · Core Platform Team Workboards (Clinic Duty Team), Notifications, Growth-Team, CPT Initiatives (Multi-DC Echo Notification Storage)
Eevans triaged T234963: Deploy final configuration as Medium priority.
Jan 15 2020, 9:13 PM · Core Platform Team Workboards (Clinic Duty Team), Notifications, Growth-Team, CPT Initiatives (Multi-DC Echo Notification Storage)

Jan 13 2020

Eevans added a comment to T242461: restrouter.svc.{eqiad,codfw}.wmnet in a failed state.

Since (long-term) we aim to replace all of this, is abandoning it entirely an option?

Is it possible to take it out for now until we either prioritize it again or drop it entirely?

You mean undeploy? Sure we can undeploy it. The only caveat being that redeploying it will take some time as we will need to create the necessary resources again (LVS entries, DNS, kubernetes namespaces etc).

We're running CI for RESTBase in both RESTBase and RESTRouter modes, so it will be in mostly deployable state if we want to put it back online, however maintaining an unused production deployment seems like a waste.

Indeed.

A lot has changed since we began this migration, including https://www.mediawiki.org/wiki/Core_Platform_Team/Decisions_Architecture_Research_Documentation/Services_Architecture_Recommendations_(2019), which is expected be a lengthly process, but will ultimately result in REST{Router,Base}-less world. I guess the question we should be asking is: Is this still something we should do in the meantime (and schedule and resource to complete), or should we cut bait, undeploy from k8s, and leave things as they are?
@WDoranWMF ?

Jan 13 2020, 8:51 PM · Patch-For-Review, serviceops, Core Platform Team Workboards (Clinic Duty Team)
Eevans added a comment to T242461: restrouter.svc.{eqiad,codfw}.wmnet in a failed state.

Since (long-term) we aim to replace all of this, is abandoning it entirely an option?

Is it possible to take it out for now until we either prioritize it again or drop it entirely?

You mean undeploy? Sure we can undeploy it. The only caveat being that redeploying it will take some time as we will need to create the necessary resources again (LVS entries, DNS, kubernetes namespaces etc).

We're running CI for RESTBase in both RESTBase and RESTRouter modes, so it will be in mostly deployable state if we want to put it back online, however maintaining an unused production deployment seems like a waste.

Indeed.

Jan 13 2020, 4:43 PM · Patch-For-Review, serviceops, Core Platform Team Workboards (Clinic Duty Team)

Jan 10 2020

Eevans triaged T242461: restrouter.svc.{eqiad,codfw}.wmnet in a failed state as Medium priority.

It's not clear to me what the status of this is. Do we need to deploy the latest code here? Since (long-term) we aim to replace all of this, is abandoning it entirely an option?

Jan 10 2020, 8:39 PM · Patch-For-Review, serviceops, Core Platform Team Workboards (Clinic Duty Team)
Eevans created T242461: restrouter.svc.{eqiad,codfw}.wmnet in a failed state.
Jan 10 2020, 8:35 PM · Patch-For-Review, serviceops, Core Platform Team Workboards (Clinic Duty Team)
Eevans added a comment to T242344: Remove Parsoid-JS tables from Cassandra.

The tables have been dropped in all 3 environments. The only thing remaining is to clear the snapshots (and actually reclaim the space). Out of an abundance of caution, I'll sit on this for a couple days and close the ticket once complete.

Jan 10 2020, 8:27 PM · Core Platform Team Workboards (Clinic Duty Team), Parsoid-PHP, RESTBase
Eevans added a comment to T242344: Remove Parsoid-JS tables from Cassandra.

OK, here is what I propose applying; Review appreciated!

Jan 10 2020, 7:51 PM · Core Platform Team Workboards (Clinic Duty Team), Parsoid-PHP, RESTBase
Eevans created P10118 deployment-prep.yaml.
Jan 10 2020, 7:50 PM
Eevans created P10116 dev.yaml.
Jan 10 2020, 7:49 PM
Eevans created P10115 production.yaml.
Jan 10 2020, 7:47 PM
Eevans updated the task description for T242344: Remove Parsoid-JS tables from Cassandra.
Jan 10 2020, 7:35 PM · Core Platform Team Workboards (Clinic Duty Team), Parsoid-PHP, RESTBase
Eevans triaged T242344: Remove Parsoid-JS tables from Cassandra as Medium priority.
Jan 10 2020, 7:34 PM · Core Platform Team Workboards (Clinic Duty Team), Parsoid-PHP, RESTBase
Eevans edited projects for T241068: Restrouter health checks fail when local wikifeeds instance is not pool in discovery records, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:47 PM · Core Platform Team (Icebox), serviceops-radar
Eevans edited projects for T178445: flapping monitoring for recommendation_api on scb, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:43 PM · Core Platform Team (Icebox), Discovery, Recommendation-API, Wikidata, Services (watching), Operations, observability
Eevans edited projects for T241905: Investigate JobQueue outage from 2020-01-04 22:00 UTC, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:41 PM · Core Platform Team Workboards (Clinic Duty Team), Wikimedia-Incident, WMF-JobQueue
Eevans edited projects for T241940: No option to continue querying for more results in globalallusers API, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:40 PM · Core Platform Team (Icebox), MediaWiki-extensions-CentralAuth, MediaWiki-API
Eevans edited projects for T242249: Unclear MCR replacement for WikiPage::prepareContentForEdit, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:40 PM · Documentation, CPT Initiatives (MCR)
Eevans edited projects for T242409: languageinfo API returns a TypeError if you request fallbacks, added: Core Platform Team Workboards (Clinic Duty Team); removed Core Platform Team.
Jan 10 2020, 5:40 PM · MW-1.35-notes (1.35.0-wmf.15; 2020-01-14), Core Platform Team Workboards (Clinic Duty Team), Wikimedia-production-error, MediaWiki-API, Regression
Eevans removed a project from T224425: MW Job consumers sometimes pause for several minutes: Core Platform Team.
Jan 10 2020, 5:39 PM · Core Platform Team Workboards (Clinic Duty Team), CPT Initiatives (Modern Event Platform (TEC2)), WMF-JobQueue, Discovery-Search (Current work)
Eevans added a project to T224425: MW Job consumers sometimes pause for several minutes: Core Platform Team Workboards (Clinic Duty Team).
Jan 10 2020, 5:38 PM · Core Platform Team Workboards (Clinic Duty Team), CPT Initiatives (Modern Event Platform (TEC2)), WMF-JobQueue, Discovery-Search (Current work)
Eevans triaged T240307: Hook container with strong types and DI as Medium priority.
Jan 10 2020, 5:34 PM · Dependency injection, Core Platform Team Workboards (Clinic Duty Team), MW-1.35-notes (1.35.0-wmf.23; 2020-03-10), Patch-For-Review, TechCom-RFC (TechCom-RFC-Closed), User-Daniel, Core Platform Team
Eevans triaged T170603: API Edit Requires a Captcha, but on Wiki edit does not as Medium priority.
Jan 10 2020, 5:33 PM · MediaWiki-extensions-OAuth, ConfirmEdit (CAPTCHA extension), MediaWiki-API
Eevans triaged T192023: Allowing seaching the archive table for titles of deleted pages through the API as Medium priority.
Jan 10 2020, 5:25 PM · MediaWiki-API
Eevans triaged T241940: No option to continue querying for more results in globalallusers API as Medium priority.
Jan 10 2020, 5:23 PM · Core Platform Team (Icebox), MediaWiki-extensions-CentralAuth, MediaWiki-API
Eevans triaged T242249: Unclear MCR replacement for WikiPage::prepareContentForEdit as Medium priority.
Jan 10 2020, 5:22 PM · Documentation, CPT Initiatives (MCR)