Maniphest T208381

mediawiki - node SSR HTTP request follow-up [2*2h]
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	• Pablo-WMDE
	Oct 31 2018, 10:17 AM

Description

T206200 implemented the basics for getting the node SSR result into wikibase. Before this can be called ready for production we should invest more time to look at mensurability, edge cases, requirements by operations, ...

~~Braindump~~ Topics

timeout => T215912
log failed connections => T215913
what happens on failure? (e.g. render the root element and trust in client-side re-rendering 1, 2)
- how do we avoid storing a suboptimal result in the ParserCache?
TLS => answer from ops -> right now this isn't supported by them Should we make a ticket to propose this?
custom user agent incl. version information of the mw/wb system performing the request T217399
log download time => right now we cannot get these free from the network level. In the future maybe yes. We will have to log from the service
graceful service shutdown
- add healthcheck T215920
- configure helm T215921 question to ops: helm help, anyone? Configure /healthcheck to yield a useful result (e.g. during start, on graceful shutdown)
caching => until we decide we need to add a caching layer in between mediawiki and the SSR service this should be covered by T214679
node service performance
- gzip T215917

Comments

Traffic in "the opposite direction" is discussed in T209961

Related Objects
Search...

Status	Assigned	Task
Resolved	Lydia_Pintscher	T214899 Deploying Wikidata Termbox
Open	None	T214901 Show mobile termbox on Wikidata item and property pages
Resolved	Tarrow	T214902 Show mobile termbox on Wikidata test wiki
Resolved	Tarrow	T208381 mediawiki - node SSR HTTP request follow-up [2*2h]
Resolved	• Matthias_Geisler_WMDE	T215912 mediawiki - node: add timeout
Resolved	• Pablo-WMDE	T215913 mediawiki - node: log (failed) connections
Resolved	None	T215917 use compression to serve termbox http response
Resolved	Tarrow	T215920 Add health monitoring as required for deployment: use service-runner
Resolved	Tarrow	T215921 Configure helm chart

Event Timeline

• Pablo-WMDE created this task.Oct 31 2018, 10:17 AM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 31 2018, 10:17 AM

Addshore subscribed.Oct 31 2018, 11:00 AM

Lydia_Pintscher added a project: Wikidata.Oct 31 2018, 7:53 PM

• Pablo-WMDE renamed this task from wikimedia - node SSR request follow-up to wikimedia - node SSR HTTP request follow-up.Nov 1 2018, 9:33 AM

• Pablo-WMDE updated the task description. (Show Details)

• Pablo-WMDE updated the task description. (Show Details)Nov 7 2018, 12:03 PM

• Pablo-WMDE renamed this task from wikimedia - node SSR HTTP request follow-up to mediawiki - node SSR HTTP request follow-up.Nov 20 2018, 3:12 PM

• Pablo-WMDE updated the task description. (Show Details)Nov 20 2018, 3:15 PM

• Pablo-WMDE updated the task description. (Show Details)

• Pablo-WMDE updated the task description. (Show Details)Dec 5 2018, 11:04 AM

• Pablo-WMDE updated the task description. (Show Details)Jan 14 2019, 2:39 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 5 2019, 2:41 PM

• Hanna_Petruschat_WMDE renamed this task from mediawiki - node SSR HTTP request follow-up to mediawiki - node SSR HTTP request follow-up [2*2h].Feb 6 2019, 3:15 PM

• Hanna_Petruschat_WMDE moved this task from Backlog to MVP things: Ready for pickup on the Wikidata-Termbox board.Feb 6 2019, 3:18 PM

• Hanna_Petruschat_WMDE added a parent task: T214902: Show mobile termbox on Wikidata test wiki.

• Lea_WMDE triaged this task as Medium priority.Feb 11 2019, 11:52 AM

• Lea_WMDE edited projects, added Wikidata-Termbox-Iteration-9; removed Wikidata-Termbox.

• Pablo-WMDE moved this task from To Do to Doing on the Wikidata-Termbox-Iteration-9 board.Feb 12 2019, 2:21 PM

Jakob_WMDE updated the task description. (Show Details)Feb 12 2019, 2:57 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:02 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:14 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:17 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:22 PM

• Pablo-WMDE updated the task description. (Show Details)

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:27 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:29 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:40 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:49 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:57 PM

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 3:59 PM

Hi @WMDE-leszek and @Addshore,
we thought about and added to the Topics and created (superficial) task for the things we deem needed before go live (mind you, this ticket is restricted to the traffic btw. mw and the node service, explicitly _not_ the other direction).
Could you please have a look at the questions indicated in red and see if you already have answers, or can point us to someone at the wmf to dive into these issues with?

Thanks

• Pablo-WMDE updated the task description. (Show Details)Feb 12 2019, 4:02 PM

I had a quick chat with @Addshore, and I'll try to point to WMF teams we believe are the best suited to provide information on those topics, the particular team member who is to our knowledge the best suited/approachable (e.g. based in Europe) in the particular topic area, and also IRC channels where teams in questions are generally present. This if of course does not mean IRC is the only way to get answers, and the mentioned individuals are the ones you must be in contact with.
Email addresses of WMF staff can be found through the staff page https://wikimediafoundation.org/role/staff-contractors/

TLS => question to ops: how is this configured?
log download time => question to ops: can we get these metrics somewhere on the network level?

Those two should be under the responsibility of "Site Reliability Engineering" team. The possible contact person could be Giuseppe L. He goes as _joe_ on IRC, or as Joe here in phabricator. Team's IRC channel is #wikimedia-serviceops.

configure helm T215921 question to ops: helm help, anyone? Configure /healthcheck to yield a useful result (e.g. during start, on graceful shutdown)

That seems like the responsibility of the "Release Engineering" team. The person with helm expertise is definitely Tyler C. He goes as thcipriani on IRC and phabricator. He is unfortunately not based inside Europe timezones as far as I know. IRC channel of the channel is #wikimedia-releng.

• Lea_WMDE changed the status of subtask T215920: Add health monitoring as required for deployment: use service-runner from Open to Stalled.Feb 25 2019, 11:08 AM

• Lea_WMDE changed the status of subtask T215921: Configure helm chart from Open to Stalled.

• Lea_WMDE added a project: Wikidata-Termbox-Iteration-10.Feb 25 2019, 11:52 AM

• Lea_WMDE moved this task from To Do to Doing on the Wikidata-Termbox-Iteration-10 board.Feb 25 2019, 11:57 AM

• Pablo-WMDE closed subtask T215917: use compression to serve termbox http response as Resolved.Feb 27 2019, 10:26 AM

Tarrow claimed this task.Feb 27 2019, 11:20 AM

Tarrow updated the task description. (Show Details)Feb 27 2019, 4:08 PM

Tarrow updated the task description. (Show Details)Mar 1 2019, 10:01 AM

Tarrow moved this task from Doing to Done on the Wikidata-Termbox-Iteration-10 board.Mar 1 2019, 2:08 PM

Tarrow changed the status of subtask T215920: Add health monitoring as required for deployment: use service-runner from Stalled to Open.Mar 1 2019, 4:01 PM

Tarrow closed subtask T215920: Add health monitoring as required for deployment: use service-runner as Resolved.Mar 18 2019, 10:22 AM

• Lea_WMDE closed this task as Resolved.Apr 17 2019, 1:51 PM