Page MenuHomePhabricator

add thumbor to production infrastructure
Closed, ResolvedPublic

Description

tracking here what needs to happen on the production infrastructure to happen for thumbor:

  • hardware provisioning
  • LVS setup
  • modify rewrite.py to duplicate a fraction of thumbnail requests to thumbor (some of it already in mediawiki-vagrant) T139484
  • thumbor puppetization (includes module, role, etc)
  • grant thumbor-admins access https://gerrit.wikimedia.org/r/#/c/302471/
  • provision swift account for thumbor and provide write access to relevant containers
  • monitoring and alarming
  • setup firejail for thumbor, similar to imagescaler

HW provisioning

thumbor is expected to be purely CPU/network bound. With ideally less, but at least similar, requirements as the current image scalers. We can order dedicated hardware and in the mean time re-use some of recently decommissioned hardware.

rewrite.py integration

Implementation is part of [[https://phabricator.wikimedia.org/rMWVA9971189bbc715e7df2d7f19a5ad5edb6ca4c2399 | mediawiki-vagrant ]] and should be ported to production and its configuration puppetized: T139484

Related Objects

StatusSubtypeAssignedTask
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
ResolvedNone
Resolvedfgiunchedi
Resolved Gilles
Resolvedfgiunchedi
Resolvedfgiunchedi
Resolved Gilles
Resolved Gilles
Declined Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Declined Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Invalid Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Duplicate Gilles
Resolved Gilles
Duplicate Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Duplicate Gilles
Duplicate Gilles
Declined Gilles
Resolved Gilles
Declined Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles
Resolved Gilles

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

"monitoring and alarming" is still unchecked on this task. Is that true? Is there still something to do there? I thought we had paging alarms raised before when disk usage filled up.

I'm going to move all the remaining tasks to the parent, since for all intents and purposes, Thumbor has been running in production for some time now, just not serving its results yet.