akosiaris (Alexandros Kosiaris)
Senior Site Reliability Engineer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Oct 3 2014, 8:40 AM (219 w, 4 d)
Availability
Available
IRC Nick
akosiaris
LDAP User
Alexandros Kosiaris
MediaWiki User
AKosiaris (WMF) [ Global Accounts ]

Blurb

Recent Activity

Today

akosiaris added a comment to T212129: Use a multi-dc aware store for ObjectCache's MainStash if needed..

The current callers don't assume the level of durability as with mysql, just that the data will likely not be randomly removed (e.g. high eviction rate, power outage, network blips).

Wed, Dec 19, 6:43 AM · Performance-Team (Radar), Core Platform Team, Operations, MediaWiki-Cache, serviceops

Yesterday

akosiaris added a comment to T211750: Introduce Python code formatters usage.

I like black too but from but from https://black.readthedocs.io/en/stable/installation_and_usage.html it tied to having python 3.6 installed.

Tue, Dec 18, 4:01 PM · Operations, Operations-Software-Development
akosiaris added a comment to T212212: eqiad: 1-2 VM requests for docker-registry-beta.wikimedia.org.

Add codfw in the mix as well, no reason to cap this to eqiad. Everything else LGTM

Tue, Dec 18, 2:40 PM · serviceops, Operations, vm-requests
akosiaris committed rDEPLOYCHARTSdf938e395cf8: blubberoid: Bump CPU limit to 1800m (authored by akosiaris).
blubberoid: Bump CPU limit to 1800m
Tue, Dec 18, 1:33 PM
akosiaris committed rDEPLOYCHARTSeb112cf79703: blubberoid: Bump CPU limit to 1800m (authored by akosiaris).
blubberoid: Bump CPU limit to 1800m
Tue, Dec 18, 1:28 PM

Mon, Dec 17

akosiaris updated the task description for T203963: Convert makevm to spicerack cookbook.
Mon, Dec 17, 3:04 PM · serviceops, Operations-Software-Development, User-jijiki, User-Joe, Operations

Fri, Dec 14

akosiaris reopened T210260: Stretch in docker registry forces ascii encoding as "Open".

Oops, closed this by mistake. Re-opened, feel free to close when the issue is indeed resolved.

Fri, Dec 14, 12:09 PM · Release Pipeline, Patch-For-Review, Release-Engineering-Team (Backlog), Wikibase-Containers, Scoring-platform-team, Wikilabels, Wikidata
akosiaris closed T210260: Stretch in docker registry forces ascii encoding as Resolved.

Following the merge of https://gerrit.wikimedia.org/r/478200 , can you possibly rebuild the two images please? :)

RepositoryTagImage idCreatedSize
docker-registry.wikimedia.org/wikimedia-stretchlatestac576ceda67113 months ago56.1MB
docker-registry.wikimedia.org/wikimedia-jessielatesta81cc7ec799813 months ago80.4MB
Fri, Dec 14, 12:08 PM · Release Pipeline, Patch-For-Review, Release-Engineering-Team (Backlog), Wikibase-Containers, Scoring-platform-team, Wikilabels, Wikidata

Thu, Dec 13

akosiaris added a comment to T211708: Blubberoid - Create Helm Chart.

Chart merged and is available at https://releases.wikimedia.org/charts/

Thu, Dec 13, 3:42 PM · Patch-For-Review, Release-Engineering-Team (Kanban), Core Platform Team Backlog (Watching / External), Services (watching), Release Pipeline, Operations
akosiaris changed the status of T203091: Move Graphoid to Kubernetes via the deployment pipeline from Open to Stalled.

The migration uncovered a number of issues in graphoid that make it worthwhile to consider a Code Stewardship request. That is done in T211881, stalling this until it is resolved.

Thu, Dec 13, 3:38 PM · Patch-For-Review, Services (watching), Operations, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris changed the status of T203091: Move Graphoid to Kubernetes via the deployment pipeline, a subtask of T205919: TEC3:O3:O3.1:Q2 Goal - Move Blubberoid, ZoteroV2, and Graphoid through the production CD Pipeline, from Open to Stalled.
Thu, Dec 13, 3:38 PM · Patch-For-Review, Core Platform Team Backlog (Watching / External), Services (watching), Release Pipeline, Operations, Release-Engineering-Team
akosiaris added a subtask for T203091: Move Graphoid to Kubernetes via the deployment pipeline: T211881: graphoid: Code stewardship request.
Thu, Dec 13, 3:37 PM · Patch-For-Review, Services (watching), Operations, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris added a parent task for T211881: graphoid: Code stewardship request: T203091: Move Graphoid to Kubernetes via the deployment pipeline.
Thu, Dec 13, 3:37 PM · Release-Engineering-Team (Kanban), Services, Operations, Code-Stewardship-Reviews, Graphoid
akosiaris changed the status of T203092: Create Graphoid .pipeline files from Open to Stalled.

Stalling until T211811 is done

Thu, Dec 13, 3:37 PM · Patch-For-Review, Core Platform Team Backlog (Watching / External), Services (watching), Operations, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris changed the status of T203092: Create Graphoid .pipeline files, a subtask of T203091: Move Graphoid to Kubernetes via the deployment pipeline, from Open to Stalled.
Thu, Dec 13, 3:37 PM · Patch-For-Review, Services (watching), Operations, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris added a subtask for T203092: Create Graphoid .pipeline files: T211881: graphoid: Code stewardship request.
Thu, Dec 13, 3:37 PM · Patch-For-Review, Core Platform Team Backlog (Watching / External), Services (watching), Operations, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris added a parent task for T211881: graphoid: Code stewardship request: T203092: Create Graphoid .pipeline files.
Thu, Dec 13, 3:37 PM · Release-Engineering-Team (Kanban), Services, Operations, Code-Stewardship-Reviews, Graphoid
akosiaris created T211881: graphoid: Code stewardship request.
Thu, Dec 13, 2:06 PM · Release-Engineering-Team (Kanban), Services, Operations, Code-Stewardship-Reviews, Graphoid
akosiaris added a comment to T207804: Upgrade calico in production to version 2.4+.

I 've had to deannotate the zotero namespace with commands like the one below

Thu, Dec 13, 12:51 PM · Operations, Patch-For-Review
akosiaris committed rDEPLOYCHARTSbacc760f3893: Package blubberoid and update repo (authored by akosiaris).
Package blubberoid and update repo
Thu, Dec 13, 12:23 PM

Wed, Dec 12

akosiaris edited P7909 blubberoid apache benchmark (ab) tests.
Wed, Dec 12, 3:04 PM
akosiaris created P7909 blubberoid apache benchmark (ab) tests.
Wed, Dec 12, 1:52 PM

Tue, Dec 11

akosiaris closed T211382: Requesting access to Proton for pmiazga, bearND, Mholloway, MSantos, Tgr as Resolved.

I 've slightly amended the patch to remove the now defunct sc-admins group and merged the patch per the SRE meeting's approval. Resolving this

Tue, Dec 11, 11:12 AM · Patch-For-Review, Proton, Operations, SRE-Access-Requests
akosiaris closed T211382: Requesting access to Proton for pmiazga, bearND, Mholloway, MSantos, Tgr, a subtask of T210652: Handoff Proton service to Reading Infrastructure, as Resolved.
Tue, Dec 11, 11:12 AM · Readers-Web-Backlog (Readers-Web-Kanbanana-Board-2018-19-Q2), Reading-Infrastructure-Team-Backlog, Proton

Wed, Dec 5

akosiaris added a comment to T197242: Transition citoid to use Zotero's translation-server-v2.

FWIW we 've had a number of minor outages and alerts resulting in increased latency for results. The corresponding graph can be seen here https://grafana.wikimedia.org/dashboard/db/restbase-external-overview?panelId=17&fullscreen&orgId=1&from=1544017415835&to=1544026700314

Wed, Dec 5, 4:26 PM · Patch-For-Review, Services (done), VisualEditor (Current work), Citoid, Operations
akosiaris added a comment to T197242: Transition citoid to use Zotero's translation-server-v2.

Seems like after this has been done the citation alerts started flapping much more then they used to. Also, the mean latency for citations endpoint went up from seconds to minutes.

Wed, Dec 5, 11:41 AM · Patch-For-Review, Services (done), VisualEditor (Current work), Citoid, Operations

Tue, Dec 4

akosiaris added a comment to T122676: Implement sentinel for ORES production Redis.

Hey,

  • @akosiaris tested twemproxy in prod and it fails because celery issues redis transactions and twemproxy doesn't support redis transactions. Same goes with dynomite.
Tue, Dec 4, 10:53 AM · User-Ladsgroup, Scoring-platform-team (Current), ORES
akosiaris closed T210720: Logrotate should restart services when more people are around as Resolved.

I 'll do so, thanks

Tue, Dec 4, 7:10 AM · ORES, Operations, Puppet, Wikimedia-Incident, Scoring-platform-team

Mon, Dec 3

akosiaris added a comment to T210890: Loading full versions of larger images from Commons stucks / repeatedly gets interrupted after a few MBs.

I can reproduce it as well. Received sizes and execution times are not consistent, ranging from a few hundreds of byes to a couple of megabytes and a few secs respectively. This and more importantly the test done above by @fgiunchedi indicate something going awry in the communication between varnish and swift.

Mon, Dec 3, 11:35 AM · Patch-For-Review, Operations, media-storage, Traffic, Wikimedia-General-or-Unknown

Fri, Nov 30

akosiaris added a comment to T210260: Stretch in docker registry forces ascii encoding.

I did just do a quick check on wikimedia-stretch image for this

Fri, Nov 30, 12:09 PM · Release Pipeline, Patch-For-Review, Release-Engineering-Team (Backlog), Wikibase-Containers, Scoring-platform-team, Wikilabels, Wikidata

Thu, Nov 29

akosiaris added a project to T210582: New node request: oresrdb[12]003: vm-requests.
Thu, Nov 29, 5:41 PM · Operations, vm-requests, Scoring-platform-team (Current), ORES
akosiaris committed rDEPLOYCHARTS8b8d7680a2c4: First draft of a graphoid helm chart (authored by akosiaris).
First draft of a graphoid helm chart
Thu, Nov 29, 2:49 PM
akosiaris added a comment to T210720: Logrotate should restart services when more people are around.

I am afraid we can't really change it. It's been at 06:25am (UTC in our case) forever and people expect that. Changing it would break the current expectations of people. Note that this is true for all services and software and it hasn't really caused an issue for a long time. So we should make a better job of surfacing and fixing the issues, not changing the logrotate schedule

Thu, Nov 29, 1:27 PM · ORES, Operations, Puppet, Wikimedia-Incident, Scoring-platform-team
akosiaris added a comment to T210260: Stretch in docker registry forces ascii encoding.

C.UTF8 does not exist. In every other locale I try, a UTF8 suffix is an alias to the UTF-8 suffix (with the dash).

This works: docker run unitest env LC_ALL=C.UTF-8 python3 -c "print('étoile')"

I'd suggest that we use the C.UTF-8 locale, but I see no strong reason to prefer it over the en_US.UTF-8 locale.

Thu, Nov 29, 1:25 PM · Release Pipeline, Patch-For-Review, Release-Engineering-Team (Backlog), Wikibase-Containers, Scoring-platform-team, Wikilabels, Wikidata
akosiaris added a comment to T210704: Migrate node-based services in production to node10.

Does this mean he have a hard deadline of 2019-04-01 for completing the migrations? Or per the "I can backport security fixes for a while" we have a couple of more months? The current goal is that by July 2019 all scb services, restbase (and probably aqs as well), proton, parsoid will be in kubernetes. That will leave turnilo and aphlict I guess.

Thu, Nov 29, 10:01 AM · Patch-For-Review, Core Platform Team Backlog (Next), Services (next), Operations

Wed, Nov 28

akosiaris added a comment to T210584: Make celery queues transient.

This has already been implemented (albeit not in celery but redis). https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/394022/2/modules/ores/manifests/redis.pp

Wed, Nov 28, 9:13 AM · Performance, ORES, Scoring-platform-team (Current), User-Ladsgroup
akosiaris added a comment to T196478: rack/setup/install backup1001.

@Cmjohnson, I think we can proceed with this. I did just try to reimage the server but mgmt is not responding

Wed, Nov 28, 9:08 AM · Patch-For-Review, Operations, ops-eqiad

Tue, Nov 27

akosiaris added a comment to T210467: codfw row D recable and add QFX.

@akosiaris for ores2008

Tue, Nov 27, 4:32 PM · User-jijiki, Patch-For-Review, ops-codfw, netops, Operations
akosiaris added a comment to T210447: codfw row A recable and add QFX.

ores2001, 2*ganeti, 15*mw
cc @akosiaris to know what specific actions need to be taken for Ores and Ganeti

Tue, Nov 27, 4:31 PM · Patch-For-Review, ops-codfw, netops, Operations
akosiaris updated the task description for T196477: rack/setup/install backup2001.
Tue, Nov 27, 3:55 PM · Patch-For-Review, ops-codfw, Operations
akosiaris closed T196477: rack/setup/install backup2001 as Resolved.

Box is reimaged and is up and running. megacli seems the controller and the disks

Tue, Nov 27, 3:55 PM · Patch-For-Review, ops-codfw, Operations
akosiaris reopened T196478: rack/setup/install backup1001 as "Open".

Solved the wrong task. I mean to resolve T196477

Tue, Nov 27, 3:54 PM · Patch-For-Review, Operations, ops-eqiad
akosiaris closed T196478: rack/setup/install backup1001 as Resolved.
Tue, Nov 27, 3:54 PM · Patch-For-Review, Operations, ops-eqiad
akosiaris updated the task description for T196478: rack/setup/install backup1001.
Tue, Nov 27, 3:52 PM · Patch-For-Review, Operations, ops-eqiad
akosiaris closed T191648: uwsgi::app sorts config keys, but the .ini file behavior depends on order as Resolved.

Child task resolved, resolving this as well

Tue, Nov 27, 1:49 PM · Patch-For-Review, Operations, Puppet
akosiaris closed T192102: deprecate and remove --autoload in uwsgi puppet class as Resolved.

Finally resolved

Tue, Nov 27, 1:49 PM · Patch-For-Review, Operations, Puppet
akosiaris closed T192102: deprecate and remove --autoload in uwsgi puppet class, a subtask of T191648: uwsgi::app sorts config keys, but the .ini file behavior depends on order, as Resolved.
Tue, Nov 27, 1:49 PM · Patch-For-Review, Operations, Puppet
akosiaris updated subscribers of T210485: Investigate high usage of Apertium and V2 endpoint.

I 've helped with the debugging. Starting from apertium it was clear something automated was POSTing a lot of requests to it. It turned out they were mostly for the rus|bel langpair but that was a red herring as it was just the snapshot in time I looked at. Moving from apertium to cxserver it became clear something was POSTing to /v2/translate endpoint. The things I noted were mostly about another language pair ca|oc but again that was a snapshot in time. Then a VM IP caught my eye, one that was of wcdo.wcdo.eqiad.wmflabs. I 've jumped into said VM and stopped a process that was clearly heavily hitting the cxserver API

Tue, Nov 27, 10:08 AM · CX-deployments, Language-Team (Language-2018-October-December)

Mon, Nov 26

akosiaris committed rDEPLOYCHARTS695ccd1b58af: First draft of a graphoid helm chart (authored by akosiaris).
First draft of a graphoid helm chart
Mon, Nov 26, 4:44 PM
akosiaris added a comment to T208426: Deploy Zuul 2.5.1-wmf5.

Should we also merge https://gerrit.wikimedia.org/r/#/c/integration/zuul/+/465324/ and release 2.5.1-wmf6 ? I 'll upload it to apt.wikimedia.org and I would rather do this once.

Mon, Nov 26, 12:09 PM · Release-Engineering-Team (Kanban), Continuous-Integration-Infrastructure, Gerrit, Zuul
akosiaris added a comment to T201611: Deploy translation-server-v2.

@akosiaris: I think your last paste is slightly broken (most of the URL got muddled with the data)

Mon, Nov 26, 6:51 AM · User-Ryasmeen, Services (watching), Patch-For-Review, User-mobrovac, Service-deployment-requests, VisualEditor (Current work), Citoid, Operations

Fri, Nov 23

akosiaris closed T201611: Deploy translation-server-v2 as Resolved.

Finally deployed to production

Fri, Nov 23, 9:07 AM · User-Ryasmeen, Services (watching), Patch-For-Review, User-mobrovac, Service-deployment-requests, VisualEditor (Current work), Citoid, Operations
akosiaris closed T201611: Deploy translation-server-v2, a subtask of T197242: Transition citoid to use Zotero's translation-server-v2, as Resolved.
Fri, Nov 23, 9:07 AM · Patch-For-Review, Services (done), VisualEditor (Current work), Citoid, Operations

Tue, Nov 20

akosiaris added a comment to T209088: Design pipeline image versioning scheme.

I think we should support multiple tags per image (docker anyway does support that and they cost next to nothing on the registry level AFAIK)

  • Keep the ${timestamp}-production (+1 to @LarsWirzenius about splitting date and time btw) as it's nice and monotonically increasing

+1

  • Add a tag based on the zuul.commit SHA1, but only if we can be reasonable sure that it's immutable. My memory fails me, but I remember some objection to this in the last meeting, does anyone remember the specifics?

I don't recall any specific objections, but I may be mis-remembering. Since we have this running as a postmerge job it should be fine, I think.

  • Possibly allow the developer to influence part of the process by supporting adding a tag on git commits that are tagged, allowing developers to implement SemVer (or any other kind of versioning scheme) if they so wish. That might or might not be the wisest decision on their part, but I think we should allow people to make that decision

For now we could do something like git tag --points-at HEAD and just add a tag based on that. In future we may want something fancier.

Tue, Nov 20, 4:13 PM · Patch-For-Review, Release-Engineering-Team (Backlog), Operations, Release Pipeline

Mon, Nov 19

akosiaris committed rDEPLOYCHARTS5f8b61230ce0: First draft of a zotero helm chart (authored by akosiaris).
First draft of a zotero helm chart
Mon, Nov 19, 12:36 PM

Nov 17 2018

akosiaris added a comment to T201611: Deploy translation-server-v2.

This has now been deployed to the kubernetes staging cluster.

Nov 17 2018, 4:28 PM · User-Ryasmeen, Services (watching), Patch-For-Review, User-mobrovac, Service-deployment-requests, VisualEditor (Current work), Citoid, Operations

Nov 16 2018

akosiaris created P7817 zotero translation server logs.
Nov 16 2018, 2:56 PM
akosiaris closed T209691: Upgrade to OTRS version 5.0.32 as Resolved.

Upgrade done, resolving

Nov 16 2018, 10:49 AM · Operations, OTRS
akosiaris created T209691: Upgrade to OTRS version 5.0.32.
Nov 16 2018, 10:32 AM · Operations, OTRS
akosiaris added a comment to T182222: Create Grafana graph to show number of ORES API requests per user-agent.

FWIW, I 'll echo @Ladsgroup and @fgiunchedi. Having the data is obviously useful. Representing them in grafana on the other hand it probably not so practical. I also have my doubts as to whether a graph would help identify the culprits of load spikes, mostly due to the nature of the service, but I am be at fault here.

Nov 16 2018, 9:00 AM · Wikimedia-Incident, ORES, monitoring, Scoring-platform-team

Nov 15 2018

akosiaris renamed T209184: Upgrade to OTRS version 5.0.31 from Upgrade to OTRS version 5.0.30 to Upgrade to OTRS version 5.0.31.
Nov 15 2018, 4:42 PM · Operations, OTRS
akosiaris added a comment to T209517: Upgrade/reboot labsdb* servers.

@akosiaris labsdb1006/7 are involved -- and if we do this with a failover, I want to make sure the additional tables for T201544 are replicated? Can we confirm that?

Nov 15 2018, 1:44 PM · User-Banyek, Patch-For-Review, Data-Services, cloud-services-team (Kanban), DBA

Nov 14 2018

akosiaris added a comment to T209088: Design pipeline image versioning scheme.

I think we should support multiple tags per image (docker anyway does support that and they cost next to nothing on the registry level AFAIK)

Nov 14 2018, 11:25 AM · Patch-For-Review, Release-Engineering-Team (Backlog), Operations, Release Pipeline

Nov 13 2018

akosiaris added a comment to T209271: improve docker registry architecture.

After looking into it a little bit, packaging harbor would be challenging. Harbor is a set of microservices published as containers. The installation and dev guide refers to docker-compose as a strong requirement for running harbor components, in order to run this docker-compose we need to build the container images and hosted them in our own docker registry which seems a sort of catch-22 problem (other people could rely on downloading it from DockerHub).

Nov 13 2018, 1:31 PM · serviceops, Prod-Kubernetes, Continuous-Integration-Infrastructure (shipyard), Kubernetes, Operations
akosiaris added a comment to T209265: Validate no namespaced keys are present in hieradata/*.yaml.

IIRC this is because of the expand_data directive in https://github.com/wikimedia/puppet/blob/production/modules/puppetmaster/files/production.hiera.yaml#L8

It's confusing indeed. That being said @Joe has expressed an interest in unifying the hiera backends so maybe we can get rid of this behavior.

Not really, the idea was to *avoid* having huge files. We might get rid of the behaviour where it won't make the files unmanageable.

So doing some easy calculations:

$ find hieradata/common -type f | xargs wc -l 
...
 3082 total
Nov 13 2018, 1:14 PM · Patch-For-Review, Puppet
akosiaris updated subscribers of T209265: Validate no namespaced keys are present in hieradata/*.yaml.

IIRC this is because of the expand_data directive in https://github.com/wikimedia/puppet/blob/production/modules/puppetmaster/files/production.hiera.yaml#L8

Nov 13 2018, 12:52 PM · Patch-For-Review, Puppet
akosiaris added a comment to T122676: Implement sentinel for ORES production Redis.

Which now brings us to the question of what's the next step?

Nov 13 2018, 11:32 AM · User-Ladsgroup, Scoring-platform-team (Current), ORES

Nov 9 2018

akosiaris closed T209184: Upgrade to OTRS version 5.0.31 as Resolved.

Upgrade completed successfully. Also checked with a SELECT * FROM version of the 2 sql statements displayed in https://community.otrs.com/security-advisory-2018-09-security-update-for-otrs-framework/ and no results were returned so no issues there.

Nov 9 2018, 9:13 PM · Operations, OTRS
akosiaris created T209184: Upgrade to OTRS version 5.0.31.
Nov 9 2018, 9:08 PM · Operations, OTRS
akosiaris updated subscribers of T206909: Degraded RAID on heze-array1 .

@Papaul I 'd say ignore it. That system+disk self/array is scheduled for decomission, to be replaced with backup2001 (T196477). The data in it is a copy of the data from helium so we ain't gonna lose something if more disks fail. There is no point in maintaining. After talking with @MoritzMuehlenhoff on IRC it seems like we can do a fresh reinstall of backup2001/backup1001 next week with the new stretch point release and set up the service on them and then decomission this

Nov 9 2018, 12:53 PM · Operations, ops-codfw

Nov 8 2018

akosiaris added a comment to T122676: Implement sentinel for ORES production Redis.

For what is worth, the upstream task is https://github.com/celery/celery/issues/3500. Closed WONTFIX apparently.

Nov 8 2018, 4:29 PM · User-Ladsgroup, Scoring-platform-team (Current), ORES
akosiaris added a comment to T182249: Diagnose and fix 4.5k req/min ceiling for ores* requests.

I like the proposal of depooling one datacenter. What do you think @akosiaris? Is this crazy?

Nov 8 2018, 11:49 AM · Patch-For-Review, Scoring-platform-team, Operations, Performance, ORES
akosiaris updated the task description for T203964: Create a spicerack cookbook to empty a ganeti node from VMs.
Nov 8 2018, 11:36 AM · Operations-Software-Development, User-jijiki, User-Joe, Operations

Nov 7 2018

akosiaris added a comment to T199853: Increase webperf1002/webperf2002 space from 50GB to 150GB (Ganeti).

For the record, if https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/472032/ is merged, the space occupied by xenon logs will increase (upto 2X).

Xenon occupies a mostly constant amount of space, but that mount may double from 25G upto 50G. This will essentially chip away at the space reserved for XHGui, which I estimated (in the task description) as being able to fit current and next 5 years of data. With this change in estimate for Xenon, that would instead accomodate (100G/2G per month) about 4 years instead.

Nov 7 2018, 5:27 PM · Performance-Team (Radar), vm-requests, Operations

Nov 6 2018

akosiaris committed rDEPLOYCHARTS2a0adb26e9e1: First draft of a zotero helm chart (authored by akosiaris).
First draft of a zotero helm chart
Nov 6 2018, 3:09 PM
akosiaris added a watcher for Wikimedia-Incident: akosiaris.
Nov 6 2018, 2:33 PM
akosiaris awarded T198699: Monitoring of MT services a Like token.
Nov 6 2018, 2:28 PM · CX-deployments, Language-Team (Language-2018-October-December), User-KartikMistry, CX-cxserver
akosiaris awarded T208715: Onboard Fabián Sellés Rosa to SRE a Like token.
Nov 6 2018, 2:28 PM · Patch-For-Review, Operations

Nov 5 2018

akosiaris added a comment to T198699: Monitoring of MT services.

FWIW, metrics_host: in config-vars.yaml, which is used by scap to build config.yaml, specifically the

Nov 5 2018, 8:39 PM · CX-deployments, Language-Team (Language-2018-October-December), User-KartikMistry, CX-cxserver
akosiaris updated the task description for T208715: Onboard Fabián Sellés Rosa to SRE.
Nov 5 2018, 5:32 PM · Patch-For-Review, Operations
akosiaris added a member for WMF-NDA-Requests: fselles.
Nov 5 2018, 5:32 PM
akosiaris added a member for WMF-NDA: fselles.
Nov 5 2018, 5:31 PM
akosiaris added a member for acl*sre-team: fselles.
Nov 5 2018, 5:31 PM
akosiaris updated the task description for T208715: Onboard Fabián Sellés Rosa to SRE.
Nov 5 2018, 5:30 PM · Patch-For-Review, Operations
akosiaris updated the task description for T208715: Onboard Fabián Sellés Rosa to SRE.
Nov 5 2018, 5:28 PM · Patch-For-Review, Operations
akosiaris added a comment to P7763 Parse a pcap file having poolcounter traffic in it.

This help generate F27065389

Nov 5 2018, 4:09 PM · ORES
akosiaris created P7763 Parse a pcap file having poolcounter traffic in it.
Nov 5 2018, 4:06 PM · ORES
akosiaris created T208715: Onboard Fabián Sellés Rosa to SRE.
Nov 5 2018, 11:05 AM · Patch-For-Review, Operations

Nov 2 2018

akosiaris closed T147872: Rename rhodium to puppetmaster1003 as Declined.

Indeed. Thanks!

Nov 2 2018, 2:23 PM · Operations
akosiaris added a comment to T204558: cloudvps: puppet project trusty deprecation.

Sorry, reopening this one because one was missed (on account of being inaccessible when Cumin was being run to find all trusty instances): compiler.puppet.eqiad.wmflabs - it's still not responding to ping or SSH.
@akosiaris, apparently you set this up 4 years ago, any chance you can shed any light on why it's not responding despite being status active instead of shutoff?

Nov 2 2018, 2:21 PM · Cloud-VPS (Ubuntu Trusty Deprecation), Puppet

Oct 26 2018

akosiaris closed T206068: Wikimedia Technical Conference 2018 Session - Identifying the requirements and goals for dependency tracking and events as Resolved.

Session notes added to
https://www.mediawiki.org/wiki/Wikimedia_Technical_Conference/2018/Session_notes/Identifying_the_requirements_and_goals_for_dependency_tracking_and_events

Oct 26 2018, 12:39 AM · Wikimedia-Technical-Conference-2018
akosiaris committed rDEPLOYCHARTS0b6ca570118d: Support canary functionality (authored by akosiaris).
Support canary functionality
Oct 26 2018, 12:36 AM
akosiaris committed rDEPLOYCHARTS49b53bd02289: Add chart to pod labels (authored by akosiaris).
Add chart to pod labels
Oct 26 2018, 12:36 AM

Oct 25 2018

akosiaris committed rDEPLOYCHARTS5b39c49c1095: Support canary functionality (authored by akosiaris).
Support canary functionality
Oct 25 2018, 11:51 PM
akosiaris committed rDEPLOYCHARTS95b270ba51b3: Add chart to pod labels (authored by akosiaris).
Add chart to pod labels
Oct 25 2018, 11:51 PM
akosiaris committed rDEPLOYCHARTS30095f0adbaa: WIP: Support canary functionality (authored by akosiaris).
WIP: Support canary functionality
Oct 25 2018, 6:12 PM
akosiaris committed rDEPLOYCHARTSa7b48fbb8dd9: Add chartid to pod labels (authored by akosiaris).
Add chartid to pod labels
Oct 25 2018, 6:12 PM
akosiaris committed rDEPLOYCHARTSb3ff819c5aaf: scaffold: Invert the externalIPs inclusion logic (authored by akosiaris).
scaffold: Invert the externalIPs inclusion logic
Oct 25 2018, 6:12 PM
akosiaris committed rDEPLOYCHARTS8fe01154a3a0: Invert logic for specifying externalIPs (authored by akosiaris).
Invert logic for specifying externalIPs
Oct 25 2018, 6:12 PM
akosiaris committed rDEPLOYCHARTS5d4dcd436704: mathoid: Add various informational chart values (authored by akosiaris).
mathoid: Add various informational chart values
Oct 25 2018, 6:12 PM