akosiaris (Alexandros Kosiaris)
Senior Site Reliability Engineer

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Oct 3 2014, 8:40 AM (228 w, 4 d)
Availability
Available
IRC Nick
akosiaris
LDAP User
Alexandros Kosiaris
MediaWiki User
AKosiaris (WMF) [ Global Accounts ]

Blurb

Recent Activity

Yesterday

akosiaris added a comment to T213566: Transferring data from Hadoop to production MySQL database.

For the record, just saying pointing out that the question of a new VM versus mwmaint1002 is probably irrelevant here. We can do both with what looks like minimal repercussions.

Mon, Feb 18, 4:07 PM · serviceops, Core Platform Team Backlog (Watching / External), Services (watching), User-Marostegui, Operations, Article-Recommendation, Analytics, Research
akosiaris added a comment to T216238: eqiad: (1) Ganeti VM for testing Kerberos in Production.

So after reading https://wikitech.wikimedia.org/wiki/Ganeti#Create_a_VM this is what I'd do:

  1. Review/Merge https://gerrit.wikimedia.org/r/491219 to add a Private ip allocation in row A (after checking gnt-group list on ganeti1003 it seems to me that either A or C are fine, but lemme know if there is more to verify)
  2. Use makevm on ganeti1003 to create the VM and annotate the MAC address
  3. Create a puppet code change to add the node to DHCP and partman configs. Run puppet on install[12]002 before proceeding.
  4. Run gnt-instance start kerberos1001.eqiad.wnet on ganeti1003, and then attach to the console via gnt-instance console kerberos1001.eqiad.wnet
  5. Wait for the OS install to finish, and then before the end execute gnt-instance modify --hypervisor-parameters=boot_order=disk kerberos1001.eqiad.wnet
Mon, Feb 18, 4:03 PM · Patch-For-Review, vm-requests, Operations

Fri, Feb 15

akosiaris awarded T216234: Clarify and document our docker image building process and policies. a Like token.
Fri, Feb 15, 2:18 PM · docker-pkg, MediaWiki-Docker, serviceops
akosiaris added a comment to T213566: Transferring data from Hadoop to production MySQL database.

they will also not allow them to send the SYN/ACK packet required for the second (of the three) phase of the TCP handshake,

? Is that true? I'm pretty sure it isn't...unless we have some special rules I'm forgetting.

Fri, Feb 15, 1:59 PM · serviceops, Core Platform Team Backlog (Watching / External), Services (watching), User-Marostegui, Operations, Article-Recommendation, Analytics, Research
akosiaris added a parent task for T207804: Upgrade calico in production to version 2.4+: Unknown Object (Task).
Fri, Feb 15, 11:30 AM · User-fsero, serviceops, Kubernetes, Operations, Patch-For-Review
akosiaris added a comment to T215319: Base Blubber policy file for CI.

Clarification needed from serviceops folks: is it only the base-image for the production variant we want to restrict?

FrEx: blubber uses the golang image to build blubberoid and then copies that artifact to the production image based on docker-registry.wikimedia.org/wikimedia-stretch

Fri, Feb 15, 9:42 AM · Release Pipeline (Blubber), serviceops, Release-Engineering-Team
akosiaris added a comment to T215320: Generate per-repo pipeline jobs.

Bonus points if these jobs could be created by someone adding a .pipeline/config.yaml to their project.

At least 100000 bonus points for this! It would be really really useful if these semi-custom jobs could be created automatically for people without human intervention and/or git commits and deploys.

Fri, Feb 15, 9:19 AM · Release-Engineering-Team, Release Pipeline
akosiaris added a comment to T177868: Define pipeline failure developer feedback.

In talking with @dduvall the ideal would be, on failure, providing a url that could be passed to docker pull that lets the developer pull down an image that didn't pass muster.

Fri, Feb 15, 9:18 AM · Patch-For-Review, Release-Engineering-Team (Next), Release Pipeline

Thu, Feb 14

akosiaris updated subscribers of T213566: Transferring data from Hadoop to production MySQL database.
Thu, Feb 14, 5:17 PM · serviceops, Core Platform Team Backlog (Watching / External), Services (watching), User-Marostegui, Operations, Article-Recommendation, Analytics, Research
akosiaris added a comment to T211881: graphoid: Code stewardship request.

In any case, and with the risk of repeating myself, the service is under a code stewardship request cause it does not have currently a maintainer. If we want to keep it around, we need first and foremost a maintainer.

@akosiaris From what I gathered in this discussion, the course of action on this should be to find a Code Steward for graphoid.

Thu, Feb 14, 3:30 PM · Core Platform Team Backlog (Watching / External), Services (watching), Release-Engineering-Team (Kanban), Operations, Code-Stewardship-Reviews, Graphoid
akosiaris triaged T216140: Investigating using CI to automate testing VCL changes against all cluster/dc combos as Low priority.
Thu, Feb 14, 2:12 PM · Operations, Traffic
akosiaris created T216140: Investigating using CI to automate testing VCL changes against all cluster/dc combos.
Thu, Feb 14, 2:11 PM · Operations, Traffic
akosiaris committed rDEPLOYCHARTS866d21d9825a: Add GPLv3 license to the repo to be used by all charts (authored by akosiaris).
Add GPLv3 license to the repo to be used by all charts
Thu, Feb 14, 10:09 AM
Gerrit Code Review <gerrit@wikimedia.org> committed rDEPLOYCHARTS699f8faccfd9: Merge "Add a simple README.md" (authored by akosiaris).
Merge "Add a simple README.md"
Thu, Feb 14, 10:04 AM

Wed, Feb 13

akosiaris committed rDEPLOYCHARTScdc6558d3803: Bump eventgate-analytics version (authored by akosiaris).
Bump eventgate-analytics version
Wed, Feb 13, 4:07 PM
akosiaris committed rDEPLOYCHARTS1a7556677070: Bump eventgate-analytics version number (authored by akosiaris).
Bump eventgate-analytics version number
Wed, Feb 13, 3:22 PM
akosiaris committed rDEPLOYCHARTS39b2b45563c8: eventgate: Add single quotes and don't chomp whitespace (authored by akosiaris).
eventgate: Add single quotes and don't chomp whitespace
Wed, Feb 13, 3:22 PM
akosiaris added a comment to T196478: rack/setup/install backup1001.

@akosiaris Sorry for the really late response to this....the task got buried. No, I don't know why mgmt would not be working now unless it's disconnected or the cable is bad. I will check it next week after all hands.

Wed, Feb 13, 11:42 AM · Patch-For-Review, Operations, ops-eqiad
akosiaris committed rDEPLOYCHARTS19acfb57d825: Add a simple README.md (authored by akosiaris).
Add a simple README.md
Wed, Feb 13, 10:56 AM

Tue, Feb 12

akosiaris added a comment to T213116: Investigate: why are we getting overload errors during ORES deployments?.

Thanks for the blogpost @akosiaris I had a complete misunderstanding of what blue/green deployment means. What I wanted to suggest for now was to have half/one third of the nodes as canary, Basically having a rather big canary. Do you think it would work?

Tue, Feb 12, 3:10 PM · Scap, ORES, Scoring-platform-team
akosiaris added a comment to T213475: Wikimedia varnish rules no longer exempt all Cloud VPS/Toolforge IPs from rate limits (HTTP 429 response).

Change has been deployed across the fleet. WMCS IP space 172.16.0.0/12 should now be exempt of rate limiting rules. @Cyberpower678, @Kelson could you please confirm?

Tue, Feb 12, 2:23 PM · Patch-For-Review, Toolforge, Operations, Traffic, Cloud-VPS
akosiaris committed rDEPLOYCHARTS0575e61f9e76: Add a default Apache-2 license to the repo (authored by akosiaris).
Add a default Apache-2 license to the repo
Tue, Feb 12, 11:27 AM
akosiaris committed rDEPLOYCHARTS1025e429a565: Add a simple README.md (authored by akosiaris).
Add a simple README.md
Tue, Feb 12, 11:27 AM
akosiaris committed rDEPLOYCHARTS8761715ebd4f: Package eventgate-analytics chart (authored by akosiaris).
Package eventgate-analytics chart
Tue, Feb 12, 11:27 AM
akosiaris committed rDEPLOYCHARTS108be84bd9ba: Package eventgate-analytics chart (authored by akosiaris).
Package eventgate-analytics chart
Tue, Feb 12, 11:12 AM
akosiaris committed rDEPLOYCHARTS1ac42967b0d2: Package kafka-dev helm chart (authored by akosiaris).
Package kafka-dev helm chart
Tue, Feb 12, 10:34 AM
Gerrit Code Review <gerrit@wikimedia.org> committed rDEPLOYCHARTSdd6b980998fb: Merge changes from topics "eventgate-analytics", "kafka-single-node" (authored by akosiaris).
Merge changes from topics "eventgate-analytics", "kafka-single-node"
Tue, Feb 12, 10:29 AM

Mon, Feb 11

akosiaris created P8067 (An Untitled Masterwork).
Mon, Feb 11, 5:11 PM
akosiaris added a comment to T213475: Wikimedia varnish rules no longer exempt all Cloud VPS/Toolforge IPs from rate limits (HTTP 429 response).

Question, when will this patch go live?

Mon, Feb 11, 1:37 PM · Patch-For-Review, Toolforge, Operations, Traffic, Cloud-VPS
akosiaris updated the task description for T170108: Operations Q1 goal: Streamlined Service Delivery.
Mon, Feb 11, 12:18 PM · Kubernetes, Operations, Goal
akosiaris closed T170108: Operations Q1 goal: Streamlined Service Delivery as Resolved.

Seems like we forgot to close this one

Mon, Feb 11, 12:18 PM · Kubernetes, Operations, Goal
akosiaris closed T170108: Operations Q1 goal: Streamlined Service Delivery, a subtask of T170453: FY2017/18 Program 6: Streamlined Service delivery, as Resolved.
Mon, Feb 11, 12:18 PM · Core Platform Team Backlog (Watching / External), Category, Epic, Release-Engineering-Team, Services (watching), Operations

Fri, Feb 8

akosiaris committed rDEPLOYCHARTScc81ba85d8c4: scaffolding: Fix deployment indentation (authored by akosiaris).
scaffolding: Fix deployment indentation
Fri, Feb 8, 2:08 PM

Thu, Feb 7

akosiaris added a comment to T213566: Transferring data from Hadoop to production MySQL database.

How is the data going to make it from Hadoop, which resides in the analytics cluster and is firewalled at the router level (aka network ACLs) to whichever machine is chosen for this? Has this been already worked out (cause I see no mention of this)?

@akosiaris: T213976

It hasn't been totally worked out, but a solution is rsync pull. The network firewalls don't allow analytics VLAN to initiate connections, but they will accept incoming ones.

Thu, Feb 7, 2:41 PM · serviceops, Core Platform Team Backlog (Watching / External), Services (watching), User-Marostegui, Operations, Article-Recommendation, Analytics, Research
akosiaris added a comment to T213566: Transferring data from Hadoop to production MySQL database.

How is the data going to make it from Hadoop, which resides in the analytics cluster and is firewalled at the router level (aka network ACLs) to whichever machine is chosen for this? Has this been already worked out (cause I see no mention of this)?

Thu, Feb 7, 2:25 PM · serviceops, Core Platform Team Backlog (Watching / External), Services (watching), User-Marostegui, Operations, Article-Recommendation, Analytics, Research
akosiaris closed T212955: Increase mwdebugXXXX hosts CPU as Resolved.

I 've removed the memory part cause https://grafana.wikimedia.org/d/000000377/host-overview?refresh=5m&orgId=1&var-server=mwdebug1001&var-datasource=eqiad%20prometheus%2Fops&var-cluster=appserver&from=now-7d&to=now shows that mwdebug1002 is never pressed for more memory. I 've also bumped vpu count to 4. I 'll resolve this for now, if we need more resources feel free to reopen.

Thu, Feb 7, 9:49 AM · vm-requests, Release-Engineering-Team (Watching / External), serviceops, Operations
akosiaris closed T212955: Increase mwdebugXXXX hosts CPU, a subtask of T203625: mwdebug1001 and mwdebug1002 are reliably the last two hosts to finish scap-cdb-rebuild , as Resolved.
Thu, Feb 7, 9:48 AM · Release-Engineering-Team (Watching / External), Scap, Operations
akosiaris closed T212955: Increase mwdebugXXXX hosts CPU, a subtask of T203664: scap timeout checking index.php/api.php mwdebug1001 / mwdebug1002, as Resolved.
Thu, Feb 7, 9:48 AM · Release-Engineering-Team (Watching / External), Scap
akosiaris renamed T212955: Increase mwdebugXXXX hosts CPU from Increase mwdebugXXXX hosts CPU and memory(?) to Increase mwdebugXXXX hosts CPU.
Thu, Feb 7, 9:46 AM · vm-requests, Release-Engineering-Team (Watching / External), serviceops, Operations
akosiaris added a comment to T213371: Document and possibly fine-tune how Proton interacts with Varnish.

@Tgr I assume you're still waiting for answers from @ema? Is there anything I can help you with?

Thu, Feb 7, 9:24 AM · Readers-Web-Backlog (Tracking), Services (watching), serviceops, Traffic, Reading-Infrastructure-Team-Backlog, Operations, Proton
akosiaris added a comment to T213475: Wikimedia varnish rules no longer exempt all Cloud VPS/Toolforge IPs from rate limits (HTTP 429 response).

I 've added the capacity to varnish puppet code to augment the wikimedia_trust and wikimedia_nets constructs, followed by a patch adding the new WMCS IP space to wikimedia_nets in order to exempt that IP space from rate limiting. @BBlack lemme know what you think.

Thu, Feb 7, 7:29 AM · Patch-For-Review, Toolforge, Operations, Traffic, Cloud-VPS
akosiaris committed rDEPLOYCHARTS98a5010d86f7: mathoid: Remove mwapi_req/restbase_req (authored by akosiaris).
mathoid: Remove mwapi_req/restbase_req
Thu, Feb 7, 6:30 AM

Wed, Feb 6

akosiaris committed rDEPLOYCHARTS77bc000a2c57: mathoid: Update prometheus-stats.conf (authored by akosiaris).
mathoid: Update prometheus-stats.conf
Wed, Feb 6, 12:57 PM
akosiaris added a comment to T205911: Track and install additional npm packages for all service container images.

Assuming we go for option (1), how would we go around and install these packages? And how would we instruct the app to load them?
It seems like using NODE_PATH is discouraged these days[1] and would anyway require changes to blubber to set NODE_PATH. We used to have that variable set and have moved away from it in https://gerrit.wikimedia.org/r/#/c/blubber/+/460997/

Wed, Feb 6, 12:19 PM · Release-Engineering-Team (Watching / External), Core Platform Team Backlog (Watching / External), Services (watching), Operations, Release Pipeline
akosiaris closed T212251: Expose blubberoid to the public allowing CI in WMCS to be able to reach out as well to it as Resolved.
curl -s https://blubberoid.wikimedia.org/?spec |head -5
---
openapi: '3.0.0'
info:
  title: Blubberoid
  description: >
Wed, Feb 6, 10:11 AM · Patch-For-Review, serviceops, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris closed T212251: Expose blubberoid to the public allowing CI in WMCS to be able to reach out as well to it, a subtask of T212247: Refactor integration/pipelinelib to use blubberoid.wikimedia.org, as Resolved.
Wed, Feb 6, 10:11 AM · Patch-For-Review, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris lowered the priority of T215389: esams cache layer mangles downloads of specific url from High to Low.

The restart of varnish-frontend on cp3030 indeed resolved the issue. I 'll lower priority but leave task open. Feel free to resolve however.

Wed, Feb 6, 10:08 AM · Operations, Traffic
akosiaris raised the priority of T215389: esams cache layer mangles downloads of specific url from Normal to High.
Wed, Feb 6, 10:00 AM · Operations, Traffic
akosiaris added a comment to T215389: esams cache layer mangles downloads of specific url.

cp3030 seems to be in some trouble since approximately 04:30 [1]

Wed, Feb 6, 9:59 AM · Operations, Traffic
akosiaris updated the task description for T215389: esams cache layer mangles downloads of specific url.
Wed, Feb 6, 9:49 AM · Operations, Traffic
akosiaris triaged T215389: esams cache layer mangles downloads of specific url as Normal priority.
Wed, Feb 6, 9:43 AM · Operations, Traffic
akosiaris created T215389: esams cache layer mangles downloads of specific url.
Wed, Feb 6, 9:42 AM · Operations, Traffic
akosiaris added a comment to T213116: Investigate: why are we getting overload errors during ORES deployments?.

@akosiaris I'd like your input here. We're running into a lot of "After moving to kubernetes, none of this would matter" for some maintenance tasks. For a problem like this one, do you think it is worthwhile to invest time into our current deployment process or if it is reasonable to wait for k8s?

Wed, Feb 6, 8:10 AM · Scap, ORES, Scoring-platform-team
akosiaris closed T214604: OTRS receiving flood of emails as Resolved.

Graphs in codfw mail[1] and eqiad mail[2] point out that this behavior has not reemerged since Jan 25, so I 'll tentatively close this as resolved. Feel free to reopen

Wed, Feb 6, 7:34 AM · Mail, Operations, OTRS

Tue, Feb 5

akosiaris committed rDEPLOYCHARTSc5bf59156e34: mathoid: Update prometheus-stats.conf (authored by akosiaris).
mathoid: Update prometheus-stats.conf
Tue, Feb 5, 2:43 PM

Wed, Jan 30

akosiaris committed rDEPLOYCHARTS479ec6b243ac: mathoid: Update prometheus-stats.conf (authored by akosiaris).
mathoid: Update prometheus-stats.conf
Wed, Jan 30, 7:59 PM
akosiaris updated the task description for T214975: proton experienced a period of high CPU usage, busy queue, lockups.
Wed, Jan 30, 5:46 PM · Proton, Operations
akosiaris triaged T214975: proton experienced a period of high CPU usage, busy queue, lockups as High priority.
Wed, Jan 30, 5:43 PM · Proton, Operations
akosiaris created T214975: proton experienced a period of high CPU usage, busy queue, lockups.
Wed, Jan 30, 5:43 PM · Proton, Operations

Fri, Jan 25

akosiaris added a comment to T214604: OTRS receiving flood of emails.

Cleaned up some 10k emails from 2 more host with the same pattern as yesterday and blocked them as well.

Fri, Jan 25, 3:06 PM · Mail, Operations, OTRS
akosiaris added a comment to T178690: Better organization for SRE grafana dashboards.

@akosiaris we had some chat about details, I don't mind the USE pattern, but a poor graph using USE doesn't mean it is good, if the chosen metrics are poor, like the above example. Note also they were probably in a worse state before my comments 0:-)

Fri, Jan 25, 3:06 PM · User-CDanis, Patch-For-Review, User-fgiunchedi, monitoring, Operations
akosiaris triaged T178690: Better organization for SRE grafana dashboards as Low priority.
Fri, Jan 25, 2:54 PM · User-CDanis, Patch-For-Review, User-fgiunchedi, monitoring, Operations
akosiaris added a comment to T178690: Better organization for SRE grafana dashboards.

Jaime, going to have to guess here; are you referring to "Prometheus machine stats" (marked for deletion) vs "Host overview"?

Yes.

Fri, Jan 25, 2:54 PM · User-CDanis, Patch-For-Review, User-fgiunchedi, monitoring, Operations

Thu, Jan 24

akosiaris committed rDEPLOYCHARTSce294a9a4f4a: mathoid: Update prometheus-stats.conf (authored by akosiaris).
mathoid: Update prometheus-stats.conf
Thu, Jan 24, 11:02 PM
akosiaris committed rDEPLOYCHARTS93105a24bc71: mathoid: Update prometheus-stats.conf (authored by akosiaris).
mathoid: Update prometheus-stats.conf
Thu, Jan 24, 11:02 PM
akosiaris lowered the priority of T214604: OTRS receiving flood of emails from Unbreak Now! to Low.

info-en-c seems to be down to 167 messages now and the hosts participating in the storm remain blocked. I 'll lower priority for now.

Thu, Jan 24, 7:38 PM · Mail, Operations, OTRS
akosiaris added a comment to T214604: OTRS receiving flood of emails.

The email storm can be witnessed at https://grafana.wikimedia.org/d/000000451/mail?orgId=1&from=1548346803405&to=1548357438520&var-datasource=codfw%20prometheus%2Fops (this is the secondary DC) and https://grafana.wikimedia.org/d/000000451/mail?orgId=1&from=1548346740714&to=1548358041101&var-datasource=eqiad%20prometheus%2Fops (for the primary DC). The 2 distinct phases are there because of me freezing a ton of emails which later got thawed and eventually delivered.

Thu, Jan 24, 7:29 PM · Mail, Operations, OTRS
akosiaris committed rDEPLOYCHARTS885c0d8628c6: add statsd_exporter config to mathoid (authored by colewhite).
add statsd_exporter config to mathoid
Thu, Jan 24, 3:50 PM
akosiaris committed rDEPLOYCHARTS02981c753a61: scaffolding: Remove the logging container (authored by akosiaris).
scaffolding: Remove the logging container
Thu, Jan 24, 3:43 PM
akosiaris closed T182914: Add a graph of ORES Celery task queue length as Resolved.

Right. Only when we're approaching overload does the celery key contain entries. All workers need to be busy before we see anything.

Thu, Jan 24, 3:38 PM · Patch-For-Review, Scoring-platform-team, Graphite, ORES
akosiaris closed T212957: Adminship of MediaWiki-India Mailing List as Resolved.

Yes indeed. I 've switched the ownership of the list to the email provided and issued a new password for it. It should arrive with an email to the new admin email. Let me know if anything goes wrong. I am closing this as resolved. Thanks!

Thu, Jan 24, 3:24 PM · Operations, Wikimedia-Mailing-lists
akosiaris committed rDEPLOYCHARTS0db40b33aaec: add statsd_exporter config to mathoid (authored by colewhite).
add statsd_exporter config to mathoid
Thu, Jan 24, 2:46 PM
akosiaris added a comment to T182914: Add a graph of ORES Celery task queue length.

Change has been merged and deployed but up to now no data has been exported. The celery key currently looks empty so I guess this is expected?

Thu, Jan 24, 1:05 PM · Patch-For-Review, Scoring-platform-team, Graphite, ORES
akosiaris committed rDEPLOYCHARTS94eaa67de774: add statsd_exporter config to mathoid (authored by colewhite).
add statsd_exporter config to mathoid
Thu, Jan 24, 9:52 AM

Wed, Jan 23

akosiaris committed rDEPLOYCHARTSa489245f71ba: Fix mathoid's prometheus-statsd.conf inclusion (authored by akosiaris).
Fix mathoid's prometheus-statsd.conf inclusion
Wed, Jan 23, 5:58 PM
akosiaris added a comment to T214459: Connection problem (Moscow ISP, 4G) with Beeline / Sovintel.

Adding that a, from our esams DC, traceroute to this IP seems to stop before beelive.ru

Wed, Jan 23, 11:51 AM · Traffic, Operations, netops

Tue, Jan 22

akosiaris committed rDEPLOYCHARTS877f14eff1e8: mathoid: Move config.yaml into a template (authored by akosiaris).
mathoid: Move config.yaml into a template
Tue, Jan 22, 8:19 PM
akosiaris committed rDEPLOYCHARTS59c608c3f387: Add an stdout log stanza to config (authored by akosiaris).
Add an stdout log stanza to config
Tue, Jan 22, 8:19 PM
akosiaris committed rDEPLOYCHARTScac9f5bcc354: zotero: Remove chartid from service as well (authored by akosiaris).
zotero: Remove chartid from service as well
Tue, Jan 22, 3:23 PM
akosiaris renamed T212251: Expose blubberoid to the public allowing CI in WMCS to be able to reach out as well to it from Allow access to blubberoid.discovery.wmnet:8748 to Expose blubberoid to the public allowing CI in WMCS to be able to reach out as well to it.
Tue, Jan 22, 2:06 PM · Patch-For-Review, serviceops, Release-Engineering-Team (Kanban), Release Pipeline
akosiaris added a comment to T182914: Add a graph of ORES Celery task queue length.

We can probably get away with reusing https://github.com/oliver006/redis_exporter that we already use. It does have a check-keys parameter that allow us to count a lists elements. It's a bit slower as an implementation that in the next version as it uses scan but in my tests it did return within ~1s. I get in prometheus the following as an example

Tue, Jan 22, 1:40 PM · Patch-For-Review, Scoring-platform-team, Graphite, ORES
akosiaris committed rDEPLOYCHARTS64af8013fa5a: Remove chartid from deployments/services (authored by akosiaris).
Remove chartid from deployments/services
Tue, Jan 22, 1:28 PM
akosiaris added a reverting change for rDEPLOYCHARTS49b53bd02289: Add chart to pod labels: rDEPLOYCHARTS64af8013fa5a: Remove chartid from deployments/services.
Tue, Jan 22, 1:28 PM
akosiaris committed rDEPLOYCHARTSd23bc394adab: blubberoid/zotero: Remove the logging sidecar container (authored by akosiaris).
blubberoid/zotero: Remove the logging sidecar container
Tue, Jan 22, 12:55 PM
akosiaris committed rDEPLOYCHARTSfdecff321cc2: mathoid: Remove the logging sidecar container (authored by akosiaris).
mathoid: Remove the logging sidecar container
Tue, Jan 22, 12:55 PM
akosiaris committed rDEPLOYCHARTSbb5e09ff150a: Remove externalIP settings (authored by akosiaris).
Remove externalIP settings
Tue, Jan 22, 12:22 PM

Mon, Jan 21

akosiaris added a comment to T213475: Wikimedia varnish rules no longer exempt all Cloud VPS/Toolforge IPs from rate limits (HTTP 429 response).

I'm not sure to fully understand the technical explanation. Is the problem confirmed? If "yes", what is the plan to solve it?

Mon, Jan 21, 4:21 PM · Patch-For-Review, Toolforge, Operations, Traffic, Cloud-VPS

Jan 19 2019

akosiaris added a comment to T119043: Graph/Graphoid/Kartographer - data storage architecture.

Most of the time, Vega is used via a template, because otherwise you have a massive copy/paste of code without any benefit, while having no way to fix issues or improve appearance of all graphs en mass. Thus, per what @Anomie said - MCR is an orthogonal (in its current form) to the generated content. This actually has more similarities with the image thumb service than MCR (content is generated from "master" - wiki markup, and cached for usage by both the rendering service like Graphoid and directly from the client via the dynamic graph loading).

This does contradict however with requirement 6. BonusB: When user looks at an older revision of an article, they should see the graphs for that revision. given above. Just noting it, effectively reiterating what I think Tim has better phrased it in his comment at T119043#1868557

@akosiaris why is it a contradiction? The bonus B is similar to being able to see an older version of an article with every type of dependent resource, not just graph, i.e. older images, templates, Lua modules, and even data tables from Commons. On the other hand, the current (master) version should auto-refresh when dependencies are updated.

Jan 19 2019, 2:57 PM · Core Platform Team Backlog (Watching / External), Reading-Infrastructure-Team-Backlog, Services (watching), Maps (Kartographer), TechCom-RFC, Graphoid, Graphs, Service-Architecture, RESTBase-architecture

Jan 18 2019

akosiaris closed T214177: Upgrade to OTRS version 5.0.34 as Resolved.

Upgrade done.

Jan 18 2019, 4:22 PM · Security, OTRS, Operations
akosiaris created T214177: Upgrade to OTRS version 5.0.34.
Jan 18 2019, 4:22 PM · Security, OTRS, Operations
akosiaris moved T209691: Upgrade to OTRS version 5.0.32 from Incoming to Resolved on the OTRS board.
Jan 18 2019, 4:20 PM · Operations, OTRS
akosiaris moved T205540: Upgrade to OTRS version 5.0.30 from Incoming to Resolved on the OTRS board.
Jan 18 2019, 4:20 PM · OTRS, Operations
akosiaris moved T209184: Upgrade to OTRS version 5.0.31 from Incoming to Resolved on the OTRS board.
Jan 18 2019, 4:20 PM · Operations, OTRS

Jan 17 2019

akosiaris added a comment to T119043: Graph/Graphoid/Kartographer - data storage architecture.

The Graph extension could potentially use an MCR slot to store the Vega JSON rather than embedding it in the wikitext inside a <graph> tag. But that wouldn't support the existing uses where templates and modules are being used to generate the Vega JSON.

Jan 17 2019, 10:00 AM · Core Platform Team Backlog (Watching / External), Reading-Infrastructure-Team-Backlog, Services (watching), Maps (Kartographer), TechCom-RFC, Graphoid, Graphs, Service-Architecture, RESTBase-architecture

Jan 16 2019

akosiaris added a comment to T211881: graphoid: Code stewardship request.

@akosiaris, the logic in <An unorthodox architecture of the API of the service> is fundamentally flawed. The client only knows about the graph's hash because MediaWiki parser knew the exact graph data, calculated a hash, and stored that data under that hash in a key-value store (page_props), and included that hash in the HTML. Also, this structure is identical to the way maps function -- map data is calculated by the parser, stored in the key-value page_props with the hash as the key, and kartotherian service does exactly the same steps as graphoid - pulls that data out of page_props to render a static image (so that Leaflet libs are not downloaded until the user interacts with the map). In short - the ONLY component that knows what user wants to draw is MediaWiki parser.

[...]
By the way, page_props.pp_value field is a blob and hence bound to 65k. That does not sound like enough space to store the representation. In fact from what I see at T184128 this has already happened?

Considering that what do you think about giving a little push on the following discussion? T119043: Graph/Graphoid/Kartographer - data storage architecture

Jan 16 2019, 6:15 PM · Core Platform Team Backlog (Watching / External), Services (watching), Release-Engineering-Team (Kanban), Operations, Code-Stewardship-Reviews, Graphoid
akosiaris changed the status of T119043: Graph/Graphoid/Kartographer - data storage architecture from Open to Stalled.

Is this still needed open? Currently Graphs and Maps development is on hold and we may rethink some of how this is done later, but it's not on the immediate agenda.

Jan 16 2019, 6:11 PM · Core Platform Team Backlog (Watching / External), Reading-Infrastructure-Team-Backlog, Services (watching), Maps (Kartographer), TechCom-RFC, Graphoid, Graphs, Service-Architecture, RESTBase-architecture
akosiaris changed the status of T119043: Graph/Graphoid/Kartographer - data storage architecture, a subtask of T147907: Invalid ExternalData is returned from API, from Open to Stalled.
Jan 16 2019, 6:11 PM · Maps (Kartotherian), Discovery, Maps
akosiaris changed the status of T119043: Graph/Graphoid/Kartographer - data storage architecture, a subtask of T151665: Investigate how <mapframe/link> work with the Flagged Revisions extension, from Open to Stalled.
Jan 16 2019, 6:11 PM · Collaboration-Feature-Rollouts (Collaboration-Maps), Collaboration-Team-Triage (Collab-Team-This-Quarter), TCB-Team, German-Community-Wishlist, Discovery, Maps (Kartographer)
akosiaris added projects to T213934: Set up a beta feature offering the use of PHP7: Core Platform Team, Performance-Team.

Adding performance-team and core platform team per SoS recommendation to request for help.

Jan 16 2019, 4:50 PM · MW-1.33-notes (1.33.0-wmf.16; 2019-02-05), Performance-Team (Radar), Core Platform Team (PHP7 (TEC4)), Core Platform Team Backlog (Watching / External), Patch-For-Review, User-Joe, serviceops, Operations
akosiaris committed rDEPLOYCHARTS17270f5b6205: Remove externalIP settings (authored by akosiaris).
Remove externalIP settings
Jan 16 2019, 4:42 PM