onimisionipe@cloudelastic1002:~$ sudo smartctl -H /dev/sdb
smartctl 6.6 2016-05-31 r4324 [x86_64-linux-4.9.0-9-amd64] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

Dec 5 2019, 10:32 PM · ops-eqiad, DC-Ops, SRE, cloud-services-team (Kanban)

Dec 4 2019

• Mathew.onipe added a comment to T239728: Re-import OSM data at eqiad and codfw to temporarily fix current OSM replication issues..

Dec 4 2019, 12:19 PM · Discovery-Search (Current work), SRE, Maps

Dec 3 2019

• Mathew.onipe created T239728: Re-import OSM data at eqiad and codfw to temporarily fix current OSM replication issues..

Dec 3 2019, 4:46 PM · Discovery-Search (Current work), SRE, Maps

Nov 28 2019

• Mathew.onipe updated subscribers of T239389: Grant merge permission to discovery-team members on https://gerrit.wikimedia.org/r/admin/projects/wikidata/query/deploy.

Nov 28 2019, 12:36 PM · Release-Engineering-Team, Gerrit-Privilege-Requests, Discovery-Search (Current work)

• Mathew.onipe triaged T239389: Grant merge permission to discovery-team members on https://gerrit.wikimedia.org/r/admin/projects/wikidata/query/deploy as Medium priority.

Nov 28 2019, 8:59 AM · Release-Engineering-Team, Gerrit-Privilege-Requests, Discovery-Search (Current work)

• Mathew.onipe created T239389: Grant merge permission to discovery-team members on https://gerrit.wikimedia.org/r/admin/projects/wikidata/query/deploy.

Nov 28 2019, 8:59 AM · Release-Engineering-Team, Gerrit-Privilege-Requests, Discovery-Search (Current work)

Nov 25 2019

• Mathew.onipe added a comment to T238822: XSS in Wikidata Query Service UI (i18n messages) - CVE-2019-19327.

I have deployed Lucas's patch. Kindly test this to confirm everything is fine.

Nov 25 2019, 9:31 AM · Vuln-XSS, Security, User-Addshore, Security-Team, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Wikidata, Wikidata Query UI

Nov 20 2019

• Mathew.onipe created T238733: Push rights on https://gerrit.wikimedia.org/r/admin/projects/wikidata/query/blazegraph for onimisionipe.

Nov 20 2019, 11:02 AM · Gerrit-Privilege-Requests, Wikidata, Release-Engineering-Team (Unit & Int & System Tooling), SRE, Wikidata-Query-Service

Nov 18 2019

• Mathew.onipe moved T238408: Metrics from the wdqs updater are no longer collected from Incoming to Needs Reporting on the Discovery-Search (Current work) board.

Nov 18 2019, 5:04 PM · Discovery-Search (Current work), Wikidata-Query-Service, Wikidata

Nov 15 2019

• Mathew.onipe moved T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs from Incoming to Needs Reporting on the Discovery-Search (Current work) board.

Nov 15 2019, 11:55 AM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

Nov 13 2019

• Mathew.onipe added a comment to T238232: blazegraph journal on wdqs1005 is oversized.

@Igorkim78 file has been uploaded.

Nov 13 2019, 5:28 PM · Wikidata-Query-Service, Discovery-Search (Current work), Wikidata

• Mathew.onipe added a comment to T238232: blazegraph journal on wdqs1005 is oversized.

wdqs1006-dumpjournal44 KBDownload

Nov 13 2019, 5:26 PM · Wikidata-Query-Service, Discovery-Search (Current work), Wikidata

Nov 6 2019

Physikerwelt awarded T233213: XSS in Wikidata Query Service UI, DATATYPE_MATHML - CVE-2019-19329 a Dislike token.

Nov 6 2019, 9:13 AM · Security, Discovery-Search (Current work), User-Addshore, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Vuln-XSS, Wikidata, Wikidata Query UI

• Mathew.onipe added a comment to T233213: XSS in Wikidata Query Service UI, DATATYPE_MATHML - CVE-2019-19329.

Sorry all.. I just did another deployment and patch has been correctly applied. Also test works fine now

Nov 6 2019, 9:12 AM · Security, Discovery-Search (Current work), User-Addshore, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Vuln-XSS, Wikidata, Wikidata Query UI

• Mathew.onipe added a comment to T233213: XSS in Wikidata Query Service UI, DATATYPE_MATHML - CVE-2019-19329.

I've deployed @Lucas_Werkmeister_WMDE patch. But I can still run:

Nov 6 2019, 7:55 AM · Security, Discovery-Search (Current work), User-Addshore, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Vuln-XSS, Wikidata, Wikidata Query UI

Nov 5 2019

• Mathew.onipe added a comment to T231446: Reindex commonswiki as shards have grown beyond critical threshold.

T230746 is almost here. So we won't be resharding any index for now

Nov 5 2019, 4:58 PM · Discovery-Search, SRE, Elasticsearch

Nov 4 2019

• Mathew.onipe added a comment to T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.

see https://phabricator.wikimedia.org/T237228 for current OSM replication issues

Nov 4 2019, 9:57 AM · SRE, Maps

• Mathew.onipe triaged T237228: OSM Replication failed at eqiad and codfw as High priority.

Nov 4 2019, 8:52 AM · SRE, Maps

• Mathew.onipe created T237228: OSM Replication failed at eqiad and codfw.

Nov 4 2019, 8:52 AM · SRE, Maps

• Mathew.onipe closed T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day as Invalid.

Nov 4 2019, 8:45 AM · SRE, Maps

• Mathew.onipe added a comment to T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.

I'm closing this task as there are icinga alerts for osm sync

Nov 4 2019, 8:44 AM · SRE, Maps

• Mathew.onipe updated subscribers of T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.

Nov 4 2019, 8:43 AM · SRE, Maps

• Mathew.onipe updated subscribers of T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.

Nov 4 2019, 8:42 AM · SRE, Maps

• Mathew.onipe raised the priority of T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day from Medium to High.

Nov 4 2019, 8:41 AM · SRE, Maps

• Mathew.onipe added a comment to T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day.

We have an alert for this. https://icinga.wikimedia.org/cgi-bin/icinga/extinfo.cgi?type=2&host=icinga1001&service=Maps+-+OSM+synchronization+lag+-+eqiad

Nov 4 2019, 8:41 AM · SRE, Maps

• Mathew.onipe lowered the priority of T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day from High to Medium.

Nov 4 2019, 8:40 AM · SRE, Maps

• Mathew.onipe triaged T237209: Alert SRE if Wikipedia Maps replication sync lag exceeds 1 day as High priority.

Nov 4 2019, 8:39 AM · SRE, Maps

Nov 1 2019

• Mathew.onipe triaged T237089: Create CQS puppet configs by applying query_service module as Medium priority.

Nov 1 2019, 8:05 AM · Discovery-Search (Current work), Patch-For-Review, Structured-Data-Backlog, Structured Data Engineering, SRE, SDC General, Wikidata

• Mathew.onipe created T237089: Create CQS puppet configs by applying query_service module.

Nov 1 2019, 8:05 AM · Discovery-Search (Current work), Patch-For-Review, Structured-Data-Backlog, Structured Data Engineering, SRE, SDC General, Wikidata

• Mathew.onipe renamed T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs from Create puppet configs for SDC query to Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Nov 1 2019, 8:01 AM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

Oct 29 2019

• Mathew.onipe added a comment to T233213: XSS in Wikidata Query Service UI, DATATYPE_MATHML - CVE-2019-19329.

In T233213#5584298, @Lucas_Werkmeister_WMDE wrote:

I’ve updated P9315 after a MathJax developer commented on the MathML incompatibility issue (it seems to be very limited in scope and can be worked around); I think it should be good enough to deploy as a first version that will fix the XSS.

@Mathew.onipe, @Gehel: I think the process to deploy this without making the fix public prematurely would be:

Turn P9315 into a patch against wikidata/query/gui-deploy

On the deployment server (is that deployment.eqiad.wmnet or a different one, btw?), apply that patch in the gui submodule of the wikidata/query/deploy checkout (/src/deployment/wdqs/wdqs?)

Commit that patch to the gui submodule, and then commit the submodule update in the main deploy repository? Not sure if this is necessary

Deploy the current tree (scap deploy?)

Verify that the vulnerability is fixed

Coordinate pre-announcements?

Upload the fix to Gerrit to the wikidata/query/gui repository

Merge it, causing a corresponding change to wikidata/query/gui-deploy to be built

Merge that change

Update gui submodule in wikidata/query/deploy repository, bringing it in line with the deployment server’s version

Announce the fix

Discuss any further changes to math rendering in T214980: Support mathematical formulae in Wikidata Query Service UI on all browsers (public)

Does that sound right to you?

Oct 29 2019, 9:22 AM · Security, Discovery-Search (Current work), User-Addshore, Wikidata-Campsite (Wikidata-Campsite-Iteration-∞ (On Hold)), Vuln-XSS, Wikidata, Wikidata Query UI

Oct 27 2019

• Mathew.onipe updated subscribers of T236601: Degraded RAID on elastic1039.

Oct 27 2019, 6:53 PM · Discovery-Search (Current work), ops-eqiad, SRE

• Mathew.onipe added a comment to T236601: Degraded RAID on elastic1039.

This is causing mjolnir deploy directory to become unavailable/missing and also causing puppet to fail.

Oct 27 2019, 6:53 PM · Discovery-Search (Current work), ops-eqiad, SRE

Oct 21 2019

• Mathew.onipe edited projects for T233403: Unassigned shards in eqiad, added: Discovery-Search (Current work); removed Discovery-Search.

Oct 21 2019, 12:58 PM · Discovery-Search (Current work), SRE, Elasticsearch

• Mathew.onipe added a comment to T233403: Unassigned shards in eqiad.

Issue still persist: https://icinga.wikimedia.org/cgi-bin/icinga/extinfo.cgi?type=2&host=search.svc.eqiad.wmnet&service=ElasticSearch+unassigned+shard+check+-+9243

Oct 21 2019, 12:57 PM · Discovery-Search (Current work), SRE, Elasticsearch

Oct 16 2019

• Mathew.onipe added a comment to T235540: StackOverflowError when SPARQL query uses same variable name before and after aggregation.

Oct 16 2019, 12:09 PM · Wikidata, Wikidata-Query-Service

Oct 10 2019

• Mathew.onipe triaged T235159: Enable write access for Mathew.onipe(onimisionipe) and gehel on wikidata gui repo as Medium priority.

Oct 10 2019, 9:37 AM · Discovery-Search (Current work), Wikidata-Query-Service, Wikidata, Gerrit-Privilege-Requests

• Mathew.onipe created T235159: Enable write access for Mathew.onipe(onimisionipe) and gehel on wikidata gui repo.

Oct 10 2019, 9:37 AM · Discovery-Search (Current work), Wikidata-Query-Service, Wikidata, Gerrit-Privilege-Requests

Oct 9 2019

• Mathew.onipe added a comment to T233316: Deployment Pipeline fails with CPS error for Kartotherian.

@dduvall Thanks!. I removed the test stage also forced devdeps to install. We should definitely look at a better way to handle this later. but Its fine as it is.
Currently, Build is passing but not publishing yet. Do we need to enable CI publish stage for the repo?

Oct 9 2019, 10:31 PM · Release-Engineering-Team-TODO (201910), Maps (Kartotherian), Release Pipeline, Release-Engineering-Team (Pipeline)

• Mathew.onipe added a comment to T233316: Deployment Pipeline fails with CPS error for Kartotherian.

@dduvall Thanks. I will implement this.

Oct 9 2019, 11:24 AM · Release-Engineering-Team-TODO (201910), Maps (Kartotherian), Release Pipeline, Release-Engineering-Team (Pipeline)

Oct 3 2019

• Mathew.onipe added a comment to T233403: Unassigned shards in eqiad.

This issue has come up again. Currently, we have only enwiki_content_1546970425 unassigned with too many shards [1] allocated to this node for index [enwiki_content_1546970425], index setting index.routing.allocation.total_shards_per_node=1] error from _cluster/allocation/explain.

Oct 3 2019, 1:05 PM · Discovery-Search (Current work), SRE, Elasticsearch

Oct 1 2019

• Mathew.onipe added a comment to T233316: Deployment Pipeline fails with CPS error for Kartotherian.

Post merge builds seems to fail.
https://gerrit.wikimedia.org/r/c/mediawiki/services/kartotherian/+/539209

Oct 1 2019, 12:31 AM · Release-Engineering-Team-TODO (201910), Maps (Kartotherian), Release Pipeline, Release-Engineering-Team (Pipeline)

Sep 24 2019

• Mathew.onipe added a comment to T225125: Migrate Elasticsearch from deprecated Gelf logstash input to rsyslog Kafka logging pipeline.

We should talk to elastic to see how we can move this forward.
Currently, we require jackson-databind 2.8.11 and jackson-annotation 2.8.11 for JsonLayout to work when using SyslogAppender. Version 2.8.6 is provided by debian for this packages. We should use the correct version to make sure everything work as expected.

Sep 24 2019, 5:49 PM · SRE Observability, observability, Discovery-Search, Elasticsearch, SRE, Wikimedia-Logstash

• Mathew.onipe moved T225125: Migrate Elasticsearch from deprecated Gelf logstash input to rsyslog Kafka logging pipeline from Needs review to Blocked/Waiting on the Discovery-Search (Current work) board.

Sep 24 2019, 5:45 PM · SRE Observability, observability, Discovery-Search, Elasticsearch, SRE, Wikimedia-Logstash

• Mathew.onipe moved T232184: MIgrate WDQS to new logging pipeline from Waiting to Needs Reporting on the Discovery-Search (Current work) board.

Sep 24 2019, 5:40 PM · Discovery-Search (Current work), Wikimedia-Logstash, SRE, observability, Discovery-Analysis (Current work), Product-Analytics, Wikidata, Wikidata-Query-Service

Sep 23 2019

• Mathew.onipe triaged T233578: hw troubleshooting: Memory correctable errors -EDAC- for elastic1029.eqiad.wmnet as Medium priority.

Sep 23 2019, 8:06 AM · SRE, ops-eqiad, DC-Ops

• Mathew.onipe created T233578: hw troubleshooting: Memory correctable errors -EDAC- for elastic1029.eqiad.wmnet.

Sep 23 2019, 8:05 AM · SRE, ops-eqiad, DC-Ops

Sep 20 2019

• Mathew.onipe created T233403: Unassigned shards in eqiad.

Sep 20 2019, 12:51 PM · Discovery-Search (Current work), SRE, Elasticsearch

Sep 18 2019

• Mathew.onipe moved T232184: MIgrate WDQS to new logging pipeline from Incoming to Waiting on the Discovery-Search (Current work) board.

Sep 18 2019, 1:23 PM · Discovery-Search (Current work), Wikimedia-Logstash, SRE, observability, Discovery-Analysis (Current work), Product-Analytics, Wikidata, Wikidata-Query-Service

Sep 16 2019

• Mathew.onipe created T233039: hw troubleshooting: <type of hardware failre> for <fqhn of server>.

Sep 16 2019, 5:53 PM · DC-Ops

• Mathew.onipe closed T201991: Broken memory on elastic1029 as Resolved.

Sep 16 2019, 5:50 PM · SRE, ops-eqiad

• Mathew.onipe reopened T201991: Broken memory on elastic1029 as "Open".

Sep 16 2019, 5:46 PM · SRE, ops-eqiad

Sep 12 2019

• Mathew.onipe added a comment to T176875: Allow access to wdqs.svc.eqiad.wmnet on port 8888.

@Ladsgroup there's no TLS termination on that port for now. We should have and I will work on it in the nearest future. Please use HTTP for now

Sep 12 2019, 10:24 AM · Patch-For-Review, Traffic, Wikidata-Query-Service, SRE, WMDE-Analytics-Engineering, User-Addshore, Discovery-ARCHIVED, Wikidata

• Mathew.onipe added a comment to T176875: Allow access to wdqs.svc.eqiad.wmnet on port 8888.

@Addshore @Ladsgroup @WMDE-leszek, can you test that you can reach wdqs.svc.eqiad.wmnet on port 8888. LVS and other appropriate changes have been merged and It should work. Thanks

Sep 12 2019, 8:54 AM · Patch-For-Review, Traffic, Wikidata-Query-Service, SRE, WMDE-Analytics-Engineering, User-Addshore, Discovery-ARCHIVED, Wikidata

Sep 11 2019

• Mathew.onipe updated the task description for T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 11 2019, 6:45 AM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

Sep 10 2019

• Mathew.onipe moved T225125: Migrate Elasticsearch from deprecated Gelf logstash input to rsyslog Kafka logging pipeline from Incoming to Needs review on the Discovery-Search (Current work) board.

Sep 10 2019, 1:50 PM · SRE Observability, observability, Discovery-Search, Elasticsearch, SRE, Wikimedia-Logstash

• Mathew.onipe added a project to T232184: MIgrate WDQS to new logging pipeline: Discovery-Search (Current work).

Sep 10 2019, 3:12 AM · Discovery-Search (Current work), Wikimedia-Logstash, SRE, observability, Discovery-Analysis (Current work), Product-Analytics, Wikidata, Wikidata-Query-Service

Sep 9 2019

• Mathew.onipe updated the task description for T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 9 2019, 2:49 PM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe updated the task description for T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 9 2019, 2:46 PM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe updated the task description for T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 9 2019, 2:42 PM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe added a project to T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs: SRE.

Sep 9 2019, 2:36 PM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe updated the task description for T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 9 2019, 2:21 PM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe triaged T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs as Medium priority.

Sep 9 2019, 7:42 AM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

• Mathew.onipe created T232297: Refactor Puppet WDQS module to make it usable for wdqs and cqs.

Sep 9 2019, 7:41 AM · Structured Data Engineering, Structured-Data-Backlog, SRE, Discovery-Search (Current work), SDC General, Wikidata

Sep 6 2019

• Mathew.onipe added a comment to T232224: September 2019 DoS attacks [Public].

This is a know issue. The SRE team is finding a quick solution to restore these services. Thanks

Sep 6 2019, 6:22 PM · Sustainability (Incident Followup), SRE

• Mathew.onipe added a comment to T225125: Migrate Elasticsearch from deprecated Gelf logstash input to rsyslog Kafka logging pipeline.

JsonLayout requires other dependencies for log4j. This include jackson databind. See https://logging.apache.org/log4j/2.x/runtime-dependencies.html.
Two options:

Rebuild log4j with this dependencies
Fall back to shipping logs with PatternLayout.

Sep 6 2019, 11:19 AM · SRE Observability, observability, Discovery-Search, Elasticsearch, SRE, Wikimedia-Logstash

• Mathew.onipe updated subscribers of T228483: Delete (rather than archive) the maps/kartotherian and maps/tilerator repos.

Let's wait for @MSantos or @Mholloway opinion before deleting those repos please

Sep 6 2019, 7:40 AM · User-MarcoAurelio, Release-Engineering-Team-TODO, Release-Engineering-Team (Development services), Diffusion-Repository-Administrators, Maps, Projects-Cleanup

• Mathew.onipe triaged T232184: MIgrate WDQS to new logging pipeline as Medium priority.

Sep 6 2019, 7:36 AM · Discovery-Search (Current work), Wikimedia-Logstash, SRE, observability, Discovery-Analysis (Current work), Product-Analytics, Wikidata, Wikidata-Query-Service

• Mathew.onipe created T232184: MIgrate WDQS to new logging pipeline.

Sep 6 2019, 7:36 AM · Discovery-Search (Current work), Wikimedia-Logstash, SRE, observability, Discovery-Analysis (Current work), Product-Analytics, Wikidata, Wikidata-Query-Service

Sep 4 2019

• Mathew.onipe added a comment to T231928: CI service-pipeline-test-and-publish job assumes blubber config has a single production image.

Not sure but seems we are missing some configs in our config.yaml patch

Sep 4 2019, 11:20 AM · Release-Engineering-Team (Radar), Maps, Product-Infrastructure-Team-Backlog-Deprecated

Sep 3 2019

• Mathew.onipe added a comment to T225125: Migrate Elasticsearch from deprecated Gelf logstash input to rsyslog Kafka logging pipeline.

rsyslog Json requires the @cee token which must be provided according to standard via profile::rsyslog::udp_localhost_compat. Let's use profile::rsyslog::udp_json_logback_compat instead as it permits parsing of json from log4j without the token.

Sep 3 2019, 3:50 PM · SRE Observability, observability, Discovery-Search, Elasticsearch, SRE, Wikimedia-Logstash