Page MenuHomePhabricator

EBernhardson (EBernhardson)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Oct 7 2014, 4:49 PM (261 w, 6 d)
Availability
Available
LDAP User
EBernhardson
MediaWiki User
EBernhardson (WMF) [ Global Accounts ]

Recent Activity

Thu, Oct 10

EBernhardson moved T233677: Avoid eager loading of mediawiki.searchSuggest scripts and dependencies from in progress to Needs review on the Discovery-Search (Current work) board.
Thu, Oct 10, 8:35 PM · Discovery-Search (Current work), Patch-For-Review, Performance-Team (Radar), MediaWiki-Search, MediaWiki-Interface
EBernhardson moved T233677: Avoid eager loading of mediawiki.searchSuggest scripts and dependencies from Wikidata Search to Current work on the Discovery-Search board.
Thu, Oct 10, 8:35 PM · Discovery-Search (Current work), Patch-For-Review, Performance-Team (Radar), MediaWiki-Search, MediaWiki-Interface
EBernhardson moved T233677: Avoid eager loading of mediawiki.searchSuggest scripts and dependencies from needs triage to Wikidata Search on the Discovery-Search board.
Thu, Oct 10, 8:35 PM · Discovery-Search (Current work), Patch-For-Review, Performance-Team (Radar), MediaWiki-Search, MediaWiki-Interface
EBernhardson closed T233197: Elastica HTTPS support as Resolved.
Thu, Oct 10, 8:34 PM · Discovery-Search, Elasticsearch
EBernhardson moved T234782: Inconsistency between requested `gsbbox` values and `geosearch` query results from needs triage to Geodata on the Discovery-Search board.
Thu, Oct 10, 8:34 PM · GeoData, Discovery-Search
EBernhardson triaged T234782: Inconsistency between requested `gsbbox` values and `geosearch` query results as Normal priority.

This looks to be a problem with coordinate ordering, geodata needs to ensure when setting the top left and bottom right coordinates of the bounding box that it's actually using the left and right. It looks like geodata is intending for the input to be (lat_1,lon_1), (lat_2,lon_2). but then passes that directly as the bounding box edges without ensuring appropriate ordering.

Thu, Oct 10, 8:34 PM · GeoData, Discovery-Search
EBernhardson moved T235000: API responses are missing coordinates of images from needs triage to Current work on the Discovery-Search board.
Thu, Oct 10, 8:20 PM · Discovery-Search (Current work), GeoData
EBernhardson triaged T235000: API responses are missing coordinates of images as Normal priority.

search document building[1] is still finding the coordinates, so geodata is appropriately parsing the coordinates and injecting them into the parser output. I also verified in the prod db that geo_tags does not contain any rows for matching gt_page_id. This puts the error somewhere in the table updates. I'm not sure whats going on, but seems should be looked into.

Thu, Oct 10, 8:20 PM · Discovery-Search (Current work), GeoData

Wed, Oct 9

EBernhardson moved T234613: The experimental highlighter may break surrogate pairs from Needs review to Done on the Discovery-Search (Current work) board.
Wed, Oct 9, 6:32 PM · Discovery-Search (Current work), CirrusSearch

Tue, Oct 8

EBernhardson added a comment to T234954: 500k files in hdfs /tmp.

Poking through the list suggests this is mostly old stuff, only ~1k files are dated 2019.

Tue, Oct 8, 4:19 PM · Analytics-Kanban, Analytics-Cluster, Analytics
EBernhardson added a project to T234954: 500k files in hdfs /tmp: Analytics-Cluster.
Tue, Oct 8, 4:16 PM · Analytics-Kanban, Analytics-Cluster, Analytics
EBernhardson created T234954: 500k files in hdfs /tmp.
Tue, Oct 8, 4:16 PM · Analytics-Kanban, Analytics-Cluster, Analytics

Thu, Oct 3

EBernhardson added a comment to T230746: (Aug 30th, 2019) rack/setup/install elastic10[53-67].eqiad.wmnet.

The servers today will not be able to utilize 10G, so they could go in 1G racks for the time being. The cluster can't take advantage of 10G until all the nodes are on 10G.

Thu, Oct 3, 5:41 PM · Patch-For-Review, Operations, ops-eqiad

Wed, Oct 2

EBernhardson committed rECIRa650eb76612d: Fix Precondition failed: Must have a resultset set (authored by dcausse).
Fix Precondition failed: Must have a resultset set
Wed, Oct 2, 11:19 PM
EBernhardson committed rECIR9e6b49be7703: Fix Precondition failed: Must have a resultset set (authored by dcausse).
Fix Precondition failed: Must have a resultset set
Wed, Oct 2, 11:18 PM
EBernhardson renamed T234471: superset not showing data after 09/16 for some datasources from superset now showing data after 09/16 for some datasources to superset not showing data after 09/16 for some datasources .
Wed, Oct 2, 8:03 PM · Analytics-Kanban, Analytics
EBernhardson created P9230 superset test_search_satisfaction_hourly datasource export.
Wed, Oct 2, 6:38 PM

Mon, Sep 30

EBernhardson added a comment to T233718: High volume mediawiki analytics events camus import is lagging.

Data looks to have backfilled appropriately, thanks!

Mon, Sep 30, 5:26 PM · Patch-For-Review, Analytics-Kanban, Analytics
EBernhardson added a comment to T231861: Check home leftovers of smalyshev.

I don't see anything in here that we would be losing, this is safe to delete.

Mon, Sep 30, 4:00 PM · Analytics-Kanban, Analytics

Wed, Sep 25

EBernhardson added a comment to T233718: High volume mediawiki analytics events camus import is lagging.

Doesn't look like this is catching up. New data is arriving again from the new partitions, but the previous data does not appear to be backfilling.

Wed, Sep 25, 7:34 PM · Patch-For-Review, Analytics-Kanban, Analytics

Tue, Sep 24

EBernhardson created T233731: Recreate mjolnir.msearch-prod-request topic in kafka-jumbo.
Tue, Sep 24, 3:20 PM · Discovery-Search (Current work), Analytics

Thu, Sep 19

EBernhardson removed a project from T230862: Create a way to filter only WB-related changes from Commons recentchanges: Patch-For-Review.
Thu, Sep 19, 10:32 PM · Patch-For-Review, Structured Data Engineering, Structured-Data-Backlog, MediaWiki-API, Wikidata-Query-Service, SDC General, Commons, Wikidata
EBernhardson added a comment to T230862: Create a way to filter only WB-related changes from Commons recentchanges.

Indeed I've completely mixed the two, sorry for confusion!

Thu, Sep 19, 10:31 PM · Patch-For-Review, Structured Data Engineering, Structured-Data-Backlog, MediaWiki-API, Wikidata-Query-Service, SDC General, Commons, Wikidata
EBernhardson added a comment to T233197: Elastica HTTPS support.

To use https add 'transport' => 'Https' to your configuration array. Should do the trick.

Thu, Sep 19, 10:16 PM · Discovery-Search, Elasticsearch
EBernhardson added a comment to T230862: Create a way to filter only WB-related changes from Commons recentchanges.

I started writing a patch for this, but got stuck trying to get mw vagrant back into working order. In this patch MediaInfo essentially always provides its fields to the NS_FILE namespace, but when no mediainfo is present it provides appropriate empty values. I'm not clear on what tagging the revision would do, I assume that must trigger some other process? I've uploaded the patch as is, but I've been unable to test this.

Thu, Sep 19, 4:51 PM · Patch-For-Review, Structured Data Engineering, Structured-Data-Backlog, MediaWiki-API, Wikidata-Query-Service, SDC General, Commons, Wikidata

Mon, Sep 16

EBernhardson added a comment to T232495: selenium-daily-beta-CirrusSearch is broken.

cindy was recently upgraded (patch merged today). It still runs mwv, but has a hacked up node10+npm install. This should be unblocked now.

Mon, Sep 16, 8:11 PM · Discovery-Search (Current work), Release-Engineering-Team-TODO (201909), Release-Engineering-Team (CI & Testing services), User-zeljkofilipin, Continuous-Integration-Infrastructure, CirrusSearch
EBernhardson removed projects from T229033: wdio-cucumber-framework fails on NodeJS due to fibers@2.x under nodejs10: CirrusSearch, Discovery-Search.

CirrusSearch tests upgraded in https://gerrit.wikimedia.org/r/#/c/mediawiki/extensions/CirrusSearch/+/536653/

Mon, Sep 16, 8:10 PM · MW-1.34-notes (1.34.0-wmf.23; 2019-09-17), Release-Engineering-Team-TODO, Patch-For-Review
EBernhardson committed rECIR63dc3105a0f2: Upgrade cucumber to node10 compatible version (authored by EBernhardson).
Upgrade cucumber to node10 compatible version
Mon, Sep 16, 8:35 AM

Sep 13 2019

EBernhardson added a comment to T204737: Verify what Python 2 packages deployed to Analytics hosts are needed.

Everything in search should be running on 3, sadly that migration only happened in the last year. But it happened!

Sep 13 2019, 6:16 PM · Analytics-Kanban, Analytics

Sep 12 2019

EBernhardson moved T231980: Discernatron at https://discernatron.wmflabs.org/ not reachable from in progress to Done on the Discovery-Search (Current work) board.
Sep 12 2019, 8:18 PM · Discovery-Search (Current work)
EBernhardson moved T224425: MW Job consumers sometimes pause for several minutes from Waiting to Blocked on the Discovery-Search (Current work) board.
Sep 12 2019, 8:17 PM · CPT Initiatives (Modern Event Platform (TEC2)), WMF-JobQueue, Discovery-Search (Current work)
EBernhardson moved T197129: Increase sampling rates for search metrics on smaller language wikis from Waiting to Blocked on the Discovery-Search (Current work) board.
Sep 12 2019, 8:17 PM · MW-1.34-notes (1.34.0-wmf.15; 2019-07-23), Patch-For-Review, Discovery-Search (Current work), Product-Analytics
EBernhardson moved T229882: Point discovery dashboards at SearchSatisfaction eventlogging table from Waiting to Blocked on the Discovery-Search (Current work) board.
Sep 12 2019, 8:17 PM · Discovery-Search (Current work), Product-Analytics
EBernhardson moved T232495: selenium-daily-beta-CirrusSearch is broken from in progress to Waiting on the Discovery-Search (Current work) board.
Sep 12 2019, 8:17 PM · Discovery-Search (Current work), Release-Engineering-Team-TODO (201909), Release-Engineering-Team (CI & Testing services), User-zeljkofilipin, Continuous-Integration-Infrastructure, CirrusSearch
EBernhardson moved T232495: selenium-daily-beta-CirrusSearch is broken from elastic / cirrus to Current work on the Discovery-Search board.
Sep 12 2019, 8:17 PM · Discovery-Search (Current work), Release-Engineering-Team-TODO (201909), Release-Engineering-Team (CI & Testing services), User-zeljkofilipin, Continuous-Integration-Infrastructure, CirrusSearch
EBernhardson moved T232495: selenium-daily-beta-CirrusSearch is broken from needs triage to elastic / cirrus on the Discovery-Search board.
Sep 12 2019, 8:17 PM · Discovery-Search (Current work), Release-Engineering-Team-TODO (201909), Release-Engineering-Team (CI & Testing services), User-zeljkofilipin, Continuous-Integration-Infrastructure, CirrusSearch
EBernhardson moved T232565: case-sensitive equivalent of haswbstatement from needs triage to elastic / cirrus on the Discovery-Search board.
Sep 12 2019, 8:17 PM · Wikidata, Discovery-Search
EBernhardson moved T232589: Migrate CirrusSearch MediaWikiIntegrationTestCase tests to MediaWikiUnitTestCase from needs triage to Current work on the Discovery-Search board.
Sep 12 2019, 8:16 PM · Discovery-Search (Current work), MediaWiki-extensions-General, Release-Engineering-Team (Code Health), Code-Health
EBernhardson moved T232608: Delete selenium-daily-beta-EXTENSION Jenkins jobs that are broken more than 30 days from needs triage to elastic / cirrus on the Discovery-Search board.
Sep 12 2019, 8:16 PM · Wikidata, Two-Column-Edit-Conflict-Merge, MediaWiki-extensions-ORES, Scoring-platform-team, Electron-PDFs, Discovery-Search, CirrusSearch, TCB-Team, Advanced-Search, User-zeljkofilipin, Release-Engineering-Team-TODO (201909), Release-Engineering-Team (CI & Testing services)
EBernhardson claimed T231980: Discernatron at https://discernatron.wmflabs.org/ not reachable.
Sep 12 2019, 8:15 PM · Discovery-Search (Current work)
EBernhardson moved T231980: Discernatron at https://discernatron.wmflabs.org/ not reachable from needs triage to Current work on the Discovery-Search board.
Sep 12 2019, 8:15 PM · Discovery-Search (Current work)
EBernhardson added a comment to T231980: Discernatron at https://discernatron.wmflabs.org/ not reachable.

For whatever reason the container stopped, I started it back up again. This probably needs to move to our more managed tools collection rather than a custom container on a cloud instance.

Sep 12 2019, 8:14 PM · Discovery-Search (Current work)

Sep 9 2019

EBernhardson committed rECIR3c75185cec2b: Update glent method m0 -> m0run (authored by EBernhardson).
Update glent method m0 -> m0run
Sep 9 2019, 6:57 PM
EBernhardson added a comment to T231861: Check home leftovers of smalyshev.

I only see one directory in smalyshev hdfs home, looks safe to delete.

Sep 9 2019, 4:04 PM · Analytics-Kanban, Analytics

Sep 3 2019

EBernhardson added a comment to T231517: Investigate and fix GC issues on cloudelastic machines.

Reducing replica count from 2 to 1 had a dramatic effect on the cluster. Things are generally looking happy now.

Sep 3 2019, 4:43 PM · Discovery-Search
EBernhardson updated subscribers of T231023: Assert.php: Bad value for parameter $responses: must have as many responses as requests.

@dcausse as a percentage of requests this is exceptionally low, but suggests we are missing some edge case. Any ideas?

Sep 3 2019, 4:42 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Discovery-Search (Current work), CirrusSearch, Wikimedia-production-error
EBernhardson updated the task description for T231517: Investigate and fix GC issues on cloudelastic machines.
Sep 3 2019, 4:01 PM · Discovery-Search

Aug 29 2019

EBernhardson updated the task description for T231517: Investigate and fix GC issues on cloudelastic machines.
Aug 29 2019, 10:05 PM · Discovery-Search
EBernhardson updated the task description for T231517: Investigate and fix GC issues on cloudelastic machines.
Aug 29 2019, 7:31 PM · Discovery-Search
EBernhardson added a comment to T231517: Investigate and fix GC issues on cloudelastic machines.

Not sure it's helping or not, but i increased refresh_interval on all cloudelastic-chi indices to 5 minutes, and removed their index.merge.max_thread_count settings (was 1, now takes default of 3) to see if we could cut back on the number of tiny indices. Segment count reduced from 65k to 57k, about 10%. Might be a minor memory savings but likely very little as the tradeoff is an increase of IndexWriter buffers from ~200M/server to ~1GB/server.

Aug 29 2019, 7:29 PM · Discovery-Search
EBernhardson added a comment to T231517: Investigate and fix GC issues on cloudelastic machines.

Looking into the graphs, it seems to me that the underlying problem is that the min heap keeps growing over time. When the node gets to 2/3 (~30.8GB) heap used the old GC goes crazy. We can re-apply NewRatio with the 45G heap and the old gen will be able to grow by a few more GB, but unless we can figure out what the final steady state value is for the old gen we can only really keep trying larger values.

Aug 29 2019, 6:19 PM · Discovery-Search
EBernhardson moved T231446: Reindex commonswiki as shards have grown beyond critical threshold from in progress to Waiting on the Discovery-Search (Current work) board.
Aug 29 2019, 6:17 PM · Discovery-Search, Patch-For-Review, Operations, Elasticsearch
EBernhardson updated subscribers of T231038: hascaption includes files that have had their captions removed.
Aug 29 2019, 5:09 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog
EBernhardson added a comment to T231038: hascaption includes files that have had their captions removed.

Not sure the right way to go about it, but the problem is essentially here:

Aug 29 2019, 5:06 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog
EBernhardson moved T130329: Icinga should alert on free disk space < 15% (now < 12%) on Elasticsearch hosts from needs triage to Current work on the Discovery-Search board.
Aug 29 2019, 4:48 PM · Discovery-Search (Current work), Patch-For-Review, Operations, Discovery, Elasticsearch
EBernhardson moved T231038: hascaption includes files that have had their captions removed from needs triage to Current work on the Discovery-Search board.
Aug 29 2019, 4:47 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Patch-For-Review, Discovery-Search (Current work), Structured-Data-Backlog
EBernhardson moved T231446: Reindex commonswiki as shards have grown beyond critical threshold from needs triage to Current work on the Discovery-Search board.
Aug 29 2019, 4:47 PM · Discovery-Search, Patch-For-Review, Operations, Elasticsearch

Aug 28 2019

EBernhardson added a comment to P8995 Khmer samples.

ubuntu bionic, chrome 73.0

Aug 28 2019, 2:57 PM · Discovery-Search

Aug 27 2019

EBernhardson committed rEWCSe099c271e614: Support pure existence in haswbstatement (authored by EBernhardson).
Support pure existence in haswbstatement
Aug 27 2019, 11:36 PM
EBernhardson moved T229807: cookbook sre.elasticsearch.rolling-restart failed with cluster relforge from Needs review to Done on the Discovery-Search (Current work) board.
Aug 27 2019, 5:19 PM · Discovery-Search (Current work), Operations, SRE-tools, Elasticsearch
EBernhardson moved T227364: Adjust mjolnir bulk_daemon to import glent swift uploads from Needs review to Done on the Discovery-Search (Current work) board.
Aug 27 2019, 5:18 PM · Patch-For-Review, Discovery-Search (Current work)
EBernhardson claimed T228633: deepcat should not be case sensitive on first letter of category name.
Aug 27 2019, 4:44 PM · MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Discovery-Search (Current work), CirrusSearch
EBernhardson moved T230175: Provide search functionality to find all files that have at least 1 structured data statement from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 27 2019, 4:44 PM · MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Discovery-Search (Current work), Structured-Data-Backlog, SDC General, Wikidata
EBernhardson moved T228633: deepcat should not be case sensitive on first letter of category name from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 27 2019, 4:44 PM · MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Discovery-Search (Current work), CirrusSearch
EBernhardson added a comment to T230730: Better way to pause writes on elasticsearch.

Is it going to back off the same amount of time as the last back-off. So it will start processing messages in a timely manner, just not instantly. But I don't think that is really an issue here. If the cluster was read-only for 30 minutes, waiting for max 10 more for the processing to start is acceptable IMHO.

Aug 27 2019, 12:14 AM · ChangeProp, Core Platform Team Workboards (Clinic Duty Team), Services (designing), Event-Platform, Analytics, WMF-JobQueue, Discovery-Search

Aug 22 2019

EBernhardson triaged T231023: Assert.php: Bad value for parameter $responses: must have as many responses as requests as Normal priority.
Aug 22 2019, 4:55 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Discovery-Search (Current work), CirrusSearch, Wikimedia-production-error
EBernhardson moved T231023: Assert.php: Bad value for parameter $responses: must have as many responses as requests from needs triage to elastic / cirrus on the Discovery-Search board.
Aug 22 2019, 4:55 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), Discovery-Search (Current work), CirrusSearch, Wikimedia-production-error

Aug 21 2019

EBernhardson moved T220625: Initialize CirrusSearch on cloudelastic from in progress to Done on the Discovery-Search (Current work) board.
Aug 21 2019, 4:40 PM · MW-1.34-notes (1.34.0-wmf.4; 2019-05-07), Discovery-Search (Current work), Cloud-Services, Elasticsearch, Discovery

Aug 20 2019

EBernhardson added a comment to T229882: Point discovery dashboards at SearchSatisfaction eventlogging table.

@mpopov Any movement here? No huge rush but this will let us stop generating all events twice

Aug 20 2019, 5:25 PM · Discovery-Search (Current work), Product-Analytics
EBernhardson added a comment to T230495: Partition CirrusSearch mediawiki jobs by cluster.

We need to rework our updater a little bit to share some expensive work before the partitioned jobs, but pull the ContentHandler data per-partition. Shouldn't be that much work, but needs to be done on our end so the cirrusSearchElasticaWrite job can be partitioned

Aug 20 2019, 3:47 PM · Core Platform Team Workboards (Clinic Duty Team), Discovery-Search, Cloud-Services, Elasticsearch, Discovery
EBernhardson committed rWDANa787f6bf7438: Rename glent partition spec part -> date (authored by EBernhardson).
Rename glent partition spec part -> date
Aug 20 2019, 9:02 AM

Aug 19 2019

EBernhardson added a comment to T230746: (Aug 30th, 2019) rack/setup/install elastic10[53-67].eqiad.wmnet.

Try to evenly space out elastic nodes in the row evenly in 1G racks.

Aug 19 2019, 8:25 PM · Patch-For-Review, Operations, ops-eqiad
EBernhardson committed rWDAN68db615cb2b9: Tweaks to glent swift upload for final subworkflow (authored by EBernhardson).
Tweaks to glent swift upload for final subworkflow
Aug 19 2019, 8:34 AM
EBernhardson committed rECIR1c4351bb011d: Implement a random sort order (authored by EBernhardson).
Implement a random sort order
Aug 19 2019, 7:38 AM

Aug 16 2019

EBernhardson committed rECIRfaef47a4a0bd: Fix highlighting of grapheme clusters in search snippets (authored by TJones).
Fix highlighting of grapheme clusters in search snippets
Aug 16 2019, 8:10 PM

Aug 15 2019

EBernhardson moved T229027: search of related images on wikidata (for structured data on commons) from needs triage to making others happy on the Discovery-Search board.
Aug 15 2019, 5:14 PM · Structured-Data-Backlog, Structured Data Engineering, SDC General, Wikidata, Discovery-Search
EBernhardson moved T230175: Provide search functionality to find all files that have at least 1 structured data statement from watching / waiting to Current work on the Discovery-Search board.
Aug 15 2019, 5:13 PM · MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Discovery-Search (Current work), Structured-Data-Backlog, SDC General, Wikidata
EBernhardson moved T230175: Provide search functionality to find all files that have at least 1 structured data statement from needs triage to watching / waiting on the Discovery-Search board.
Aug 15 2019, 5:13 PM · MW-1.34-notes (1.34.0-wmf.21; 2019-09-03), Discovery-Search (Current work), Structured-Data-Backlog, SDC General, Wikidata
EBernhardson moved T89151: SearchHighlighter::highlightText() should not be checking if Cite is installed from needs triage to elastic / cirrus on the Discovery-Search board.
Aug 15 2019, 5:12 PM · MW-1.34-notes (1.34.0-wmf.20; 2019-08-27), Discovery-Search, Technical-Debt, MediaWiki-Search
EBernhardson moved T230409: Spicerack: extend elasticsearch_cluster module by allowing us to wait for write queue to go empty from needs triage to Ops / SRE on the Discovery-Search board.
Aug 15 2019, 5:12 PM · Discovery-Search, Elasticsearch

Aug 14 2019

EBernhardson added a comment to T230472: Search index not updated for nl.wikipedia.org.

Known problem related to recent deployment, currently updates are backlogged about 12 hours. The plan to deal with this is T230495, while some short term hacks are being worked on to alleviate the current backlog.

Aug 14 2019, 9:20 PM · Discovery-Search (Current work)
EBernhardson created T230495: Partition CirrusSearch mediawiki jobs by cluster.
Aug 14 2019, 4:08 PM · Core Platform Team Workboards (Clinic Duty Team), Discovery-Search, Cloud-Services, Elasticsearch, Discovery
EBernhardson added a comment to T71658: Make subphrase matching the default search option on all Wikisources.

It looks like enwikisource has added a suggestion to Special:Search to enable subphrase completion matching. This has resulted in ~421 users that have turned it on, and we don't see any that have turned it back off for the default. This suggests we could possibly move forward with making subphrase matching the default on wikisource.

Aug 14 2019, 3:59 PM · Discovery-Search, Wikimedia-Site-requests, Discovery, Wikisource, CirrusSearch

Aug 8 2019

EBernhardson moved T229807: cookbook sre.elasticsearch.rolling-restart failed with cluster relforge from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 8 2019, 5:14 PM · Discovery-Search (Current work), Operations, SRE-tools, Elasticsearch
EBernhardson closed T158627: oozie job "transfer_to_es" should send email on error as Invalid.
Aug 8 2019, 5:09 PM · Discovery-Search, Discovery
EBernhardson added a comment to T220625: Initialize CirrusSearch on cloudelastic.

All wikis are writing to cloudelastic now. Still be a few days to catchup on writes since july 29, the day the dump was made. Also somehow importing commonswiki_file only imported ~25M out of 50M items. The saneitizer is working on fixing that, but will take a bit.

Aug 8 2019, 3:22 PM · MW-1.34-notes (1.34.0-wmf.4; 2019-05-07), Discovery-Search (Current work), Cloud-Services, Elasticsearch, Discovery

Aug 7 2019

EBernhardson committed rECIR29774bd07f5b: Mark DeleteArchive jobs as handling private_data (authored by EBernhardson).
Mark DeleteArchive jobs as handling private_data
Aug 7 2019, 11:21 PM
EBernhardson moved T229937: Prometheus not collecting cloudelastic metrics from Needs review to Done on the Discovery-Search (Current work) board.
Aug 7 2019, 10:14 PM · Discovery-Search (Current work), Operations

Aug 6 2019

EBernhardson moved T229937: Prometheus not collecting cloudelastic metrics from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 6 2019, 8:00 PM · Discovery-Search (Current work), Operations
EBernhardson moved T229937: Prometheus not collecting cloudelastic metrics from needs triage to Current work on the Discovery-Search board.
Aug 6 2019, 8:00 PM · Discovery-Search (Current work), Operations
EBernhardson claimed T229937: Prometheus not collecting cloudelastic metrics.
Aug 6 2019, 8:00 PM · Discovery-Search (Current work), Operations
EBernhardson added a comment to T229937: Prometheus not collecting cloudelastic metrics.

Think i found it:

Aug 6 2019, 8:00 PM · Discovery-Search (Current work), Operations
EBernhardson added a comment to T229937: Prometheus not collecting cloudelastic metrics.

Looked into this a little bit (on cloudelastic1001.wikimedia.org), no solution yet:

Aug 6 2019, 7:37 PM · Discovery-Search (Current work), Operations
EBernhardson moved T227364: Adjust mjolnir bulk_daemon to import glent swift uploads from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 6 2019, 5:45 PM · Patch-For-Review, Discovery-Search (Current work)
EBernhardson moved T220625: Initialize CirrusSearch on cloudelastic from Waiting to in progress on the Discovery-Search (Current work) board.
Aug 6 2019, 5:44 PM · MW-1.34-notes (1.34.0-wmf.4; 2019-05-07), Discovery-Search (Current work), Cloud-Services, Elasticsearch, Discovery
EBernhardson updated the task description for T229937: Prometheus not collecting cloudelastic metrics.
Aug 6 2019, 4:04 PM · Discovery-Search (Current work), Operations
EBernhardson created T229937: Prometheus not collecting cloudelastic metrics.
Aug 6 2019, 4:03 PM · Discovery-Search (Current work), Operations
EBernhardson moved T224324: LB for cloudelastic from in progress to Done on the Discovery-Search (Current work) board.
Aug 6 2019, 3:53 PM · Discovery-Search (Current work), Cloud-Services, Elasticsearch, Discovery

Aug 5 2019

EBernhardson added a comment to T228925: Fix documentation of boolean operators.

I reviewed the draft on mw.org, everything there looks accurate as far as I'm aware. I didn't realize that implicit and explciit AND behave differently. The on-wiki documentation doesn't feel scary enough for what's really going on, but I'm not sure how to make it more explicit that this thing is funny and not what they think it is.

Aug 5 2019, 11:08 PM · Discovery-Search (Current work)
EBernhardson added a comment to T229861: Can't reach cloudelastic.wikimedia.org via IPv6.

Resolution: The ipv6 address I set, which was the ipv6 version of the ipv4 address, was incorrect. Rather a new ipv6 address within the LVS range needed to be used. Brandon applied a fix with https://gerrit.wikimedia.org/r/#/c/528215/ and https://gerrit.wikimedia.org/r/#/c/528216/ and everything now looks to be working as expected wrt ipv6

Aug 5 2019, 10:01 PM · Operations, Traffic, Discovery-Search (Current work)