Page MenuHomePhabricator
Feed Advanced Search

Thu, Apr 18

TJones edited projects for T117217: Redirect in the search box for Arabic projects, added: Discovery-Search (Current work); removed Discovery-Search.
Thu, Apr 18, 12:59 PM · Discovery-Search (Current work), MediaWiki-Search, I18n, Wikimania-Hackathon-2018, MediaWiki-Internationalization
TJones moved T220124: Update Analysis Analysis tools (& prep for Haystack) from in progress to Done on the Discovery-Search (Current work) board.
Thu, Apr 18, 12:56 PM · Patch-For-Review, Discovery-Search (Current work)

Fri, Apr 12

TJones moved T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek) from in progress to Done on the Discovery-Search (Current work) board.
Fri, Apr 12, 11:43 AM · Discovery-Search (Current work), Turkish-Sites

Thu, Apr 11

TJones triaged T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek) as Normal priority.
Thu, Apr 11, 6:50 PM · Discovery-Search (Current work), Turkish-Sites

Wed, Apr 10

TJones closed T216738: Reindex Korean-language wikis to enable Nori analyzer as Resolved.

Korean got reindexed incidentally as part of the ES 6 upgrade: some of our previous configuration for spaceless languages was deprecated so we had to upgrade them to BM25, and reindex, which picked up the Nori changes for Korean.

Wed, Apr 10, 3:15 PM · Discovery-Search
TJones closed T216738: Reindex Korean-language wikis to enable Nori analyzer, a subtask of T206874: Add Nori (Korean) configuration to AnalysisConfigBuilder, as Resolved.
Wed, Apr 10, 3:15 PM · Patch-For-Review, Discovery-Search (Current work), Discovery
TJones updated the task description for T147505: [Recurring task] CirrusSearch: what is updated during re-indexing.
Wed, Apr 10, 3:11 PM · Discovery-Search (Current work), Discovery
TJones added a comment to T220205: Define constraints for cloudelastic use cases.

This looks good, @Gehel. You brought up of some things we hadn't talked about before, so you covered more than 100% of the topics I had!

Wed, Apr 10, 2:06 PM · Discovery-Search (Current work)

Mon, Apr 8

TJones added a comment to T220124: Update Analysis Analysis tools (& prep for Haystack).

The code update is done, but I'm moving this back to "in progress" because I'm still working on my presentation.

Mon, Apr 8, 10:00 PM · Patch-For-Review, Discovery-Search (Current work)
TJones moved T220124: Update Analysis Analysis tools (& prep for Haystack) from Needs review to in progress on the Discovery-Search (Current work) board.
Mon, Apr 8, 9:59 PM · Patch-For-Review, Discovery-Search (Current work)

Fri, Apr 5

TJones moved T220124: Update Analysis Analysis tools (& prep for Haystack) from in progress to Needs review on the Discovery-Search (Current work) board.
Fri, Apr 5, 2:01 PM · Patch-For-Review, Discovery-Search (Current work)

Thu, Apr 4

TJones edited projects for T220124: Update Analysis Analysis tools (& prep for Haystack), added: Discovery-Search (Current work); removed Discovery-Analysis (Current work).
Thu, Apr 4, 4:09 PM · Patch-For-Review, Discovery-Search (Current work)
TJones moved T220124: Update Analysis Analysis tools (& prep for Haystack) from Backlog to In progress on the Discovery-Analysis (Current work) board.
Thu, Apr 4, 4:08 PM · Patch-For-Review, Discovery-Search (Current work)
TJones removed a project from T220124: Update Analysis Analysis tools (& prep for Haystack): Product-Analytics.
Thu, Apr 4, 4:08 PM · Patch-For-Review, Discovery-Search (Current work)
TJones edited projects for T220124: Update Analysis Analysis tools (& prep for Haystack), added: Discovery-Analysis (Current work); removed Discovery-Search.
Thu, Apr 4, 4:08 PM · Patch-For-Review, Discovery-Search (Current work)
TJones triaged T220124: Update Analysis Analysis tools (& prep for Haystack) as High priority.
Thu, Apr 4, 4:07 PM · Patch-For-Review, Discovery-Search (Current work)
TJones moved T220124: Update Analysis Analysis tools (& prep for Haystack) from needs triage to Language Stuff on the Discovery-Search board.
Thu, Apr 4, 4:07 PM · Patch-For-Review, Discovery-Search (Current work)
TJones added a project to T220124: Update Analysis Analysis tools (& prep for Haystack): Discovery-Search.
Thu, Apr 4, 4:07 PM · Patch-For-Review, Discovery-Search (Current work)
TJones created T220124: Update Analysis Analysis tools (& prep for Haystack).
Thu, Apr 4, 4:06 PM · Patch-For-Review, Discovery-Search (Current work)

Tue, Apr 2

TJones added a comment to T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.

Created the following tasks and will prioritize them into the Language Stuff workboard column:

Tue, Apr 2, 6:27 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones moved T219915: Enable more of the unambiguous/less ambiguous scripts for language identification from needs triage to Language Stuff on the Discovery-Search board.
Tue, Apr 2, 6:24 PM · Discovery-Search
TJones triaged T219915: Enable more of the unambiguous/less ambiguous scripts for language identification as Normal priority.
Tue, Apr 2, 6:24 PM · Discovery-Search
TJones created T219915: Enable more of the unambiguous/less ambiguous scripts for language identification.
Tue, Apr 2, 6:24 PM · Discovery-Search
TJones removed a subtask for T174116: Another look at multi-hyphen tokens on enwiki and zhwiki: T219911: Retrain Chinese query-based language ID models.
Tue, Apr 2, 6:11 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones edited parent tasks for T219911: Retrain Chinese query-based language ID models, added: T118278: EPIC: Improve Language Identification for use in Cirrus Search; removed: T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Tue, Apr 2, 6:11 PM · Chinese-Sites, Discovery-Search
TJones added a subtask for T118278: EPIC: Improve Language Identification for use in Cirrus Search: T219911: Retrain Chinese query-based language ID models.
Tue, Apr 2, 6:11 PM · Epic, Discovery
TJones removed a subtask for T174116: Another look at multi-hyphen tokens on enwiki and zhwiki: T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5.
Tue, Apr 2, 6:11 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones added a subtask for T118278: EPIC: Improve Language Identification for use in Cirrus Search: T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5.
Tue, Apr 2, 6:11 PM · Epic, Discovery
TJones edited parent tasks for T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5, added: T118278: EPIC: Improve Language Identification for use in Cirrus Search; removed: T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Tue, Apr 2, 6:11 PM · Discovery-Search
TJones added subtasks for T174116: Another look at multi-hyphen tokens on enwiki and zhwiki: T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5, T219911: Retrain Chinese query-based language ID models.
Tue, Apr 2, 6:08 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones added a parent task for T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5: T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Tue, Apr 2, 6:08 PM · Discovery-Search
TJones added a parent task for T219911: Retrain Chinese query-based language ID models: T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Tue, Apr 2, 6:08 PM · Chinese-Sites, Discovery-Search
TJones triaged T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5 as Normal priority.
Tue, Apr 2, 6:07 PM · Discovery-Search
TJones created T219912: Loosen limit on DYM suggestions blocking cross-language results from < 3 to < 5.
Tue, Apr 2, 6:07 PM · Discovery-Search
TJones triaged T219911: Retrain Chinese query-based language ID models as Normal priority.
Tue, Apr 2, 6:00 PM · Chinese-Sites, Discovery-Search
TJones created T219911: Retrain Chinese query-based language ID models.
Tue, Apr 2, 5:59 PM · Chinese-Sites, Discovery-Search
TJones moved T174116: Another look at multi-hyphen tokens on enwiki and zhwiki from in progress to Needs review on the Discovery-Search (Current work) board.

The results are in! A brief summary:

Tue, Apr 2, 5:53 PM · Discovery-Search (Current work), Chinese-Sites, Discovery

Fri, Mar 29

TJones added a comment to T216055: Move backend for current search dashboard to pull data from Hadoop.

@mpopov are there any of these metrics we want to remove in the light of the classification that @TJones did on spreadsheet?

Fri, Mar 29, 4:12 PM · Discovery-Search (Current work), Patch-For-Review, Product-Analytics, Epic

Thu, Mar 28

TJones moved T219550: Harmonize language analysis across languages from needs triage to Language Stuff on the Discovery-Search board.
Thu, Mar 28, 7:47 PM · Discovery-Search
TJones added a project to T219550: Harmonize language analysis across languages: Discovery-Search.
Thu, Mar 28, 7:47 PM · Discovery-Search
TJones added a parent task for T180387: Enable hiragana/katakana mapping for other languages: T219550: Harmonize language analysis across languages.
Thu, Mar 28, 7:47 PM · Discovery-Search, Discovery, CirrusSearch
TJones added a parent task for T219108: Investigate applying aggressive_splitting everywhere, not just on English-language wikis: T219550: Harmonize language analysis across languages.
Thu, Mar 28, 7:47 PM · Discovery, CirrusSearch, Discovery-Search
TJones added subtasks for T219550: Harmonize language analysis across languages: T170625: Investigate disabling or modifying word_break_helper in language analyzers., T219108: Investigate applying aggressive_splitting everywhere, not just on English-language wikis, T180387: Enable hiragana/katakana mapping for other languages.
Thu, Mar 28, 7:47 PM · Discovery-Search
TJones added a parent task for T170625: Investigate disabling or modifying word_break_helper in language analyzers.: T219550: Harmonize language analysis across languages.
Thu, Mar 28, 7:47 PM · Discovery-Search
TJones created T219550: Harmonize language analysis across languages.
Thu, Mar 28, 7:46 PM · Discovery-Search
TJones renamed T219108: Investigate applying aggressive_splitting everywhere, not just on English-language wikis from Cross-wiki search tokenizer is better than local search one to Investigate applying aggressive_splitting everywhere, not just on English-language wikis.
Thu, Mar 28, 7:04 PM · Discovery, CirrusSearch, Discovery-Search
TJones added a comment to T219108: Investigate applying aggressive_splitting everywhere, not just on English-language wikis.

As I thought, this is a customization that was added to the English Language Analysis years ago before my time. It was originally limited to search on MediaWiki.org in 2013, and then expanded to all English-language wikis in 2014, but it was never expanded beyond that.

Thu, Mar 28, 6:59 PM · Discovery, CirrusSearch, Discovery-Search
TJones added a comment to T219108: Investigate applying aggressive_splitting everywhere, not just on English-language wikis.

I'll take a look today. I'm pretty sure I know what's happening, but will double check.

Thu, Mar 28, 5:31 PM · Discovery, CirrusSearch, Discovery-Search

Mar 20 2019

TJones renamed T212891: [EPIC-ish][Milestone 2] Implement NLP Search Suggestion Method 2 for CJK languages from [EPIC-ish][Milestone 3] Implement NLP Search Suggestion Method 2 for CJK languages to [EPIC-ish][Milestone 2] Implement NLP Search Suggestion Method 2 for CJK languages.
Mar 20 2019, 3:45 PM · Chinese-Sites, Discovery-Search, Epic
TJones renamed T212889: [EPIC-ish][Milestone 1] Implement NLP Search Suggestion Method 1 for 10 languages from [EPIC-ish][Milestone 2] Implement NLP Search Suggestion Method 1 for 10 languages to [EPIC-ish][Milestone 1] Implement NLP Search Suggestion Method 1 for 10 languages.
Mar 20 2019, 3:44 PM · Discovery-Search, Epic
TJones renamed T212888: [EPIC-ish][Milestone 0] Implement NLP Search Suggestion Method 0 for English from [EPIC-ish][Milestone 1] Implement NLP Search Suggestion Method 0 for English to [EPIC-ish][Milestone 0] Implement NLP Search Suggestion Method 0 for English.
Mar 20 2019, 3:44 PM · Patch-For-Review, Discovery-Search, Epic

Mar 19 2019

TJones updated the task description for T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Mar 19 2019, 5:25 PM · Discovery-Search (Current work), Chinese-Sites, Discovery

Mar 12 2019

TJones moved T217602: Properly handle language-specific lowercasing in language analyzers from Needs review to Done on the Discovery-Search (Current work) board.
Mar 12 2019, 1:45 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work)
TJones moved T203117: Greek language analysis generates unexpected empty tokens from Needs review to Done on the Discovery-Search (Current work) board.
Mar 12 2019, 1:44 PM · Patch-For-Review, Discovery-Search (Current work)

Mar 8 2019

TJones moved T216083: Update required version of TextCat in CirrusSearch from Needs review to Done on the Discovery-Search (Current work) board.
Mar 8 2019, 3:13 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones added a comment to T216083: Update required version of TextCat in CirrusSearch.

Thanks, @Smalyshev & @EBernhardson, for the vendor patch!

Mar 8 2019, 3:12 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery

Mar 7 2019

TJones claimed T174116: Another look at multi-hyphen tokens on enwiki and zhwiki.
Mar 7 2019, 6:10 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones moved T174116: Another look at multi-hyphen tokens on enwiki and zhwiki from Language Stuff to Current work on the Discovery-Search board.
Mar 7 2019, 6:10 PM · Discovery-Search (Current work), Chinese-Sites, Discovery
TJones moved T216083: Update required version of TextCat in CirrusSearch from in progress to Needs review on the Discovery-Search (Current work) board.
Mar 7 2019, 6:08 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones claimed T216083: Update required version of TextCat in CirrusSearch.
Mar 7 2019, 6:06 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones moved T216083: Update required version of TextCat in CirrusSearch from Language Stuff to Current work on the Discovery-Search board.
Mar 7 2019, 6:06 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery

Mar 6 2019

TJones added a comment to T217602: Properly handle language-specific lowercasing in language analyzers.

After refactoring the lowercase-to-ICU-normalization upgrade code for Greek (T203117) so that the lowercase filter is kept if it is language-specific, I needed to test it for the other language-specific cases: Turkish and Irish. The impact is positive but small because it is limited to the plain field and other fields besides the text field (where the lang-specific lowercasing is already in effect because the analyzers have not been unpacked). Full details on MediaWiki.

Mar 6 2019, 11:17 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work)
TJones added a comment to T203117: Greek language analysis generates unexpected empty tokens.

Unpacking the Greek analyzer exposes the lowercase filter, which is upgraded to icu_normalizer, losing the Greek-specific processing therein! So, we need to keep the Greek lowercasing even if we do ICU normalization. After that, everything is copacetic. Full write up on MediaWiki.

Mar 6 2019, 11:14 PM · Patch-For-Review, Discovery-Search (Current work)
TJones updated the task description for T147505: [Recurring task] CirrusSearch: what is updated during re-indexing.
Mar 6 2019, 11:06 PM · Discovery-Search (Current work), Discovery
TJones renamed T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek) from Reindex Greek-language wikis to enable empty-token filtering to Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek).
Mar 6 2019, 11:05 PM · Discovery-Search (Current work), Turkish-Sites
TJones added a subtask for T217602: Properly handle language-specific lowercasing in language analyzers: T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek).
Mar 6 2019, 11:04 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work)
TJones added a parent task for T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek): T217602: Properly handle language-specific lowercasing in language analyzers.
Mar 6 2019, 11:04 PM · Discovery-Search (Current work), Turkish-Sites
TJones moved T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek) from needs triage to Language Stuff on the Discovery-Search board.
Mar 6 2019, 11:02 PM · Discovery-Search (Current work), Turkish-Sites
TJones edited projects for T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek), added: Discovery-Search; removed Discovery-Search (Current work).
Mar 6 2019, 11:02 PM · Discovery-Search (Current work), Turkish-Sites
TJones created T217806: Reindex Greek, Turkish, and Irish wikis to keep lang-specific lowercasing & enable empty-token filtering (Greek).
Mar 6 2019, 11:01 PM · Discovery-Search (Current work), Turkish-Sites
TJones moved T203117: Greek language analysis generates unexpected empty tokens from in progress to Needs review on the Discovery-Search (Current work) board.
Mar 6 2019, 11:00 PM · Patch-For-Review, Discovery-Search (Current work)
TJones moved T217602: Properly handle language-specific lowercasing in language analyzers from in progress to Needs review on the Discovery-Search (Current work) board.
Mar 6 2019, 11:00 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work)

Mar 4 2019

TJones created T217602: Properly handle language-specific lowercasing in language analyzers.
Mar 4 2019, 8:49 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work)

Feb 26 2019

TJones claimed T203117: Greek language analysis generates unexpected empty tokens.
Feb 26 2019, 4:49 PM · Patch-For-Review, Discovery-Search (Current work)
TJones moved T203117: Greek language analysis generates unexpected empty tokens from Language Stuff to Current work on the Discovery-Search board.
Feb 26 2019, 4:48 PM · Patch-For-Review, Discovery-Search (Current work)

Feb 21 2019

TJones moved T216740: Advanced search syntax for newbies from Backlog to Trainings / Skill sharing on the Wikimedia-Hackathon-2019 board.
Feb 21 2019, 5:00 PM · Wikimedia-Hackathon-2019
TJones created T216740: Advanced search syntax for newbies.
Feb 21 2019, 5:00 PM · Wikimedia-Hackathon-2019
TJones renamed T216738: Reindex Korean-language wikis to enable Nori analyzer from Reindex Korean-language wikis to Reindex Korean-language wikis to enable Nori analyzer.
Feb 21 2019, 4:54 PM · Discovery-Search
TJones updated the task description for T147505: [Recurring task] CirrusSearch: what is updated during re-indexing.
Feb 21 2019, 4:54 PM · Discovery-Search (Current work), Discovery
TJones moved T216738: Reindex Korean-language wikis to enable Nori analyzer from needs triage to Language Stuff on the Discovery-Search board.
Feb 21 2019, 4:52 PM · Discovery-Search
TJones created T216738: Reindex Korean-language wikis to enable Nori analyzer.
Feb 21 2019, 4:52 PM · Discovery-Search
TJones moved T206874: Add Nori (Korean) configuration to AnalysisConfigBuilder from in progress to Done on the Discovery-Search (Current work) board.

We need to reindex, but not until after the ES6 upgrade is complete, and LTR has been disabled.

Feb 21 2019, 4:47 PM · Patch-For-Review, Discovery-Search (Current work), Discovery

Feb 20 2019

TJones added a comment to T215969: Measure mutation latency across the newly split elasticsearch clusters.

@EBernhardson, thanks for the explanation!

Feb 20 2019, 10:36 PM · Patch-For-Review, Discovery-Search (Current work)
TJones added a comment to T215969: Measure mutation latency across the newly split elasticsearch clusters.

The spikes on create_index are pretty extreme, with 194s for chi-eqiad-with-archive and 291s for omega-eqiad-with-archive. Is that just bad luck, or is something going on with the archives that makes this sometimes take much longer?

Feb 20 2019, 9:52 PM · Patch-For-Review, Discovery-Search (Current work)
TJones awarded T215969: Measure mutation latency across the newly split elasticsearch clusters a Pterodactyl token.
Feb 20 2019, 9:50 PM · Patch-For-Review, Discovery-Search (Current work)

Feb 14 2019

TJones added a comment to T63080: CirrusSearch: intitle:¢ returns no results despite there being a redirect at [[¢]].

Bleh. It looks like that symbol is turned into a text boundary by the standard analyzer which isn't nice.

Feb 14 2019, 9:56 PM · Discovery-Search, good first bug, Discovery, CirrusSearch

Feb 13 2019

TJones moved T216083: Update required version of TextCat in CirrusSearch from needs triage to Language Stuff on the Discovery-Search board.
Feb 13 2019, 10:38 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones renamed T216083: Update required version of TextCat in CirrusSearch from Update required version of TextCat in Mediawiki to Update required version of TextCat in CirrusSearch.
Feb 13 2019, 10:38 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones triaged T216083: Update required version of TextCat in CirrusSearch as Normal priority.
Feb 13 2019, 10:36 PM · MW-1.33-notes (1.33.0-wmf.22; 2019-03-19), Patch-For-Review, Discovery-Search (Current work), Discovery
TJones moved T213936: Deploy new version of TextCat from in progress to Done on the Discovery-Search (Current work) board.
Feb 13 2019, 10:34 PM · Discovery-Search (Current work), Discovery
TJones assigned T213936: Deploy new version of TextCat to Smalyshev.

Cool! Thanks, @Smalyshev!

Feb 13 2019, 10:34 PM · Discovery-Search (Current work), Discovery
TJones added a comment to T215966: Requesting access to Production Shell for julia.glen.

Woo hoo!

Feb 13 2019, 9:18 PM · Patch-For-Review, Operations, SRE-Access-Requests
TJones added a comment to T215966: Requesting access to Production Shell for julia.glen.

Change 490412 had a related patch set uploaded (by Gehel; owner: Gehel):
[operations/puppet@production] admin: reset Julia SSH key

https://gerrit.wikimedia.org/r/490412

Feb 13 2019, 9:10 PM · Patch-For-Review, Operations, SRE-Access-Requests
TJones moved T206874: Add Nori (Korean) configuration to AnalysisConfigBuilder from Language Stuff to Current work on the Discovery-Search board.
Feb 13 2019, 7:00 PM · Patch-For-Review, Discovery-Search (Current work), Discovery
TJones moved T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias from Tech Debt/Misc to Language Stuff on the Discovery-Search board.

Removing this from current work and moving it to the "Language Stuff" backlog. I'm the only one who could work on this this quarter, and I'm a bit out of my depth with the integration. We'll reprioritize this for future work when we can assign a slightly larger team (≥2 people) to work on it.

Feb 13 2019, 6:59 PM · Discovery-Search, Russian-Sites, Discovery
TJones edited projects for T138958: Detect "wrong keyboard" queries for Russian/American keyboards on EN/RU Wikipedias, added: Discovery-Search; removed Discovery-Search (Current work).
Feb 13 2019, 6:58 PM · Discovery-Search, Russian-Sites, Discovery

Feb 12 2019

TJones added a comment to T215966: Requesting access to Production Shell for julia.glen.

@Julia.glen, I think this patch should give you an account, but as user juliaglen. You may need to add User juliaglen to your ssh config.

Feb 12 2019, 10:05 PM · Patch-For-Review, Operations, SRE-Access-Requests
TJones added a comment to T215916: ElasticSearch 6 migration plan checklist (search cluster).

Hmm—what about Nori (the Korean analyzer) and LTR? I believe we have to disable LTR for Korean, enable Nori, gather more data, then rebuild the LTR model. Sounds like maybe all of that should wait until after the ES upgrade, even though it means re-indexing Korean wikis at a later date.

Feb 12 2019, 4:49 PM · Discovery-Search
TJones added a comment to T215916: ElasticSearch 6 migration plan checklist (search cluster).

Looks good, and all the detail is much appreciated.

Feb 12 2019, 3:51 PM · Discovery-Search

Feb 11 2019

TJones added a comment to T212889: [EPIC-ish][Milestone 1] Implement NLP Search Suggestion Method 1 for 10 languages.

Sounds good to me! If it turns out that the smallest volume languages have trouble, we can fall back to larger languages on the list.

Feb 11 2019, 8:43 PM · Discovery-Search, Epic