Page MenuHomePhabricator

TJones (Trey Jones)
Staff Computational Linguist, Search Platform Team

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Tuesday

  • Clear sailing ahead.

User Details

User Since
Jul 8 2015, 3:02 PM (386 w, 3 d)
Availability
Available
IRC Nick
Trey314159
LDAP User
Tjones
MediaWiki User
TJones (WMF) [ Global Accounts ]

I would have written a shorter comment, but I did not have the time.

I'm part of the Search Platform team and I spend my time working on search & relevance, trying to better support search in various languages, analyzing queries, and doing random mathy things. I tend to write long, detailed notes about my investigations (so as to improve the bus number of my work).

When I have to work on _GitHub,_ /‍‍/Phab,/‍‍/ and ''MediaWiki'' all on the same day, I sometimes suffer Severe Markup Incongruence Fatigue.

I � Unicode.

Recent Activity

Mon, Nov 28

TJones created T323945: Reindex Russian-language wikis to enable ICU Folding.
Mon, Nov 28, 7:02 PM · Discovery-Search
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Mon, Nov 28, 6:57 PM · Epic, Discovery-Search (Current work)
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Mon, Nov 28, 4:36 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones created T323927: Reindex Ukrainian-language wikis to enable unpacked analysis.
Mon, Nov 28, 4:32 PM · Discovery-Search (Current work)
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Mon, Nov 28, 4:19 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED

Tue, Nov 22

TJones moved T318264: Investigate Unpacking Ukrainian Analyzer from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Tue, Nov 22, 4:53 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)

Mon, Nov 21

TJones updated the task description for T323508: The preparation job should discover what index to write to.
Mon, Nov 21, 4:56 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Patch-For-Review, Discovery-Search (Current work), CirrusSearch

Fri, Nov 18

TJones moved T318264: Investigate Unpacking Ukrainian Analyzer from In Progress to Needs review on the Discovery-Search (Current work) board.
Fri, Nov 18, 8:32 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)

Thu, Nov 17

TJones added a comment to T318264: Investigate Unpacking Ukrainian Analyzer.

Full write up on Mediawiki:

Thu, Nov 17, 10:28 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)
TJones moved T322044: Reindex Egyptian Arabic and Moroccan Arabic wikis to enable Arabic language analysis from In Progress to Needs review on the Discovery-Search (Current work) board.

Full notes on Mediawiki.

Thu, Nov 17, 5:41 PM · Discovery-Search (Current work)

Tue, Nov 15

TJones claimed T322044: Reindex Egyptian Arabic and Moroccan Arabic wikis to enable Arabic language analysis.
Tue, Nov 15, 7:42 PM · Discovery-Search (Current work)

Mon, Nov 14

TJones renamed T322905: [EPIC] Upgrade Search Platform spark jobs to spark 3 from Upgrade Search Platform spark jobs to spark 3 to [EPIC] Upgrade Search Platform spark jobs to spark 3.
Mon, Nov 14, 4:28 PM · Epic, Discovery-Search

Wed, Nov 9

TJones added a project to T322776: Deploy Ukrainian Analyzer Plugin: Discovery-Search.
Wed, Nov 9, 6:14 PM · Patch-For-Review, Discovery-Search (Current work)
TJones created T322776: Deploy Ukrainian Analyzer Plugin.
Wed, Nov 9, 6:14 PM · Patch-For-Review, Discovery-Search (Current work)

Oct 31 2022

TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Oct 31 2022, 4:26 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones claimed T318264: Investigate Unpacking Ukrainian Analyzer.
Oct 31 2022, 4:25 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)
TJones created T322044: Reindex Egyptian Arabic and Moroccan Arabic wikis to enable Arabic language analysis.
Oct 31 2022, 4:24 PM · Discovery-Search (Current work)
TJones moved T319420: Reindex Arabic & Thai wikis to enable unpacked versions from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Oct 31 2022, 4:19 PM · Discovery-Search (Current work)
TJones moved T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Oct 31 2022, 4:18 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)

Oct 26 2022

TJones added a comment to T321680: Problems with autocompletion in Search bar.

This looks like a repeat of T317381: Reduction in helpfulness and quantity of autocomplete search results. It even uses the same examples as the linked Village Pump discussion. This was a transient error caused by the Elasticsearch 7 update, and has been resolved for a while. If there are no fresh reports of problems, I suggest closing this ticket.

Oct 26 2022, 3:26 PM · CirrusSearch, Discovery-Search
TJones added a comment to T321680: Problems with autocompletion in Search bar.

Random additional thought: It's possible that there was a transient problem with the generation of the autocomplete data, but it is regenerated daily and it seems to be working fine today. My best explanation is that this was a problem for a day, but now it is fixed. I'll bring it up in today's meeting and see if there's any evidence of a problem in the last few days.

Oct 26 2022, 2:40 PM · CirrusSearch, Discovery-Search
TJones added a comment to T321680: Problems with autocompletion in Search bar.

screenshots don't show an image preview, why?

Oct 26 2022, 2:36 PM · CirrusSearch, Discovery-Search

Oct 25 2022

TJones changed the point value for T318264: Investigate Unpacking Ukrainian Analyzer from 5 to 8.

So... the components of the analyzer are all defined together in one object, and the elements are all clear in the code: standard tokenizer, lowercase, stopwords, and stemmer, along with a pre-tokenization char filter on line 50. The stopwords are available as a plaintext file, and the dictionary used for the stemmer has been extracted out into it's own separate artifact.

Oct 25 2022, 7:18 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Oct 25 2022, 2:41 PM · Epic, Discovery-Search (Current work)
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Oct 25 2022, 2:41 PM · Epic, Discovery-Search (Current work)
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Oct 25 2022, 2:37 PM · Epic, Discovery-Search (Current work)

Oct 24 2022

TJones moved T318264: Investigate Unpacking Ukrainian Analyzer from Ready for Dev -- SWE to In Progress on the Discovery-Search (Current work) board.
Oct 24 2022, 4:37 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)

Oct 21 2022

TJones moved T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic from In Progress to Needs review on the Discovery-Search (Current work) board.
Oct 21 2022, 6:59 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)
TJones added a comment to T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic.

Full write up on Mediawiki.

Oct 21 2022, 6:58 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)

Oct 20 2022

TJones changed the point value for T319420: Reindex Arabic & Thai wikis to enable unpacked versions from 2 to 3.

Changing story points because this was more involved than usual due to the changes to Thai tokenization.

Oct 20 2022, 8:09 PM · Discovery-Search (Current work)
TJones moved T319420: Reindex Arabic & Thai wikis to enable unpacked versions from In Progress to Needs review on the Discovery-Search (Current work) board.
Oct 20 2022, 8:07 PM · Discovery-Search (Current work)
TJones added a comment to T319420: Reindex Arabic & Thai wikis to enable unpacked versions.

Full write up on Mediawiki.

Oct 20 2022, 8:07 PM · Discovery-Search (Current work)

Oct 17 2022

TJones claimed T319420: Reindex Arabic & Thai wikis to enable unpacked versions.
Oct 17 2022, 3:51 PM · Discovery-Search (Current work)

Oct 6 2022

TJones moved T294147: Unpack Arabic & Thai Elasticsearch Analyzers from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.
Oct 6 2022, 6:03 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Oct 5 2022

TJones created T319420: Reindex Arabic & Thai wikis to enable unpacked versions.
Oct 5 2022, 1:57 PM · Discovery-Search (Current work)
TJones moved T294147: Unpack Arabic & Thai Elasticsearch Analyzers from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Oct 5 2022, 1:48 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Oct 4 2022

TJones claimed T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic.
Oct 4 2022, 7:51 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)
TJones moved T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic from Ready for Dev -- SWE to In Progress on the Discovery-Search (Current work) board.
Oct 4 2022, 7:51 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)

Sep 27 2022

TJones moved T294147: Unpack Arabic & Thai Elasticsearch Analyzers from In Progress to Needs review on the Discovery-Search (Current work) board.
Sep 27 2022, 8:08 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)
TJones added a comment to T294147: Unpack Arabic & Thai Elasticsearch Analyzers.

Full writeup is on MediaWiki.

Sep 27 2022, 7:27 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Sep 26 2022

TJones set the point value for T318269: Test and analyze Kuromoji Japanese language analyzer to 13.

I'm on the fence between 8 & 13 story points (can I say 10?), so I'm going with the bigger number until we talk about it at a later meeting.

Sep 26 2022, 4:01 PM · Discovery-Search (Current work)
TJones set the point value for T318264: Investigate Unpacking Ukrainian Analyzer to 5.
Sep 26 2022, 4:00 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)
TJones set the point value for T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic to 5.
Sep 26 2022, 3:59 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)
TJones updated the task description for T317023: Investigate moving incoming_links computation to a batch job.
Sep 26 2022, 3:28 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Patch-For-Review, Discovery-Search (Current work), CirrusSearch
TJones assigned T317283: Coordinate with ServiceOps Team about a rework of the Search Update Pipeline to Gehel.
Sep 26 2022, 3:16 PM · Discovery-Search (Current work), serviceops
TJones assigned T317046: Coordinate with Platform Engineering / Data Value Stream Team about a rework of the Search Update Pipeline to Gehel.
Sep 26 2022, 3:16 PM · Discovery-Search (Current work), Event-Platform Value Stream, Data-Engineering-Planning, Platform Engineering
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Sep 26 2022, 2:27 PM · Epic, Discovery-Search (Current work)
TJones added a comment to T303013: Indicate when search results are from redirects (sometimes).

@Jdlrobson @TJones — what is needed in order to make this change? Is there Search API stuff that needs to be changed, or is it mostly client/front-end work? cc @ovasileva @ldelench_wmf

Sep 26 2022, 1:33 PM · Readers-Web-Backlog, Design-Systems-Team, Codex, Desktop Improvements (Vector 2022)

Sep 21 2022

TJones edited projects for T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic, added: Discovery-Search (Current work); removed Discovery-Search.
Sep 21 2022, 7:37 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)
TJones created T318269: Test and analyze Kuromoji Japanese language analyzer.
Sep 21 2022, 7:37 PM · Discovery-Search (Current work)
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Sep 21 2022, 6:47 PM · Epic, Discovery-Search (Current work)
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Sep 21 2022, 6:47 PM · Epic, Discovery-Search (Current work)
TJones created T318264: Investigate Unpacking Ukrainian Analyzer.
Sep 21 2022, 6:46 PM · MW-1.40-notes (1.40.0-wmf.12; 2022-11-28), Discovery-Search (Current work)

Sep 19 2022

TJones changed the point value for T294147: Unpack Arabic & Thai Elasticsearch Analyzers from 5 to 8.

Updating story points. Arabic was easy. Thai is more complicated than default unpacking.

Sep 19 2022, 3:57 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Sep 12 2022

TJones added a comment to T317476: Filter and sort search results of Japanese kana search queries in accordance with how much of the query appears as a consecutive substring.

TL;DR: Sorry not to have better news. Processing of Japanese text on non-Japanese wikis is inconsistent and weird and complicated and hard to change or improve. Searching with the judicious use of double quotes and spaces may help improve accuracy of search results for Japanese kana queries on English-language wikis.


Sep 12 2022, 6:38 PM · Discovery-Search, CirrusSearch

Sep 7 2022

TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Sep 7 2022, 2:33 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones created T317200: Reindex all wikis to fix nnbsp regression.
Sep 7 2022, 2:31 PM · MW-1.40-notes (1.40.0-wmf.6; 2022-10-17), Discovery-Search (Current work)

Aug 31 2022

TJones created T316817: Explore Using Arabic Analysis Chain for Egyptian Arabic and Moroccan Arabic.
Aug 31 2022, 8:28 PM · MW-1.40-notes (1.40.0-wmf.8; 2022-10-31), Discovery-Search (Current work)

Aug 29 2022

TJones renamed T316087: [tracking] Peter's Onboarding from Peter's Onboarding to [tracking] Peter's Onboarding.
Aug 29 2022, 3:30 PM · Epic, Discovery-Search (Current work)
TJones updated the task description for T315118: Handle variation in apostrophe-like characters better.
Aug 29 2022, 2:40 PM · CirrusSearch, Discovery-Search
TJones added a comment to T311654: Apostrophes do not work well in search on nia.wikipedia.

Follow up task to look at this more broadly: T315118: Handle variation in apostrophe-like characters better

Aug 29 2022, 2:38 PM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch

Aug 24 2022

TJones moved T301131: Test Elastic 7.10 language analyzers from In Progress to Needs review on the Discovery-Search (Current work) board.
Aug 24 2022, 7:14 PM · Discovery-Search (Current work)

Aug 22 2022

TJones added a comment to T301131: Test Elastic 7.10 language analyzers.

Summary:

  • There are no changes to most analyzers between 6.8 and 7.10.
  • The most impactful (and most debatable) changes to the Nori (Korean) tokenizer made between 6.5 and 6.8 have been reverted (keeping the smaller, better changes).
  • The Thai tokenizer now allows some less commonly used Unicode characters through, where before it would delete/ignore them.
  • The problem of narrow non-breaking spaces (NNBSP) that existed in the 6.5 ICU tokenizer and that was introduced in the 6.8 standard tokenizer persists, so I'm going to patch it.
Aug 22 2022, 9:34 PM · Discovery-Search (Current work)
TJones added a comment to T311654: Apostrophes do not work well in search on nia.wikipedia.

FYI: The reindexing to enable the new apostrophe handling is complete. I checked and searching for Hili'adulo, Hili’adulo, or Hili‘adulo all return the same 66 results.

Aug 22 2022, 5:31 PM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch
TJones added a comment to T315907: Reindex Nias Wikis to enable better apostrophe handling.

There are about 10K pages between Nias Wikipedia and Wiktionary, so reindexing only took a few minutes. All done!

Aug 22 2022, 5:25 PM · Discovery-Search (Current work), CirrusSearch
TJones moved T315907: Reindex Nias Wikis to enable better apostrophe handling from In Progress to Needs Reporting on the Discovery-Search (Current work) board.
Aug 22 2022, 5:25 PM · Discovery-Search (Current work), CirrusSearch
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Aug 22 2022, 5:24 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Aug 22 2022, 5:20 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones moved T315907: Reindex Nias Wikis to enable better apostrophe handling from Incoming to In Progress on the Discovery-Search (Current work) board.
Aug 22 2022, 5:17 PM · Discovery-Search (Current work), CirrusSearch
TJones updated the task description for T315907: Reindex Nias Wikis to enable better apostrophe handling.
Aug 22 2022, 5:17 PM · Discovery-Search (Current work), CirrusSearch
TJones created T315907: Reindex Nias Wikis to enable better apostrophe handling.
Aug 22 2022, 5:14 PM · Discovery-Search (Current work), CirrusSearch
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Aug 22 2022, 3:14 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones closed T287718: Look into improving Dagbani Search as Declined.

The initial enthusiasm died down without any activity in a year, so I'm going to close this ticket.

Aug 22 2022, 2:29 PM · Dagbani-Sites, Discovery-Search
TJones closed T287719: Look into improving Igbo Search as Declined.

The initial enthusiasm died down without any activity in a year, so I'm going to close this ticket.

Aug 22 2022, 2:28 PM · Igbo-Wikimedians-User-Group, Discovery-Search
TJones placed T258094: Improve Breton language analysis up for grabs.
Aug 22 2022, 2:24 PM · Discovery-Search

Aug 18 2022

TJones moved T315265: Reindex Bengali wikis to enable new analyzer from In Progress to Needs Reporting on the Discovery-Search (Current work) board.
Aug 18 2022, 5:40 PM · Discovery-Search (Current work)
TJones added a comment to T315265: Reindex Bengali wikis to enable new analyzer.

Reindexing is done. Write up on Mediawiki.

Aug 18 2022, 5:40 PM · Discovery-Search (Current work)

Aug 17 2022

TJones archived P32456 Trey's ES 7 error.
Aug 17 2022, 6:13 PM
TJones created P32456 Trey's ES 7 error.
Aug 17 2022, 3:31 PM

Aug 16 2022

TJones claimed T301131: Test Elastic 7.10 language analyzers.
Aug 16 2022, 4:33 PM · Discovery-Search (Current work)
TJones claimed T294147: Unpack Arabic & Thai Elasticsearch Analyzers.
Aug 16 2022, 4:32 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)
TJones moved T301131: Test Elastic 7.10 language analyzers from Ready for Dev -- SWE to In Progress on the Discovery-Search (Current work) board.
Aug 16 2022, 4:32 PM · Discovery-Search (Current work)
TJones moved T294147: Unpack Arabic & Thai Elasticsearch Analyzers from Ready for Dev -- SWE to In Progress on the Discovery-Search (Current work) board.
Aug 16 2022, 4:32 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Aug 15 2022

TJones set the point value for T315265: Reindex Bengali wikis to enable new analyzer to 2.
Aug 15 2022, 10:43 PM · Discovery-Search (Current work)
TJones claimed T315265: Reindex Bengali wikis to enable new analyzer.
Aug 15 2022, 10:42 PM · Discovery-Search (Current work)
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Aug 15 2022, 10:40 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery-ARCHIVED
TJones moved T294067: Install and unpack Bengali analyzer from In Progress to Needs Reporting on the Discovery-Search (Current work) board.

Write up is complete (though the code has been merged for a while): Bengali enabling/unpacking notes.

Aug 15 2022, 10:39 PM · Discovery-Search (Current work)
TJones created T315265: Reindex Bengali wikis to enable new analyzer.
Aug 15 2022, 10:39 PM · Discovery-Search (Current work)

Aug 12 2022

TJones created T315118: Handle variation in apostrophe-like characters better.
Aug 12 2022, 9:11 PM · CirrusSearch, Discovery-Search
TJones moved T311654: Apostrophes do not work well in search on nia.wikipedia from Ready for Dev -- SWE to Needs review on the Discovery-Search (Current work) board.
Aug 12 2022, 7:54 PM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch
TJones assigned T311654: Apostrophes do not work well in search on nia.wikipedia to EJoseph.

@EJoseph and I looked at this over the last couple of days. In addition to the curly quotes, we found some additional characters being used as apostrophes. The config Emmanuel submitted a patch for converts all of the following letters to apostrophes:

Aug 12 2022, 7:53 PM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch

Aug 11 2022

TJones added a comment to T311654: Apostrophes do not work well in search on nia.wikipedia.

(Updated title to reflect focus on apostrophes after discussion here.)

Aug 11 2022, 2:49 AM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch
TJones renamed T311654: Apostrophes do not work well in search on nia.wikipedia from Diacritics and apostrophes do not work well in search on nia.wikipedia to Apostrophes do not work well in search on nia.wikipedia.
Aug 11 2022, 2:49 AM · MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Discovery-Search (Current work), CirrusSearch

Aug 8 2022

TJones renamed T170625: Smarter handling of acronyms for word_break_helper in language analyzers from Investigate disabling or modifying word_break_helper in language analyzers. to Smarter handling of acronyms for word_break_helper in language analyzers.
Aug 8 2022, 8:33 PM · Discovery-Search
TJones edited projects for T313973: GrowthExperiments\NewcomerTasks\AddImage\ServiceImageRecommendationProvider::get Unable to decode JSON response for page {title} upstream connect error or disconnect/reset before headers. reset reason: connection termination, added: Discovery-Search; removed Discovery-Search (Current work).
Aug 8 2022, 3:29 PM · Structured-Data-Backlog, Structured Data Engineering, Patch-For-Review, serviceops, API Platform, MW-1.39-notes (1.39.0-wmf.25; 2022-08-15), Platform Engineering, Growth-Team (Current Sprint), Image-Suggestions, Growth-Structured-Tasks, Wikimedia-production-error
TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Aug 8 2022, 2:11 PM · Epic, Discovery-Search (Current work)
TJones renamed T294147: Unpack Arabic & Thai Elasticsearch Analyzers from Unpack Arabic, Latvian, Thai Elasticsearch Analyzers to Unpack Arabic & Thai Elasticsearch Analyzers.
Aug 8 2022, 2:08 PM · MW-1.40-notes (1.40.0-wmf.5; 2022-10-10), Discovery-Search (Current work)

Jul 28 2022

TJones raised the priority of T72899: Search box needs some normalization for Arabic Family languages from Low to Medium.
Jul 28 2022, 3:26 PM · Discovery-Search, Discovery-ARCHIVED, CirrusSearch, I18n, MediaWiki-Search

Jul 25 2022

TJones updated the task description for T272606: [EPIC] Unpack all Elasticsearch analyzers.
Jul 25 2022, 4:04 PM · Epic, Discovery-Search (Current work)
TJones closed T108500: Searching for "kentai kessen" (a typo of "kantai kessen"), you get results for "kendal kessel" on en.wp as Invalid.

As @Aklapper points out, the completion suggester (drop-down suggestions) gives the correct result, which is the best way to deal with typos.

Jul 25 2022, 3:37 PM · Discovery-Search, CirrusSearch, Discovery-ARCHIVED