Page MenuHomePhabricator

TJones (Trey Jones)
Sr. Computational Linguist, Search Platform Team

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Jul 8 2015, 3:02 PM (294 w, 6 d)
Availability
Available
IRC Nick
Trey314159
LDAP User
Tjones
MediaWiki User
TJones (WMF) [ Global Accounts ]

I would have written a shorter comment, but I did not have the time.

I'm part of the Search Platform team and I spend my time working on search & relevance, trying to better support search in various languages, analyzing queries, and doing random mathy things. I tend to write long, detailed notes about my investigations (so as to improve the bus number of my work).

When I have to work on _GitHub,_ /‍‍/Phab,/‍‍/ and ''MediaWiki'' all on the same day, I sometimes suffer Severe Markup Incongruence Fatigue.

I � Unicode.

Recent Activity

Mon, Mar 1

TJones added a comment to T274204: Deploy new version of Extra Plugin (with Khmer filter) to Elasticsearch cluster.

@RKemper, I meant to bring this up in today's meeting, but it slipped my mind. Anything I can do to help?

Mon, Mar 1, 9:58 PM · Discovery-Search (Current work)
TJones updated the task description for T276169: Don't make DYM suggestions with negation in them (Glent).
Mon, Mar 1, 8:34 PM · Discovery-Search
TJones created T276169: Don't make DYM suggestions with negation in them (Glent).
Mon, Mar 1, 8:33 PM · Discovery-Search
TJones added a comment to T274908: How to exclude or lower the priority of subpages in search results?.

A possible partial workaround is to use the intitle: regex feature to exclude titles with a slash in them.

Mon, Mar 1, 7:35 PM · Editing-team (Tracking), WMDE-Templates-FocusArea, VisualEditor, WMDE-TechWish, VisualEditor-MediaWiki-Templates, Discovery-Search, CirrusSearch
TJones renamed T275782: "Execute Query" button on WDQS not visible for long text query from "Execute Query" button on WDQS not vissible for long text query to "Execute Query" button on WDQS not visible for long text query.
Mon, Mar 1, 4:25 PM · Wikidata-Query-Service, Wikidata

Wed, Feb 24

TJones claimed T265081: Fix Chinese Analysis Chain for Glent M2.
Wed, Feb 24, 10:10 PM · Discovery-Search (Current work), Chinese-Sites
TJones moved T267971: Analyze Speaker-Reviewed M2 Data for Chinese from In Progress to Needs review on the Discovery-Search (Current work) board.
Wed, Feb 24, 10:06 PM · Discovery-Search (Current work), Chinese-Sites
TJones added a comment to T267971: Analyze Speaker-Reviewed M2 Data for Chinese.

Summary: The stats for Glent M2 suggestions for Chinese are roughly similar to Korean and Japanese. A big difference is that a fair number of suggestions are traditional-to-simplified conversions, which were rated as good suggestions, but which probably don't make much difference in search results (we do traditional-to-simplified conversion for indexing and searching behind the scenes)—though it is possible that Glent's traditional-to-simplified conversion is better for these queries than our rule-based one for searching and indexing.

Wed, Feb 24, 10:05 PM · Discovery-Search (Current work), Chinese-Sites

Wed, Feb 17

TJones added a comment to T8373: Provide a list of zero-result searches.

In T8373#2508574, @MarkAHershberger wrote:

debt writes:

I'd like to close this ticket out - but wanted to get thoughts on it first.

Please leave it open. This isn't something that WMF is interested in, but other users of MediaWiki are very much interested in this and don't have the same long tail problem.

Wed, Feb 17, 10:01 PM · MediaWiki-Search

Thu, Feb 11

TJones claimed T267971: Analyze Speaker-Reviewed M2 Data for Chinese.
Thu, Feb 11, 6:15 PM · Discovery-Search (Current work), Chinese-Sites
TJones moved T267971: Analyze Speaker-Reviewed M2 Data for Chinese from Ready for Development to In Progress on the Discovery-Search (Current work) board.
Thu, Feb 11, 6:14 PM · Discovery-Search (Current work), Chinese-Sites

Tue, Feb 9

TJones moved T271249: Update TextCat / Language ID documentation from In Progress to Needs review on the Discovery-Search (Current work) board.
Tue, Feb 9, 10:31 PM · Discovery-Search (Current work), CirrusSearch
TJones added a comment to T271249: Update TextCat / Language ID documentation.

The updated docs are here: https://www.mediawiki.org/wiki/TextCat

Tue, Feb 9, 10:31 PM · Discovery-Search (Current work), CirrusSearch
TJones claimed T271249: Update TextCat / Language ID documentation.
Tue, Feb 9, 4:57 PM · Discovery-Search (Current work), CirrusSearch
TJones moved T271249: Update TextCat / Language ID documentation from Ready for Development to In Progress on the Discovery-Search (Current work) board.
Tue, Feb 9, 4:56 PM · Discovery-Search (Current work), CirrusSearch

Mon, Feb 8

TJones updated the task description for T274205: Reindex Khmer wikis to enable Khmer syllable reordering.
Mon, Feb 8, 10:13 PM · Discovery-Search (Current work)
TJones renamed T274205: Reindex Khmer wikis to enable Khmer syllable reordering from Reindex Khmer wikis to enable to Reindex Khmer wikis to enable Khmer syllable reordering.
Mon, Feb 8, 10:13 PM · Discovery-Search (Current work)
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Mon, Feb 8, 10:12 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery
TJones created T274205: Reindex Khmer wikis to enable Khmer syllable reordering.
Mon, Feb 8, 10:10 PM · Discovery-Search (Current work)
TJones updated the task description for T274204: Deploy new version of Extra Plugin (with Khmer filter) to Elasticsearch cluster.
Mon, Feb 8, 10:07 PM · Discovery-Search (Current work)
TJones created T274204: Deploy new version of Extra Plugin (with Khmer filter) to Elasticsearch cluster.
Mon, Feb 8, 10:06 PM · Discovery-Search (Current work)
TJones created T274203: Build Extra Plugin with extra-analysis-khmer and deploy to Maven Central.
Mon, Feb 8, 10:03 PM · Discovery-Search (Current work)
TJones edited projects for T274200: Reindex English and Italian wikis to enable homoglyph plugin, added: Discovery-Search; removed Discovery-Search (Current work), CirrusSearch.
Mon, Feb 8, 9:48 PM · Discovery-Search (Current work)
TJones updated the task description for T147505: [tracking] CirrusSearch: what is updated during re-indexing.
Mon, Feb 8, 9:48 PM · Tracking-Neverending, Epic, Discovery-Search (Current work), Discovery
TJones created T274200: Reindex English and Italian wikis to enable homoglyph plugin.
Mon, Feb 8, 9:46 PM · Discovery-Search (Current work)
TJones edited projects for T265081: Fix Chinese Analysis Chain for Glent M2, added: Discovery-Search (Current work); removed Discovery-Search.
Mon, Feb 8, 4:30 PM · Discovery-Search (Current work), Chinese-Sites
TJones edited projects for T267971: Analyze Speaker-Reviewed M2 Data for Chinese, added: Discovery-Search (Current work); removed Discovery-Search.
Mon, Feb 8, 4:30 PM · Discovery-Search (Current work), Chinese-Sites

Fri, Feb 5

TJones edited projects for T258094: Improve Breton language analysis, added: Discovery-Search (Current work); removed Discovery-Search.
Fri, Feb 5, 3:28 PM · Discovery-Search (Current work)
TJones added a comment to T258094: Improve Breton language analysis.

Sorry this has been languishing for so long. It doesn't look like we're making a lot of progress with the stop word list, so I'm going to work on the elision and ICU folding.

Fri, Feb 5, 3:27 PM · Discovery-Search (Current work)
TJones moved T185721: Null or inconsistent search results using Khmer script from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Fri, Feb 5, 2:12 AM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch, Discovery
TJones moved T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards from Needs review to To Be Deployed on the Discovery-Search (Current work) board.
Fri, Feb 5, 2:12 AM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch

Thu, Feb 4

TJones moved T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards from In Progress to Needs review on the Discovery-Search (Current work) board.
Thu, Feb 4, 9:22 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch

Tue, Feb 2

TJones claimed T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.
Tue, Feb 2, 8:27 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch
TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

Full write up is on MediaWiki: Khmer Reordering Analysis Analysis.

Tue, Feb 2, 6:30 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch, Discovery

Jan 28 2021

TJones moved T185721: Null or inconsistent search results using Khmer script from In Progress to Needs review on the Discovery-Search (Current work) board.
Jan 28 2021, 6:47 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch, Discovery

Jan 25 2021

TJones added a comment to T268186: Document how to replace or update "text", "plain" and other analyzers.

Since David found a way to hook into the system and change or replace analyzers, and that seems to meet the spirit of the original ticket, I've updated this to a documentation task.

Jan 25 2021, 5:46 PM · Discovery-Search, CirrusSearch
TJones renamed T268186: Document how to replace or update "text", "plain" and other analyzers from Investigate adding a simple hook to replace "text" and "plain" analyzers to Document how to replace or update "text", "plain" and other analyzers.
Jan 25 2021, 5:45 PM · Discovery-Search, CirrusSearch

Jan 22 2021

TJones added a comment to P13907 Customize CirrusSearch for czech.

Added a comment about ICU folding, too.

Jan 22 2021, 7:46 PM
TJones edited P13907 Customize CirrusSearch for czech.
Jan 22 2021, 7:45 PM
TJones added a comment to P13907 Customize CirrusSearch for czech.

I edited one comment, added a comment to the text_search config, and removed a duplicate line adding asciifolding_preserve to plain. Looks good!

Jan 22 2021, 7:35 PM
TJones edited P13907 Customize CirrusSearch for czech.
Jan 22 2021, 7:33 PM
TJones edited P13907 Customize CirrusSearch for czech.
Jan 22 2021, 7:28 PM

Jan 19 2021

TJones added a comment to T266027: Test perfield_builder on spaceless languages.

A total of 6,046,684 events have been collected between 2012-12-09T00:00:00Z and 2012-12-16T00:00:00Z,

Jan 19 2021, 9:04 PM · Patch-For-Review, MW-1.36-notes (1.36.0-wmf.21; 2020-12-08), Chinese-Sites, Discovery-Search (Current work), CirrusSearch

Jan 14 2021

TJones added a comment to T258055: [L] [SPIKE] Investigate traversing entities tree to include more entities with more detail.

After some additional experimentation, it's worth expanding on my earlier summary with some more findings.

Jan 14 2021, 4:42 PM · Patch-For-Review, SDAW-MediaSearch (MediaSearch-ReleaseCandidate), Structured-Data-Backlog (Current Work)

Jan 13 2021

TJones added a comment to T260957: Show correct create page/page exists message on wikis with multiple writing systems.

I checked the Serbian, Inuktitu, and Crimean Tatar examples and they all work now, too! Stick a fork in it—it's done! Thanks, @dcausse!!

Jan 13 2021, 7:20 PM · MW-1.36-notes (1.36.0-wmf.22; 2020-12-15), Serbian-Sites, Discovery-Search (Current work), Chinese-Sites

Jan 12 2021

TJones moved T185721: Null or inconsistent search results using Khmer script from Needs review to In Progress on the Discovery-Search (Current work) board.
Jan 12 2021, 6:55 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch, Discovery

Jan 8 2021

TJones added a comment to T262566: Enable DWIM support for Vue.js search.

This was likely based on the Arabic gadget https://ar.wikipedia.org/wiki/%D9%85%D9%8A%D8%AF%D9%8A%D8%A7%D9%88%D9%8A%D9%83%D9%8A:Gadget-Dwim.js so there's probably an issue there.

Jan 8 2021, 9:56 PM · Tech-Product API Roadmap, MW-1.36-notes (1.36.0-wmf.25; 2021-01-05), Patch-For-Review, Readers-Web-Backlog (Kanbanana-FY-2020-21), Vue.js (Vue.js-Search)
TJones added a comment to T262566: Enable DWIM support for Vue.js search.

@Jdlrobson & @ovasileva—sorry for the late reply; there was end-of-year holiday-making and then this week has been busy and … distracting.

Jan 8 2021, 9:17 PM · Tech-Product API Roadmap, MW-1.36-notes (1.36.0-wmf.25; 2021-01-05), Patch-For-Review, Readers-Web-Backlog (Kanbanana-FY-2020-21), Vue.js (Vue.js-Search)

Jan 5 2021

TJones added a comment to T270848: Investigate ways to make language identification case-insensitive.

Making a note here for now regarding: " regardless of the case of the query, because that usually doesn't matter to me".
There are instances in English where cased searching could matter a lot: e.g. cat ~ CAT (scan)

Jan 5 2021, 10:03 PM · CirrusSearch, Discovery-Search
TJones renamed T270848: Investigate ways to make language identification case-insensitive from Cross-wiki searching shows results from Russian with lowercase letters, but not with uppercase letters to Investigate ways to make language identification case-insensitive.
Jan 5 2021, 8:46 PM · CirrusSearch, Discovery-Search
TJones renamed T271249: Update TextCat / Language ID documentation from Update TextCat documentation to Update TextCat / Language ID documentation.
Jan 5 2021, 8:28 PM · Discovery-Search (Current work), CirrusSearch
TJones created T271249: Update TextCat / Language ID documentation.
Jan 5 2021, 8:27 PM · Discovery-Search (Current work), CirrusSearch
TJones renamed T270847: Investigate ways of including or indicating results in more than one language when showing cross-language results from Ukrainian Wikipedia is only sometimes shown in cross-wiki search results even if a relevant result is available to Investigate ways of including or indicating results in more than one language when showing cross-language results.
Jan 5 2021, 6:55 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270847: Investigate ways of including or indicating results in more than one language when showing cross-language results.
  • It doesn't show that the Ukrainian Wikipedia exists, and that there is such an article there. Same for Bulgarian, Kazakh and the other languages I mentioned.

... they'll think that Wikipedia is a thing that only exists in English and Russian, but not in their language. It may sound weird, but I absolutely met people who are sure that there is no Wikipedia in their language, even though there is...

Jan 5 2021, 6:15 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270848: Investigate ways to make language identification case-insensitive.

@Amire80, is language detection being case-sensitive enough of an explanation, and can we close this ticket? Overall the case-sensitivity is helpful, but there will always be edge cases where the case difference puts different words on different sides of a meaningful threshold. If not, that's fine and I'll update this ticket as a research/feature request.

Jan 5 2021, 5:50 PM · CirrusSearch, Discovery-Search

Jan 4 2021

TJones added a comment to T270848: Investigate ways to make language identification case-insensitive.

I'm also going to update the TextCat documentation to include more of this information (though a little less technically, probably) for future reference.

Jan 4 2021, 9:22 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270847: Investigate ways of including or indicating results in more than one language when showing cross-language results.

I'm also going to update the TextCat documentation to include more of this information (though a little less technically, probably) for future reference.

Jan 4 2021, 9:22 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270848: Investigate ways to make language identification case-insensitive.

I've addressed this in more detail over in T270847 since a lot of the background and context is the same—see my previous comment there.

Jan 4 2021, 9:20 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270847: Investigate ways of including or indicating results in more than one language when showing cross-language results.

I'm going to address T270848 here, too, since a lot of the background and context is the same. I'll put a summary over there and a link back to here.

Jan 4 2021, 9:18 PM · CirrusSearch, Discovery-Search
TJones added a comment to T270847: Investigate ways of including or indicating results in more than one language when showing cross-language results.

This is working as designed, though that may not be obvious from the behavior you are seeing.

Jan 4 2021, 9:17 PM · CirrusSearch, Discovery-Search
TJones updated the task description for T270614: Automatically depool wdqs servers that are "lagged".
Jan 4 2021, 4:27 PM · Wikidata, Wikidata-Query-Service

Dec 14 2020

TJones renamed T269819: Undesirable results when searching English words on Persian Wikipedia from Unexpected search result to Undesirable results when searching English words on Persian Wikipedia.
Dec 14 2020, 3:56 PM · Discovery-Search, CirrusSearch
TJones edited projects for T269819: Undesirable results when searching English words on Persian Wikipedia, added: Discovery-Search; removed Discovery-Search (Current work).
Dec 14 2020, 3:55 PM · Discovery-Search, CirrusSearch
TJones moved T185721: Null or inconsistent search results using Khmer script from In Progress to Needs review on the Discovery-Search (Current work) board.
Dec 14 2020, 3:52 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch, Discovery

Dec 3 2020

TJones added a comment to P13502 blazegraph collation ternary vs identical.

I sorted the results, which makes them a little easier to read through, since similar character classes are grouped together.

Dec 3 2020, 6:53 PM
TJones edited P13502 blazegraph collation ternary vs identical.
Dec 3 2020, 6:52 PM

Dec 2 2020

TJones added a comment to T265931: Set up dashboard to track resource usage for Commons and Wikidata Elasticsearch indexes.

This is cool, Erik. Thanks for all the details in the write up. I'm not the target audience, but I always appreciate graphs and big piles of numbers! Gathering page cache data every ~30 minutes also seems way better than none at all.

Dec 2 2020, 11:47 PM · Discovery-Search (Current work)

Nov 30 2020

TJones renamed T268648: [EPIC] MediaSearch should use a dedicated service/query for doing its concept-lookup instead of the wikidata search API from MediaSearch should use a dedicated service/query for doing its concept-lookup instead of the wikidata search API to [EPIC] MediaSearch should use a dedicated service/query for doing its concept-lookup instead of the wikidata search API.
Nov 30 2020, 4:28 PM · Epic, Structured-Data-Backlog, SDAW-MediaSearch (MediaSearch-ReleaseCandidate2), Discovery-Search, CirrusSearch

Nov 25 2020

TJones updated the task description for T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.
Nov 25 2020, 8:23 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch
TJones updated the task description for T268788: Create Elasticsearch filter so we can do aggressive_splitting without causing an invalid token order.
Nov 25 2020, 8:22 PM · Discovery-Search, CirrusSearch
TJones updated the task description for T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.
Nov 25 2020, 8:18 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch
TJones created T268788: Create Elasticsearch filter so we can do aggressive_splitting without causing an invalid token order.
Nov 25 2020, 8:16 PM · Discovery-Search, CirrusSearch
TJones added a comment to T268730: startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards.

I've uploaded a patch to just disable homoglyphs for English- and Italian-language wikis. That gives us time to think about and test a more permanent solution that allows the homoglyph plugin to be enabled with breaking things.

Nov 25 2020, 4:39 PM · MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Discovery-Search (Current work), CirrusSearch

Nov 24 2020

TJones added a comment to T222669: Normalize homoglyphs in mixed-script tokens when possible.

TL; DR:

  • Inconsistencies between codfw and eqiad all come down to "failed to process cluster event"
  • There are real inconsistencies between cloudelastic and codfw/eqiad
    • How out of date is cloudelastic? It seems odd that so many would get mixed homophone text all at once.
  • All the offset failures are in wikis with either en or it as their language, so we need to fix the English and Italian analysis chains and reindex all wikis with those languages.
  • We need a better way to capture failures when we reindex everything.
Nov 24 2020, 6:37 PM · MW-1.36-notes (1.36.0-wmf.16; 2020-11-03), Discovery-Search (Current work), Wikimedia-Hackathon-2019, I18n

Nov 23 2020

TJones updated subscribers of T222669: Normalize homoglyphs in mixed-script tokens when possible.

I have a hypothesis.. for English, the interaction between homoglyph_norm and aggressive_splitting is causing the problem, creating an invalid token graph.

Nov 23 2020, 10:36 PM · MW-1.36-notes (1.36.0-wmf.16; 2020-11-03), Discovery-Search (Current work), Wikimedia-Hackathon-2019, I18n
TJones added a comment to T222669: Normalize homoglyphs in mixed-script tokens when possible.

It looks like it didn't take for some of the wikis. You can check the Cirrus settings dump (e.g., for enwiki) and search the page for "text" (with quotes). The first item in the "filter" array should be "homoglyph_norm", but isn't.

Nov 23 2020, 8:02 PM · MW-1.36-notes (1.36.0-wmf.16; 2020-11-03), Discovery-Search (Current work), Wikimedia-Hackathon-2019, I18n

Nov 18 2020

TJones created T268186: Document how to replace or update "text", "plain" and other analyzers.
Nov 18 2020, 10:13 PM · Discovery-Search, CirrusSearch
TJones created T268180: Bug: Airflow reports Mjolnir connection to Hive is already open.
Nov 18 2020, 9:40 PM · Discovery-Search (Current work)

Nov 16 2020

TJones updated the task description for T267971: Analyze Speaker-Reviewed M2 Data for Chinese.
Nov 16 2020, 8:50 PM · Discovery-Search (Current work), Chinese-Sites
TJones renamed T267971: Analyze Speaker-Reviewed M2 Data for Chinese from Review Chinese M2 Data to Analyze Speaker-Reviewed M2 Data for Chinese.
Nov 16 2020, 8:50 PM · Discovery-Search (Current work), Chinese-Sites
TJones edited projects for T267971: Analyze Speaker-Reviewed M2 Data for Chinese, added: Discovery-Search; removed Discovery-Search (Current work).
Nov 16 2020, 8:49 PM · Discovery-Search (Current work), Chinese-Sites
TJones created T267971: Analyze Speaker-Reviewed M2 Data for Chinese.
Nov 16 2020, 8:49 PM · Discovery-Search (Current work), Chinese-Sites
TJones moved T244800: Analysis of Method 2 Suggestion results from Needs review to Needs Reporting on the Discovery-Search (Current work) board.
Nov 16 2020, 6:07 PM · Discovery-Search (Current work), Chinese-Sites

Nov 9 2020

TJones added a comment to T244800: Analysis of Method 2 Suggestion results.

Under Korean Stats (and Japanese stats). Should identical be unique?

Nov 9 2020, 4:11 AM · Discovery-Search (Current work), Chinese-Sites

Nov 4 2020

TJones added a comment to T266995: Give Trey jones access necessary to support Search Platform Airflow jobs.

Thanks, @RobH!

Nov 4 2020, 5:43 PM · SRE-Access-Requests, Discovery-Search (Current work), SRE

Nov 2 2020

TJones moved T265081: Fix Chinese Analysis Chain for Glent M2 from needs triage to Language Stuff on the Discovery-Search board.
Nov 2 2020, 6:20 PM · Discovery-Search (Current work), Chinese-Sites

Oct 29 2020

TJones moved T244800: Analysis of Method 2 Suggestion results from Waiting to Needs review on the Discovery-Search (Current work) board.
Oct 29 2020, 3:09 PM · Discovery-Search (Current work), Chinese-Sites

Oct 28 2020

TJones added a comment to T244800: Analysis of Method 2 Suggestion results.

Completed analysis of Japanese and Korean suggestions, reviewed by speakers—thanks, Jerry & Lisa!

Oct 28 2020, 9:02 PM · Discovery-Search (Current work), Chinese-Sites

Oct 26 2020

TJones renamed T265290: Rediscover, review, and update the federation input process for WDQS from Review the federation input process for WDQS to Rediscover, review, and update the federation input process for WDQS.
Oct 26 2020, 6:34 PM · Discovery-Search (Current work), Wikidata, Wikidata-Query-Service
TJones added a comment to T212888: Implement NLP Search Suggestion Method 0 for English.

All child tasks are either complete or in "Needs Reporting", so we're mostly done except for the talking about it!

Oct 26 2020, 6:16 PM · Discovery-Search (Current work), Patch-For-Review
TJones moved T212888: Implement NLP Search Suggestion Method 0 for English from In Progress to Needs Reporting on the Discovery-Search (Current work) board.
Oct 26 2020, 6:14 PM · Discovery-Search (Current work), Patch-For-Review
TJones closed T238247: Run Null A/B test for DYM suggestions, a subtask of T237364: Write Glent M0 A/B test report, as Resolved.
Oct 26 2020, 6:14 PM · Discovery-Search (Current work), CirrusSearch
TJones closed T238247: Run Null A/B test for DYM suggestions as Resolved.

Closing this because we have already done some reporting and dealt with some of the difficulties there. We don't really need the null A/B report anymore.

Oct 26 2020, 6:14 PM · Discovery-Search, CirrusSearch
TJones moved T258094: Improve Breton language analysis from needs triage to Language Stuff on the Discovery-Search board.

Moving this off the current workboard because I don't work on it regularly. I'll move it back when we get to code to deploy and it needs review.

Oct 26 2020, 6:09 PM · Discovery-Search (Current work)
TJones edited projects for T258094: Improve Breton language analysis, added: Discovery-Search; removed Discovery-Search (Current work).
Oct 26 2020, 6:09 PM · Discovery-Search (Current work)

Oct 19 2020

TJones moved T238151: Tune Glent Method 1 algorithm from To Be Deployed to Needs Reporting on the Discovery-Search (Current work) board.

It looks like we are currently running glent 0.2.3 which includes the patches referenced above. Checked the attached patches and it looks like everything is shipped. Should we close this and move on to figuring out how we want to put it in front of users?

Oct 19 2020, 7:38 PM · Discovery-Search (Current work)
TJones created T265931: Set up dashboard to track resource usage for Commons and Wikidata Elasticsearch indexes.
Oct 19 2020, 5:50 PM · Discovery-Search (Current work)
TJones updated the task description for T265914: Investigate Resource Needs for Commons and Wikidata Elasticsearch indices .
Oct 19 2020, 5:42 PM · Discovery-Search (Current work)
TJones renamed T265914: Investigate Resource Needs for Commons and Wikidata Elasticsearch indices from Design the solution for Commons and Wikidata Elasticsearch indices to Investigate Resource Needs for Commons and Wikidata Elasticsearch indices .
Oct 19 2020, 5:35 PM · Discovery-Search (Current work)
TJones renamed T265246: Make search-related phabricator tags less confusing from Make search related phabricator tags less confusing to Make search-related phabricator tags less confusing.
Oct 19 2020, 5:21 PM · Discovery-Search, PM