Page MenuHomePhabricator

TJones (Trey Jones)
Sr. Software Engineer, Search Platform Team

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Jul 8 2015, 3:02 PM (223 w, 3 d)
Availability
Available
IRC Nick
Trey314159
LDAP User
Tjones
MediaWiki User
TJones (WMF) [ Global Accounts ]

I would have written a shorter comment, but I did not have the time.

I'm part of the Search Platform team and I spend my time working on search & relevance, trying to better support search in various languages, analyzing queries, and doing random mathy things. I tend to write long, detailed notes about my investigations (so as to improve the bus number of my work).

When I have to work on _GitHub,_ /‍‍/Phab,/‍‍/ and ''MediaWiki'' all on the same day, I sometimes suffer Severe Markup Incongruence Fatigue.

I � Unicode.

Recent Activity

Thu, Oct 17

TJones added a comment to T235778: dewiki: "Search results from Polish Wikipedia" lists search results from dewiki.

Another odd side effect is that if there are no results with matching titles on the local wiki, no results are displayed, though the message "Showing Results from <other> Wikipedia" still shows up. For example here.

Thu, Oct 17, 8:28 PM · Discovery-Search (Current work)

Wed, Oct 16

TJones updated the task description for T147505: [Recurring task] CirrusSearch: what is updated during re-indexing.
Wed, Oct 16, 1:54 PM · Discovery-Search (Current work), Discovery
TJones created T235654: Re-index Slovak Wikis to enable folding of Slovak diacritics after stemming.
Wed, Oct 16, 1:51 PM · Discovery-Search (Current work)
TJones moved T235561: Implement folding of Slovak diacritics for Slovak-language wikis from Needs review to Done on the Discovery-Search (Current work) board.
Wed, Oct 16, 1:46 PM · Discovery-Search (Current work), CirrusSearch, MW-1.35-notes (1.35.0-wmf.3; 2019-10-22)
TJones committed rECIR0119c4a579b8: Fold Slovak diacritics (after stemming) (authored by TJones).
Fold Slovak diacritics (after stemming)
Wed, Oct 16, 1:35 PM

Tue, Oct 15

TJones moved T235561: Implement folding of Slovak diacritics for Slovak-language wikis from in progress to Needs review on the Discovery-Search (Current work) board.
Tue, Oct 15, 10:14 PM · Discovery-Search (Current work), CirrusSearch, MW-1.35-notes (1.35.0-wmf.3; 2019-10-22)
TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

I've moved this to "Waiting" while I wrap up some work on other open tasks.

Tue, Oct 15, 9:06 PM · Discovery-Search (Current work), CirrusSearch, Discovery
TJones moved T185721: Null or inconsistent search results using Khmer script from in progress to Waiting on the Discovery-Search (Current work) board.
Tue, Oct 15, 9:05 PM · Discovery-Search (Current work), CirrusSearch, Discovery
TJones moved T223787: Investigate impact of folding diacritics in Slovak from Waiting to Done on the Discovery-Search (Current work) board.

I've updated my notes on Mediawiki with the final round of review. We're now ready to create a patch to implement the folding: T235561: Implement folding of Slovak diacritics for Slovak-language wikis

Tue, Oct 15, 9:03 PM · Discovery-Search (Current work)
TJones created T235561: Implement folding of Slovak diacritics for Slovak-language wikis.
Tue, Oct 15, 9:01 PM · Discovery-Search (Current work), CirrusSearch, MW-1.35-notes (1.35.0-wmf.3; 2019-10-22)

Fri, Oct 11

TJones moved T232760: Analysis of Method 1 Suggestion results from in progress to Needs review on the Discovery-Search (Current work) board.

I completed my analysis of Method 1, and it performs significantly worse than the current production DYM. I think we should improve Method 1 before considering an A/B test. Full details on MediaWiki.

Fri, Oct 11, 8:24 PM · Discovery-Search (Current work)
TJones claimed T232760: Analysis of Method 1 Suggestion results.
Fri, Oct 11, 7:24 PM · Discovery-Search (Current work)

Wed, Oct 9

TJones closed T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts as Resolved.

Following combining marks are included with highlighted text to prevent broken-looking text when highlighting search suggestions and in search result snippets.

Wed, Oct 9, 8:54 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones closed T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts, a subtask of T218613: {EPIC} Search: Local Impact. Making making bigger improvements for smaller (or underrepresented) communities, as Resolved.
Wed, Oct 9, 8:54 PM · Discovery-Search
TJones removed projects from T234170: Special:Search's search bar improperly responding to highlighting words on Google Chrome 77: Discovery-Search, MediaWiki-Search.
Wed, Oct 9, 7:53 PM · OOUI, MediaWiki-Interface, Browser-Support-Android-Google-Chrome, Browser-Support-Google-Chrome
TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

Thanks, @Eltimbalino. That is a lot more phonetic than the residue of the queries I looked at, so I don't think many people are using the transliterated Khmer on-wiki.

Wed, Oct 9, 7:38 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Tue, Oct 8

TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

I've finished my analysis of the effects of pre-tokenization (harder) vs post-tokenization (less hard) re-ordering using my command line re-ordering tool. The difference is pretty big, so I guess I'm going to have to do it the hard(er) way! More details on MediaWiki.

Tue, Oct 8, 7:25 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Mon, Oct 7

TJones added a project to T234170: Special:Search's search bar improperly responding to highlighting words on Google Chrome 77: OOUI.

I wasn't having this problem when I first saw this ticket, but now I am. I'm also now at Version 77.0.3865.90 of Chrome.

Mon, Oct 7, 2:20 PM · OOUI, MediaWiki-Interface, Browser-Support-Android-Google-Chrome, Browser-Support-Google-Chrome

Thu, Oct 3

TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

@Eltimbalino, can you give me a sample of the transliteration? Is it somewhat phonetic, or is it more like the JMua examples above?

Thu, Oct 3, 3:42 PM · Discovery-Search (Current work), CirrusSearch, Discovery
TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

I did a review of Khmer Wikipedia queries for comparison. There's a higher rate of syllables to re-order (1.3% vs <0.2%) and a similarly higher rate of syllable boundary errors, though it is still very low (<0.005% of all syllables detected). There's also the usual collection of queries in miscellaneous scripts, junk queries, and porn queries. One unexpected result is that only about half of all queries have Khmer characters in them, and about half have predominantly Latin characters in them.

Thu, Oct 3, 2:20 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Wed, Oct 2

TJones added a comment to T234170: Special:Search's search bar improperly responding to highlighting words on Google Chrome 77.

Thanks for checking on other pages. It may be specific to the search page, but it's not really part of the main search stack, so I'm hoping some UI folks will have ideas.

Wed, Oct 2, 2:22 PM · OOUI, MediaWiki-Interface, Browser-Support-Android-Google-Chrome, Browser-Support-Google-Chrome

Tue, Oct 1

TJones added a project to T234170: Special:Search's search bar improperly responding to highlighting words on Google Chrome 77: MediaWiki-Interface.
Tue, Oct 1, 5:44 PM · OOUI, MediaWiki-Interface, Browser-Support-Android-Google-Chrome, Browser-Support-Google-Chrome
TJones added a comment to T234170: Special:Search's search bar improperly responding to highlighting words on Google Chrome 77.

@gh87 this sounds like a general UI problem rather than a search-specific problem. Can you try it out in other form elements, like the edit summary or the subject line when emailing a user?

Tue, Oct 1, 5:43 PM · OOUI, MediaWiki-Interface, Browser-Support-Android-Google-Chrome, Browser-Support-Google-Chrome

Thu, Sep 26

TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

Further good news: based on my samples, less than 0.2% of syllables need to be re-ordered in Wikipedia and Wiktionary article text, so the problem is important to fix, but not as widespread as it could be. (I should check on queries, too, though.)

Thu, Sep 26, 7:23 PM · Discovery-Search (Current work), CirrusSearch, Discovery
TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

I've added a preliminary write up of what's been going on so far, including a high-level version of my re-ordering algorithm, on MediaWiki.

Thu, Sep 26, 2:50 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Wed, Sep 25

TJones added a project to T233840: Fix unclickable links for combining characters: MediaWiki-Interface.
Wed, Sep 25, 8:23 PM · MediaWiki-Interface
TJones added a comment to T233840: Fix unclickable links for combining characters.

Which software is this task about? CirrusSearch?

Wed, Sep 25, 8:21 PM · MediaWiki-Interface
TJones updated the task description for T233840: Fix unclickable links for combining characters.
Wed, Sep 25, 7:20 PM · MediaWiki-Interface
TJones added a comment to T233840: Fix unclickable links for combining characters.

I just checked other browsers, and a link on combining ` is clickable in Safari (12.1.12) but a link with   ҉   is not clickable.

Wed, Sep 25, 7:18 PM · MediaWiki-Interface
TJones created T233840: Fix unclickable links for combining characters.
Wed, Sep 25, 7:14 PM · MediaWiki-Interface

Fri, Sep 20

TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

I've got some automatically re-ordered syllables up for review on Mediawiki. I particularly need help on the first three groups, "???", "Questionably Reordered Syllables", and "Visible Duplicates". Any advice on the others would be great, but those are the ones I am most unsure about.

Fri, Sep 20, 7:38 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Sep 12 2019

TJones renamed T232760: Analysis of Method 1 Suggestion results from Analysis of M1 Suggestion results to Analysis of Method 1 Suggestion results.
Sep 12 2019, 4:51 PM · Discovery-Search (Current work)
TJones moved T232760: Analysis of Method 1 Suggestion results from needs triage to elastic / cirrus on the Discovery-Search board.
Sep 12 2019, 4:50 PM · Discovery-Search (Current work)
TJones renamed T232760: Analysis of Method 1 Suggestion results from Analysis of M1 results to Analysis of M1 Suggestion results.
Sep 12 2019, 4:50 PM · Discovery-Search (Current work)
TJones created T232760: Analysis of Method 1 Suggestion results.
Sep 12 2019, 4:50 PM · Discovery-Search (Current work)

Sep 5 2019

TJones added a comment to T185721: Null or inconsistent search results using Khmer script .

Anyone here familiar with Khmer (maybe @Eltimbalino?) who can help me with some of the harder corner cases I'm encountering while trying to normalize Khmer syllables?

Sep 5 2019, 2:16 AM · Discovery-Search (Current work), CirrusSearch, Discovery

Sep 3 2019

TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.

If I understand correctly, https://gerrit.wikimedia.org/r/c/oojs/ui/+/530960 includes the combining marks after the search match to the highlight range, so that we don't add a highlight boundary before combining mark.

Sep 3 2019, 5:12 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.

Comment deleted. [Ugh.. there is some keyboard sequence that I keep fat-fingering and submitting my comment before it is complete.]

Sep 3 2019, 4:09 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n

Aug 30 2019

TJones updated the task description for T231593: Improve Basque language processing for search.
Aug 30 2019, 6:56 PM · Discovery-Search
TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.

@TJones For future reference, can you share how you created the regexp you used instead of \p{Mark}? I tried replicating it using this tool: https://mothereff.in/regexpu#input=%2F%5Cp%7BM%7D%2Fu&dotAllFlag=1&unicodePropertyEscape=1 and got a different result. (Perhaps this tool and whatever you used generate the regexps from different Unicode versions.)

Aug 30 2019, 4:03 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n

Aug 29 2019

TJones moved T231593: Improve Basque language processing for search from needs triage to Language Stuff on the Discovery-Search board.
Aug 29 2019, 5:31 PM · Discovery-Search
TJones created T231593: Improve Basque language processing for search.
Aug 29 2019, 5:30 PM · Discovery-Search

Aug 28 2019

TJones added a comment to P8995 Khmer samples.

BTW, Erik asked today how other search engines handle this kind of variation. Looks like some of the big U.S. ones don't.

Aug 28 2019, 7:45 PM · Discovery-Search
TJones added a comment to P8995 Khmer samples.

Thanks, gang! These are helpful!

Aug 28 2019, 7:16 PM · Discovery-Search
TJones added a comment to P8995 Khmer samples.

From Browsershots—and thus a bit lo-res—Windows 2008, Chrome 73

Aug 28 2019, 2:46 PM · Discovery-Search
TJones added a comment to P8995 Khmer samples.

My screenshot (OSX 10.14 / Chrome 76)

Aug 28 2019, 2:35 PM · Discovery-Search
TJones edited P8995 Khmer samples.
Aug 28 2019, 2:33 PM · Discovery-Search
TJones created P8995 Khmer samples.
Aug 28 2019, 2:32 PM · Discovery-Search

Aug 21 2019

TJones closed T195042: insource fails to return highlights on some queries as Invalid.

Closing after talking to @EBernhardson a bit more.

Aug 21 2019, 3:49 PM · CirrusSearch, Discovery, Discovery-Search
TJones updated the task description for T195042: insource fails to return highlights on some queries.
Aug 21 2019, 3:41 PM · CirrusSearch, Discovery, Discovery-Search
TJones updated the task description for T195042: insource fails to return highlights on some queries.
Aug 21 2019, 3:40 PM · CirrusSearch, Discovery, Discovery-Search

Aug 20 2019

TJones moved T223787: Investigate impact of folding diacritics in Slovak from Needs review to Waiting on the Discovery-Search (Current work) board.
Aug 20 2019, 5:24 PM · Discovery-Search (Current work)
TJones claimed T185721: Null or inconsistent search results using Khmer script .
Aug 20 2019, 3:24 PM · Discovery-Search (Current work), CirrusSearch, Discovery

Aug 19 2019

TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.

Patches and Projects

Aug 19 2019, 8:58 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones edited projects for T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts, added: OOjs, OOUI, JavaScript, MediaWiki-General; removed Discovery, MediaWiki-Search.
Aug 19 2019, 8:50 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones added a comment to T195042: insource fails to return highlights on some queries.

@EBernhardson, I stumbled across this one again while looking at the phab board. Are you sure it is the problem you think it is? I don't think you have the right regex for what you intended. Cirrus doesn't support \w, so \w+ requires one or more literal w's. One hit is highlighted, and it looks like this: "[[freehosting|zdarma hostující]] [[wiki]]weby". It doesn't look like .+? is being respected either. I wonder if the other matches are actually super long or something so there's nothing to highlight in the snippet.

Aug 19 2019, 6:20 PM · CirrusSearch, Discovery, Discovery-Search
TJones updated the task description for T195042: insource fails to return highlights on some queries.
Aug 19 2019, 6:13 PM · CirrusSearch, Discovery, Discovery-Search
TJones updated the task description for T195042: insource fails to return highlights on some queries.
Aug 19 2019, 6:08 PM · CirrusSearch, Discovery, Discovery-Search
TJones added a comment to T204089: CirrusSearch: Add filter for exclusion of redirects or finding only them.

@halfeatenscone, thanks for the API point of view.

Aug 19 2019, 6:00 PM · Advanced-Search, Discovery-Search, CirrusSearch, TCB-Team
TJones added a comment to T204089: CirrusSearch: Add filter for exclusion of redirects or finding only them.

I think it could be a keyword, preferably one that allows people both to exclude and include redirects from the search (inredirect:true / inredirect:false then?).

Aug 19 2019, 4:03 PM · Advanced-Search, Discovery-Search, CirrusSearch, TCB-Team
TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.

@debt (or others in Search), please don't close this ticket, just remove our tag. There is still work to be done, I think on the OOUI/OOjs side.

Aug 19 2019, 2:15 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.
Aug 19 2019, 1:55 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones moved T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts from Needs review to Done on the Discovery-Search (Current work) board.
Aug 19 2019, 1:20 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones moved T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 19 2019, 1:20 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones added a comment to T204089: CirrusSearch: Add filter for exclusion of redirects or finding only them.

@Speravir, @stjn, @halfeatenscone—quick question for you about this feature. Do you have any thoughts on how it should be exposed? Would you want keywords like intitleonly: and inredirect: (exact names open for discussion) or a checkbox? If a checkbox, should the checkbox only exist on the "Advanced" tab to keep the UI simple for less sophisticated searchers?

Aug 19 2019, 1:17 PM · Advanced-Search, Discovery-Search, CirrusSearch, TCB-Team
TJones committed rECIR7782449945c8: Rename NormL2 to NormLog2 to avoid confusion with L2 Norm (authored by TJones).
Rename NormL2 to NormLog2 to avoid confusion with L2 Norm
Aug 19 2019, 7:35 AM

Aug 16 2019

TJones added a comment to T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.
  • Patch up to fix the problem for search snippets, even though that's not the main problem.
  • I have a working fix for upper corner search box, but it may not be up to code standards since it is in the Javascript repo.
  • Main box on Search page is a separate thing; still working on that.
  • Ligatures are more complicated.
    • options.highlightInput can probably be used to turn off highlighting for suggestions (though I'd still like to do the right thing with combining characters in the general case).
    • Cirrus doesn't have a way to disable highlighting, but one could probably be added.
Aug 16 2019, 5:01 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n

Aug 14 2019

TJones moved T223787: Investigate impact of folding diacritics in Slovak from in progress to Needs review on the Discovery-Search (Current work) board.
Aug 14 2019, 2:32 PM · Discovery-Search (Current work)

Aug 8 2019

TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

I got some great feedback from @Jetam2, and everything is looking good. I just need a few clarifications. If everything is still good, then we'll be ready to deploy the new analysis chain and then re-index.

Aug 8 2019, 9:13 PM · Discovery-Search (Current work)
TJones moved T223787: Investigate impact of folding diacritics in Slovak from Waiting to in progress on the Discovery-Search (Current work) board.
Aug 8 2019, 9:06 PM · Discovery-Search (Current work)
TJones added a comment to T227924: Improve Slovak Stemmer.

Another note from working on T223787: Investigate impact of folding diacritics in Slovak: Consider adding (probably hard-coding at first) a short exceptions list to prevent unwanted collisions. The only item to add to the list at the moment would be kedy, to keep it from being stemmed as ked (folded keď).

Aug 8 2019, 8:23 PM · CirrusSearch, Discovery-Search

Aug 7 2019

TJones moved T228925: Fix documentation of boolean operators from in progress to Done on the Discovery-Search (Current work) board.

I've moved the draft to Help:CirrusSearch/Logical_operators and updated Help:CirrusSearch to link to it. Help:CirrusSearch doesn't mention parens, so nothing to fix there. (I made some other incidental edits, too, particularly changing AND and OR to and and or when used for emphasis rather than as operators.)

Aug 7 2019, 2:22 PM · Discovery-Search (Current work)
TJones moved T228925: Fix documentation of boolean operators from Needs review to in progress on the Discovery-Search (Current work) board.
Aug 7 2019, 1:36 PM · Discovery-Search (Current work)

Aug 6 2019

TJones claimed T35242: Search suggestion highlighting does not respect grapheme clusters causing wrong rendering for Arabic and Indic scripts.
Aug 6 2019, 5:49 PM · MW-1.34-notes (1.34.0-wmf.25; 2019-10-01), OOUI (OOUI-0.34.0), Patch-For-Review, Discovery-Search, Utilities-UnicodeJS, MediaWiki-General, JavaScript, Hindi-Sites, Malayalam-Sites, Tamil-Sites, I18n
TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

Moving this to "Waiting" for a while, to see if we get any feedback from Slovak speakers. If not, we will consider options for next steps, which include pushing forward with this with non-native speaker review, maintaining the status quo, and rolling back the Slovak stemmer.

Aug 6 2019, 4:07 PM · Discovery-Search (Current work)
TJones moved T223787: Investigate impact of folding diacritics in Slovak from Needs review to Waiting on the Discovery-Search (Current work) board.
Aug 6 2019, 4:05 PM · Discovery-Search (Current work)
TJones added a comment to T228925: Fix documentation of boolean operators.

Thanks for taking a look, Erik! I've tried to make it a tiny bit scarier by simplifying, bolding, and embiggening the pre-TOC text.

Aug 6 2019, 1:55 PM · Discovery-Search (Current work)

Jul 31 2019

TJones added a comment to T228925: Fix documentation of boolean operators.

Isn't the English and canonical documentation on https://www.mediawiki.org/wiki/Help:CirrusSearch and translatable there? As I don't see you suddenly maintain 101+ help pages, I'd propose to define one canonical place (usually mediawiki.org, meta, wikitech), and to maintain and update that one single place only...

Jul 31 2019, 9:00 PM · Discovery-Search (Current work)
TJones added a comment to T228925: Fix documentation of boolean operators.

@TJones: As that covers 1 random Wikimedia site, what's Discovery-Search's plan for updating the approx. 100 other Help:Searching pages on other Wikimedia sites, like https://ru.wikipedia.org/wiki/Википедия:Поиск or https://sd.wikipedia.org/wiki/مدد:وڪيپيڊيا_۾_ڳولها or https://ka.wikipedia.org/wiki/ვიკიპედია:ძიება ?

Jul 31 2019, 8:00 PM · Discovery-Search (Current work)

Jul 30 2019

TJones updated subscribers of T228925: Fix documentation of boolean operators.
Jul 30 2019, 8:07 PM · Discovery-Search (Current work)
TJones added a comment to T228925: Fix documentation of boolean operators.

I added a note on the Talk page for Help:Searching to try to get more feedback.

Jul 30 2019, 8:06 PM · Discovery-Search (Current work)

Jul 29 2019

TJones added a comment to T228925: Fix documentation of boolean operators.

I've also sent a message to the Discovery mailing list to encourage additional feedback.

Jul 29 2019, 10:10 PM · Discovery-Search (Current work)
TJones added a comment to T228925: Fix documentation of boolean operators.

I've written a draft of the longer explanation (~1200 words) of the use of Logical operators in on-wiki search. Comments and suggestions are welcome!

Jul 29 2019, 10:02 PM · Discovery-Search (Current work)
TJones updated the task description for T228925: Fix documentation of boolean operators.
Jul 29 2019, 5:07 PM · Discovery-Search (Current work)
TJones claimed T228925: Fix documentation of boolean operators.
Jul 29 2019, 4:37 PM · Discovery-Search (Current work)
TJones moved T223787: Investigate impact of folding diacritics in Slovak from in progress to Needs review on the Discovery-Search (Current work) board.
Jul 29 2019, 4:35 PM · Discovery-Search (Current work)

Jul 26 2019

TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

Okay, I've made the first pass at writing speaker review documentation that can be transcluded into the notes for a particular language.

Jul 26 2019, 10:34 PM · Discovery-Search (Current work)

Jul 24 2019

TJones moved T223787: Investigate impact of folding diacritics in Slovak from Needs review to in progress on the Discovery-Search (Current work) board.
Jul 24 2019, 7:04 PM · Discovery-Search (Current work)
TJones moved T228925: Fix documentation of boolean operators from needs triage to elastic / cirrus on the Discovery-Search board.
Jul 24 2019, 7:03 PM · Discovery-Search (Current work)
TJones updated subscribers of T228925: Fix documentation of boolean operators.

Subscribing @Cpiral and @The_Transhumanist since they edit Help:Searching and it would be great if they could review the documentation we want to add before it goes live. Please subscribe any other contributors to Help:Searching who might be able to provide feedback, too. Thanks!

Jul 24 2019, 7:03 PM · Discovery-Search (Current work)
TJones created T228925: Fix documentation of boolean operators.
Jul 24 2019, 7:02 PM · Discovery-Search (Current work)

Jul 23 2019

TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

Having some trouble with the speaker review being unclear, so I'm working on better generic documentation I can use for this task and in the future.

Jul 23 2019, 5:24 PM · Discovery-Search (Current work)

Jul 18 2019

TJones moved T223787: Investigate impact of folding diacritics in Slovak from in progress to Needs review on the Discovery-Search (Current work) board.
Jul 18 2019, 11:29 PM · Discovery-Search (Current work)
TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

I've completed my analysis for stemming before folding, and it definitely looks much better. The new, mostly desirable merges are roughly the same, without preventing the stemmer from doing its job. The stemmer still needs an update, though. (T227924: Improve Slovak Stemmer)

Jul 18 2019, 11:09 PM · Discovery-Search (Current work)

Jul 17 2019

TJones added a comment to T228226: Evaluate DYM metrics available in current search satisfaction logging.

Looks good.

Jul 17 2019, 2:10 PM · Discovery-Search (Current work), Discovery
TJones updated the task description for T228226: Evaluate DYM metrics available in current search satisfaction logging.
Jul 17 2019, 1:59 PM · Discovery-Search (Current work), Discovery

Jul 15 2019

TJones added a comment to T227781: Make Search Platform metrics available in Druid.

Things that show up as important in our metric-rating spreadsheet that do not seem to be covered by the current list include:

Jul 15 2019, 5:12 PM · Product-Analytics, Discovery-Search

Jul 12 2019

TJones moved T227924: Improve Slovak Stemmer from needs triage to Language Stuff on the Discovery-Search board.
Jul 12 2019, 10:11 PM · CirrusSearch, Discovery-Search
TJones created T227924: Improve Slovak Stemmer.
Jul 12 2019, 10:09 PM · CirrusSearch, Discovery-Search

Jul 10 2019

TJones added a comment to T223787: Investigate impact of folding diacritics in Slovak.

Sorry for the delay getting back to this. In addition to my planned two weeks away from the office I had another unexpected week away. I've been catching up on everything this week, and I'm back to working on this now.

Jul 10 2019, 7:57 PM · Discovery-Search (Current work)