Page MenuHomePhabricator

Sasha1024
User

Projects

User does not belong to any projects.

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Sunday

  • Clear sailing ahead.

User Details

User Since
Sep 22 2016, 9:07 AM (156 w, 1 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Sasha1024 [ Global Accounts ]

Recent Activity

Apr 1 2017

Sasha1024 added a comment to T160106: Test and analyze new Ukrainian language analyzers.
  • стели іs: (1) singular genitive of сте́ла ("stele"), (2) imperative mood of стели́ти ("to lay down");
    • стелю is: (1) first person singular of стели́ти ("to lay down"), (2) singular dative of сте́ля ("ceiling");
    • слати is: (1) "to send", (2) alternative (archaic?) form for стели́ти ("to lay down");
    • шлем is: (1) alternative of шлемо́, which is first person plural of сла́ти ("to send"); (2) colloquial for шоло́м ("helmet").
Apr 1 2017, 4:36 PM · MW-1.29-release-notes, Epic, Discovery-Search (Current work), Discovery

Oct 12 2016

Sasha1024 added a comment to T146358: Improve processing of the apostrophe by the search engine in Ukrainian.

Well Russian, Ukrainian and Belorussian are quite close. (Polish is close too, some claim that it's even closer to Ukrainian than Russian — but as Polish uses different alphabet, it seems to be out of scope.) (BTW, what is about Belorussian? Does it use different lexical analyzer? Theoretically, it may fit Ukrainian even better than Russian.)

Oct 12 2016, 11:06 PM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery

Oct 7 2016

Sasha1024 added a comment to T146358: Improve processing of the apostrophe by the search engine in Ukrainian.

though I do worry about languages that might use the apostrophe as a proper letter

Sorry for stupid question, but what is difference between apostrophe-as-proper-letter and apostrophe-as-part-of-word-but-still-not-considered-a-letter? I.e. I understand difference between apostrophe-as-part-of-word and apostrophe-as-punctuation-mark: in first case "abc'def" gives one lexeme, and in second it gives two lexemes ("abc" and "def"). But I don't understand what is practical difference for search between something considered a letter and something considered a part of a word (but still not letter).

Oct 7 2016, 9:59 PM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery
Sasha1024 added a comment to T146358: Improve processing of the apostrophe by the search engine in Ukrainian.

@TJones, IMHO treating all three apostrophes (U+0027, U+2019, U+02BC) as one sign in search shouldn't harm Russian or any other languages using Russian-language analyzer.

Oct 7 2016, 8:51 PM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery

Sep 22 2016

Sasha1024 updated the task description for T146358: Improve processing of the apostrophe by the search engine in Ukrainian.
Sep 22 2016, 10:00 AM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery
Sasha1024 updated the task description for T146358: Improve processing of the apostrophe by the search engine in Ukrainian.
Sep 22 2016, 10:00 AM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery
Sasha1024 created T146358: Improve processing of the apostrophe by the search engine in Ukrainian.
Sep 22 2016, 9:56 AM · MW-1.28-release (WMF-deploy-2016-10-25_(1.28.0-wmf.23)), Patch-For-Review, Discovery-Search (Current work), CirrusSearch, Discovery