Page MenuHomePhabricator

Hovercard text extract is broken for academic titles before and after names of person
Closed, DuplicatePublic


Steps to reproduce

  1. Create a page with first sentence like this: [[Armádní generál|Arm. gen.]] [[Inženýr|Ing.]] '''Petr Pavel''', [[Master of Arts|M.A.]], (* [[1. listopad|1. listopadu]] [[1961]] [[Planá]]) je ... (or copy this page)
  2. Link to it from another article
  3. See its hovercard

Expected behavior
The whole first sentence should be shown in hovercard.

Current behavior
Only Arm. gen. Ing. (academical prefixes) is shown in a hovercard.

Snímek z 2017-08-20 00-32-02.png (741×1 px, 268 KB)

Event Timeline

Jdlrobson subscribed.

Thanks for the bug report! We're working on a new API to improve our summaries (T113094) and I've added a test case to show this is fixed by it!:

Dvorapa updated the task description. (Show Details)

I doubt these two are duplicates. This is about academic name prefixes or suffixes, or numbers written with dot (.) in some European languages instead of English st nd th, or abbreviations written with dot too

I see what you mean. However, they are somewhat related....

The current way we obtain extracts is fundamentally flawed in that it uses the . or . character (or any end of sentence character we know about) to mean "end of sentence".
Code here:
We've seen numerous issues with this approach and this example is just one of them. Basically the conclusion is that exsentences has too many issues and should be considered broken.

As a result, we won't be using that in the new endpoint.

The API request currently being generated to use that text extract is:

.Also notice the broken HTML (which is T168329 so why I say they are related).
doesn't have the same problem, but causes other issues elsewhere.

The way we plan to solve this, is to abandon the use of this API in favor of T168848 for the purpose of page previews

I'm not sure whether to decline this task or merge it into T168848. What would you prefer I do @Dvorapa ?

@Jdlrobson: I am a little bit confused. Task T168848 is about Mobile(Apps) Content Service. I thought that new API would serve all requests, not only requests from mobile devices?

@Vachovec1 confusingly we're using the Mobile(Apps) content service for page previews :) The "Mobile" is in the name for historic reasons and just the service where this service lives. Now it is being used by page previews on desktop, clearly it's scope has changed!

@Jdlrobson thank you for your detailed explanation