Page MenuHomePhabricator

Instead of returning html from the highlighter, use unicode reserved characters as the pre/post highlight signals.
Open, Needs TriagePublic

Description

This would allow us to push sanitization out to the edge where it is supposed to be. It was suggested that this is how bing is internally handling highlighting.

Event Timeline

EBernhardson raised the priority of this task from to Needs Triage.
EBernhardson updated the task description. (Show Details)
EBernhardson added a project: CirrusSearch.
EBernhardson added a subscriber: EBernhardson.
Restricted Application added a project: Discovery. · View Herald TranscriptFeb 11 2016, 12:37 AM
Restricted Application added subscribers: StudiesWorld, Aklapper. · View Herald Transcript

16:37 < JustinO> ebernhardson: bing uses '\uE000' & '\uE001' to highlight

Nice idea. This also has advantages for clients that cannot render HTML easily, such as mobile apps.

jayvdb renamed this task from Instead of returning html from the higlighter, use unicode reserved characters as the pre/post highlight signals. to Instead of returning html from the highlighter, use unicode reserved characters as the pre/post highlight signals..Feb 11 2016, 12:47 AM
jayvdb set Security to None.
EBernhardson moved this task from Needs triage to Search on the Discovery board.Feb 11 2016, 11:16 PM

16:37 < JustinO> ebernhardson: bing uses '\uE000' & '\uE001' to highlight

Specifically, see "Hit Highlighting / Bing": https://msdn.microsoft.com/en-us/library/gg447081.aspx