Test performance impacts of highlighting brackets for a large section of content
Closed, ResolvedPublic2 Estimated Story PointsSpike
Actions

Description

Use test instance to understand the performance impacts of highlighting brackets from the middle or the beginning or end of a large section (and large section with nested brackets). From the initial investigation T254976: Investigation: Bracket matching , it was estimated that this feature may be too slow on large chunks of text.

Details

	Subject	Repo	Branch	Lines +/-
	[POC] Always highlight brackets when cursor is inside a pair	mediawiki/extensions/CodeMirror	master	+91 -1

Customize query in gerrit

Related Objects
Search...

Status	Subtype	Assigned	Task
Resolved		None	T302857 Deploy first template focus-area improvements to enwiki
Resolved		WMDE-Fisch	T280023 Enable bracket matching on all wikis (except enwiki)
Resolved		Lena_WMDE	T273591 Enable bracket matching on more wikis
Resolved		Lena_WMDE	T270238 Enable bracket matching on the first wikis
Resolved		Lena_WMDE	T243835 Investigation: Check CodeMirror's capabilities for more "distinct" syntax highlighting
Resolved		Tobi_WMDE_SW	T270240 Try to write a browser test for the bracket matching feature
Resolved		thiemowmde	T270317 Optimization and limits for bracket matching
Resolved	Feature	None	T15302 Implement brace matching in page editing
Resolved		lilients_WMDE	T261857 Implement bracket matching in CodeMirror behind a feature flag
Resolved	Spike	thiemowmde	T254976 Investigation: Bracket matching
Resolved	Spike	awight	T259700 Investigation: scope bracket matching and tag matching add-ons
Resolved	Spike	thiemowmde	T259701 Test performance impacts of highlighting brackets for a large section of content
Resolved	Spike	awight	T260249 Test performance impacts of highlighting tags for a large section of content

Event Timeline

ECohen_WMDE created this task.Aug 5 2020, 12:16 PM

ECohen_WMDE moved this task from Backlog to Ready for pickup on the WMDE-Templates-FocusArea board.

Lena_WMDE renamed this task from Test performance impacts of highlighting brackets from center of a section to Test performance impacts of highlighting brackets for a large section of content.Aug 12 2020, 2:37 PM

Lena_WMDE added a project: WMDE-QWERTY-Sprint-2020-08-12.

Lena_WMDE updated the task description. (Show Details)

Lena_WMDE set the point value for this task to 2.

Note: need a follow up ticket to investigate the same impacts for tag matching. This has the potential to be even slower.

Here is what I have seen so far when reviewing the code of the existing matchbrackets add-on:

It starts by looking at the cursor position. Is the cursor next to a bracket? If not, nothing happens.
Is it an open or closing bracket? Depending on that the search direction is either forward or backwards. Never both.
Search starts, character by character.
Search might encounter a bogus bracket, e.g. (…}. It stops and highlights this error.
Search might encounter nested brackets, e.g. (…{…}…). It uses a stack to keep track of these and skips them.
Search stops when the matching bracket on the same nesting level is found, or reaches one of the rate-limits (maxScanLines and maxScanLineLength).

Note that while most of the code looks like it works with single characters only, there are also calls to cm.getTokenTypeAt(). This does, as far as I can see, work on larger chunks, i.e. "tokens". And because our "mediawiki" mode is already able to handle {{, {{{ and such as single tokens, there are situations where the character-based matchbrackets add-on gets confused.

Option A

My idea for "in the middle" support includes reusing all the steps above exactly as they are. All we do is to add a few more steps before the existing ones:

When the cursor is not next to a bracket, we start searching for one. Let's use this example, where _ is the cursor: ( { } _ { } ).
It might be a good idea to search in both directions the same time for performance reasons. But that's optional. Let's assume we search forward first.
Going forward, we find {. That's a nesting level we need to skip. As before, create a stack and skip these. This will make sure the } is skipped as well.
The moment we find the ) we found one of the two brackets we need. Now we use the existing algorithm from above to find the other one.

The complexity of this algorithm is, on average, 150% of the original one (assuming we start exactly in the middle on average).

Option B

Similar to A, but we search in both directions (doesn't matter if done the same time or one after the other). As above, nested brackets are skipped.
When we found a brackets in each direction, we highlight them.
When they don't match (e.g. in { _ )), we additionally highlight them as errors.
We never call the original algorithm. However, we must make sure we respect the same rate limits.

The complexity of this algorithm should be the same as the original one.

However, in both cases the new search algorithm is executed on all cursor positions, not only when the cursor is next to a bracket. This might significantly impact how responsive it feels when navigating the wikitext. It will probably feel the same as navigating a page that is entirely made of brackets.

There are many ways to improve this:

Start the search only when the cursor is on the same position for – for example – 500ms. This makes sure rapidly changing the cursor position is not affected. But it might be a visible delay, which might create the impression the editor is slow – the opposite of what we want.
Make sure the search is interrupted and restarts the moment the cursor position changes.
We could cache the structure of the text somehow and scan this tree instead scanning the text character by character over and over again. The cache is only updated when the text changes.
The syntax highlighter already marks pairs of brackets and everything they contain with increasingly opaque color shades. This might be a super-cheap way to find the pair of brackets we want to highlight. It doesn't look like the necessary information is there. There is only a stream of colored tokens, but no nesting.

thiemowmde mentioned this in T260249: Test performance impacts of highlighting tags for a large section of content.Aug 14 2020, 11:39 AM

thiemowmde moved this task from Sprint Backlog to Doing on the WMDE-QWERTY-Sprint-2020-08-12 board.Aug 17 2020, 7:32 AM

awight assigned this task to A.Wiki1.Aug 17 2020, 7:32 AM

thiemowmde claimed this task.Aug 17 2020, 7:33 AM

thiemowmde added a subscriber: A.Wiki1.

awight removed a subscriber: A.Wiki1.Aug 17 2020, 7:56 AM

Change 620653 had a related patch set uploaded (by Thiemo Kreuz (WMDE); owner: Thiemo Kreuz (WMDE)):
[mediawiki/extensions/CodeMirror@master] [POC] Always highlight brackets when cursor is inside a pair

https://gerrit.wikimedia.org/r/620653

gerritbot added a project: Patch-For-Review.Aug 17 2020, 8:48 AM

to be honest I'm not sure how to reliably benchmark this. I believe this needs to be done on a slower machine. On my current dev machine navigating the text feels all fine and good with the POC patch https://gerrit.wikimedia.org/r/620653 in place. I did a quick benchmark with the Chromium dev tools and my new method findSurroundingBrackets() shows up with 0.4% "self time". That's not nothing, but very reasonable. I believe we can improve this a lot, if we need to. I already marked some ideas in the patch.

In T259701#6388134, @thiemowmde wrote:

to be honest I'm not sure how to reliably benchmark this.

IMHO, perception is the most important measure and you've already checked this. In T260249#6388060 I measured microseconds elapsed for iterations through the add-on hooks, but it's not a very useful number other than to show the magnitude and give an upper bound on some specific machine.

thiemowmde moved this task from Doing to Demo on the WMDE-QWERTY-Sprint-2020-08-12 board.Aug 17 2020, 1:04 PM

Lena_WMDE moved this task from Ready for pickup to In sprint on the WMDE-Templates-FocusArea board.Aug 17 2020, 2:33 PM

ECohen_WMDE moved this task from Demo to Done on the WMDE-QWERTY-Sprint-2020-08-12 board.Aug 25 2020, 12:04 PM

Lena_WMDE closed this task as Resolved.Aug 26 2020, 8:22 AM

Lena_WMDE closed subtask T260249: Test performance impacts of highlighting tags for a large section of content as Resolved.

ECohen_WMDE added a parent task: T261857: Implement bracket matching in CodeMirror behind a feature flag.Sep 25 2020, 9:55 AM

Change 620653 abandoned by Thiemo Kreuz (WMDE):
[mediawiki/extensions/CodeMirror@master] [POC] Always highlight brackets when cursor is inside a pair

Reason:
Now part of Ib01d991.