[RFC] Introduce notion of DOM scopes in wikitext
Closed, DeclinedPublic
Actions

Assigned To

Authored By

	ssastry
	Oct 1 2015, 10:23 PM

Description

Introduce the notion of DOM scopes in wikitext (https://www.mediawiki.org/wiki/User:SSastry_%28WMF%29/Notes/Wikitext#DOM_scopes). The idea is that when enforced, HTML produced by the wikitext construct will be balanced in isolation.

The first and easiest application of this would be for top level sections (T114072) where we can experiment and prototype this idea (without expliciting calling it such). We would have to understand the implications for editing, and for the parsing implementations (ease of supporting this and performance impacts). This can inform the other contexts where this can be extended to.In the longer run, this can be applied to template output, extension output, tables, image captions -- as part of the gradual move towards evolving wikitext towards a newer Wikitext 2.0 (T112996).

Doing this can help with:

editability: individual dom scopes can be edited independently and in isolation. This can help VE as well as other wikitext editing tools.
performance:you can parse and process DOM scopes somewhat in isolation -- a step towards supporting incremental parsing
ability to reason about the markup: you don't have to look at rest of the page to make sense of what this piece of code does (I am deliberately exaggerating this to highlight that when enforced, this property is not dependent on ability to not have wikitext markup errors).

The name and notion is up for discussion, but the idea is to come up with an understandable and enforceable concept that can be applied consistently.

Related Objects

Mentioned In: T149282: Improved editability, tooling, reasoning, and performance by adopting DOM-based semantics for wikitext
T125865: Assign RFCs to ArchCom shepherds
T122472: Whole sections put into one line
T118110: DOM scopes / constraints inconsistently defined for transcluded lists and templates
T116461: End table added at an obviously incorrect place
T114072: <section> tags for MediaWiki sections
T114445: [RFC] Balanced templates
Mentioned Here: E146: RFC Meeting: triage! (2016-03-02, #wikimedia-office)
T114445: [RFC] Balanced templates
T112996: A vision for templates / wikitext 2.0
T114072: <section> tags for MediaWiki sections

Event Timeline

ssastry created this task.Oct 1 2015, 10:23 PM

ssastry raised the priority of this task from to Medium.

ssastry updated the task description. (Show Details)

ssastry added projects: Parsoid, Parsing-Team--ARCHIVED, VisualEditor.

ssastry subscribed.

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 1 2015, 10:23 PM

ssastry updated the task description. (Show Details)Oct 1 2015, 10:27 PM

ssastry set Security to None.

ssastry added subscribers: • GWicke, • brooke, tstarling, Halfak.

ssastry mentioned this in T114445: [RFC] Balanced templates.Oct 1 2015, 10:47 PM

Whoops, I think this is at least a partial dup of T114445: [RFC] Balanced templates, which I think I was writing at the same time you were writing this.

In T114444#1695624, @cscott wrote:

Whoops, I think this is at least a partial dup of T114445: [RFC] Balanced templates, which I think I was writing at the same time you were writing this.

Yes, there is overlap, but, not a duplicate. This one is concerned about scoping more generally and the other one is concerned specifically about templates and also brings in all the discussion related to it (including the Q2 goal we have about the prototype of a opt-in / opt-out solution) that is somewhat about, but not entirely only about scoping.

Works for me. Let's let T114445 be all the "controversial" questions that require broader discussion; this task can be narrowly focused on implementation.

ssastry mentioned this in T114072: <section> tags for MediaWiki sections.Oct 6 2015, 5:47 PM

Jdforrester-WMF moved this task from To Triage to Freezer on the VisualEditor board.Oct 6 2015, 7:14 PM

• GWicke added a project: TechCom-RFC.Oct 7 2015, 8:40 PM

tstarling moved this task from P1: Define to Under discussion on the TechCom-RFC board.Oct 14 2015, 8:50 PM

ssastry mentioned this in T116461: End table added at an obviously incorrect place.Oct 25 2015, 5:14 PM

• GWicke mentioned this in T118110: DOM scopes / constraints inconsistently defined for transcluded lists and templates.Nov 8 2015, 6:25 AM

• Elitre subscribed.Nov 19 2015, 4:07 PM

ssastry mentioned this in T122472: Whole sections put into one line.Dec 28 2015, 4:50 PM

• RobLa-WMF mentioned this in T125865: Assign RFCs to ArchCom shepherds.Feb 10 2016, 8:15 PM

• RobLa-WMF moved this task from Under discussion to (unused) on the TechCom-RFC board.Feb 12 2016, 1:57 AM

Per E146

• DStrine moved this task from (unused) to Under discussion on the TechCom-RFC board.Mar 2 2016, 10:32 PM

In T114444#1695661, @cscott wrote:

Works for me. Let's let T114445 be all the "controversial" questions that require broader discussion; this task can be narrowly focused on implementation.

This RFC is about the general notion of DOM scopes that is showing up in different guises in the <section> tags proposal, balanced templates proposal, and potentially others in the future. Those other RFCs are grappling with backward-compability concerns in specific problem areas (sections, templates) which is somewhat orthogonal to the semantics that will result from those proposals.

This RFC is about:

does it make sense to generalize the scoping semantics more broadly and apply in other areas?
identify the implications (pros / cons) of doing so.
identify the implementation challenges and propose concrete implementation strategies that are applicable in all those scenarios.

A fully reversible wikitext2json and json2wikitext would be nice. Each object of wikitext (section, paragraph, magicword, transclusion, wilklink, exernallink, ref-tags, pre/nowiki/syntaxhighlight ....) could be representated as a k-v-pair where the value is a object or as array of objects. The main goal of this functionality is to make tooling of wikitext easier.
(one usecase could be: replace of misspelling in section and paragraphs but not in transclutions, reftags, external linktext, links, ...)

Boshomi unsubscribed.Mar 2 2016, 11:03 PM

@Boshomi: We already have the functionality you are looking for. Parsoid's HTML can be converted back to wikitext. It can also be represented as JSON without too much trouble. See https://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec for background.

Is there a API for parasoid?

Boshomi: See https://en.wikipedia.org/api/rest_v1/?doc, or more generally https://{domain}/api/rest_v1/?doc

• MZMcBride subscribed.Apr 1 2016, 12:21 AM

RandomDSdevel awarded a token.Apr 12 2016, 8:35 PM

RandomDSdevel subscribed.

Danny_B added a project: Proposal.May 2 2016, 10:39 PM

• RobLa-WMF added a project: TechCom-Has-shepherd.Jul 13 2016, 5:09 AM

• RobLa-WMF moved this task from Backlog to tstarling on the TechCom-Has-shepherd board.Jul 13 2016, 5:12 AM

ssastry mentioned this in T149282: Improved editability, tooling, reasoning, and performance by adopting DOM-based semantics for wikitext.Nov 23 2016, 6:58 PM

Arlolra subscribed.Nov 24 2016, 12:59 AM

Krinkle removed projects: TechCom-Has-shepherd, Proposal.Dec 21 2017, 11:52 PM

The meeting minutes suggest the action item was for @ssastry to reword this proposal so as to make it more concrete. @ssastry have you followed up on this? Any interest in pursuing it further?

There are multiple related proposals that have evolved slightly differently -- I want to consolidate them into a single proposal.

If we want to restrict structure to just templates, there is the balanced templates proposal (T114445)
If we want to make structure more generic in wikitext, dom scopes is one way to do it.
If we want to formalize the notion of structure (dom scopes or balanced templates) into the notion of a type (which has additional benefits), we have typed wikitext.
- If we generalize the notion of typed wikitext when applied to templates to capture additional information and meta data about templates in the "type structure", we have typed templates. https://www.mediawiki.org/wiki/Parsing/Notes/Wikitext_2.0/Typed_Templates

The broader context for this is to support document composition from fragments. There are many different ways of achieving document composition but I think typed wikitext is my proposed pathway to that goal.

So, I think dom scopes in and of itself is an early notion / proposal. https://www.mediawiki.org/wiki/Parsing/Notes/Wikitext_2.0 is the typed wikitext proposal that I discussed in 2017 devsummit . I think I wish to pursue the more full fledged proposal of a typing layer on top of wikitext which implements dom scoping semantics on various constructs. If necessary, we could close this one and open a new one for it, or repurpose this one for it, as appropriate. But, I can flesh out more details in the typed wikitext 2.0 proposals, as required.

I agree that the overall goal is to get to Wikitext 2.0, I just wasn't sure if you plan on tackling this particular issue as milestone in that path. Declining this RfC for now then and waiting on the Wikitext 2.0 one :)

[RFC] Introduce notion of DOM scopes in wikitextClosed, DeclinedPublicActions

Description

Related Objects

Event Timeline

[RFC] Introduce notion of DOM scopes in wikitext
Closed, DeclinedPublic
Actions