Maniphest T119088

Parsing team: Q3 2015-16 goals planning dependency tracker task
Closed, ResolvedPublic
Actions

Description

Reading, Visual Editor, Flow, Language, Services and possibly Community Tech undertake work that requires support from the Parsing Team. To help with our team's Q3 planning process, it would be helpful to know about projects that you would like support from us.

This is a tracker task to identify Q3 dependencies on the Parsing team. Feel tree to either add this as a blocker on your team's tasks that you depend on parsing team work in Q3 2015-16 (Jan - Mar 2016) or add a comment here. We will use this information to figure out what we can reasonably prioritize in Q3. Those that we cannot get done in Q3 will have the blocker removed.

Related Objects

Mentioned Here: T78676: Store & load data-mw separately
T39902: RFC: Implement rendering of redlinks in Parsoid HTML as post-processor
T117519: Mark red links
T118306: File: pages of images stored on commons result in 404s
T118882: Honor Accept-Language header to set language variant
T119265: More metadata in Parsoid output
T105845: RFC: Page components / content widgets
T114072: <section> tags for MediaWiki sections
T116350: Design and implement an algorithm to provide stable element ids

Event Timeline

ssastry created this task.Nov 19 2015, 4:22 PM

ssastry raised the priority of this task from to High.

ssastry updated the task description. (Show Details)

ssastry added projects: Parsing-Team--ARCHIVED, Services, Reading-Admin, Language-Team, Community-Tech, StructuredDiscussions, VisualEditor.

ssastry subscribed.

Restricted Application added a project: Collaboration-Team-Triage. · View Herald TranscriptNov 19 2015, 4:22 PM

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Arrbee set Security to None.Nov 19 2015, 4:25 PM

Arrbee added subscribers: Amire80, • santhosh.

Arrbee subscribed.

• bearND subscribed.Nov 19 2015, 4:49 PM

Some things that I'd love to see prioritized on your end are:

Improve how Parsoid marks up semantic content elements like navboxes, infoboxes, table of content (T105845), sections (T114072) to make it easier and more efficient to work with / post-process the content.
Provide component metadata (T105845), to aid in composition of content & solving generic issues like dependency tracking, cache invalidation, redirects.
Multimedia support, even if it's "just" <video> tags.
Not a hard blocker, but I think we should seriously tackle this to avoid driving projects like CX into implementing their own: T116350: Design and implement an algorithm to provide stable element ids

In T119088#1818145, @GWicke wrote:

Multimedia support, even if it's "just" <video> tags.

This is one of our Q2 goals but we suspect some or all of it will spill over to Q3.

From Language-Team we would like to see some priority for T116350: Design and implement an algorithm to provide stable element ids

dr0ptp4kt moved this task from Backlog to Doing on the Reading-Admin board.Nov 21 2015, 12:12 AM

From Mobile-Content-Service (Reading) we would like to see the following improvements in preference order:

T118306 [RB] Make image description pages on commons work
T39902 (dup T117519) Red links
T119265 More metadata
T118882 [RB] Language variant handling (zh-hans vs zh-hant)

[RB]: probably something that needs to be changed on the RESTBase level.

Skyllfully moved this task from New & TBD Tickets to Archive on the Community-Tech board.Nov 22 2015, 8:18 PM

Skyllfully moved this task from Archive to New & TBD Tickets on the Community-Tech board.

Here's some of the wishlist of reading web:

Stable shared heading id's with mediawiki parser.
First class sections
Extremely lean html endpoint.
- Instead of the extremely verbose 2-way html-wikitext html, let's have a standard extremely lean html with all the transforms that Android content service is doing. (Minimal payload, fastest rendering)

I'll add more if I think of them.

In T119088#1824560, @Jhernandez wrote:

Extremely lean html endpoint.

Instead of the extremely verbose 2-way html-wikitext html, let's have a standard extremely lean html with all the transforms that Android content service is doing. (Minimal payload, fastest rendering)

Note that this is basically T78676: Store & load data-mw separately and requires the VisualEditor (cc @Jdforrester-WMF, @Esanders) and Content Translation (cc @santhosh) to have functionality in place to load data-mw separately. So, us implementing this is blocked on VE and CX acquiring that functionality.

Jdforrester-WMF edited projects, added Contributors-Team; removed VisualEditor, StructuredDiscussions.Nov 24 2015, 8:25 PM

ssastry updated the task description. (Show Details)Nov 24 2015, 8:54 PM

Note that this is basically T78676: Store & load data-mw separately and requires the VisualEditor (cc @Jdforrester-WMF, @Esanders) and Content Translation (cc @santhosh) to have functionality in place to load data-mw separately.

To make this more concrete, a basic implementation of this would

make another API request to /data-mw/{title}{/revision} to fetch the data-mw metadata, and
iterate through each id in the returned object, and add a data-mw attribute to the corresponding DOM node returned by getElementById.

Pseudo code:

// Assuming dataMw holding the parsed data-mw response, and 
// doc holding the DOM
Object.keys(dataMw).forEach(function(id) {
  var node = doc.getElementById(id);
  if (node) {
    node.dataset.mw = JSON.stringify(dataMw[id]);
  } else {
    throw new Error("Node corresponding to id " + id + " not found!");
  }
});

In T119088#1824977, @ssastry wrote:

In T119088#1824560, @Jhernandez wrote:

Extremely lean html endpoint.

Instead of the extremely verbose 2-way html-wikitext html, let's have a standard extremely lean html with all the transforms that Android content service is doing. (Minimal payload, fastest rendering)

Note that this is basically T78676: Store & load data-mw separately and requires the VisualEditor (cc @Jdforrester-WMF, @Esanders) and Content Translation (cc @santhosh) to have functionality in place to load data-mw separately. So, us implementing this is blocked on VE and CX acquiring that functionality.

@Esanders says they can get this functionality in place. @santhosh how about CX?

@ssastry, CX can also do this. The method @GWicke mentioned should work. We(Parsing, VE and CX) just need to coordinate on timeline of implementation and deployment.

I am assuming that the HTML->WIkitext conversion path is unaffected and it continues to accept HTML with data-mw in it.

• Mholloway subscribed.Dec 1 2015, 4:49 PM

In T119088#1824977, @ssastry wrote:

In T119088#1824560, @Jhernandez wrote:

Extremely lean html endpoint.

Instead of the extremely verbose 2-way html-wikitext html, let's have a standard extremely lean html with all the transforms that Android content service is doing. (Minimal payload, fastest rendering)

Note that this is basically T78676: Store & load data-mw separately and requires the VisualEditor (cc @Jdforrester-WMF, @Esanders) and Content Translation (cc @santhosh) to have functionality in place to load data-mw separately. So, us implementing this is blocked on VE and CX acquiring that functionality.

yup. It would be useful if we could at least invoke turning this off via an optional API parameter in the interim.

Here are our draft goals for this quarter:

Leaner HTML by stripping data-mw and storing in separate bucket in RESTBase -- requires co-ordinating deploy with Services, CX, VE, OCG after code on their end is implemented to repopulate data-mw in the DOM

Improved multimedia support in Parsoid -- RFCs go through ArchCOM; DOMSpec updated; updates to PHP parser + Parsoid

Majority of the blockers for replacing Tidy identified and resolved -- mass visual diff testing infra in place; processes in place for fixing templates and pages affected by switch; PHP parser / Parsoid changes in place where necessary

Anything else we get done will be bonuses like pieces of T119265: More metadata in Parsoid output perhaps.

In T119088#1844973, @Jdlrobson wrote:

In T119088#1824977, @ssastry wrote:

In T119088#1824560, @Jhernandez wrote:

Extremely lean html endpoint.

Instead of the extremely verbose 2-way html-wikitext html, let's have a standard extremely lean html with all the transforms that Android content service is doing. (Minimal payload, fastest rendering)

Note that this is basically T78676: Store & load data-mw separately and requires the VisualEditor (cc @Jdforrester-WMF, @Esanders) and Content Translation (cc @santhosh) to have functionality in place to load data-mw separately. So, us implementing this is blocked on VE and CX acquiring that functionality.

yup. It would be useful if we could at least invoke turning this off via an optional API parameter in the interim.

Leaner HTML is one our goals for next quarter, so maybe better to get that done. But, separate from that, if you want an interim flag, this would have to be something that RESTBase provides.

dr0ptp4kt subscribed.Dec 11 2015, 9:59 AM

dr0ptp4kt moved this task from Doing to Next Quarter Candidates on the Reading-Admin board.Dec 11 2015, 10:43 AM

• DannyH moved this task from New & TBD Tickets to Product backlog on the Community-Tech board.Dec 11 2015, 6:40 PM

dr0ptp4kt moved this task from Next Quarter Candidates to Current Quarter on the Reading-Admin board.Mar 14 2016, 7:42 PM

ssastry closed this task as Resolved.Jun 13 2016, 4:00 PM

ssastry claimed this task.

• DannyH moved this task from Product backlog to Archive on the Community-Tech board.Jul 5 2016, 8:05 PM