Page MenuHomePhabricator

Paragraph wrapper introduces <p></p> in output HTML -- investigate and kill them where they are a result of edge case diffs between PHP parser and Parsoid code
Closed, ResolvedPublic

Description

See discussion @ https://gerrit.wikimedia.org/r/c/mediawiki/services/parsoid/+/476053/3/tests/parserTests.txt#31161

Most of the empty p-tags are likely edge case bugs.

Event Timeline

ssastry renamed this task from Paragraph wrapper introduces <p></p> in o utput HTML -- investigate and kill them where they are a result of edge case diffs between PHP parser and Parsoid code to Paragraph wrapper introduces <p></p> in output HTML -- investigate and kill them where they are a result of edge case diffs between PHP parser and Parsoid code.Nov 28 2018, 7:27 PM
ssastry triaged this task as Medium priority.

This test "Empty paragraphs (marked with mw-empty-elt) found in source should not be normalized away" should be removed after this is fixed.

Change 476352 had a related patch set uploaded (by Arlolra; owner: Arlolra):
[mediawiki/services/parsoid@master] Stop matching non-breaking spaces in TokenUtils.isSolTransparent

https://gerrit.wikimedia.org/r/476352

Change 476352 merged by jenkins-bot:
[mediawiki/services/parsoid@master] Stop matching non-breaking spaces in TokenUtils.isSolTransparent

https://gerrit.wikimedia.org/r/476352

Change 477686 had a related patch set uploaded (by Arlolra; owner: Arlolra):
[mediawiki/services/parsoid@master] Don't migrate out trailing non-breaking spaces

https://gerrit.wikimedia.org/r/477686

Change 477686 merged by jenkins-bot:
[mediawiki/services/parsoid@master] Don't migrate out trailing non-breaking spaces

https://gerrit.wikimedia.org/r/477686

Change 596761 had a related patch set uploaded (by Subramanya Sastry; owner: Subramanya Sastry):
[mediawiki/services/parsoid@master] WIP: Quick fix for incorrect p-wrapping of table captions

https://gerrit.wikimedia.org/r/596761

ssastry lowered the priority of this task from Medium to Low.May 15 2020, 9:26 PM
ssastry moved this task from Backlog to Known Differences on the Parsoid board.

Change 597319 had a related patch set uploaded (by Arlolra; owner: Arlolra):
[mediawiki/core@master] Add caption to always suppressing

https://gerrit.wikimedia.org/r/597319

Change 599076 had a related patch set uploaded (by Arlolra; owner: Arlolra):
[mediawiki/services/parsoid@master] Add caption to always suppressing

https://gerrit.wikimedia.org/r/599076

Change 597319 merged by jenkins-bot:
[mediawiki/core@master] Add caption to always suppressing

https://gerrit.wikimedia.org/r/597319

Change 599076 merged by jenkins-bot:
[mediawiki/services/parsoid@master] Add caption to always suppressing

https://gerrit.wikimedia.org/r/599076

It seems like quite a number of cases have been covered in the above patches. I wonder if now's the time to remove the papering over and see what shakes out?

It seems like quite a number of cases have been covered in the above patches. I wonder if now's the time to remove the papering over and see what shakes out?

A low-priority tech debt task sure.

Change 599123 had a related patch set uploaded (by Arlolra; owner: Arlolra):
[mediawiki/services/parsoid@master] Revert the rest of 18a98af

https://gerrit.wikimedia.org/r/599123

Change 596761 abandoned by Arlolra:
WIP: Quick fix for incorrect p-wrapping of table captions

Reason:
In favour of https://gerrit.wikimedia.org/r/c/mediawiki/services/parsoid/ /599076

https://gerrit.wikimedia.org/r/596761

Change 601428 had a related patch set uploaded (by Subramanya Sastry; owner: Subramanya Sastry):
[mediawiki/vendor@master] Bump Parsoid to v0.12.0-a15

https://gerrit.wikimedia.org/r/601428

Change 601428 merged by jenkins-bot:
[mediawiki/vendor@master] Bump Parsoid to v0.12.0-a15

https://gerrit.wikimedia.org/r/601428

Change 599123 merged by jenkins-bot:
[mediawiki/services/parsoid@master] Revert the rest of 18a98af

https://gerrit.wikimedia.org/r/599123

ssastry assigned this task to Arlolra.

Change 605678 had a related patch set uploaded (by Subramanya Sastry; owner: Subramanya Sastry):
[mediawiki/vendor@master] Bump Parsoid to 0.12.0-a17

https://gerrit.wikimedia.org/r/605678

Change 605678 merged by jenkins-bot:
[mediawiki/vendor@master] Bump Parsoid to 0.12.0-a17

https://gerrit.wikimedia.org/r/605678