Page MenuHomePhabricator

CX2: Should not output span tags with VE attributes
Closed, DuplicatePublic

Description

CX2 should clean up its output and remove span tags who are internal to VE, rather than putting them in the resulting article

Example on frwiki "Tommaso Piroli" : original translation with the following content

  • <span data-ve-ignore="true"></span>

Event Timeline

Pginer-WMF subscribed.

We may want to investigate if this is a specific issue of ContentTranslation, VisualEditor, or Parsoid

Nikerabbit moved this task from Bugs to Content handling on the ContentTranslation board.

This is still happening, six years later. See https://en.wikipedia.org/w/index.php?title=Abdourahamane_Soli&oldid=1303063596

There are nonexistent template pages imported:

<templatestyles src="Vorlage:IconExternal/styles.css" />

There are extraneous span and cite tags:

<cite class="citation book cs1" data-ve-ignore="true">''Abdourahmane Idrissa, Samuel Decalo: Historical Dictionary of Niger'' (4th&nbsp;ed.). Scarecrow. 2012. p.&nbsp;414. [[ISBN (identifier)|ISBN]]&nbsp;[[Special:BookSources/978-0-8108-6094-0|<bdi>978-0-8108-6094-0</bdi>]].</cite><span title="ctx_ver=Z39.88-2004&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=book&rft.btitle=Abdourahmane+Idrissa%2C+Samuel+Decalo%3A+Historical+Dictionary+of+Niger.&rft.pages=414&rft.edition=4th&rft.pub=Scarecrow&rft.date=2012&rft.isbn=978-0-8108-6094-0&rfr_id=info%3Asid%2Fen.wikipedia.org%3AAbdourahamane+Soli" class="Z3988" data-ve-ignore="true"></span>

Why can't the translator filter this stuff out? There are multiple open bugs on issues similar to this, like T218420 and T287936, some of which have been open for four to six years.