Here is a sample segmented HTML from the cxserver, produced out of https://en.wikipedia.org/wiki/Central_Bank_of_the_Republic_of_Turkey#cite_ref-5
<li about="#cite_note-5" data-seqid="1830" id="cite_note-5"> <span class="cx-segment" data-segmentid="1831"> <a class="cx-link" data-linkid="1832" href="#cite_ref-5" rel="mw:referencedBy"><span class="mw-linkback-text">↑</span></a> <span class="mw-reference-text" id="mw-reference-text-cite_note-5"> <a class="cx-link" data-linkid="1833" href="http://www.tcmb.gov.tr/yeni/eng/" id="mwaw" rel="mw:ExtLink">Banco central de la Repúbl5ica de Turquía.</a> </span> </span> <span class="mw-reference-text" id="mw-reference-text-cite_note-5"> <span class="cx-segment" data-segmentid="1834">Museo de billete: 7. </span> </span> <span class="mw-reference-text" id="mw-reference-text-cite_note-5"> <span class="cx-segment" data-segmentid="1835">Grupo de emisión - Veinte mil turco Lira - <a class="cx-link" data-linkid="1836" href="http://www.tcmb.gov.tr/yeni/banknote/E7/294.htm" id="mwbA" rel="mw:ExtLink">yo. Serie</a> & II. </span> </span> <span class="mw-reference-text" id="mw-reference-text-cite_note-5"> <span class="cx-segment" data-segmentid="1838"> <a class="cx-link" data-linkid="1837" href="http://www.tcmb.gov.tr/yeni/banknote/E7/296.htm" id="mwbQ" rel="mw:ExtLink">Serie. </a> </span> </span> <span class="mw-reference-text" id="mw-reference-text-cite_note-5"> <span class="cx-segment" data-segmentid="1839">@– Recuperó el 20 de abril de 2009.</span> </span> </li>
This corresponds to the rendering
When published the first segment only captured in output
The multiple spans with same id mw-reference-text-cite_note-5 is problematic here.