Page MenuHomePhabricator

Implement a comparison algorithm which can compare different MathML outputs for scientific results
Open, Needs TriagePublic

Description

Implement a comparison algorithm which can compare different MathML outputs for scientific results (possibily use mathpipe as before), does this contain some type of similarity score etc?

From the dialogue:

Earlier this was done based on images. Maybe this is too complicated. We can start with tree-edit distance as a similarity measure. But maybe we just want to figure out if same or different for at the moment?

Ok, i guess either Text-Based (i.e. tree-edit distance) or Image comparison can be done, let's see what the compared outputs look like. If there are many differing artifacts within the tool specific MathML notations, image comparison might make more sense.

Tentative Tasks here:

  • Check most suitable comparison method
  • Implement the comparison method ( to be clarified: will this go into automated testsuites or rather extra research test or repo ? )
  • Conduct evaluation (to be clarified, might make sense to have this extra task)
  • Fine tune comparison method: Check that artifacts in MathML are correctly compared (when using text-based approach) etc

> Shifted from https://phabricator.wikimedia.org/maniphest/task/edit/327393/