During the HTML enrichment pipeline, we need to also enrich the event with the parent revision HTML content. To do that, we are doing the following:
- Use the new schema designed for it
- If the page change has a parent_rev_id, call the HTML endpoint to get the content.
- Given the main HTML (html_canonical) and the parent HTML, compute a unified_diff and store it in the event.
- Make sure the unified_diff can be used to build the parent HTML completely.