The "who are moderators" SDS 1.2.3 (T371865) project requires html diff data, i.e. the html of a revision and the html of the parent revision. See more details in T378617.
As html datasets are not currently available in the data lake, this task tracks two initiatives:
- T380871: Create a one-off HTML diff dataset to unblock work on "who are moderators" SDS 1.2.3 for Q2.
- T380874: Request Data Engineering to prioritize adding a HTML dataset to the data lake.
Task 1. is to avoid being blocked on making progress on SDS 1.2.3, and task 2. is needed to accomplish the goal of this project.
