I just discovered the edit data in the wmf_raw database on the data lake. Could we add a raw copy of the sites table as well? It doesn't matter from which wiki, they should all be the same.
The main benefit would be easily joining to it to filter the data in the data lake to "only Wikipedias" or "only Wikivoyages" and so on.