Both of these projects are cool and they have produced some really valuable datasets. With @ellery moving out of the WMF, these data items are unlikely to get updates in the near future. We should turn them into regular jobs and host them somewhere for download.
Clickstream dataset was generated manually by @Ellery.
We should productionize its regular generation and publication.