Checking in. I wish that I hadn't turned on verbose mode (which is way too verbose) for our monthly article quality extraction process. I'd be able to look at INFO log lines to see how we're progressing on processing dump files.
Right now, i can only say that we've got 291M article quality assessments. We might end up with 360M if my conservative estimate is about right.
All datasets are here: https://datasets.wikimedia.org/public-datasets/all/wp10/20160801/
I'm traveling so it's hard to upload to figshare. I'll do that upload when I'm on a better connection.