(In the context of T160430)
Wikimedia has some Wikimedia's "GitHub-only" code repositories.
Find out how to differentiate (exclude) those repositories that are mirrors-only (maybe there no trivial way).
Also, open question: What about stuff that Wikimedia forked? Exclude or not? (Similar problem with measuring activity in pulled upstream repos in Gerrit)
Random links