WikiApiary has amassed information belonging to 25,000 public, active MW sites and 3,000 inactive MW sites. We will query WikiApiary to sort the information into meaningful categories.
To start, we will look at the active sites to identify the third party, non-WMF users and refine their classifications beyond "commercial", "university", "government", etc. We would also look to identify the release versions they are using and anonymize the data.
For the inactive sites, we would like to know why they are inactive. See T1246: Mentor Google Code-in 2014 Student(s) Who Will Research MediaWiki Sites Classified as Defunct on WikiApiary