Based on the attributes collected in segmentation phase 1 and potentially with additional ones added, we want to cluster the wikis into meaningful groups which we can name, describe with personas, and make standard tools for understanding the Wikimedia landscape.
|Open||Neil_P._Quinn_WMF||T188391 Develop strategies and tools for segmenting wikis [2018-19 AP output 4.3]|
|Open||Neil_P._Quinn_WMF||T203033 Construct and personify wiki clusters [segmentation phase 3]|
I've put together the results of the much, much clustering that I did into https://github.com/wikimedia-research/wiki-segmentation/tree/master/clustering-initial/deliverable
We have a meeting scheduled to discuss these results and then it'll be up to the rest of the folks to figure out which clustering they want to use until the data is iterated on and ready to be re-clustered (by me! :D)
@Neil_P._Quinn_WMF fair to call this task resolved?
@kzimmerman and I talked and came up with a tentative plan for me to take the lead on this again in the new year.
I want to do some simple tweaks to Mikhail's clustering (removing inactive wikis and fixing some broken data) and then organize that meeting to review the current clusters.