Culture.Linguistics contains many articles that are disambiguations of names, like this one. Perhaps we should handle disambiguation pages differently, or exclude them from the results we give to newcomers.
The Culture.Linguistics connections to names is because of WikiProject Anthroponymy (yaml). Many of those name pages aren't actually disambiguation pages in English -- e.g., the Robert example. I prefer the route of filtering out disambiguation pages after the fact and leaving Anthroponymy in because it legitimately does belong in Linguistics. I'm willing to be convinced otherwise though. Interestingly too, many of the pages it covers are actually redirects in English -- e.g., the page Churchill, which redirects to Winston Churchill.
We should implement a strategy for filtering out disambiguation pages as part of our modeling pipeline.
Note that we'll need to check if a specific page on a specific wiki is a disambiguation page in order to exclude it from the set.