We want to integrate page view information into the scoring algorithms we use for both the completion suggestions and our regular search results.
Our initial idea is we only update this page view information when doing normal document updates after a page edit. We need to analyze if this page view data will be "good enough" or if we need to do something more. Maybe any page not edited in the last 30 days needs its page view data updated, maybe not.
I'm guessing we could perhaps look at the distribution of how often pages are edited and compare that against how much page view data tends to change over that time. The page view information is all available in hive from stat1002.