Count the daily requests with 'printable=yes' URLs (normally reached by clicking "Printable version" in the left sidebar on desktop)
|Resolved||Fomafix||T93746 Style for print is not applied at printable version|
|Declined||None||T51722 ResourceLoader does not process media queries for printable view with JS-added CSS|
|Declined||None||T134248 printableversion link in toolbox does not take viewed revision into account|
|Resolved||ovasileva||T167956 Deprecate printable version mode|
|Resolved||None||T167237 Determine number of "Printable version" clicks per day|
Answer: 398k/day during the week from May 22-28, excluding spiders. Very roughly, that corresponds to about [edit: fixed typo] 0.08% of our total pageviews (although of course a printed-out Wikipedia article is likely to be associated with vastly more reader attention / content consumption that one pageview in a web browser).
SELECT COUNT(*) AS printviews FROM wmf.webrequest WHERE agent_type = 'user' AND uri_query LIKE '%printable=yes%' AND year = 2017 AND month = 5 AND day >= 22 AND day <= 28; printviews 2787554 1 row selected (794.559 seconds)
PS: Out of curiosity, I also looked at the numbers per country for that timespan. The differences are intriguing (e.g. Kenya, as the only African country besides South Africa in this top 50 list, had by far the highest print version click ratio, and China and Cambodia are also on the high end); but one would need a more thorough examination to determine if there are really persistent geographical differences in how often readers use this feature (on could start with comparing Global South with Global North, and looking at a longer timespan).
SELECT country, SUM(partprintrequests) AS printrequests, ROUND(100*SUM(partprintrequests)/SUM(partpvs),2) AS printpercentage FROM ( SELECT geocoded_data['country'] AS country, SUM(IF(uri_query LIKE '%printable=yes%',1,0)) AS partprintrequests, SUM(IF(is_pageview,1,0)) AS partpvs FROM wmf.webrequest WHERE year = 2017 AND month = 5 AND day >= 22 AND day <= 28 AND agent_type = 'user' GROUP BY geocoded_data) AS gpbygeod GROUP BY country ORDER BY printrequests DESC LIMIT 50;
|Republic of Lithuania||5568||0.12|
|United Arab Emirates||3561||0.04|
|Republic of Korea||3437||0.01|
50 rows selected (12161.217 seconds)
@Tbayer partly on 23 and 24th for the smaller stuff, and the big wikipedias on june 1st it seems: https://www.mediawiki.org/wiki/MediaWiki_1.30/Roadmap (1.30.0-wmf.2)
Not really a significant influence it seems so far I guess. But there is a lot of variance there in that graph.. A lot more than I would have anticipated (maybe it correlates with the overall traffic those days. Any way, it's anywhere between 0.01-0.15 % of pageviews if I'm basing it on 650 million daily average of pageviews (incl. robots, spiders etc, all projects)
OK, here is an extension of the previous graph, now covering April 4-July 2:
Looking at this fuller picture, it seems like there is a baseline of about 400-500k/day (which, again, would correspond to about 0.08 to 0.09% of the ca. 530 million daily non-spider pageviews) , and various spikes on top of it.
BTW I also checked that all those requests are actually human (more precisely, 100.0% had agent_type = 'user' on each day from June 25 to July 2).
Data sources: see previously mentioned SWAP notebook
Closing this now; feel free to reopen in case there are further questions.