Repeat demographics surveys for longer time period
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	Isaac
	Sep 10 2019, 7:57 PM

Description

One of the major questions around interpreting the first round of demographics surveys has been what readers are our results reflective of? That is, do the results best represent the population of readers who reads daily? Or the population who reads weekly? monthly?

Why might a longer survey give different results?

Reasons why a longer survey (one month) might change the results, especially with respect to gender:

External surveys have shown that the gender distribution of Wikipedia readers can depend on whether you ask something like "Do you read daily?" or "Do you read weekly?"
For several languages in the first round of surveys, a trend was seen where the further into the week that the survey went, the more balanced the proportion of men/women in the respondents.
From debiasing, women were consistently less likely to respond to the surveys than men
It is known that, especially on desktop, it is incredibly easy to miss the survey widget. For readers who visit Wikipedia once a week or less, this means that it would be incredibly unlikely for them to see the widget to respond.

Ethical considerations

We are always hesitant to survey readers unnecessarily because however small the survey widget, it does still have the potential to disrupt reading. One of the benefits of running a survey for longer is that we can drastically lower the sampling rate (e.g., in the realm of 1:100). This ensures that only a small proportion of readers will see this survey and therefore readers with many browsers or who clear cookies will still be unlikely to be resampled into the survey. Repeatedly resampling the same user is mainly problematic from the perspective of respecting readers as opposed to statistics / results.

We also will not run these surveys in all 13 languages but limit them to new languages and a few languages whose sampling rate was lower than 1 in 20 in the first round (to minimize overlap between surveys). In this way, we do not overburden every language community but also get some sense of whether any patterns we see are language specific. We hope that the results will provide some indication of how much the results of the other languages might shift with a longer survey period.

Details

The exact same survey will be used to maximize the comparability of the June round of surveys and this round (though notably seasonality will make direct comparison impossible as it will likely have a large impact).

English was selected due to its wide coverage and availability of survey data in America
Polish was not part of the first round so they are being included this round
Russian is included because it is a language that had a low sampling rate the first round (less likely for readers to have already seen the survey) and displayed the narrowing gender gap throughout the week.

Sampling rates are still more an art than a science. In this case, the survey is expected to go 4x longer than the June survey, but with more time to see and respond to the survey, we expect response rate will increase. This would argue for at least a 6-8x reduction in sampling rate. For both Russian and English, we also do not need as many responses as were gathered in the first round (ideally more like 1000-2000 responses), so we can further reduce to 12-16x.

Language	Survey Ready	Privacy Policy Uploaded	Interface Pages Created	Village Pump Notified	June Sampling Rate	June Responses*	Proposed Sampling Rate
en -- English (world)	Yes	Yes	T232525#5507366	23-Sept-19	1 of 98	6181	1 of 1200
pl -- Polish	Yes	Yes	T232525#5516283	23-Sept-19	N/A	N/A	1 of 200
ru -- Russian	Yes	Yes	T232525#5512050	23-Sept-19	1 of 59	4565	1 of 720

* This is the total number of responses after removing those under 18 and responses that could not be linked to EventLogging, so it's a lower number than others reported. But it's the number that matters when analyzing the gender of the reading population.

Details

	Subject	Repo	Branch	Lines +/-
	Undeploy reader surveys in English, Polish, and Russian.	operations/mediawiki-config	master	+0 -52
	Enable reader demographic surveys in English, Polish, and Russian. With proper links now.	operations/mediawiki-config	master	+55 -0

Customize query in gerrit

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Resolved		Isaac	T203042 Output 2.2: Characterizing readership by demographics
		Resolved		Isaac	T232525 Repeat demographics surveys for longer time period

Event Timeline

Isaac created this task.Sep 10 2019, 7:57 PM

Regarding the sampling rate for Polish Wikipedia, their page views are generally about one third of those to Russian Wikipedia (https://tools.wmflabs.org/siteviews/?platform=all-access&source=pageviews&agent=user&range=this-year&sites=ru.wikipedia.org|pl.wikipedia.org) and unique devices are also a bit under one third (https://stats.wikimedia.org/v2/#/pl.wikipedia.org/reading/unique-devices/normal|line|2-year|~total|monthly). Based on this, I will recommend setting the sampling rate to three times that of Russian Wikipedia.

Isaac updated the task description. (Show Details)Sep 19 2019, 3:27 PM

English (worldwide)

Survey	Duration	Goal	Start Date and time	End Date
reader-demographics-en	28 days	2000 responses	2019/9/24 @ 1100 UTC (0700 EST)	2019/10/15 @ 1100 UTC

Interface pages

Page created	Message
https://en.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-message	Take a short survey and help us improve Wikipedia
https://en.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-privacy	Survey data handled by a third party. [https://foundation.wikimedia.org/wiki/2019_Wikipedia_Demographics_Survey_Privacy_Statement Privacy policy].
https://en.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-description	// leaving blank
https://en.wikipedia.org/wiki/MediaWiki:Reader-demographics-2-link	https://docs.google.com/forms/d/e/1FAIpQLSdMZP7_pUgnOsvbYxRnAsdlJQ3Q9jwi_kXSc-wsuCBi__0Xuw/viewform?hl=en

enwiki => [
    'enabled' => true,
    "name" => "reader-demographics-en",
    "type" => "external",
    "description" => "Reader-demographics-1-description",
    "link" => "Reader-demographics-2-link",
    "question" => "Reader-demographics-1-message",
    "privacyPolicy" => "Reader-demographics-1-privacy",
    "coverage" => 0.000833, // 1 out of 1200
    "instanceTokenParameterName" => "entry.1791119923",
    "platforms" => [
        "desktop"=> ["stable"],
        "mobile"=> ["stable"]
    ],
]

Russian

Survey	Duration	Goal	Start Date and time	End Date
reader-demographics-ru	28 days	2000 responses	2019/9/24 @ 1100 UTC (0700 EST)	2019/10/15 @ 1100 UTC

Interface pages

Page created	Message
https://ru.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-message	Примите участие в опросе и помогите нам улучшить Википедию
https://ru.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-privacy	Передаваемые данные обрабатываются третьей стороной. [https://foundation.wikimedia.org/wiki/%D0%98%D1%81%D1%81%D0%BB%D0%B5%D0%B4%D0%BE%D0%B2%D0%B0%D0%BD%D0%B8%D0%B5_%D1%80%D0%B0%D0%B7%D0%BD%D0%BE%D0%BE%D0%B1%D1%80%D0%B0%D0%B7%D0%B8%D1%8F_%D1%87%D0%B8%D1%82%D0%B0%D1%82%D0%B5%D0%BB%D0%B5%D0%B9 Заявление о конфиденциальности].
https://ru.wikipedia.org/wiki/MediaWiki:Reader-segmentation-1-description	// leaving blank
https://ru.wikipedia.org/wiki/MediaWiki:Reader-demographics-2-link	https://docs.google.com/forms/d/e/1FAIpQLSdHXJGC90dyrvzfj1XmHWnmn6fgE0B3qazPNk11d2_z6EuDRA/viewform?hl=ru

ruwiki => [
    'enabled' => true,
    "name" => "reader-demographics-ru",
    "type" => "external",
    "description" => "Reader-segmentation-1-description",
    "link" => "Reader-demographics-2-link",
    "question" => "Reader-demographics-1-message",
    "privacyPolicy" => "Reader-demographics-1-privacy",
    "coverage" => 0.00167, // 1 out of 600
    "instanceTokenParameterName" => "entry.1791119923",
    "platforms" => [
        "desktop"=> ["stable"],
        "mobile"=> ["stable"]
    ],
]

Polish

Survey	Duration	Goal	Start Date and time	End Date
reader-demographics-pl	28 days	2000 responses	2019/9/26 @ 1100 UTC (0700 EST)	2019/10/17 @ 1100 UTC

Interface pages

Page created	Message
https://pl.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-message	Odpowiedz na kilka pytań i pomóż nam poprawić Wikipedię.
https://pl.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-privacy	Przetwarzanie danych ankiety przez podmiot trzeci. [https://foundation.wikimedia.org/wiki/O%C5%9Bwiadczenie_o_prywatno%C5%9Bci_podczas_badania_demografii_czytelnik%C3%B3w_Wikipedii_w_roku_2019 Polityka prywatności].
https://pl.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-description	// leaving blank
https://pl.wikipedia.org/wiki/MediaWiki:Reader-demographics-1-link	https://docs.google.com/forms/d/e/1FAIpQLSe6QQY9W6gAQ_Q2yjrGRtduBgHYWuoAXrqaDYf4aUsBTJKfag/viewform?hl=pl
https://pl.wikipedia.org/wiki/MediaWiki:Ext-quicksurveys-external-survey-yes-button	Zobacz ankietę
https://pl.wikipedia.org/wiki/MediaWiki:Ext-quicksurveys-survey-confirm-msg	Dziękujemy za wzięcie udziału. Dzięki uzyskanym danym {{SITENAME}} będzie jeszcze lepszą stroną!

plwiki => [
    'enabled' => true,
    "name" => "reader-demographics-pl",
    "type" => "external",
    "description" => "Reader-demographics-1-description",
    "link" => "Reader-demographics-1-link",
    "question" => "Reader-demographics-1-message",
    "privacyPolicy" => "Reader-demographics-1-privacy",
    "coverage" => 0.005, // 1 out of 200
    "instanceTokenParameterName" => "entry.1791119923",
    "platforms" => [
        "desktop"=> ["stable"],
        "mobile"=> ["stable"]
    ],
]

Isaac updated the task description. (Show Details)Sep 23 2019, 3:04 PM

Change 539183 had a related patch set uploaded (by Isaac Johnson; owner: Isaac Johnson):
[operations/mediawiki-config@master] Enable reader demographic surveys in English, Polish, and Russian.

https://gerrit.wikimedia.org/r/539183

gerritbot added a project: Patch-For-Review.Sep 25 2019, 6:57 PM

Change 539183 merged by jenkins-bot:
[operations/mediawiki-config@master] Enable reader demographic surveys in English, Polish, and Russian. With proper links now.

https://gerrit.wikimedia.org/r/539183

Maintenance_bot removed a project: Patch-For-Review.Sep 26 2019, 11:10 AM

Mentioned in SAL (#wikimedia-operations) [2019-09-26T11:10:47Z] <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: 7645e55: Enable reader demographic surveys in English, Polish, and Russian (T232525) (duration: 01m 06s)

Had no responses to the Village Pump posts or on the meta page so deployed the surveys this morning. The intent is for one month but we will monitor response count / feedback and adjust as needed.

Isaac moved this task from Backlog to In Progress on the Research board.Sep 30 2019, 4:24 PM

Change 547339 had a related patch set uploaded (by Isaac Johnson; owner: Isaac Johnson):
[operations/mediawiki-config@master] Undeploy reader surveys in English, Polish, and Russian.

https://gerrit.wikimedia.org/r/547339

gerritbot added a project: Patch-For-Review.Oct 30 2019, 10:56 PM

Change 547339 merged by jenkins-bot:
[operations/mediawiki-config@master] Undeploy reader surveys in English, Polish, and Russian.

https://gerrit.wikimedia.org/r/547339

Maintenance_bot removed a project: Patch-For-Review.Oct 31 2019, 6:10 PM

Mentioned in SAL (#wikimedia-operations) [2019-10-31T18:12:33Z] <urbanecm@deploy1001> Synchronized wmf-config/InitialiseSettings.php: SWAT: fe08fbb: Undeploy reader surveys in English, Polish, and Russian (T232525) (duration: 01m 02s)

leila mentioned this in T203042: Output 2.2: Characterizing readership by demographics.Dec 9 2019, 9:27 PM

Closing this task as summary of results has been added: https://meta.wikimedia.org/wiki/Research:Characterizing_Wikipedia_Reader_Behaviour/Demographics_and_Wikipedia_use_cases#Results

DDeSouza mentioned this in T345951: Deploy pilot on enwiki for Global Readers Demographic Survey.Sep 11 2023, 1:54 AM

Repeat demographics surveys for longer time periodClosed, ResolvedPublicActions

Description

Why might a longer survey give different results?

Ethical considerations

Details

Details

Related ObjectsSearch...

Event Timeline

English (worldwide)

Interface pages

Russian

Interface pages

Polish

Interface pages

Repeat demographics surveys for longer time period
Closed, ResolvedPublic
Actions

Related Objects
Search...