Page MenuHomePhabricator

[Epic] Text to speech support on the Android Wikipedia app
Open, MediumPublic

Description

KR 3.1: Release two curated, accessible, and community-driven browsing and learning experiences to representative wikis, with the goal of increasing the logged-out reader retention of experienced users by 5%.

Hypothesis

If we create a feature to read articles out loud (using text-to-speech functionality built into modern mobile devices), we will make our content more accessible to a broader range of users, which would improve retention and increase interaction with our content for these audiences.

Must Haves
  • Navigates to an article in the app.
  • User is provided some affordance to initiate text to speech.
  • User has the ability to control playback either in the app itself or in the media playback control.
  • User must be able to navigate to a different article or flow within the application and be able to persist the listening experience.
  • User must be able to have basic control of the listening experience from outside the application.
  • When the user is on a different article to the one currently playing, they have the ability to switch to the new article.
Nice to Haves
  • Indicate or provide the ability to use spoken or AI version.
  • Be able to select system level voices.
  • Entry points in other features

How will we know we were successful
Validation
  • 1% higher app retention rate for logged-out users that engaged with the feature compared to those who did not engage
  • 25% increase in average time spent on articles for users who engage with the feature compared to those who did not engage.
  • 10% of unique users that engage with the feature use it more than once in a 30 day period.
Curiosities
  • What are the overall average/median session lengths for text-to-speech playback?
  • How many users engage with the system-level playback controls, instead of just the in-app playback controls?
  • Assuming we offer an option to change the "voice" of the text-to-speech speaker, which voice is most commonly selected? Most commonly selected playback speed?

Resources

  • Initial comparative review on Figma
  • Original task (T126889) with logs of user requests for this type of feature, as well as initial design explorations.
  • The WikiSpeech extension provides insights into massaging article content (i.e. removing elements that should be excluded from text-to-speech narration) that we can likely reuse.

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript