Background
This research emerges from T396029: [Epic] Guidance for Article and Section creation which is intended to improve the effectiveness of article and section creation activities.
WMF research related to Cx have used the following core metrics for "quality" when assessing articles created using Cx: whether or not an article is deleted, and article-quality-model features.
My intention is to make recommendations for how to extend our definition of quality so that it can be used for better assessing Cx-created articles. Improvements that we will explore may include:
- Longitudinality - how does content progress (or not) over time?
- Takes into account the localization over time. (E.g., are new, local-language sources incorporated into the translation or are the original sources largely retained?)
- Takes into account the article’s ‘freshness’, or how up-to-date the article’s content is. (E.g., does translated content remain fresh as new information is published about the subject of the article or does it remain static and slowly fall out-of-date as the source article continues to evolve?
- Thicker features - what aspects tend to be missing?
- Can assess the presence or absence of important features, like infoboxes
- Can distinguish between media types, like images versus icons.
- Can determine language of sources
- Can measure (in tandem with the article-country model) degree of localization of content, sources
The goal of this improved metric is to address the following questions: What does Cx make easy? Where are there gaps? And how do we capture those gaps through measurement?
Scope
- Wikipedia
- Main namespace (article pages)
- Articles created using Cx
- Articles created from scratch, for baselines
Methods
Methods will be adapted from T371934: [medium] Analyze localization and maintenance of translated content.
Outputs
An exploratory analysis in PAWS with results presented on Meta.
This research can inform article and section creation guidance (T396029) as well as product work related to content deficits in translated articles (T397282).