Page MenuHomePhabricator

Understand the usage of template filters
Closed, DeclinedPublicThu, Jun 6

Description

Please refer to https://www.mediawiki.org/wiki/Product_Analytics#How_to_get_help_with_data_or_analysis for help answering these prompts

What team/program is this request for?
CommTech

What are you requesting?
The full request is here. The question is whether we should needsexplore filtering by "template category" to help users find the right template. We are concerned that each wiki have their own conventions for template categorization, let alone usage, and it'd be challenging to develop a menu of "categories" for finding the right templates.

Correspondingly, we want to learn if there are any indicators that we could leverage to help people find the right template to insert on a given page.

I've written this brief outlining the question: https://docs.google.com/document/d/15mhPFRIYeB34QHQEY8T-J0VjHQ7Y-2Y5Xk5UEjFbsnc/edit?usp=sharing

What is the problem you're trying to solve?
Help us know where to invest product, tech, and design energy.

What decision will you make or action will you take with the deliverable?

  • how varied are template categories across wikis (ie, does EN-Wiki have 20 categories, whereas de-Wiki have 42?)
  • how well organized are templates across wikis
  • how we might find specific data to help wikimedians insert the right templates

Additional details
https://docs.google.com/document/d/15mhPFRIYeB34QHQEY8T-J0VjHQ7Y-2Y5Xk5UEjFbsnc/edit?usp=sharing

Previous related analysis
T357442: [REQUEST] Understand how templates are used

Details

Due Date
Thu, Jun 6, 5:00 AM

Event Timeline

KCVelaga_WMF removed a subscriber: KCVelaga.

Additional context from meeting with @JWheeler-WMF

  • if the categeries are consistent, the team can think of suggesting templates based on categories
  • are there any specific kind of templates more used by newcomers vs. experienced users.
  • templates present on most wikis and transclusion count by wiki (for example, cite web is on 287 language wikis; Wikidata items can be used)
    • helps to identify the most commonly used templates across wiki
KCVelaga_WMF added subscribers: mpopov, KCVelaga_WMF.

Given my current bandwidth, I am not sure if I can get to this before the last week of June (especially as it needs investigation on data availability). I am unassigning myself for now.

cc @mpopov to triage.

mpopov raised the priority of this task from Medium to Needs Triage.Thu, May 30, 7:42 PM

@JWheeler-WMF Is this for FY24–25 PES 1.2.3?

Yes, it is a feature we're actively designing and plan to start eng work in july.

First, I want to acknowledge that I appreciate wanting to incorporate data into the design stage of product development cycle. Unfortunately we do not have capacity to take this request (or any request, really) for data analysis for the remainder of Q4.

I discussed this request with Kate Z and she suggested that looking at some examples from wikis with extensive differences in categorization – such as https://en.wikipedia.org/wiki/Wikipedia:Template_index and https://fr.wikipedia.org/wiki/Aide:Liste_de_mod%C3%A8les – would help you address the question. You could do this yourself with some support from Product Analytics via our consultation hours, rather than request extensive evidence.

Hi @mpopov . Is there a way for me to DIY an answer to this question?

How varied are template categories across wikis (ie, does EN-Wiki have 20 categories, whereas de-Wiki have 42?)

Kind of? Categories are…a giant unstructured, manually-maintained mess and inconsistent across languages. (In general, not just specific to templates.) Categories can also have layers upon layers of sub-categories nested within each other, and there's not an easy way to get them all.

Consider https://en.wikipedia.org/wiki/Category:Wikipedia_template_categories which is supposed to be "a category that contains all categories of templates, listed alphabetically" – it is linked to 58 languages and German isn't one of them. There may be a category like it, but not linked.
33,040 total on enwiki, 2,987 total on ptwiki

Then there's https://en.wikipedia.org/wiki/Category:Wikipedia_templates which "groups templates according to use, type, etc." and that's linked to 244 languages, one of which IS German https://de.wikipedia.org/wiki/Kategorie:Vorlage:
You can expand each sub-category and see how many sub-categories there are and how many templates (pages) there are in each sub-category.

A DIY approach would be to explore these pages across languages, ideally with someone who reads those languages.

Unfortunately there's no ready-to-use self-service dataset for totally valid questions about the content on Wikimedia projects. Even querying would be difficult because category names & IDs are inconsistent across languages. That second "Category:Wikipedia_templates" example if you had a query that retrieved all the sub-categories (assuming you didn't want to also get sub-categories of each sub-categories, etc.), you'd still need to know the name or ID associated with that "parent" category for each language you're interested in.

Thanks @mpopov I think this answer sufficiently answers our question. Please correct me if my assumptions are wrong:

Given the extremely high variance in category usage across wikis (33,040 on enwiki, 2,987 on ptwiki, and a whole list of subcategories and even category structures across wikis, we cannot build a scalable solution by category or subcategory. While we could build a model that's useful for a single wiki, we would not be able to scale this to many.