Goal
To enable section topics ranking.
Definition
General section topics relevance is a score that measures to what extent a given section topic helps summarize and understand the content of a given piece of text in a Wikipedia article.
Proposal
The baseline score can be computed through a custom TF-IDF weight, where:
- the term frequency (TF) component follows a cross-wiki fashion thanks to section alignment.
This is already implemented as the initial score, see T311750: Combine section extraction with blue links relevance score- UPDATE: the initial score is at the article level, as it doesn't take into account sections - the inverted document frequency (IDF) component is instead computed within the same wiki