User Story
As a newcomer completing Revise Tone suggestions, I want the tool to avoid highlighting tone issues in reference sections, external links, captions, and other non-prose areas so I can focus on improving the main text of the article.
Description
Suggestions should only appear in main article prose. The current extraction pipeline sometimes surfaces suggestions inside tables, reference lists, external links, and other structural elements that are not intended for tone evaluation. These suggestions are generally misleading and can confuse newcomers.
The Growth team can provide examples of current errors across multiple wikis.
We should use practical heuristics that target the most common markup patterns without requiring complete coverage of all variations.
Related previous work: T304150: Allow communities to configure which sections are excluded from link suggestion generation. Can we consider reusing the "List of excluded sections from the Add link task" from Special:CommunityConfiguration/GrowthSuggestedEdits?
Acceptance Criteria:
Exclude content found in commonly structured non-prose sections, including:
- Reference lists generated by <references> or similar templates
- External links sections and lists formatted with standard headings
If possible, we should also exclude suggestions within:
- Tables
- Infoboxes and data tables
- Image captions
Filtering does not need to be exhaustive, but should remove the majority of false positives in these areas.