Page MenuHomePhabricator

Exclude templates from breaking news algo
Open, MediumPublic

Description

The frequency of false positives delivered by our breaking news algorithm is high. Based on a quick analysis of the logs being delivered weekly, we deliver ~200 results/week. Only 10% of those are accurate, in my opinion. We're taking the backwards approach to cutting down.

ToDo

  • Exclude all results from simplewiki
  • Create an exclude list of templates. In other words -- if the article has the following templates, it cannot be breaking news
    • if any category/template in the article includes a death year we are not in. See here for an example. Please include all other language versions.

Acceptance criteria

  • create ignore/un-include list of templates in code to be malleable - can easily add and remove templates as we iterate
  • Update internal documentation to reflect the change
Test Strategy
  • Francisco will keep monitoring and requesting changes as the logs inform results

Related Objects

StatusSubtypeAssignedTask
OpenNone
OpenNone

Event Timeline

FNavas-foundation renamed this task from [Stub] changes to breaking news algorithm to Un-include these templates from breaking news.Dec 21 2023, 3:54 PM
FNavas-foundation updated the task description. (Show Details)
FNavas-foundation renamed this task from Un-include these templates from breaking news to Un-include templates from breaking news algo.Dec 21 2023, 3:58 PM
FNavas-foundation updated the task description. (Show Details)
FNavas-foundation renamed this task from Un-include templates from breaking news algo to Exclude templates from breaking news algo.Dec 21 2023, 4:05 PM
FNavas-foundation triaged this task as Medium priority.