Page MenuHomePhabricator

OnThisDay for French is missing some events.
Closed, ResolvedPublic3 Estimated Story Points

Description

The onthisday scraping logic for frwiki seems to miss certain types of events.
Compare the output of onthisday:

https://fr.wikipedia.org/api/rest_v1/feed/onthisday/events/12/26

...with the wiki page itself for that day:

https://fr.wikipedia.org/wiki/26_d%C3%A9cembre

The wiki page has noticeably more events. It's likely because some of the events are in doubly-indented bullet points. Also, in addition to generic Events ("Événements"), the page has other headings like "Art"/"Science" etc. that are not getting parsed.

Event Timeline

Dbrant triaged this task as Medium priority.
Dbrant set the point value for this task to 3.

Change #1131035 had a related patch set uploaded (by Dbrant; author: Dbrant):

[mediawiki/services/wikifeeds@master] Improve scraping of OnThisDay events for frwiki.

https://gerrit.wikimedia.org/r/1131035

Change #1131035 merged by Cooltey:

[mediawiki/services/wikifeeds@master] Improve scraping of OnThisDay events for fr,ar,zh,svwiki.

https://gerrit.wikimedia.org/r/1131035