Page MenuHomePhabricator

Refactor Wikilink data collection from EventStream
Closed, ResolvedPublic

Description

Background
After conducting an investigation in task T250084 to check why the EventStream kept disconnecting, the team concluded that the data collection script needs to be refactored. The current script uses the Python SSEClient to read from the stream and save all changed page links relevant to the Wikipedia Library project into the Wikilink database for further analysis and consumption.

The refactoring process has been divided into three tasks

Event Timeline