Page MenuHomePhabricator

[EPIC] Mapping available data sources [up to milestone 2]
Closed, DeclinedPublic

Description

What is this task?
This task is used for planning and organizing only. To comment on the project or discuss it, please use one of the linked tasks instead.

Description of main objective
The main objective is to understand the data sources available to us (see PRD).

Next milestones

Milestone 1: Basic user and content data (snapshots)

  • First task using that data source
  • Understand private vs. non-private data
  • Basic mapping of WMF data lake
    • User data (user attributes and actions)
    • Content data (Item/ Property/ Lexeme/ Wikipages attributes and revisions)

Milestone 2: Basic traffic data

  • First task using that data source
  • Basic mapping of Webrequest-related data sources

Milestone 3: Basic user and content data (live data)

  • First task using that data source
  • Basic mapping live data sources
    • User data (user attributes and actions)
    • Content data (Item/Property/Lexeme/Wikipage attributes and revisions)

Later

  • Supplement user and content data sources
  • Traffic data
  • Event data

Archived milestones: