User Story
As a Data User, I want to be able to count unique editors across different geographic regions regardless of the project they edited.
Scope
Build a data pipeline that modifies the existing insert_editors_daily_data job and aggregates editors across geographies while de-duplicating editors across wiki projects . The output should include a data table that de-duplicates active editors across wiki projects and aggregates them by geography.
Pipeline Description : Here
Success Criteria
- A data pipeline deployed to the Data Engineering instance of Airflow that performs the functions described in the pipeline description
- Outputs a queryable hive table that includes monthly counts of editors aggregate by geography.
{fad75b8e3a1322e824d6c8cd5fcd0116d2064c37}