Maniphest T347998

Commons Impact Metrics - Implement prototype
Closed, ResolvedPublic11 Estimated Story Points
Actions

Assigned To

Authored By

	mforns
	Oct 3 2023, 1:54 PM

Tags

Referenced Files

None

Subscribers

Description

After we defined an implementation plan, we can go ahead and implement the prototype.
This would include:

Sqooping of source data (if necessary)
Finalize generated table schemas and create tables (see implementation plan)
Computation of (denormalized?) generated datasets (SparkSql? SparkSubmit?)
Finalize examples of queries to the generated data (tested), to extract Commons metrics (see implementation plan)

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Resolved		mforns	T347998 Commons Impact Metrics - Implement prototype
		Resolved		xcollazo	T350038 Spike: Understand the source data category and categorylinks
		Resolved		xcollazo	T350274 Implement dataset for glam_commons_edits

Event Timeline

mforns created this task.Oct 3 2023, 1:54 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptOct 3 2023, 1:54 PM

VirginiaPoundstone triaged this task as High priority.Oct 6 2023, 5:58 PM

xcollazo subscribed.Oct 12 2023, 1:17 PM

WDoranWMF set the point value for this task to 11.Oct 17 2023, 11:11 AM

WDoranWMF edited projects, added Data Products; removed Data Products (Sprint 02).Oct 17 2023, 12:24 PM

VirginiaPoundstone moved this task from Incoming to To be discussed/To be estimated on the Data Products board.Oct 17 2023, 2:51 PM

VirginiaPoundstone moved this task from To be discussed/To be estimated to Estimated/Discussed on the Data Products board.Oct 23 2023, 2:16 PM

VirginiaPoundstone moved this task from Estimated/Discussed to Data Products (Sprint 03) on the Data Products board.Oct 25 2023, 4:17 PM

VirginiaPoundstone edited projects, added Data Products (Data Products (Sprint 03)); removed Data Products.

xcollazo mentioned this in T350038: Spike: Understand the source data category and categorylinks.Oct 30 2023, 2:17 PM

xcollazo changed the status of subtask T350274: Implement dataset for glam_commons_edits from Open to In Progress.Nov 2 2023, 4:28 PM

mforns claimed this task.Nov 6 2023, 5:23 PM

mforns moved this task from Sprint Backlog to In Process on the Data Products (Data Products (Sprint 03)) board.

WDoranWMF moved this task from In Process to Paused on the Data Products (Data Products (Sprint 03)) board.Nov 16 2023, 12:10 PM

WDoranWMF moved this task from Paused to Wormhole to Sprint 04 on the Data Products (Data Products (Sprint 03)) board.

WDoranWMF edited projects, added Data Products (Data Products Sprint 04); removed Data Products (Data Products (Sprint 03)).

WDoranWMF moved this task from Sprint Backlog to In Process on the Data Products (Data Products Sprint 04) board.

Milimetric moved this task from In Process to Done on the Data Products (Data Products Sprint 04) board.Nov 20 2023, 5:12 PM

Milimetric moved this task from Done to Sign Off on the Data Products (Data Products Sprint 04) board.

WDoranWMF closed subtask T350274: Implement dataset for glam_commons_edits as Resolved.Nov 20 2023, 5:43 PM

WDoranWMF closed subtask T350038: Spike: Understand the source data category and categorylinks as Resolved.

VirginiaPoundstone moved this task from Sign Off to Done on the Data Products (Data Products Sprint 04) board.Nov 22 2023, 5:15 PM

VirginiaPoundstone mentioned this in T351836: [User Story] Commons Impact Metrics Prototype Documentation.Nov 22 2023, 6:07 PM

WDoranWMF closed this task as Resolved.Jan 24 2024, 4:45 PM