At the moment it's only registering a small number of file types, so the file classification is not great.
Description
Details
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | None | T210313 Statistics for views of individual Wikimedia images | |||
Resolved | None | T207208 Add mediacounts data to AQS and, from there, Restbase | |||
Resolved | • fdans | T228149 Load media requests data into cassandra | |||
Resolved | • fdans | T225911 Add new mediatypes to media classification refinery code |
Event Timeline
Change 517641 had a related patch set uploaded (by Fdans; owner: Fdans):
[analytics/refinery/source@master] Add media formats to file url parser regex
@fdans let's please file tasks for mediarequest api as child tasks of https://phabricator.wikimedia.org/T207208 so we keep track of all the work we are doing on this regard.
Change 522390 had a related patch set uploaded (by Fdans; owner: Fdans):
[analytics/refinery@master] [wip] Add file extension and media classification to mediacounts job
Change 517641 merged by Fdans:
[analytics/refinery/source@master] Add file extension and media type classification to media files UDF
Change 522390 abandoned by Fdans:
[wip]Add file extension and media classification to mediacounts job
Reason:
Since we'll be using a new dataset for the mediarequests API, I'm going to abandon this and open a new change instead