| Subject | Repo | Branch | Lines +/- | |
|---|---|---|---|---|
| Initial popularity score calculator | wikimedia/discovery/analytics | master | +623 -0 |
Details
Details
Related Changes in Gerrit:
Event Timeline
Comment Actions
I started building this by putting together a hive query, but after playing with it a bit I don't think it's going to be flexible enough. Oozie can kick of spark jobs, so working that out now. Spark can be done with the java, scala or python bindings. Not sure which to use, but the docs seem to lean towards scala so will work with that initially.
Comment Actions
actually since stas started his part in python, it seems to make the most sense to continue and do all our spark work in python unless a specific need arises to diverge.
Comment Actions
Change 256167 had a related patch set uploaded (by EBernhardson):
Initial popularity score calculator