We should expand the wmfdata package to include utilities for PySpark, particularly tools for creating Spark contexts/sessions without relying on the opaque functionality of the preconfigured PySpark kernel.
Description
Description
Status | Subtype | Assigned | Task | ||
---|---|---|---|---|---|
Resolved | nshahquinn-wmf | T236291 Better tooling and training for analysis on SWAP | |||
Resolved | nshahquinn-wmf | T236315 Add PySpark utilities to wmfdata package |