Superset has a CSV upload capability, which could be very useful for building dashboards from static external datasets. We have an analytics MySQL 'staging' db which is used for custom user tables. We should allow superset users to upload smallish CSVs to the MySQL staging db, resulting in new tables created there.
Description
Details
Project | Branch | Lines +/- | Subject | |
---|---|---|---|---|
operations/puppet | production | +9 -0 | Set Superset UPLOAD_FOLDER to /tmp/superset_uploads/ |
Event Timeline
Change 573393 had a related patch set uploaded (by Ottomata; owner: Ottomata):
[operations/puppet@production] Set Superset UPLOAD_FOLDER to /tmp/superset_uploads/
We just need https://gerrit.wikimedia.org/r/c/operations/puppet/+/573393. That + enabling this for the mysql_staging database in superset (which I already did) works!
Is there a reason not to do this?
Past me filed a similar code change in the past: https://gerrit.wikimedia.org/r/#/c/operations/puppet/+/479408/
Moritz suggested to modify Superset's systemd unit to add PrivateTmp=true. I am +1 as long as the staging db is the one on dbstore (not db1108/eventlogging etc..).
Change 573393 merged by Ottomata:
[operations/puppet@production] Set Superset UPLOAD_FOLDER to /tmp/superset_uploads/
Tested that this works well, pretty easy to create cc @cchen so she knows this is a possible option, tables have to be created on the mysql_staging database cc @kzimmerman cause uploading cvs can aide with use cases like the ones we have seen for dashboards that dana's group was making from multiple data sources.
pinging also @EYener in FR so she knows this is an easy way to prototype dashboards from ad hoc data sources, just a csv file is needed
Hi @Nuria and all, we're ready to try a 'mock' data set as well. Can someone point me toward instructions on accessing and utilizing the staging environment so that I can get started with the upload? Thank you!
@EYener CVS uploads are enabled on http://superset.wikimedia.org so no special access needed