Currently, Wmfdata has a hive.load_csv function, which is most notably used to upload the canonical datasets into the Data Lake.
However, it has some limitations (T327983, T355847), and in any case Hive is deprecated and should be removed from Wmfdata in the future (T384541).
We can build a better replacement using Spark's saveAsTable function. Rather than focusing on delimited files, we should simply build a function that uploads a Pandas data frame. Users can use Pandas directly to read in local files in any of the many formats it supports.