We decided to extract features hourly and label data as "automated" using the last 24 hours of data. This data is calculated for computation only so it should probably be on wmf_raw rather than wmf.
Features to be computed:
sessionId session_start session_end session_length_secs number_of_pageviews pageview_ratio_per_min nocookies user_agent_length