Wed, Oct 31
Sat, Oct 27
Fri, Oct 26
Just want to make a note that as Android team has started including Echo notifications as app notifications (see https://www.mediawiki.org/wiki/Android_editing_features#Q1_-_July-September_2018), results of this analysis are of interest to that team.
Thu, Oct 25
Mon, Oct 22
(Updated the funnel analysis diagram because I had a brain toot that made me write "users" in place of "uploads")
I've thought about this and I think the current event-per-interaction approach should be scrapped in favor of a more forward-thinking solution. Analytics Engineering has some guidelines in place for creating EventLogging schemas in a way that the events are easily ingested into Druid, which makes them easy to visualize in Turnilo/Superset, which is usable by non-analysts which means @Ramsey-WMF et al. wouldn't be blocked by, say, the unavailability of a data analyst ;)
Currently Shiny Server is available (via its developer, RStudio) as a package only for Ubunty Trusty. This task is about packaging it up ourselves to make it available on VMs running Debian…I guess Stretch at this point. (I'll update the task title & description.)
Thu, Oct 18
Wed, Oct 17
I've put together the results of the much, much clustering that I did into https://github.com/wikimedia-research/wiki-segmentation/tree/master/clustering-initial/deliverable
Oct 11 2018
That's fair :)
Oct 10 2018
Oct 3 2018
All good now :)
Oct 2 2018
Sep 28 2018
Alright, I wiped all the request counts starting with August 10th (after making a backup) so Golden/Reportupdater is going to start a re-count using the webrequests in the 'text' partition. WDQS stats re-count should be done by Monday. Thanks for your patience, folks!
Sep 27 2018
For example usage of Hive with Reportupdater, see: https://github.com/wikimedia/wikimedia-discovery-golden/tree/master/modules/metrics/wdqs
Sep 25 2018
Logging out and back in worked.
Sep 24 2018
@Ottomata @Gehel: I tried editing stat1005:/srv/published-datasets/discovery/metrics/wdqs/basic_usage.tsv but couldn't because the file belongs to group analytics-search, not analytics-search-users (which I belong to) and that sort of makes sense because of how we have it configured right now in statistics::discovery:
For archive happiness, this was done in T164603 :)
Oooh, exciting!!! :D
Sep 22 2018
Sep 20 2018
Updating to better reflect its actual priority in the grander scheme of things.
Sep 17 2018
Sep 14 2018
Sep 12 2018
Sep 11 2018
Sep 7 2018
Sorry, I haven't checked my Phabricator emails in a while! Thanks so much @Ottomata! The upgrade has fixed the chart that wasn't working and has revealed that there's an issue with the data:
Whoops, realized I was missing a digit in the version.
Sep 6 2018
Sep 5 2018
Dmitry and I have gone over the schemas I proposed and he gave them a thumbs up for instrumentation:
I updated the existing ToC interactions schema for the redesigned ToC: https://meta.wikimedia.org/wiki/Schema:MobileWikiAppToCInteraction
Sep 4 2018
Aug 28 2018
We'll discuss with Josh and Charlotte (once she's back from vacation)
Aug 20 2018
Aug 9 2018
Aug 8 2018
Aug 7 2018
Motivation (beyond that it's just nice to have the latest and greatest): I'm trying to add a filter to a slice (which usually works) but when the filter is added, the slice goes from working totally fine to unorderable types: str() < int():
Jul 19 2018
@Charlotte: is this still relevant or can we close it?
Proof of concept dashboard up using the new test_gsc_all datasource in Druid: https://superset.wikimedia.org/superset/dashboard/wikipediagoogledemo/
Jul 17 2018
We did this.
Jul 12 2018
Jul 10 2018