Page MenuHomePhabricator

Milimetric (Dan Andreescu)
Staff Engineer (Data Engineering)

Projects (12)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
Oct 8 2014, 5:48 PM (348 w, 5 d)
Availability
Available
IRC Nick
Milimetric
LDAP User
Milimetric
MediaWiki User
Milimetric (WMF) [ Global Accounts ]

Recent Activity

Thu, Jun 10

Milimetric moved T284623: Top edited pages list on enwiktionary contains nonexistent pages with titles made up of question marks from Next Up to In Progress on the Analytics-Kanban board.
Thu, Jun 10, 4:47 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
Milimetric added a project to T284623: Top edited pages list on enwiktionary contains nonexistent pages with titles made up of question marks: Analytics-Kanban.
Thu, Jun 10, 4:47 PM · Analytics-Kanban, Analytics, Analytics-Wikistats
Milimetric claimed T230915: Import of MediaWiki tables into the Data Lakes mangles usernames.

seems related to T230915, so I'm going to look at both.

Thu, Jun 10, 4:45 PM · Analytics-Kanban, Analytics-Data-Quality, Analytics
Milimetric updated the task description for T284610: General Usage statistics for AQS.
Thu, Jun 10, 2:32 AM · Analytics

Wed, Jun 9

Milimetric closed T274322: Clean up issues with jobs after Hadoop Upgrade, a subtask of T273711: Upgrade the Analytics Hadoop cluster to Apache Bigtop, as Resolved.
Wed, Jun 9, 5:23 PM · Patch-For-Review, Analytics
Milimetric closed T274322: Clean up issues with jobs after Hadoop Upgrade as Resolved.
Wed, Jun 9, 5:23 PM · Patch-For-Review, Analytics-Kanban, Analytics
Milimetric added a comment to T251376: Support right-to-left languages in Wikistats.

:) not to toot my own horn, but for planning purposes, I do happen to be very fast at stuff like that. I could probably move off Semantic and to something like Vuetify in a week or two at most. Or a WMF design system, if it has a good chunk of the components we need.

Wed, Jun 9, 4:13 PM · I18n, RTL, Analytics

Tue, Jun 8

Milimetric updated subscribers of T284610: General Usage statistics for AQS.
Tue, Jun 8, 9:06 PM · Analytics
Milimetric added a comment to T284610: General Usage statistics for AQS.

If you run this query on presto:

Tue, Jun 8, 9:04 PM · Analytics
Milimetric created T284610: General Usage statistics for AQS.
Tue, Jun 8, 9:03 PM · Analytics

Mon, Jun 7

Milimetric added a comment to T215001: Revisions missing from mediawiki_revision_create.

Just a quick note to say that I ran the query for May 17th, and still found mismatches on both sides. I will find a way to do a better analysis that we can easily re-run every time we make improvements.

Mon, Jun 7, 5:44 PM · MW-1.37-notes (1.37.0-wmf.5; 2021-05-11), Event-Platform, Growth-Team-Filtering, Analytics-Kanban, Growth-Team, Product-Analytics, Analytics

Thu, Jun 3

Milimetric added a comment to T257071: "Page views by edition of Wikipedia" for each country.

Should I create another feature request for that? Or is this idea too far-fetched?

Thu, Jun 3, 3:47 PM · Analytics, Analytics-Wikistats
Milimetric added a comment to T251376: Support right-to-left languages in Wikistats.

Thanks for the work, this is great. I think the static nature of the site isn't too much of a problem, we've solved similar problems with bundles and, worst case, some Apache config magic.

Thu, Jun 3, 3:42 PM · I18n, RTL, Analytics
Milimetric awarded T159584: Secure Hue/Superset/Turnilo/Yarn/Piwik with CAS (and possibly 2FA) a Love token.
Thu, Jun 3, 3:37 PM · Patch-For-Review, CAS-SSO, User-Elukey, Analytics
Milimetric moved T221890: Add wikidata ids to data lake tables from Smart Tools for Better Data to Deprioritized on the Analytics board.
Thu, Jun 3, 3:37 PM · Epic, Analytics, Product-Analytics
Milimetric added a comment to T280678: Crunch and delete many old dumps logs.

Would we need to ask a security review for exporting aggregated data out of hadoop?

Thu, Jun 3, 3:17 PM · Analytics-Kanban, Analytics

Tue, Jun 1

Milimetric closed T222180: Point Extension:Sentry to EventGate (beta) as Declined.

As far as I understand, our experiments with Extension:Sentry were replaced by the Client errors data pipeline.

Tue, Jun 1, 3:16 PM · observability, Product-Infrastructure-Team-Backlog, Wikimedia-Logstash
Milimetric closed T222180: Point Extension:Sentry to EventGate (beta), a subtask of T217142: [Proposal] Use the Kafka-Logstash logging infrastructure to log client-side errors, as Declined.
Tue, Jun 1, 3:16 PM · observability, User-fgiunchedi, Better Use Of Data, MW-1.34-notes (1.34.0-wmf.15; 2019-07-23), Patch-For-Review, User-herron, Product-Infrastructure-Team-Backlog, Wikimedia-Logstash
Milimetric added a comment to T283980: Phacility (Maintainer of Phabricator) is winding down. Upstream support ending..

Much love to @epriestley for starting this project, helping us move to it, and incorporating our feedback and contributions. In my opinion, moving to Phab was the best decision we've made in the almost 9 years I've been here. Thank you, @epriestley, for your part in that.

Tue, Jun 1, 2:53 PM · Release-Engineering-Team (Seen), User-Matthewrbowker, Phabricator

Thu, May 20

Milimetric claimed T275233: wmfdata-python's Hive query output includes logspam.

I think Andrew has some ideas, we'll get to the bottom of this one way or another. Then, once Neil's issue is resolved I'd like to reframe this or add a subtask to go over logging on the cluster in general. Lots of background noise like SLF4J warnings clutter the already cluttered logs, and make maintenance harder.

Thu, May 20, 5:30 PM · Analytics-Kanban, Analytics, Product-Analytics, wmfdata-python
Milimetric moved T275233: wmfdata-python's Hive query output includes logspam from Incoming to Operational Excellence on the Analytics board.
Thu, May 20, 5:27 PM · Analytics-Kanban, Analytics, Product-Analytics, wmfdata-python

Wed, May 19

Milimetric added a comment to T282632: Superset Presto LIMIT >10000 error .

@SNowick_WMF, Reportupdater is fine, it's what's available right now. We don't want to slow you down waiting for AirFlow

Wed, May 19, 2:32 PM · Analytics-Kanban, Analytics

Tue, May 18

Milimetric moved T279380: Add Traffic's notion of "from public cloud" to Analytics webrequest data from In Code Review to Ready to Deploy on the Analytics-Kanban board.
Tue, May 18, 1:50 PM · Analytics-Kanban, SRE, Analytics, Traffic

Mon, May 17

Milimetric added a comment to T257071: "Page views by edition of Wikipedia" for each country.

Thanks very much for following through with that. Seeing your prototype makes it very clear what you need and why. I think ideally we would create a better pipeline from community-requested statistics to on-wiki infographics. This is something that's been hard for WMF to prioritize, but something I care about, and will continue to think about.

Mon, May 17, 9:59 PM · Analytics, Analytics-Wikistats
Milimetric added a project to T275233: wmfdata-python's Hive query output includes logspam: Analytics.

Ok, weird, I can't reproduce this... maybe it's some weird access problem? We'll triage and look into it

Mon, May 17, 9:47 PM · Analytics-Kanban, Analytics, Product-Analytics, wmfdata-python
Milimetric added a comment to T280311: Temp files left around in wikistats_1/ ?.

It takes up about 15G so honestly it's not that big a deal to keep around, even if there are only a few downloaders. I can't tell about our mirrors of course, but even from our own web server there are a few downloaders that aren't bots. So, meh. Keep?

Mon, May 17, 9:45 PM · Analytics-Radar, Dumps-Generation
Milimetric added a comment to T280678: Crunch and delete many old dumps logs.

There's no need for a fancy tool, this would be a few lines of spark to read the data and save to, probably, a Hive table with an explicit schema. Should take a day to set up and some time after that to run some analysis. We just don't have the capacity, there's a lot of stuff going on that's higher priority right now. But it's relatively easy for anyone to play with. The only concern here for me that's a bit time sensitive is that there are a bunch of IPs in the logs.

Mon, May 17, 9:42 PM · Analytics-Kanban, Analytics
Milimetric added a comment to T274880: Deployment access request for some analytics repos.

Thanks, the wmde-qwerty group would happily self-merge reportupdater-queries while just CC'ing WMF Analytics.

Mon, May 17, 9:36 PM · Analytics-Kanban, Analytics, WMDE-TechWish
Milimetric added a comment to T281605: Stop Refining mediawiki_job events in Hive.

The description says we're keeping the raw JSON import, just not the rest of the pipeline. I agree to delete any of it, unused data is just confusing, just making sure everyone expects the same thing

Mon, May 17, 9:26 PM · Analytics-Kanban, Analytics
Milimetric reassigned T282632: Superset Presto LIMIT >10000 error from Milimetric to JAllemandou.
Mon, May 17, 9:21 PM · Analytics-Kanban, Analytics
Milimetric lowered the priority of T282887: Avoid accepting Kafka messages with whacky timestamps from High to Medium.
Mon, May 17, 9:20 PM · Platform Team Workboards (Clinic Duty Team), Event-Platform, Product-Data-Infrastructure, Discovery, SRE, Analytics
Milimetric moved T182804: Remove request for font.googleapis.com from analytics.wikimedia.org from In Code Review to Done on the Analytics-Kanban board.
Mon, May 17, 3:19 PM · Analytics-Kanban, Analytics
Milimetric added a comment to T282842: Early adoption signup for WMF GitLab.

https://stats.wikimedia.org/ runs https://gerrit.wikimedia.org/r/admin/repos/analytics/wikistats2 and we'd be happy to migrate to GitLab. We merge translation commits from translatewiki.net, and have Jenkins build set up, so not sure if that's tricky for the migration, but we're happy to do it together.

Mon, May 17, 2:48 PM · Release-Engineering-Team (Doing), User-brennen, GitLab (Initialization)

May 15 2021

Milimetric added a comment to T282632: Superset Presto LIMIT >10000 error .

I'm just as lost as you are so far... it's expected behavior and not a bug, but I can't figure out what configuration triggers it and why "https://github.com/apache/superset/blob/9773aba522e957ed9423045ca153219638a85d2f/superset/translations/en/LC_MESSAGES/messages.json#L1017"

May 15 2021, 1:52 AM · Analytics-Kanban, Analytics

May 13 2021

Milimetric moved T282710: Missing data in virtualpageview_hourly table since April 15, 2021 from Incoming to Operational Excellence on the Analytics board.
May 13 2021, 4:59 PM · Analytics-Kanban, Analytics
Milimetric updated subscribers of T282710: Missing data in virtualpageview_hourly table since April 15, 2021.
May 13 2021, 4:59 PM · Analytics-Kanban, Analytics
Milimetric assigned T282710: Missing data in virtualpageview_hourly table since April 15, 2021 to mforns.

Culprit is uppercase mismatch, so druid jobs weren't finding the data: https://github.com/wikimedia/analytics-refinery/blob/master/oozie/virtualpageview/druid/daily/coordinator.properties#L48

May 13 2021, 4:58 PM · Analytics-Kanban, Analytics
Milimetric assigned T282589: Please grant CRS access to Superset/Turnilo (deadline EOD Monday 17) to elukey.
May 13 2021, 4:57 PM · Analytics, CommRel-Specialists-Support (Apr-Jun-2021), SRE, LDAP-Access-Requests
Milimetric triaged T282657: Adding data from centralauth to the lake and the mediawiki_history dataset as Medium priority.

This would have to happen after data governance, so any help before that is appreciated (I can review patches to the pipeline anytime)

May 13 2021, 4:54 PM · Research, Analytics
Milimetric moved T282632: Superset Presto LIMIT >10000 error from Next Up to In Progress on the Analytics-Kanban board.
May 13 2021, 4:52 PM · Analytics-Kanban, Analytics
Milimetric claimed T282632: Superset Presto LIMIT >10000 error .
May 13 2021, 4:52 PM · Analytics-Kanban, Analytics
Milimetric updated subscribers of T282618: Superset query timeouts for charts using Druid table.

cc-ing @JAllemandou, who said wanted to look at it, we'll triage with him Monday when he's back from vacation

May 13 2021, 4:49 PM · Analytics
Milimetric triaged T282584: Clean up EventLogging Schema: pages on meta as High priority.
May 13 2021, 4:47 PM · Analytics-Kanban, Analytics-EventLogging, Documentation, Wikimedia-Developer-Portal, Analytics
Milimetric triaged T282562: WMDEBanner* Event Platform Migration as High priority.
May 13 2021, 4:43 PM · Patch-For-Review, Analytics-Kanban, Analytics, Event-Platform
Milimetric edited projects for T282550: EventLogging revision popup gets hidden behind content in Vector, added: Analytics-Radar, Product-Data-Infrastructure, Metrics-Platform; removed Analytics.

@jlinehan and/or @Mholloway want to take a look? If not, anyone can feel free to add me to a code review and ping me to make sure I get to it.

May 13 2021, 4:42 PM · Metrics-Platform, Product-Data-Infrastructure, Analytics-Radar, Vector (Vector (Tracking)), Analytics-EventLogging
Milimetric assigned T282491: [Newpyter] Conda stacked environment overwrites TAR environment variable to Ottomata.
May 13 2021, 4:40 PM · Analytics-Kanban, Analytics
Milimetric assigned T282262: [Newpyter] Can't install 'haven' package with conda R but can with system R to Ottomata.
May 13 2021, 4:39 PM · Analytics-Kanban, Analytics
Milimetric edited projects for T273700: eventgate_validation_error for NewcomerTask, HomepageTask, and HomepageVisit schemas, added: Analytics-Radar; removed Analytics.
May 13 2021, 4:36 PM · Analytics-Radar, MW-1.36-notes (1.36.0-wmf.30; 2021-02-09), Growth-Team (Current Sprint), GrowthExperiments

May 12 2021

Milimetric merged task T282684: Include Global blocks in mediawiki_history into T282657: Adding data from centralauth to the lake and the mediawiki_history dataset.
May 12 2021, 1:57 PM · Analytics
Milimetric merged T282684: Include Global blocks in mediawiki_history into T282657: Adding data from centralauth to the lake and the mediawiki_history dataset.
May 12 2021, 1:57 PM · Research, Analytics
Milimetric created T282684: Include Global blocks in mediawiki_history.
May 12 2021, 1:53 PM · Analytics

May 10 2021

Milimetric triaged T263489: AQS 2.0 as Medium priority.
May 10 2021, 7:13 PM · Code-Health-Objective, Platform Engineering Roadmap, Platform Team Initiatives (API Gateway), Analytics, Epic
Milimetric triaged T262205: Need for new event-type - `user_create` and `user_rename` as High priority.

This is still high priority for us as we look to make some of our datasets incremental. We're not focusing on it right right now

May 10 2021, 4:32 PM · Platform Team Workboards (S&F Workboard), Platform Engineering Roadmap Decision Making, Analytics, Event-Platform
Milimetric triaged T252617: Use types in Analytics Puppet classes/profiles/etc.. as Medium priority.

Might be a good task for Ben (starting soon).

May 10 2021, 4:32 PM · Patch-For-Review, Analytics
Milimetric triaged T248964: Implement inequality metrics for WikiStats as Low priority.
May 10 2021, 4:29 PM · Analytics, Analytics-Wikistats
Milimetric edited projects for T275786: Remove all debian python-* and other user requested packages installed for analytics clients, use conda instead, added: Analytics-Clusters; removed Analytics.
May 10 2021, 4:28 PM · Patch-For-Review, Analytics-Kanban, Analytics-Clusters
Milimetric triaged T276791: Configure the Hadoop cluster to use the GPUs available on some workers as High priority.
May 10 2021, 4:27 PM · Analytics, Machine-Learning-Team
Milimetric moved T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia from Next Up to Done on the Analytics-Kanban board.
May 10 2021, 4:27 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team
Milimetric closed T277609: Generate dump of scored-revisions from 2018-2020 for English Wikipedia as Resolved.
May 10 2021, 4:27 PM · Analytics-Kanban, Data-Services, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team
Milimetric triaged T274986: Purge deprecated reportupdater outputs as High priority.
May 10 2021, 4:26 PM · Analytics
Milimetric triaged T280107: Generate dump of scored-revisions from 2018-2020 for Wikis except English Wikipedia as Medium priority.
May 10 2021, 4:25 PM · Analytics-Kanban, artificial-intelligence, editquality-modeling, ORES, Analytics, Machine-Learning-Team
Milimetric placed T208230: Update pageview_hourly to include timestamp for better druid indexation up for grabs.

To be done in concert with move to Apache Iceberg and overhaul of how we handle the time dimension more generally

May 10 2021, 4:23 PM · Analytics
Milimetric assigned T182804: Remove request for font.googleapis.com from analytics.wikimedia.org to razzi.
May 10 2021, 4:22 PM · Analytics-Kanban, Analytics
Milimetric raised the priority of T182804: Remove request for font.googleapis.com from analytics.wikimedia.org from Medium to High.

making high for privacy reasons, anyone should feel free to grab it

May 10 2021, 4:21 PM · Analytics-Kanban, Analytics
Milimetric triaged T182804: Remove request for font.googleapis.com from analytics.wikimedia.org as Medium priority.
May 10 2021, 4:20 PM · Analytics-Kanban, Analytics
Milimetric triaged T264791: Rework how mediawiki-history differentiates fake page-create from real ones as Medium priority.
May 10 2021, 4:18 PM · Analytics
Milimetric triaged T277012: Create a debian package for Apache Airflow as High priority.
May 10 2021, 4:17 PM · Packaging, Analytics-Kanban, Analytics
Milimetric triaged T247510: Refine + EventLoggingSchemaLoader should use api.svc instead of meta.wikimedia.org directly. as High priority.
May 10 2021, 4:17 PM · Analytics-Kanban, Analytics
Milimetric triaged T282033: Airflow collaborations as High priority.
May 10 2021, 4:16 PM · Platform Team Workboards (Image Suggestion API), Analytics
Milimetric triaged T282035: Catalog, Categorize, and Templetize existing scheduled workflows as High priority.
May 10 2021, 4:16 PM · Platform Engineering, Analytics
Milimetric triaged T276472: Odd behavior in unique device counts as Medium priority.

ping Product-Analytics, any interest in taking a look at this?

May 10 2021, 4:15 PM · Analytics
Milimetric triaged T267355: Traffic anomaly alarms as Medium priority.
May 10 2021, 4:14 PM · Analytics-Kanban, Analytics
Milimetric triaged T263030: Make data quality stats alert only if anomalous metrics change as High priority.

We'll look at possible ways to improve this as we move data quality jobs to AirFlow

May 10 2021, 4:13 PM · Analytics
Milimetric triaged T280844: Too many views to Skathi (moon) on enwiki as Medium priority.
May 10 2021, 4:09 PM · Analytics, Product-Analytics, Pageviews-Anomaly
Milimetric triaged T281300: Drop old WMDEBanner events from Hive as High priority.
May 10 2021, 4:05 PM · Analytics-Kanban, WMDE-New-Editors-Banner-Campaigns, WMDE-Analytics-Engineering, Analytics
Milimetric triaged T276955: Develop comprehensive process, guidelines, and roles for Event Platform stream sanitization as Medium priority.

This feels to me like it will be part of the data governance effort, so definitely something I care about

May 10 2021, 4:05 PM · Product-Analytics, Event-Platform, Better Use Of Data, Analytics
Milimetric triaged T272058: Address jackson version security vulnerabilities in refinery-source as High priority.
May 10 2021, 4:00 PM · Analytics
Milimetric claimed T274880: Deployment access request for some analytics repos.

@awight so it seems like you're good with the secondary event schemas repo, and you (WMDE technical wishes team) just need access to reportupdater-queries? I'm happy to add this, what gerrit group/list of folks should I use?

May 10 2021, 3:56 PM · Analytics-Kanban, Analytics, WMDE-TechWish
Milimetric triaged T278451: NullPointerException at beginning of spark job as Medium priority.
May 10 2021, 3:52 PM · Analytics
Milimetric added a comment to T278451: NullPointerException at beginning of spark job.

ping @fkaelin

May 10 2021, 3:52 PM · Analytics
Milimetric triaged T280262: Decommission analytics-tool1001 and all the CDH leftovers as High priority.
May 10 2021, 3:52 PM · Analytics-Kanban, Patch-For-Review, Analytics
Milimetric triaged T278701: Store AQS schema and grants in git as High priority.
May 10 2021, 3:51 PM · Analytics-Kanban, Cassandra, Analytics
Milimetric lowered the priority of T280905: Analytics coordinator failover improvements from High to Medium.
May 10 2021, 3:50 PM · Analytics
Milimetric triaged T280905: Analytics coordinator failover improvements as High priority.
May 10 2021, 3:50 PM · Analytics
Milimetric closed T282414: Wikistats New Feature - Enhanced user stats as Declined.

Sorry, this is not really possible for privacy reasons. Even if you were logged in, we throw away most of the data that would be needed to compile these stats.

May 10 2021, 3:37 PM · Analytics, Analytics-Wikistats
Milimetric lowered the priority of T282195: ApacheBeam prototype for DP noise addition with pageview privacy units on top of Spark from Medium to Low.
May 10 2021, 3:35 PM · Analytics, Research, Privacy Engineering, Privacy, Data-release
Milimetric triaged T282195: ApacheBeam prototype for DP noise addition with pageview privacy units on top of Spark as Medium priority.

ping us if you need any support, @Nuria

May 10 2021, 3:34 PM · Analytics, Research, Privacy Engineering, Privacy, Data-release
Milimetric triaged T282185: Add password reset to kerberos manage_principals.py as High priority.
May 10 2021, 3:31 PM · Analytics-Kanban, Analytics
Milimetric moved T282185: Add password reset to kerberos manage_principals.py from Incoming to Operational Excellence on the Analytics board.
May 10 2021, 3:30 PM · Analytics-Kanban, Analytics
Milimetric assigned T282178: Article missing from the Clickstream dataset to JAllemandou.
May 10 2021, 3:29 PM · Analytics-Kanban, Analytics
Milimetric edited projects for T282454: Switch kafka/Hadoop away from java::security, added: Analytics-Clusters; removed Analytics.
May 10 2021, 3:27 PM · Analytics-Kanban, Analytics-Clusters, SRE
Milimetric moved T274322: Clean up issues with jobs after Hadoop Upgrade from In Progress to Done on the Analytics-Kanban board.
May 10 2021, 3:12 PM · Patch-For-Review, Analytics-Kanban, Analytics
Milimetric triaged T274322: Clean up issues with jobs after Hadoop Upgrade as Medium priority.
May 10 2021, 3:12 PM · Patch-For-Review, Analytics-Kanban, Analytics
Milimetric moved T270431: Switch off skipTrash for some data purging from In Code Review to In Progress on the Analytics-Kanban board.
May 10 2021, 3:10 PM · Analytics-Kanban, Analytics
Milimetric moved T270431: Switch off skipTrash for some data purging from In Progress to In Code Review on the Analytics-Kanban board.
May 10 2021, 3:10 PM · Analytics-Kanban, Analytics
Milimetric updated subscribers of T256463: QuickSurveys should show an error when response is blocked.

I can find the handling code in the eventgate server implementation, but it seems there's no way to send a "guaranteed" event from the eventlogging client yet? Would it make sense to expose this in the client API, or does that belong in a new / different client implementation? In other words, should "Event Logging" always be sent hastily, and we introduce a new abstraction for sending to the same endpoint but synchronously?

May 10 2021, 1:43 PM · Readers-Web-Backlog (Tracking), Analytics-EventLogging, Analytics, WMDE-TechWish, QuickSurveys

May 7 2021

Milimetric added a comment to T256463: QuickSurveys should show an error when response is blocked.

So I'm not sure if you're talking about other problems but I hear two:

May 7 2021, 2:08 PM · Readers-Web-Backlog (Tracking), Analytics-EventLogging, Analytics, WMDE-TechWish, QuickSurveys

May 6 2021

Milimetric added a comment to T280311: Temp files left around in wikistats_1/ ?.

So it looks like the https://dumps.wikimedia.org/other/wikistats_1.0/ folder is empty, so that can be deleted.

May 6 2021, 5:02 PM · Analytics-Radar, Dumps-Generation
Milimetric added a comment to T280678: Crunch and delete many old dumps logs.

The format looks like Common Log Format with two additional fields, "full URI requested" and "user agent"

May 6 2021, 3:10 PM · Analytics-Kanban, Analytics
Milimetric added a comment to T278982: npm install gives Verification failed while extracting mediawiki-storage@https://github.com/wikimedia/analytics-mediawiki-storage/archive/master.tar.gz.

Thanks, good point, I added a note at the beginning. It's not quite deprecated yet, we may decide it's a good idea and refresh it, wouldn't be terribly hard. But for now, the note will help nice folks like you not waste time.

May 6 2021, 2:47 PM · Analytics-Kanban, Analytics, Analytics-Dashiki