Nuria (Nuria)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Monday

  • Clear sailing ahead.

User Details

User Since
Nov 26 2014, 3:04 AM (164 w, 3 d)
Availability
Available
LDAP User
Nuria
MediaWiki User
Unknown

Recent Activity

Today

Nuria added a comment to T185350: Vet reliability of the response_size field for data analysis purposes.

Both 18550 and 15722 are divisors of 253193 (13 and 16) .

Sat, Jan 20, 6:55 AM · Operations, Traffic, Analytics-Data-Quality
Nuria added a comment to T179530: Wikistats Bug: Menu to select projects doesn't work (sometimes?).

I do not think we can both both offer all options to scroll plus offer localized search and have it be performant.

Sat, Jan 20, 6:31 AM · Patch-For-Review, Analytics-Kanban, Analytics-Wikistats
Nuria moved T185334: Wikiselector Perf issues on Chrome from Next Up to In Code Review on the Analytics-Kanban board.
Sat, Jan 20, 5:45 AM · Patch-For-Review, Analytics-Kanban

Yesterday

Nuria added a comment to T185334: Wikiselector Perf issues on Chrome .

Adding a lookup for subtitle, that is " {{r{subtitle]}}" brings rendering time up to 7 secs.

Fri, Jan 19, 11:54 PM · Patch-For-Review, Analytics-Kanban
Nuria added a comment to T184627: Estimate how long a new Dashiki Layout for Qualtrics Survey data would take.

Ok, thank you. Those are quite specific for this survey rather than being a generic take on how to display survey data. On our end we really to think about survey data rather than data for this specific survey thus if we take this project is pretty clear we would need to contract a designer for our work to have value beyond this specific use case.

Fri, Jan 19, 11:52 PM · Analytics
Nuria added a comment to T185334: Wikiselector Perf issues on Chrome .

Fri, Jan 19, 11:49 PM · Patch-For-Review, Analytics-Kanban
Nuria added a comment to T185334: Wikiselector Perf issues on Chrome .

After looking at this for a while the only obvious issue that i can find has to do with the amount of nodes we are adding to the DOM at once. We have available 500 options each of which has two spans, so at least we will be rendering 1000 nodes (with styling). Seems like a lot but actually I think a factor is also property access to vue binded objects.

Fri, Jan 19, 11:48 PM · Patch-For-Review, Analytics-Kanban
Nuria set the point value for T185334: Wikiselector Perf issues on Chrome to 5.
Fri, Jan 19, 11:43 PM · Patch-For-Review, Analytics-Kanban
Nuria added a comment to T185334: Wikiselector Perf issues on Chrome .
  1. click on menu
Fri, Jan 19, 11:43 PM · Patch-For-Review, Analytics-Kanban
Nuria added a project to T185350: Vet reliability of the response_size field for data analysis purposes: Traffic.
Fri, Jan 19, 10:38 PM · Operations, Traffic, Analytics-Data-Quality
Nuria added a comment to T185350: Vet reliability of the response_size field for data analysis purposes.

Can we have an example that is not a media type? it is likely that for media downloaded in chunks the field doesn't reflect file size, it should however reflect file size in files that are not downloaded in chuncks (everything but media pretty much)

Fri, Jan 19, 10:28 PM · Operations, Traffic, Analytics-Data-Quality
Nuria created T185334: Wikiselector Perf issues on Chrome .
Fri, Jan 19, 5:13 PM · Patch-For-Review, Analytics-Kanban
Nuria set the point value for T184911: Use country ISO codes instead of country names in top by country pageviews to 5.
Fri, Jan 19, 5:01 PM · Patch-For-Review, Pageviews-API, Analytics-Kanban

Thu, Jan 18

Nuria moved T131782: Put data needed for edits metrics through Event Bus into HDFS from Q3 (january 2018) to Q1 (July 2018) on the Analytics board.
Thu, Jan 18, 6:37 PM · Analytics
Nuria moved T155014: Import 2001 wikipedia data from Q3 (january 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 18, 6:37 PM · Analytics
Nuria moved T159046: Track page views by page ID rather than title (handles moved pages) from Q3 (january 2018) to Q1 (July 2018) on the Analytics board.
Thu, Jan 18, 6:31 PM · Pageviews-API, Analytics
Nuria added a comment to T176426: Implement purging scheme for eventlogging data on top of eventlogging refine.

One idea is to refine incoming data so we have, say,

Thu, Jan 18, 6:15 PM · Analytics, Analytics-EventLogging
Nuria edited projects for T185237: Lookout for duplicates in EL refine , added: Analytics; removed Analytics-Kanban.
Thu, Jan 18, 6:11 PM · Analytics, Analytics-EventLogging
Nuria triaged T185237: Lookout for duplicates in EL refine as Normal priority.
Thu, Jan 18, 6:11 PM · Analytics, Analytics-EventLogging
Nuria renamed T185229: Investigate why data was missing from mediawiki events around January 3rd from Investigate why data was missing to Investigate why data was missing from mediawiki events around January 3rd.
Thu, Jan 18, 5:30 PM · Analytics-Kanban
Nuria closed T182688: Make superset more scalable as Resolved.
Thu, Jan 18, 5:00 PM · Analytics-Kanban, Patch-For-Review
Nuria closed T182944: Read the python code and design the Hadoop version as Resolved.
Thu, Jan 18, 4:59 PM · Analytics-Kanban
Nuria closed T182944: Read the python code and design the Hadoop version, a subtask of T176996: Private geo wiki data in new analytics stack , as Resolved.
Thu, Jan 18, 4:59 PM · Analytics
Nuria closed T168414: Purge all old data from EventLogging master as Resolved.
Thu, Jan 18, 4:59 PM · Analytics-Kanban, DBA
Nuria closed T168414: Purge all old data from EventLogging master, a subtask of T108850: Set up auto-purging after 90 days {tick}, as Resolved.
Thu, Jan 18, 4:59 PM · User-Elukey, Analytics, Patch-For-Review, DBA
Nuria moved T184768: Bug behavior of QTree[Long] for quantileBounds from Next Up to In Progress on the Analytics-Kanban board.
Thu, Jan 18, 4:47 PM · Analytics-Kanban, MobileApp, Wikipedia-Android-App-Backlog, Discovery-Analysis
Nuria claimed T184768: Bug behavior of QTree[Long] for quantileBounds.
Thu, Jan 18, 4:46 PM · Analytics-Kanban, MobileApp, Wikipedia-Android-App-Backlog, Discovery-Analysis

Tue, Jan 16

Nuria moved T177965: Beta Release: Resiliency, Rollback and Deployment of Data from Q1 (July 2018) to To Task on the Analytics board.
Tue, Jan 16, 8:44 PM · Analytics, Analytics-Wikistats
Nuria edited projects for T177965: Beta Release: Resiliency, Rollback and Deployment of Data, added: Analytics; removed Analytics-Kanban.
Tue, Jan 16, 8:44 PM · Analytics, Analytics-Wikistats
Nuria added a comment to T177965: Beta Release: Resiliency, Rollback and Deployment of Data.

Let's move this task to tasking i think there is quite a bit to talk about.

Tue, Jan 16, 8:44 PM · Analytics, Analytics-Wikistats
Nuria added a comment to T177965: Beta Release: Resiliency, Rollback and Deployment of Data.

Given we probably want to use Druid as a query engine to check numbers between old and new, cache warming would actually be a side-effect of checking data consistency.

mmm... wait , the data cannot be surfaced outside when we do not know yet whether it is any good. Thus are we talking about requests that are internal to aqs itself? They will warm up the druid cache but in any case should they touch the web cache. Correct?

Tue, Jan 16, 8:43 PM · Analytics, Analytics-Wikistats
Nuria moved T180412: Replace any debouncing with Vue.nextTick from Done to Ready to Deploy on the Analytics-Kanban board.
Tue, Jan 16, 4:05 PM · Patch-For-Review, Analytics-Wikistats, Analytics-Kanban
Nuria moved T183192: Please add download option 'as csv file' to Wikistats 2 from Done to Ready to Deploy on the Analytics-Kanban board.
Tue, Jan 16, 4:05 PM · Patch-For-Review, Analytics-Kanban, Analytics-Wikistats
Nuria moved T184138: Wrong y-axis labels on wikistats graph from Done to Ready to Deploy on the Analytics-Kanban board.
Tue, Jan 16, 4:05 PM · Patch-For-Review, Analytics-Kanban, Analytics-Wikistats
Nuria added a comment to T119772: Create dashboard showing MediaWiki tarball download statistics.

My advice , rather than using hadoop for this would be to instrument with piwik, releases.wikimedia.org. Combing terabytes of data for this few requests doesn't seem the most expedient approach. Our piwik instance is piwik.wikimedia.org. We use it for similarly-low volume metrics.

Tue, Jan 16, 2:57 AM · Analytics, MediaWiki-Releasing

Mon, Jan 15

Nuria moved T119772: Create dashboard showing MediaWiki tarball download statistics from Bots to Incoming on the Analytics board.
Mon, Jan 15, 4:32 PM · Analytics, MediaWiki-Releasing
Nuria moved T119772: Create dashboard showing MediaWiki tarball download statistics from Radar to Bots on the Analytics board.
Mon, Jan 15, 4:32 PM · Analytics, MediaWiki-Releasing
Nuria added a comment to T119772: Create dashboard showing MediaWiki tarball download statistics.

If the server that fronts mediawiki downloads is backed up by varnish (is it?) this data exists in hadoop most likely. Can @Legoktm answer this question?

Mon, Jan 15, 4:01 PM · Analytics, MediaWiki-Releasing

Fri, Jan 12

Nuria added a comment to T177965: Beta Release: Resiliency, Rollback and Deployment of Data.

+1 to @mforns comment

Fri, Jan 12, 5:58 PM · Analytics, Analytics-Wikistats
Nuria moved T182718: SEO-friendly HTML titles for Wikistats 2.0 from In Progress to Paused on the Analytics-Kanban board.
Fri, Jan 12, 5:08 PM · Analytics-Kanban, Analytics-Wikistats
Nuria set the point value for T184759: Sqoop cu_changes table for geowiki to 5.
Fri, Jan 12, 5:07 PM · Analytics-Kanban
faidon awarded T167907: Incorporate data from the GeoIP2 ISP database to webrequest a Love token.
Fri, Jan 12, 3:59 PM · Patch-For-Review, Analytics-Kanban

Thu, Jan 11

Nuria triaged T184759: Sqoop cu_changes table for geowiki as High priority.
Thu, Jan 11, 6:48 PM · Analytics-Kanban
Nuria added a comment to T176996: Private geo wiki data in new analytics stack .

We can get started in scooping the cu_changes_table

Thu, Jan 11, 6:45 PM · Analytics
Nuria added a comment to T176996: Private geo wiki data in new analytics stack .

Data will be updated monthly (?) (maybe data needs to be updated more frequently?)

Thu, Jan 11, 6:43 PM · Analytics
Nuria moved T159584: Secure hue and other private data access sites with 2FA from Incoming to Q3 (january 2018) on the Analytics board.
Thu, Jan 11, 6:07 PM · User-Elukey, Analytics
Nuria added a project to T159584: Secure hue and other private data access sites with 2FA: User-Elukey.
Thu, Jan 11, 6:07 PM · User-Elukey, Analytics
Nuria edited projects for T163933: Investigate oozie suspended workflows, added: Analytics-Kanban; removed Analytics.
Thu, Jan 11, 6:04 PM · Analytics-Kanban
Nuria added a comment to T163933: Investigate oozie suspended workflows.

Can @JAllemandou please document this on our oncall docs?

Thu, Jan 11, 6:04 PM · Analytics-Kanban
Nuria assigned T163933: Investigate oozie suspended workflows to JAllemandou.
Thu, Jan 11, 6:04 PM · Analytics-Kanban
Nuria edited projects for T184713: EventStreams doesnt find any messages anymore, added: Analytics-Kanban; removed Analytics.
Thu, Jan 11, 6:01 PM · Analytics-Kanban, EventBus, Pywikibot-core
Nuria moved T184698: EventBus rejecting events because of malformed characters in the comment from Incoming to Radar on the Analytics board.
Thu, Jan 11, 6:01 PM · EventBus, Analytics, Services (next)
Nuria moved T184627: Estimate how long a new Dashiki Layout for Qualtrics Survey data would take from Incoming to Q1 (July 2018) on the Analytics board.
Thu, Jan 11, 6:00 PM · Analytics
Nuria added a comment to T184626: Transform and Import Qualtrics Survey data.

Maybe put data into mysql on labs once transformed?

Thu, Jan 11, 5:59 PM · Analytics
Nuria moved T184626: Transform and Import Qualtrics Survey data from Incoming to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:59 PM · Analytics
Nuria added a comment to T184626: Transform and Import Qualtrics Survey data.

ideas: put data into mysql labs and have a superset labs instance ?

Thu, Jan 11, 5:59 PM · Analytics
Nuria moved T184551: EQIAD: (1) hardware request for eventlog1001 replacement - eventlog1002. from Incoming to Radar on the Analytics board.
Thu, Jan 11, 5:55 PM · Analytics, hardware-requests, Operations
Nuria edited projects for T184541: Update AQS pageview-top definition, added: Analytics-Kanban; removed Analytics.
Thu, Jan 11, 5:55 PM · Services (done), Patch-For-Review, Analytics-Kanban, RESTBase-API
Nuria removed projects from T184501: What to do with deployment-sca03?: Analytics, Services, EventBus.
Thu, Jan 11, 5:54 PM · Release-Engineering-Team, Recommendation-API, Beta-Cluster-Infrastructure, Scoring-platform-team (Current)
Nuria edited projects for T184482: analytics VPS project puppet errors, added: Analytics-Kanban; removed Analytics.
Thu, Jan 11, 5:54 PM · Analytics-Kanban, User-Elukey, Puppet
Nuria added a comment to T184482: analytics VPS project puppet errors.

We will be killing that instance

Thu, Jan 11, 5:53 PM · Analytics-Kanban, User-Elukey, Puppet
Nuria added a project to T184482: analytics VPS project puppet errors: User-Elukey.
Thu, Jan 11, 5:53 PM · Analytics-Kanban, User-Elukey, Puppet
Nuria moved T184479: Discuss Wikistats integration for ORES from Incoming to Radar on the Analytics board.
Thu, Jan 11, 5:52 PM · Analytics, Scoring-platform-team, ORES, Analytics-Wikistats
Nuria moved T170826: Enable base::firewall on stat boxes after restricting Spark REPL ports. from Incoming to Deprioritized on the Analytics board.
Thu, Jan 11, 5:48 PM · Analytics-Cluster, Analytics
Nuria removed a project from T184139: When displaying a graph include metric total not only average : Analytics.
Thu, Jan 11, 5:41 PM · Analytics-Kanban, Analytics-Wikistats
Nuria moved T148461: Bot Identification: Inconsistent data in #all-sites-by-os-and-browser for IE7 from Q3 (january 2018) to Bots on the Analytics board.
Thu, Jan 11, 5:40 PM · Analytics
Nuria moved T121912: Better redirect handling for pageview API from Q3 (january 2018) to Q1 (July 2018) on the Analytics board.
Thu, Jan 11, 5:39 PM · Analytics
Nuria moved T88775: Add mediacounts to pageview API from Q3 (january 2018) to Q1 (July 2018) on the Analytics board.
Thu, Jan 11, 5:39 PM · Multimedia, Analytics
Nuria moved T159584: Secure hue and other private data access sites with 2FA from Q1 (July 2018) to Incoming on the Analytics board.
Thu, Jan 11, 5:38 PM · User-Elukey, Analytics
Nuria moved T163725: Enable nested on-wiki config pages in mediawiki-storage from Q1 (July 2018) to Radar on the Analytics board.
Thu, Jan 11, 5:38 PM · Analytics-Dashiki, Analytics
Nuria added a project to T163725: Enable nested on-wiki config pages in mediawiki-storage: Analytics-Dashiki.
Thu, Jan 11, 5:38 PM · Analytics-Dashiki, Analytics
Nuria moved T163933: Investigate oozie suspended workflows from Q1 (July 2018) to Incoming on the Analytics board.
Thu, Jan 11, 5:38 PM · Analytics-Kanban
Nuria moved T164500: Add jobs for druid compaction for pageviews data set from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:37 PM · Analytics
Nuria moved T164259: Add VSL error counters to Varnishkafka stats from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:37 PM · User-Elukey, Traffic, Analytics, Operations
Nuria moved T154381: Yearly endpoint for the /pageviews/top API from Q1 (July 2018) to Deprioritized on the Analytics board.
Thu, Jan 11, 5:37 PM · Pageviews-API, Analytics
Nuria moved T115634: --- Items above are triaged ----------------------- from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:36 PM · Analytics, Trash
Nuria moved T112284: Create new table for 'referer' aggregated data from To Task to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:35 PM · Analytics
Nuria moved T150483: Set up a fake Pageview API endpoint for the beta cluster from Q1 (July 2018) to Deprioritized on the Analytics board.
Thu, Jan 11, 5:35 PM · Beta-Cluster-Infrastructure, Analytics
Nuria moved T118839: Productionize Pageview_sanitization hive code with Oozie job and refinery inclusion {hawk} from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:34 PM · Analytics
Nuria moved T123442: Pageview API: Better filtering of bot traffic on top enpoints from Q1 (July 2018) to Bots on the Analytics board.
Thu, Jan 11, 5:33 PM · Analytics, Pageviews-API
Nuria edited projects for T136732: Puppetize job that saves old versions of geoIP database, added: Analytics-Kanban; removed Analytics.
Thu, Jan 11, 5:33 PM · Analytics-Kanban
Nuria moved T136732: Puppetize job that saves old versions of geoIP database from Q1 (July 2018) to Q3 (january 2018) on the Analytics board.
Thu, Jan 11, 5:32 PM · Analytics-Kanban
Nuria moved T118842: Backfill pageview_hourly sanitization - 1 month - {hawk} - DUPLICATE THIS TASK FOR EACH MONTH TO BACKFILL from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:32 PM · Analytics
Nuria moved T98831: Honor DNT header for access logs & varnish logs from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:32 PM · WMF-Legal, Analytics, Operations, Privacy
Nuria moved T156965: Remove user_agent_map from pageview_hourly long term from Q1 (July 2018) to Q4 (April 2018) on the Analytics board.
Thu, Jan 11, 5:32 PM · Analytics
Nuria added a comment to T184748: Add ISO code to AQS data per country .

This involves:

Thu, Jan 11, 5:30 PM · Analytics-Kanban
Nuria moved T181520: Add "Pageviews by Country" AQS endpoint from Done to Ready to Deploy on the Analytics-Kanban board.
Thu, Jan 11, 5:29 PM · Services (done), RESTBase-API, Analytics-Kanban, Analytics-Wikistats, Analytics-Cluster
Nuria added a subtask for T181520: Add "Pageviews by Country" AQS endpoint: T184748: Add ISO code to AQS data per country .
Thu, Jan 11, 5:28 PM · Services (done), RESTBase-API, Analytics-Kanban, Analytics-Wikistats, Analytics-Cluster
Nuria added a parent task for T184748: Add ISO code to AQS data per country : T181520: Add "Pageviews by Country" AQS endpoint.
Thu, Jan 11, 5:28 PM · Analytics-Kanban
Nuria created T184748: Add ISO code to AQS data per country .
Thu, Jan 11, 5:28 PM · Analytics-Kanban

Wed, Jan 10

Nuria added a comment to T184627: Estimate how long a new Dashiki Layout for Qualtrics Survey data would take.

Ping @egalvezwmf can you add mocks and also the audiences for this tool?

Wed, Jan 10, 7:57 PM · Analytics
Nuria added a comment to T176996: Private geo wiki data in new analytics stack .

Moving notes here:

Wed, Jan 10, 6:42 PM · Analytics
Nuria added a comment to T176996: Private geo wiki data in new analytics stack .

See notes:
https://etherpad.wikimedia.org/p/analytics-geowiki

Wed, Jan 10, 6:22 PM · Analytics
Nuria added a comment to T181520: Add "Pageviews by Country" AQS endpoint.

Let's add a link to docs to this ticket

Wed, Jan 10, 5:14 PM · Services (done), RESTBase-API, Analytics-Kanban, Analytics-Wikistats, Analytics-Cluster
Nuria added a comment to T138505: Split opera mini in proxy or turbo mode .

We use UA parser in both python and java

Wed, Jan 10, 5:04 PM · Easy, New-Readers, Analytics
Nuria added a comment to T179976: Create scala-spark job to ingest simple data sets from Hive-EventLogging to Druid to Pivot.

Is this the changeset: https://gerrit.wikimedia.org/r/#/c/386882/?

Wed, Jan 10, 12:51 AM · Analytics-Kanban
Nuria added a comment to T182718: SEO-friendly HTML titles for Wikistats 2.0.

The titles of pages shoudl be pushed to piwik so we can see traffic paths through the site

Wed, Jan 10, 12:28 AM · Analytics-Kanban, Analytics-Wikistats

Tue, Jan 9

Nuria updated subscribers of T172581: Set up mechanism for archiving Google Search Console data.

There are couple initiatives of us meeting with google, pinging @DFoy in this ticket in case it is of interest.

Tue, Jan 9, 11:40 PM · Discovery-Analysis (Current work), Discovery, SEO, Reading-analysis
Nuria added a comment to T172581: Set up mechanism for archiving Google Search Console data.

Again, call me crazy but i bet this data could be made public by google 100% such you do not need authentication to query it , we woudl be able to do it and so will be any interested party. seems that it would require a few conversations but little actual hands-on work

Tue, Jan 9, 11:39 PM · Discovery-Analysis (Current work), Discovery, SEO, Reading-analysis
Nuria added a comment to T172581: Set up mechanism for archiving Google Search Console data.

Call me crazy but i bet if we ask google for this data they will be happy to give it to us w/o having to setup web scraping/downloads

Tue, Jan 9, 11:32 PM · Discovery-Analysis (Current work), Discovery, SEO, Reading-analysis