Page MenuHomePhabricator

Nettrom (PERSONAL)
User

Projects

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Friday

  • Clear sailing ahead.

User Details

User Since
Jun 19 2015, 6:00 PM (415 w, 4 d)
Availability
Available
LDAP User
Unknown
MediaWiki User
Nettrom [ Global Accounts ]

Recent Activity

Feb 3 2021

Nettrom updated subscribers of T263663: [S] Instrument file name and wikitext snippet copy in MediaSearch quickview.
Feb 3 2021, 3:42 AM · MW-1.36-notes (1.36.0-wmf.32; 2021-02-23), Patch-For-Review, SDAW-MediaSearch (MediaSearch-ReleaseCandidate), Structured-Data-Backlog (Current Work)

Aug 25 2020

Nettrom committed R1821:dd76437fffc3: Update package names to store shard mapping correctly (authored by Nettrom).
Update package names to store shard mapping correctly
Aug 25 2020, 1:46 PM

May 30 2020

Nettrom closed T116830: Address cause of irrelevant outputs for SuggestBot as Declined.

I'd like to close this as "declined" for now, as we haven't really seen any interest in this since the last comment. If there's interest, and I'm able to get more focused time to work on this part of SuggestBot, then we can reopen this. It could also be a potential candidate for a Hackathon project, but I don't know much about the criteria for those.

May 30 2020, 6:22 PM · WikiProject-X
Nettrom placed T167362: Draft specification of article importance API up for grabs.

I don't have the bandwidth to work on this, so I'm removing myself as the assignee.

May 30 2020, 6:20 PM · Machine-Learning-Team, artificial-intelligence
Nettrom placed T155541: [Epic] Article importance prediction model up for grabs.

I've updated the project page on meta so it marks the research project as completed, and links to the GitHub repository that contains the code I wrote during the project.

May 30 2020, 6:18 PM · Research, Machine-Learning-Team, artificial-intelligence

Aug 26 2019

Nettrom added a comment to T224850: Offer alternate views of the comment and actor tables which only check for supression in a single table in the Wiki Replicas.

@Bstorm : looks like https://gerrit.wikimedia.org/r/513943 did not include a definition of comment_archive, was that intentional? I would expect there to be a comment_archive table to allow for joining with the archive table when querying archived edit comments (similar to how there's an actor_archive table), but that table doesn't exist on the replicated databases on Toolforge.

Aug 26 2019, 12:24 AM · cloud-services-team, Documentation, Data-Services

Jan 29 2019

Nettrom committed rODQ0102204553ce: Add NLTK Wordnet requirement to readme Testing the draft quality model resulted… (authored by Nettrom).
Add NLTK Wordnet requirement to readme Testing the draft quality model resulted…
Jan 29 2019, 6:48 PM

Dec 15 2018

Nettrom closed T204693: cloudvps: suggestbot project trusty deprecation as Resolved.

The suggestbot-prod instance has been shut down and deleted (ref the suggestbot log). SuggestBot is now running from suggestbot-01. Move completed, task resolved.

Dec 15 2018, 5:13 PM · Cloud-VPS (Ubuntu Trusty Deprecation)

Oct 27 2018

Nettrom added a comment to T204693: cloudvps: suggestbot project trusty deprecation.

The migration process has been started by creating a new instance suggestbot-01. Had a bit of a start-stop-start situation as the process of moving instances to the new eqiad region is coinciding with this work, so the new instance had a short life in the old region, but is now present in eqiad1-r.

Oct 27 2018, 6:09 PM · Cloud-VPS (Ubuntu Trusty Deprecation)

Oct 10 2018

Nettrom updated subscribers of T206700: Create a method for 'Avg. daily views to pages that have uploaded files' .
Oct 10 2018, 11:29 PM · Community-Tech-Sprint, Grant-Metrics, Community-Tech, Event Metrics

Aug 21 2018

Nettrom reassigned T201658: Growth: power calculations for experiments from Nettrom to nettrom_WMF.
Aug 21 2018, 9:27 PM · Growth-Team (Current Sprint), Product-Analytics
Nettrom closed T198361: Investigation: benchmark Google and Turnitin for copyvio as Resolved.

The report is now posted as a sub-page of the AfC Process Improvement page on enwiki. Marking this as resolved and reassigning it so I can track it there in case it gets reopened.

Aug 21 2018, 8:46 PM · Product-Analytics, Growth-Team (Current Sprint), English-Wikipedia-New-Pages-Patrol
Nettrom closed T198361: Investigation: benchmark Google and Turnitin for copyvio, a subtask of T193809: [Timebox 12hr] Investigation: applying copyvio for page review prioritization, as Resolved.
Aug 21 2018, 8:46 PM · Growth-Team (Current Sprint), English-Wikipedia-New-Pages-Patrol
Nettrom closed T201960: Phabricator account with existing email address as Resolved.

Fixed it by getting an email alias set up, so I'm marking this as resolved.

Aug 21 2018, 8:29 PM · Phabricator

Aug 20 2018

Nettrom committed rOWC0244e6cad128: Update classification examples to revscoring 2.0 (authored by Nettrom).
Update classification examples to revscoring 2.0
Aug 20 2018, 6:30 PM
Nettrom committed rOWC4e3d026d7da0: Remove unnecessary mwparserfromhell import (authored by Nettrom).
Remove unnecessary mwparserfromhell import
Aug 20 2018, 6:30 PM
Nettrom committed rOWC2477b633ce7a: Merge from upstream master (authored by Nettrom).
Merge from upstream master
Aug 20 2018, 6:30 PM
Nettrom committed rOWC301bd5cc70fb: Add example that should break, but needs testing (authored by Nettrom).
Add example that should break, but needs testing
Aug 20 2018, 6:30 PM
Nettrom committed rOWCa253239ed15e: Let mwparserfromhell strip HTML comments (authored by Nettrom).
Let mwparserfromhell strip HTML comments
Aug 20 2018, 6:29 PM
Nettrom committed rOWCbaa857771c20: Merge remote-tracking branch 'upstream/ruwiki' into ruwiki (authored by Nettrom).
Merge remote-tracking branch 'upstream/ruwiki' into ruwiki
Aug 20 2018, 6:29 PM
Nettrom committed rOWCfb748f74751c: Bugfix, wrong variable name (authored by Nettrom).
Bugfix, wrong variable name
Aug 20 2018, 6:29 PM
Nettrom committed rOWC1fffaf1ee42d: Add HTML comment filtering to Russian, add testcase (authored by Nettrom).
Add HTML comment filtering to Russian, add testcase
Aug 20 2018, 6:29 PM
Nettrom committed rOWC57e42cbc262b: Add Russian assessment extractor, fix typo in French (authored by Nettrom).
Add Russian assessment extractor, fix typo in French
Aug 20 2018, 6:29 PM
Nettrom committed rOWC3ea167a6c3b9: fix missing mwp import (authored by Nettrom).
fix missing mwp import
Aug 20 2018, 6:29 PM
Nettrom committed rOWC567220619cb4: Add HTML comment detection and removal in WikiProject template paramters (authored by Nettrom).
Add HTML comment detection and removal in WikiProject template paramters
Aug 20 2018, 6:29 PM
Nettrom committed rOWCb803456946f3: Update README.rst (authored by Nettrom).
Update README.rst
Aug 20 2018, 6:28 PM
Nettrom committed rODQ3caf95bd5919: Add NLTK Wordnet requirement to readme (authored by Nettrom).
Add NLTK Wordnet requirement to readme
Aug 20 2018, 6:20 PM
Nettrom closed T202303: Superset access request as Resolved.

Can confirm I have access and everything seems to be working. Thanks for taking care of this, and so quickly as well, awesome work!

Aug 20 2018, 5:05 PM · Analytics-Kanban, Analytics
Nettrom created T202303: Superset access request.
Aug 20 2018, 3:50 PM · Analytics-Kanban, Analytics
Nettrom added a comment to T201960: Phabricator account with existing email address.

Is this something that can be resolved on the Phabricator end, or should I look for a workaround? Either way is fine with me, as long as I can get a second account set up.

Aug 20 2018, 3:46 PM · Phabricator

Aug 16 2018

Nettrom moved T198361: Investigation: benchmark Google and Turnitin for copyvio from Triage to Doing on the Product-Analytics board.
Aug 16 2018, 8:18 PM · Product-Analytics, Growth-Team (Current Sprint), English-Wikipedia-New-Pages-Patrol
Nettrom moved T192515: Measurement for AfC improvement (April 2018) from Triage to Next Up on the Product-Analytics board.
Aug 16 2018, 8:17 PM · Growth-Team-Filtering, Analytics-Radar, Product-Analytics, Growth-Team
Nettrom closed T201420: Page creation data no longer updates as Resolved.
Aug 16 2018, 2:28 PM · Analytics-Kanban, Analytics

Aug 14 2018

Nettrom created T201960: Phabricator account with existing email address.
Aug 14 2018, 8:21 PM · Phabricator

Aug 13 2018

Nettrom added a comment to T201420: Page creation data no longer updates.

@Milimetric Thanks for taking care of the SQL queries! I don't see a need for backfilling the data at the moment, there's not a benefit warranting that cost. As mentioned I can help the NPP folks out with getting their data together. In other words, as far as I can tell, this ticket can be closed now.

Aug 13 2018, 3:54 PM · Analytics-Kanban, Analytics

Aug 10 2018

Nettrom added a comment to T192574: Produce graph of AfC submissions for all of ACTRIAL.

@Niharika Yes, I'd like to keep this open and try to wrap it up in the near future, if that's okay with you?

Aug 10 2018, 11:36 PM · Product-Analytics, Community-Tech
Nettrom added a comment to T201420: Page creation data no longer updates.

I don't think backfilling all the data is very important. The only ones that appear to be affected are the NPP reviewers, and I should be able to run some queries on the Data Lake to either fill the missing data, or get reasonable estimates they can use.

Aug 10 2018, 9:45 PM · Analytics-Kanban, Analytics
Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Aug 10 2018, 7:40 PM · Contributors-Analysis, Product-Analytics

Aug 8 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Aug 8 2018, 4:04 PM · Contributors-Analysis, Product-Analytics

Aug 7 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Aug 7 2018, 5:41 PM · Contributors-Analysis, Product-Analytics
Nettrom created T201420: Page creation data no longer updates.
Aug 7 2018, 3:02 PM · Analytics-Kanban, Analytics

Aug 2 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Aug 2 2018, 7:41 PM · Contributors-Analysis, Product-Analytics

Aug 1 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Aug 1 2018, 10:15 PM · Contributors-Analysis, Product-Analytics

Jul 31 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Jul 31 2018, 12:10 AM · Contributors-Analysis, Product-Analytics

Jul 30 2018

Nettrom updated the task description for T194336: Get onboarded to the Product Analytics team.
Jul 30 2018, 7:47 PM · Contributors-Analysis, Product-Analytics

Apr 30 2018

Nettrom added a comment to T192515: Measurement for AfC improvement (April 2018).

I see you're running into some of the same challenges that I had with getting good data on this for ACTRIAL, and that you've found some of the code and data that I have. Since I'm currently working on T192574, there's also some newer code and data available.

Apr 30 2018, 9:14 PM · Growth-Team-Filtering, Analytics-Radar, Product-Analytics, Growth-Team

Apr 23 2018

Nettrom added a comment to T192574: Produce graph of AfC submissions for all of ACTRIAL.

The data gathering for this is now running, and I expect it'll take a day or two to complete. I also updated the database schema to have a column for the timestamp when a submission was withdrawn so that we can use that to better estimate the contribution to the AfC backlog from pages created in the Draft namespace (hypothesis 17).

Apr 23 2018, 5:32 PM · Product-Analytics, Community-Tech

Apr 19 2018

Nettrom created T192574: Produce graph of AfC submissions for all of ACTRIAL.
Apr 19 2018, 5:13 PM · Product-Analytics, Community-Tech

Mar 27 2018

Nettrom added a comment to T190434: Issues with page deleted dates on data lake .

I've spent a bit of time looking at this, and as far as I can find, the revision_deleted_timestamp is consistently incorrect. Using a sample dataset of creations from four different months, I've found that 15% of the time the deletion timestamp is missing. For pages that have it set, the vast majority of entries (almost 90%) do not match against the logging table. Lastly, of those that match against the logging table, it's almost always not a page deletion event.

Mar 27 2018, 6:25 PM · Patch-For-Review, Analytics, Analytics-Kanban

Mar 22 2018

Nettrom added a comment to T190434: Issues with page deleted dates on data lake .

As mentioned on IRC earlier today, I never filed a ticket because I didn't have the time to sit down and make sure I had data that allowed me to understand exactly what the problem is. Picked it up again today because I now have some time to dig in.

Mar 22 2018, 10:55 PM · Patch-For-Review, Analytics, Analytics-Kanban

Feb 23 2018

Nettrom awarded T173720: Add pagecounts by article and top pagecounts to AQS a Love token.
Feb 23 2018, 12:24 AM · Data-Engineering-Icebox, Analytics

Jan 17 2018

Nettrom closed T185019: Data missing in page creation datasets as Resolved.

I checked the dashboard for enwiki and spot-checked a dataset, and the data appears to be in working order. Thanks for helping take care of this @Milimetric, and great to learn there's a way to easily fix this next time!

Jan 17 2018, 7:07 PM · Analytics, Community-Tech

Jan 16 2018

Nettrom created T185019: Data missing in page creation datasets.
Jan 16 2018, 6:45 PM · Analytics, Community-Tech

Nov 29 2017

Nettrom committed rODQa62f7cc90283: Add NLTK Wordnet requirement to readme Testing the draft quality model resulted… (authored by Nettrom).
Add NLTK Wordnet requirement to readme Testing the draft quality model resulted…
Nov 29 2017, 11:22 PM

Nov 21 2017

Nettrom added a comment to T156844: Decommission old dbstore hosts (db1046, db1047).

Nevermind, turns out @mforns has already updated that configuration, should've checked that first. Thanks again for taking care of it!

Nov 21 2017, 3:47 PM · Analytics-Kanban, Patch-For-Review, User-Elukey, SRE, DBA
Nettrom added a comment to T156844: Decommission old dbstore hosts (db1046, db1047).

The data behind Page Creation Dashboard is configured to read data from the log database on dbstore1002. Can I at this point submit a patch to the ReportUpdater configuration that updates it to use db1108.eqiad.wmnet, as that now has the updated log database?

Nov 21 2017, 3:40 PM · Analytics-Kanban, Patch-For-Review, User-Elukey, SRE, DBA

Oct 26 2017

Nettrom closed T178602: Add option to not truncate Y-axis as Resolved.

Looks good to me, thanks again!

Oct 26 2017, 9:26 PM · Patch-For-Review, Analytics-Kanban, Data-Engineering-Dashiki

Oct 19 2017

Nettrom created T178602: Add option to not truncate Y-axis.
Oct 19 2017, 6:01 PM · Patch-For-Review, Analytics-Kanban, Data-Engineering-Dashiki

Sep 27 2017

Nettrom closed T176375: Add draft namespace creations to page creation dashboard as Resolved.
  1. Verified that the dataset of number of pages created is available in the correct dataset directory.
  2. Added metric for number of pages created in the Draft namespace to Dashiki:CategorizedMetrics in this edit.
  3. Added metric to Config:Dashiki:PageCreations in this edit.
  4. Verified that the metric is now available, it can be viewed here.
Sep 27 2017, 9:59 PM · Analytics-Radar, Patch-For-Review, Data-Engineering-Dashiki, Community-Tech

Sep 21 2017

Nettrom awarded T176372: Request to be added to the "wmf" LDAP group a Yellow Medal token.
Sep 21 2017, 4:20 PM · Community-Tech, LDAP-Access-Requests

Sep 20 2017

MusikAnimal awarded T176375: Add draft namespace creations to page creation dashboard a Yellow Medal token.
Sep 20 2017, 11:25 PM · Analytics-Radar, Patch-For-Review, Data-Engineering-Dashiki, Community-Tech
Nettrom created T176375: Add draft namespace creations to page creation dashboard.
Sep 20 2017, 11:24 PM · Analytics-Radar, Patch-For-Review, Data-Engineering-Dashiki, Community-Tech
Nettrom created T176372: Request to be added to the "wmf" LDAP group.
Sep 20 2017, 10:30 PM · Community-Tech, LDAP-Access-Requests

Sep 12 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

@kaldari : The three last metrics are only defined for English Wikipedia, partly because I saw them as ACTRIAL-specific. When it comes to the autopatrol right, those are also defined for different user groups depending on what wiki we're looking at, and I didn't see the benefit of figuring those out for the entire set of wikis.

Sep 12 2017, 11:26 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging
Nettrom added a comment to T170850: Visualize page create events for all wikis .

@Nuria : Thanks for taking care of this! Sorry I didn't get around to updating the commit message as you requested, forgot to put that on my todo list.

Sep 12 2017, 10:54 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging
Nettrom added a comment to T170434: Improve cleaning of article quality assessment datasets.

New dataset has now been uploaded to figshare. If this direct link does not work, use this DOI link and download the "2017_english_wikipedia_quality_dataset.tar.bz2" file.

Sep 12 2017, 7:47 PM · articlequality-modeling, Machine-Learning-Team (Active Tasks), artificial-intelligence

Sep 11 2017

Nettrom added a comment to T166276: What percentage of new articles are created by auto-patrolled users?.

There's a user group called "autoreviewer" that specifically gets the "autopatrol" user right. That right is also applied to bots and admins. Or at least that's how I read en:Special:ListGroupRights. The help page mentions that it used to be called "autoreviewer", so I guess they just never renamed the user group.

Sep 11 2017, 11:23 PM · Community-Tech
Nettrom committed rODQa54d1586add6: Add NLTK Wordnet requirement to readme (authored by Nettrom).
Add NLTK Wordnet requirement to readme
Sep 11 2017, 10:44 PM
Nettrom added a comment to T166276: What percentage of new articles are created by auto-patrolled users?.

@kaldari : No, I really mean "autoreviewer", ref en:Special:ListGroupRights. I haven't been able to find any documentation that defines the user group in the system as "autopatrolled". And yes, I find that confusing.

Sep 11 2017, 10:18 PM · Community-Tech
Nettrom added a comment to T166276: What percentage of new articles are created by auto-patrolled users?.

@Neil_P._Quinn_WMF : I actually ran a query to get similar data on Friday, because I've been using it to figure out how long it takes for articles to get reviewed. My current best version of the query is in our GitHub repository: non_autopatrolled_creations.hql It looks for non-autopatrolled creations, but it's trivial to calculate the opposite proportion as I also have data on all article creations.

Sep 11 2017, 9:30 PM · Community-Tech
GitHub <noreply@github.com> committed rOWC194e594c62bc: Merge 48f748bdb7bff2e945f940b0b62df32df5ac44f8 into… (authored by Nettrom).
Merge 48f748bdb7bff2e945f940b0b62df32df5ac44f8 into…
Sep 11 2017, 9:24 PM
Nettrom committed rOWC48f748bdb7bf: Update classification examples to revscoring 2.0 (authored by Nettrom).
Update classification examples to revscoring 2.0
Sep 11 2017, 9:24 PM
Nettrom added a comment to T170434: Improve cleaning of article quality assessment datasets.

@awight : I was working on this yesterday, but didn't get the dataset ready overnight. The process I have goes as follows:

Sep 11 2017, 5:11 PM · articlequality-modeling, Machine-Learning-Team (Active Tasks), artificial-intelligence
Nettrom added a comment to T170850: Visualize page create events for all wikis .

@Nuria : I added a short note to the tutorial about the requirements. Since I don't know npm very well, it's rather non-specific on how to get them installed. I'll make a mental note to look into nvm on a rainy day, as that might allow it to be more specific on how to go about doing this since I'll then know how to do this for both a global npm install as well as for a local one using nvm.

Sep 11 2017, 4:07 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Sep 8 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

@Nuria: I've tested our dashboard locally here and everything seemed to be working just fine. How do we go about getting it deployed? In this specific project, having a VM on Labs isn't really an option.

Sep 8 2017, 3:58 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Sep 6 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

Ah, I see! The tutorial isn't aligned with said documentation then. I'll update the tutorial and move forward.

Sep 6 2017, 10:15 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging
Nettrom added a comment to T170850: Visualize page create events for all wikis .

From what I can tell after digging around a bit, the configuration of the Dashiki extension limits the creation of pages in the "Config" namespace to ones with titles starting with "Dashiki:" (refs [1,2]). Thus, I can create "Config:Dashiki:PageCreations", but not "Config:PageCreations", I suspect the latter is instead a pseudo page used by the JsonConfig extension.

Sep 6 2017, 9:04 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Sep 5 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

@Nuria : I'm working on this now, got the metrics added to [[m:Dashiki:CategorizedMetrics]] without breaking anything, or so it seems. I do not have permissions to create [[m:Config:PageCreationDashboard]], but it appears I can edit existing dashboards. Could you (or someone else who has permissions, pinging @kaldari) create the config page for our dashboard so I can edit it? Feel free to create it with a different title if the one I suggested breaks conventions.

Sep 5 2017, 4:52 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Aug 31 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

Ah, I remember being confused by the configuration file path in the examples I looked at, but forgot to ask about what it should be. Thanks for figuring that out and updating it, and also for your help with reviewing the patch, much appreciated!

Aug 31 2017, 8:38 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Aug 29 2017

Nettrom added a comment to T170434: Improve cleaning of article quality assessment datasets.

I'm a bit pressed for time at the moment, so to prevent this from stalling I'd like to propose that a first priority is that I try to create a dataset that doesn't have any redirects in it. Given the low number of redirects we have in the dataset, I expect this problem to be minimal if I simply sample a few hundred extra articles in the classes where that is possible. I'll also make sure the dataset doesn't contain any disambiguation pages.

Aug 29 2017, 4:25 AM · articlequality-modeling, Machine-Learning-Team (Active Tasks), artificial-intelligence

Aug 28 2017

Nettrom created T174306: Request creation of suggestbot VPS project.
Aug 28 2017, 4:31 AM · Cloud-VPS (Project-requests)

Aug 25 2017

Nettrom added a comment to T149021: Determine: What percentage of new articles are created by non-autoconfirmed editors.

I adapted this query for use in gathering some statistics for the ACTRIAL project and noticed that it seemed to fail to pick up deleted articles. In my dataset gathered a week ago there is 730 article creations on 2017-01-01, and 729 of those currently exist in the revision table. What appears to be a key reason for this is that event_comment for those deleted articles is NULL leading any event_comment NOT REGEXP 'foo' to remove that row from the query result.

Aug 25 2017, 11:06 PM · Community-Tech

Aug 23 2017

Nettrom added a comment to T170850: Visualize page create events for all wikis .

@mforns Patch submitted (linked below), and I added you as a reviewer. First time working with Gerrit, hopefully I got it mostly right! Happy to make changes as need be, fun to learn how to do this. Thanks again!

Aug 23 2017, 9:13 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging
Nettrom added a comment to T170850: Visualize page create events for all wikis .

@mforns : Thanks much for your help with this! I've set up the queries so they return two columns, with the second named after the wiki as you recommended. Also, thanks for the link to the tutorial, it's a lot easier to follow than the technical documentation ([[:wikitech:Analytics/Systems/Dashiki]], I'd be happy to add a link to the tutorial from that page if that's useful?).

Aug 23 2017, 6:05 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Aug 22 2017

Nettrom added a comment to T166556: Impact analysis of New page reviewer userright change.

Just a head's up that we've rephrased our hypotheses around patroller workload since the start of the ACTRIAL project, and "number of active patrollers" is now one of our measurements together with a few related ones. Ref hypotheses 9–13 on our project page: https://meta.wikimedia.org/wiki/Research:Autoconfirmed_article_creation_trial I plan to reuse your query for counting number of active patrollers, thanks!

Aug 22 2017, 6:49 PM · Research-Backlog
Nettrom added a comment to T170850: Visualize page create events for all wikis .

I'm working on this and got ReportUpdater working locally. A couple of questions:

Aug 22 2017, 6:34 PM · Product-Analytics, Analytics-Radar, Community-Tech, Patch-For-Review, MW-1.30-release-notes (WMF-deploy-2017-06-27_(1.30.0-wmf.7)), Event-Platform Value Stream, Contributors-Analysis, MediaWiki-extensions-EventLogging

Jul 14 2017

Nettrom added a comment to T170434: Improve cleaning of article quality assessment datasets.

I've gathered revision timestamps for all the revisions in the published dataset, and also checked for redirects. Here are some summaries:

Jul 14 2017, 10:53 PM · articlequality-modeling, Machine-Learning-Team (Active Tasks), artificial-intelligence

Jul 12 2017

Nettrom created T170434: Improve cleaning of article quality assessment datasets.
Jul 12 2017, 4:24 PM · articlequality-modeling, Machine-Learning-Team (Active Tasks), artificial-intelligence

Jun 8 2017

Nettrom added a comment to T162933: Endpoint for average view rate in Pageview API.

Coming back to this I have a bunch of questions, so I'll just ask them and see where we go from there. Apologies if this is counterproductive, feel free to let me know how to improve in future work.

Jun 8 2017, 5:24 PM · Data-Engineering-Planning, Pageviews-API

Jun 7 2017

Nettrom created T167362: Draft specification of article importance API.
Jun 7 2017, 10:17 PM · Machine-Learning-Team, artificial-intelligence
Nettrom added a comment to T164671: Implement wp10 model for trwiki.

@Mavrikant Excellent! The extractor looks good to go as far as I can tell. Also, happy to hear that you don't have HTML comments in your WikiProject templates, that makes life a lot easier :)

Jun 7 2017, 4:22 PM · Turkish-Sites, Machine-Learning-Team (Active Tasks), artificial-intelligence, articlequality-modeling

Jun 6 2017

Nettrom added a comment to T164671: Implement wp10 model for trwiki.

@Mavrikant: thanks for getting code for the trwiki extractor up on https://github.com/Mavrikant/wikiclass/blob/master/wikiclass/extractors/trwiki.py, it makes everything a lot easier!

Jun 6 2017, 11:12 PM · Turkish-Sites, Machine-Learning-Team (Active Tasks), artificial-intelligence, articlequality-modeling
GitHub <noreply@github.com> committed rOWC7a9d6edcd987: Merge 63c5cf6ff3ef3b94dd55f2053a7819d6be045197 into… (authored by Nettrom).
Merge 63c5cf6ff3ef3b94dd55f2053a7819d6be045197 into…
Jun 6 2017, 11:02 PM
Nettrom committed rOWCa07e4d1ad63e: Merge from upstream master (authored by Nettrom).
Merge from upstream master
Jun 6 2017, 11:02 PM
Nettrom committed rOWC63c5cf6ff3ef: Remove unnecessary mwparserfromhell import (authored by Nettrom).
Remove unnecessary mwparserfromhell import
Jun 6 2017, 11:02 PM
Nettrom committed rOWCc793659d1297: Add example that should break, but needs testing (authored by Nettrom).
Add example that should break, but needs testing
Jun 6 2017, 11:02 PM
Nettrom committed rOWCd9687dfdc7a9: Let mwparserfromhell strip HTML comments (authored by Nettrom).
Let mwparserfromhell strip HTML comments
Jun 6 2017, 11:02 PM

Apr 17 2017

Nettrom created T163171: ORES server error.
Apr 17 2017, 11:55 PM · Machine-Learning-Team (Active Tasks), ORES

Jan 4 2017

Nettrom added a comment to T154608: Kernel consistently crashing.

3,746,600 rows. The file I'm importing is 259MiB when unzipped.

Jan 4 2017, 8:17 PM · PAWS
Nettrom renamed T154608: Kernel consistently crashing from Kernel consistently crashing (out of memory issue?) to Kernel consistently crashing.
Jan 4 2017, 8:10 PM · PAWS