Page MenuHomePhabricator

Iflorez (Irene)
User

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Thursday

  • Clear sailing ahead.

User Details

User Since
May 14 2019, 10:44 PM (127 w, 6 d)
Availability
Available
LDAP User
Iflorez
MediaWiki User
IFlorez (WMF) [ Global Accounts ]

Recent Activity

Yesterday

Iflorez moved T292867: FY21-22 Q1 Tuning Session Metrics - Irene from Needs Sign-off to Needs Review on the Product-Analytics (Kanban) board.
Mon, Oct 25, 4:26 PM · Product-Analytics (Kanban)

Fri, Oct 22

Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Fri, Oct 22, 8:00 PM · Product-Analytics (Kanban)

Thu, Oct 21

Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Thu, Oct 21, 9:20 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Thu, Oct 21, 9:18 PM · Product-Analytics (Kanban)
Iflorez moved T292867: FY21-22 Q1 Tuning Session Metrics - Irene from Needs Review to Needs Sign-off on the Product-Analytics (Kanban) board.
Thu, Oct 21, 9:14 PM · Product-Analytics (Kanban)
Iflorez moved T292867: FY21-22 Q1 Tuning Session Metrics - Irene from Doing to Needs Review on the Product-Analytics (Kanban) board.
Thu, Oct 21, 9:14 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Thu, Oct 21, 9:12 PM · Product-Analytics (Kanban)

Fri, Oct 15

Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Fri, Oct 15, 9:32 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T287283: High Level Metrics : Future Improvements.
Fri, Oct 15, 9:27 PM · Product-Analytics
Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Fri, Oct 15, 9:12 PM · Product-Analytics (Kanban)

Wed, Oct 13

Iflorez updated the task description for T291107: August & September 2021 Wikimedia movement metrics.
Wed, Oct 13, 8:46 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Wed, Oct 13, 8:43 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Wed, Oct 13, 8:40 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Wed, Oct 13, 4:23 PM · Product-Analytics (Kanban)

Tue, Oct 12

Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Tue, Oct 12, 11:15 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Tue, Oct 12, 11:15 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T287283: High Level Metrics : Future Improvements.
Tue, Oct 12, 11:10 PM · Product-Analytics
Iflorez closed T293130: Fix editors_daily data issue for Tuning session reporting as Resolved.
Tue, Oct 12, 8:04 PM · Product-Analytics, Data-Engineering
Iflorez added a comment to T293130: Fix editors_daily data issue for Tuning session reporting.

Closing this ticket as the data is not missing. Documentation for this table notes:

Tue, Oct 12, 8:04 PM · Product-Analytics, Data-Engineering
Iflorez updated subscribers of T293130: Fix editors_daily data issue for Tuning session reporting.
Tue, Oct 12, 6:52 PM · Product-Analytics, Data-Engineering
Iflorez triaged T293130: Fix editors_daily data issue for Tuning session reporting as High priority.
Tue, Oct 12, 6:50 PM · Product-Analytics, Data-Engineering
Iflorez created T293130: Fix editors_daily data issue for Tuning session reporting.
Tue, Oct 12, 6:50 PM · Product-Analytics, Data-Engineering
Iflorez added a comment to T292880: Manually update cchen.new_editors.

Thank you!!!

Tue, Oct 12, 4:36 PM · Product-Analytics (Kanban)

Fri, Oct 8

Iflorez triaged T292880: Manually update cchen.new_editors as High priority.
Fri, Oct 8, 10:57 PM · Product-Analytics (Kanban)
Iflorez updated the task description for T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Fri, Oct 8, 10:42 PM · Product-Analytics (Kanban)
Iflorez created T292880: Manually update cchen.new_editors.
Fri, Oct 8, 10:39 PM · Product-Analytics (Kanban)
Iflorez created T292867: FY21-22 Q1 Tuning Session Metrics - Irene.
Fri, Oct 8, 6:34 PM · Product-Analytics (Kanban)

Wed, Sep 29

Iflorez added a comment to T289799: [REQUEST] Investigate decrease in New Registered Users.

@Tgr Do you know of recent bot detection deployments that should be considered here? Or insights about changes to bot activity that I should review? Another potential component is whether bot-created new registered user accounts are part of the picture. If there have been changes in bot activity/behavior or in bot detection, I will include that in this investigation.

Wed, Sep 29, 9:23 PM · Analytics-Radar, Product-Analytics (Kanban)
Iflorez updated the task description for T292118: Request access to private data group for ifried.
Wed, Sep 29, 9:06 PM · SRE, SRE-Access-Requests
Iflorez added a comment to T291957: Puppet configuration for Product Analytics ETL jobs.

(1) Do any of the movement metrics ETL notebooks currently use Presto to query?
No; A few use spark:
Readers:
2a spark
2b hive, spark
Platform:
1 hive, spark
Editors:
1b hive, spark
2a hive, spark

Wed, Sep 29, 6:22 PM · Product-Analytics (Kanban)
Iflorez added a comment to T291957: Puppet configuration for Product Analytics ETL jobs.

I believe that I didn't install any Python packages for the data wrangling notebooks when I created a new environment and cloned the repos in July. The packages that I installed were R packages to run the three data viz notebooks.
These are the packages imported:
from cycler import cycler
from dateutil.relativedelta import relativedelta
from functools import reduce
from google.oauth2.service_account import Credentials
from numbers import Number
from pathlib import Path
import datetime
import gspread
import matplotlib as mpl
import matplotlib.pyplot as plt
import numpy as np
import os
import pandas as pd
import requests
import requests
import time

Wed, Sep 29, 5:23 PM · Product-Analytics (Kanban)

Tue, Sep 28

Iflorez added a comment to T289799: [REQUEST] Investigate decrease in New Registered Users.

Wikistats numbers for New Registered Users are in line with the numbers seen in the logging table (where log_action = 'create') and SSAC table (where event.isselfmade = true). See how wikistats defines New Registered Users.
These numbers were tested by comparing them to user counts on wikis for user accounts created within the same time period, using the below query:

Tue, Sep 28, 12:27 AM · Analytics-Radar, Product-Analytics (Kanban)

Sep 15 2021

Iflorez added a comment to T289799: [REQUEST] Investigate decrease in New Registered Users.

When comparing logging table records where log_action = 'create' to ssac table records where event.isselfmade = true, I see a very similar number of results in the two tables for the list of wikis queried. Out of 77 wikis: 66 showed the same number of new registrations in July in both tables, 11 showed a .44% to a .02% difference in registration counts between the two tables with the ssac table undercounting by a few registrants each of the 11 times.

Sep 15 2021, 4:56 AM · Analytics-Radar, Product-Analytics (Kanban)

Sep 9 2021

Iflorez updated subscribers of T289799: [REQUEST] Investigate decrease in New Registered Users.

Hi All,
I'm investigating this. Following my recent check-in with @mpopov, my next step is to touch base with @nettrom_WMF when he returns on Monday to review queries/results.

Sep 9 2021, 10:12 PM · Analytics-Radar, Product-Analytics (Kanban)
Iflorez edited projects for T289799: [REQUEST] Investigate decrease in New Registered Users, added: Analytics-Radar; removed Analytics.
Sep 9 2021, 10:07 PM · Analytics-Radar, Product-Analytics (Kanban)
Iflorez edited projects for T289799: [REQUEST] Investigate decrease in New Registered Users, added: Analytics; removed Analytics-Radar.
Sep 9 2021, 10:06 PM · Analytics-Radar, Product-Analytics (Kanban)

Sep 8 2021

Iflorez added a comment to T289799: [REQUEST] Investigate decrease in New Registered Users.

Hi all,
I am looking at counts from ServerSideAccountCreation and comparing those to counts from logging for swiki, bnwiki, idwiki, and enwiki to start.
I will check in with @mpopov tomorrow about this and will post an update here by Thursday.

Sep 8 2021, 4:46 AM · Analytics-Radar, Product-Analytics (Kanban)

Sep 3 2021

Iflorez updated the task description for T290358: [REQUEST] % of wikipedia articles related to Africa.
Sep 3 2021, 11:04 PM · Product-Analytics
Iflorez created T290358: [REQUEST] % of wikipedia articles related to Africa.
Sep 3 2021, 9:36 PM · Product-Analytics

Aug 30 2021

Iflorez updated the task description for T287715: Analysis Request for Editor Growth due to Campaign Activity in Sub-Saharan Africa.
Aug 30 2021, 5:19 PM · Campaign-Tools (Campaign-Tools-Active-Work), Product-Analytics (Kanban)

Aug 18 2021

Iflorez added a comment to T287715: Analysis Request for Editor Growth due to Campaign Activity in Sub-Saharan Africa.

Question: what percentage of editors in SSA Africa participated in campaign activities in the 2019-2020 period?

Aug 18 2021, 11:43 PM · Campaign-Tools (Campaign-Tools-Active-Work), Product-Analytics (Kanban)
Iflorez closed T288284: Metrics Platform Schema testing & feedback: Irene as Resolved.
Aug 18 2021, 11:04 PM · Product-Analytics (Kanban), Metrics-Platform
Iflorez updated the task description for T288140: July 2021 Wikimedia movement metrics.
Aug 18 2021, 11:03 PM · Product-Analytics (Kanban)

Aug 16 2021

Iflorez closed T288864: Change /user/hive/warehouse/neilpquinn.db/editor_month ownership to iflorez, a subtask of T288140: July 2021 Wikimedia movement metrics, as Resolved.
Aug 16 2021, 5:48 PM · Product-Analytics (Kanban)
Iflorez closed T288864: Change /user/hive/warehouse/neilpquinn.db/editor_month ownership to iflorez as Resolved.
Aug 16 2021, 5:48 PM · Product-Analytics, Analytics

Aug 13 2021

Iflorez triaged T288864: Change /user/hive/warehouse/neilpquinn.db/editor_month ownership to iflorez as High priority.
Aug 13 2021, 8:47 PM · Product-Analytics, Analytics
Iflorez created T288864: Change /user/hive/warehouse/neilpquinn.db/editor_month ownership to iflorez.
Aug 13 2021, 8:47 PM · Product-Analytics, Analytics

Aug 11 2021

Iflorez created T288657: Change /user/hive/warehouse/wmf_product.db ownership to iflorez.
Aug 11 2021, 8:08 PM · Analytics-Kanban, Product-Analytics, Analytics

Aug 10 2021

Iflorez closed T239001: create a program out of the various GLOW project HIVE, MariaDB, and API queries and publish code as Resolved.
Aug 10 2021, 8:58 PM · GLOW

Aug 3 2021

Iflorez added a member for Campaign-Tools: Iflorez.
Aug 3 2021, 6:14 PM

Jul 26 2021

Iflorez updated the task description for T285954: Onboarding Checklist for Irene Florez.
Jul 26 2021, 9:41 PM · Product-Analytics (Kanban)
Iflorez created T287418: Request to be added to WMF-NDA phabricator group for Iflorez.
Jul 26 2021, 9:22 PM · WMF-NDA-Requests

Jul 12 2021

Iflorez added a comment to T223496: Requesting access to machines [stat1004, stat1005 (now stat1007), and stat1006] and groups for iflorez.

Hello I've joined the Product Analytics team with @mpopov as my manager. Hooorah!

Jul 12 2021, 7:10 PM · SRE, SRE-Access-Requests

Jul 9 2021

Iflorez updated the task description for T285954: Onboarding Checklist for Irene Florez.
Jul 9 2021, 7:44 PM · Product-Analytics (Kanban)

Jul 8 2021

Iflorez updated the task description for T285954: Onboarding Checklist for Irene Florez.
Jul 8 2021, 7:09 PM · Product-Analytics (Kanban)

May 3 2021

Iflorez updated the task description for T275427: Grants API tooling (Fluxx-Meta).
May 3 2021, 10:32 PM · Community-Resources

Feb 24 2021

Iflorez updated the task description for T275427: Grants API tooling (Fluxx-Meta).
Feb 24 2021, 5:45 PM · Community-Resources
Iflorez updated the task description for T275427: Grants API tooling (Fluxx-Meta).
Feb 24 2021, 5:32 PM · Community-Resources

Feb 22 2021

Iflorez created T275427: Grants API tooling (Fluxx-Meta).
Feb 22 2021, 7:42 PM · Community-Resources

Aug 4 2020

Iflorez added a comment to T255028: Move the stat1004-6-7 hosts to Debian Buster.

Thank you, @elukey !

Aug 4 2020, 4:43 PM · Analytics-Kanban, Analytics-Clusters

Jun 18 2020

Iflorez closed T235889: Pull 2018 Hindi and Malayalam data for Project Tiger and Wiki Asia Month as Resolved.
Jun 18 2020, 10:08 PM · GLOW
Iflorez closed T235894: Survey GLOW project hosts (country level) for total monthly avg devices accessing Wikipedia as Resolved.
Jun 18 2020, 10:07 PM · GLOW
Iflorez closed T235888: Define evaluation plan for GLOW project as Resolved.
Jun 18 2020, 10:07 PM · GLOW
Iflorez closed T233804: Measure Tiger 1.0 for Punjabi against other community campaigns as Resolved.
Jun 18 2020, 10:07 PM · GLOW
Iflorez closed T233808: Discuss the use of Wikidata items in articles tracking as Resolved.
Jun 18 2020, 10:06 PM · GLOW
Iflorez closed T238938: GLOW Team access to superset as Resolved.
Jun 18 2020, 10:05 PM · GLOW
Iflorez closed T233806: Prepare a preliminary metric plan for GLOW , a subtask of T235888: Define evaluation plan for GLOW project, as Resolved.
Jun 18 2020, 9:58 PM · GLOW
Iflorez closed T233806: Prepare a preliminary metric plan for GLOW as Resolved.
Jun 18 2020, 9:58 PM · GLOW
Iflorez closed T237581: external search engine traffic CTR on superset? as Resolved.
Jun 18 2020, 9:57 PM · GLOW
Iflorez closed T249077: Measure number of surviving new GLOW Project Tiger 2 articles also submitted to other contests as Resolved.
Jun 18 2020, 9:57 PM · GLOW
Iflorez closed T245543: Identify GLOW articles created or edited with the translation tool as Resolved.
Jun 18 2020, 9:57 PM · GLOW
Iflorez closed T247568: Measure how many entries were picked from the list of suggestions as Resolved.
Jun 18 2020, 9:57 PM · GLOW
Iflorez closed T248140: Collect content quality metrics for articles submitted in GLOW Project Tiger 2.0 contest as Resolved.
Jun 18 2020, 9:57 PM · GLOW

Jun 2 2020

Iflorez added a comment to T249752: Decomission notebook hosts .

I deleted all files on nb3 and shutdown the server.
I rsynced all files from nb4 and shutdown the server.
Thank you!

Jun 2 2020, 3:02 AM · Analytics-Kanban, Analytics-Clusters, Patch-For-Review

May 21 2020

Iflorez added a comment to T234701: "Content" equivalent of pageviews daily or edits_hourly available to use in Turnilo and Superset.

These were interesting and helpful metrics to review for GLOW India articles:
Namespace (or just main/not main?)
Project
age
num of editors
num of edits
length/size
num of watchers
time since last edit
links

May 21 2020, 6:43 PM · Epic, Product-Analytics

May 14 2020

Iflorez added a comment to T249752: Decomission notebook hosts .

Hi @elukey I'll transfer files and shut down notebooks over the next few days. I'll check in on Tuesday with an update or questions if any.
Thank you!

May 14 2020, 5:19 PM · Analytics-Kanban, Analytics-Clusters, Patch-For-Review

Apr 21 2020

Iflorez added a comment to T247768: Code review: Review rec list results.

thank you @nettrom_WMF! sorry for the delay

Apr 21 2020, 12:39 AM · Product-Analytics (Kanban), GLOW

Apr 1 2020

Iflorez updated the task description for T247768: Code review: Review rec list results.
Apr 1 2020, 5:09 AM · Product-Analytics (Kanban), GLOW
Iflorez added a comment to T249077: Measure number of surviving new GLOW Project Tiger 2 articles also submitted to other contests.

Loading neighboring contest articles:
https://github.com/IreneFlorez/GLOW/blob/article_suggestions/scripts/data_wrangling/1d_load_neighboring_contests.ipynb

Apr 1 2020, 1:48 AM · GLOW
Iflorez created T249077: Measure number of surviving new GLOW Project Tiger 2 articles also submitted to other contests.
Apr 1 2020, 1:45 AM · GLOW

Mar 27 2020

Iflorez added a comment to T248140: Collect content quality metrics for articles submitted in GLOW Project Tiger 2.0 contest.

Data wrangling code to pull the items in this task can be found here: https://github.com/IreneFlorez/GLOW/tree/article_suggestions/scripts/data_wrangling

Mar 27 2020, 10:49 PM · GLOW
Iflorez updated the task description for T248140: Collect content quality metrics for articles submitted in GLOW Project Tiger 2.0 contest.
Mar 27 2020, 10:48 PM · GLOW

Mar 24 2020

Iflorez added a comment to T245543: Identify GLOW articles created or edited with the translation tool .

articles that were edited using a translation tool (by type):
expanded 113 (expanded total 1418)
new 3602. (new total 7445)


Expanded articles edited using a translation tool:
7.96%

Mar 24 2020, 5:05 PM · GLOW

Mar 20 2020

Iflorez created T248140: Collect content quality metrics for articles submitted in GLOW Project Tiger 2.0 contest.
Mar 20 2020, 12:44 AM · GLOW

Mar 18 2020

Iflorez updated the task description for T247768: Code review: Review rec list results.
Mar 18 2020, 4:19 PM · Product-Analytics (Kanban), GLOW
Iflorez added a comment to T247768: Code review: Review rec list results.

@mpopov maybe the faulty link was related to a bug? I'm receiving bug reports related to this ticket. Would it make sense to create a new ticket?

Mar 18 2020, 4:08 PM · Product-Analytics (Kanban), GLOW
Iflorez added a comment to T247768: Code review: Review rec list results.

Sorry about that, I just updated the ticket with a functional task link.

Mar 18 2020, 4:06 PM · Product-Analytics (Kanban), GLOW
Iflorez updated the task description for T247768: Code review: Review rec list results.
Mar 18 2020, 4:04 PM · Product-Analytics (Kanban), GLOW
Iflorez updated the task description for T247768: Code review: Review rec list results.
Mar 18 2020, 3:53 PM · Product-Analytics (Kanban), GLOW

Mar 16 2020

Iflorez added a comment to T245373: Optimization tips and feedback.

In an effort to run these queries from a Python3 notebook without needing to change the notebook type, I've switched these queries to run as spark queries using the wmf data package's spark.run function. I'm now able to run the queries. For example, here's the code for the translation query:

Mar 16 2020, 6:12 PM · Analytics-Radar, GLOW
Iflorez added a comment to T245373: Optimization tips and feedback.

Thank you. Yes, I can confirm that I had run kinit and entered my kerberos credentials in a notebook-terminal.

Mar 16 2020, 4:13 PM · Analytics-Radar, GLOW
Iflorez created T247768: Code review: Review rec list results.
Mar 16 2020, 3:59 PM · Product-Analytics (Kanban), GLOW
Iflorez added a comment to T245373: Optimization tips and feedback.

@JAllemandou I tried running these spark queries over the weekend on a small batch of articles and they timed out.
Might you have tips or insights? I didn't receive any error messages, simply the queries took a very long time and eventually I stopped the kernel.
Given that behavior, I also tried running the queries as hive queries and had similar issues.

Mar 16 2020, 3:53 PM · Analytics-Radar, GLOW
Iflorez added a project to T245373: Optimization tips and feedback: GLOW.
Mar 16 2020, 12:54 AM · Analytics-Radar, GLOW
Iflorez updated the task description for T245543: Identify GLOW articles created or edited with the translation tool .
Mar 16 2020, 12:45 AM · GLOW

Mar 12 2020

Iflorez updated the task description for T247568: Measure how many entries were picked from the list of suggestions.
Mar 12 2020, 9:46 PM · GLOW
Iflorez updated the task description for T247568: Measure how many entries were picked from the list of suggestions.
Mar 12 2020, 9:45 PM · GLOW
Iflorez updated the task description for T247568: Measure how many entries were picked from the list of suggestions.
Mar 12 2020, 9:45 PM · GLOW
Iflorez added a comment to T247568: Measure how many entries were picked from the list of suggestions.

Total values in full rec list: 34295


total recs in translation: 14155
total recs in editing: 20102


Mar 12 2020, 9:44 PM · GLOW
Iflorez created T247568: Measure how many entries were picked from the list of suggestions.
Mar 12 2020, 9:35 PM · GLOW