Page MenuHomePhabricator

Deprecate neilpquinn.editor_month
Closed, ResolvedPublic

Description

As we have completed Editing ETL automation T295332 and have the new wmf_product.editor_month table, we should manually stop inserting data into neilpquinn.editor_month going forward.

  • We are aware that this table was widely popular but have no way of knowing what the dependencies are. We will reach out to DE to know what reports, dashboards and users use(d) this table
  • inform users of the deprecation

Event Timeline

kzimmerman triaged this task as Medium priority.Apr 19 2022, 5:14 PM

I've been trying to find references to this table in the database of Superset. So far I have only found the following datasets that refer to it into their SQL statements.

MariaDB [superset_production]> select table_name,id from tables where `sql` like '%neilpquinn.editor_month%' \G;
*************************** 1. row ***************************
table_name: Neil Quinn-Untitled Query-gbTqyjkck
        id: 37
*************************** 2. row ***************************
table_name: Neil Quinn-Untitled Query-WEfP3MjXV
        id: 38
*************************** 3. row ***************************
table_name: Neil Quinn-Untitled Query-JzDWDsWjM
        id: 39
*************************** 4. row ***************************
table_name: Neil Quinn-Untitled Query-tXA9caYzh
        id: 40
*************************** 5. row ***************************
table_name: Connie Chen-active editors by wiki-2ZRXuhN66
        id: 100
*************************** 6. row ***************************
table_name: Connie Chen-active editors by wiki-ZMqLg4ARd
        id: 101
*************************** 7. row ***************************
table_name: Connie Chen-active editors -7NTvxOOzU
        id: 102
7 rows in set (0.008 sec)

I can carry on looking, but I hope that helps for now.

Mayakp.wiki added a subscriber: Aroraakhil.

Thanks @BTullis ! for the stat hosts I tried
grep -R --no-messages -l "neilpquinn" /home
but that gave me 1 result after running for a few hours. Can you let me know how to optimize this search?

PS: The one user I got is @Aroraakhil, and Im not completely sure if they are WMF staff or if that is an active account?

@Mayakp.wiki I was looking at our Wikimedia Descriptive Stats and noticed that we reference a query here - T261015#6428301 - that uses neilpquinn.editor_month . I've updated the query in T261015#7915950 and the link in the Wikimedia Descriptive Stats.

The last snapshot on neilpquinn.editor_month table is 2022-03-01 . Since then, no user or analyst has reached out to us with questions/concerns regarding the data in this table.

Per @BTullis advice, I searched for any active (or inactive) queries and notebooks on all stat machines that use neilpquinn.editor_month using the query string given below and didnt find anything (i.e. this table may not infact be used as much as I previously thought).
grep -R --no-messages -l "neilpquinn.editor_month" /home/-mindepth 1 2>/dev/null|head|awk -F / '{print $3}'|uniq
find -R --no-messages -l "neilpquinn.editor_month" /home/-mindepth 1 2>/dev/null|head|awk -F / '{print $3}'|uniq

As such, we are ready to close this task and continue inserting data in the new table wmf_product.editor_month using the product-analytics job.