Page MenuHomePhabricator

drop CitatitionUsage data on mysql
Closed, ResolvedPublic3 Estimated Story Points

Description

CitationUsage data is been used from hadoop, i think we can entirely delete this dataset from Mysql, asking @Miriam and @leila to confirm

Event Timeline

Space that can be potentially recovered:

elukey@db1107:~$ du -hsc /srv/sqldata/_log_CitationUsage_*
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_dt_1c810f402_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_timestamp_1c810f687_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_uuid_1c810f3fe_3_1d_B_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18051472_main_1c810f3f9_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18051472_status_1c810f3f9_1_1d.tokudb
20M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_dt_1d340a154_3_1d_P_0.tokudb
20M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_timestamp_1d340a14d_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_uuid_1d340a149_3_1d_B_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18219861_main_1d340a146_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18219861_status_1d340a146_1_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_dt_1e2d5abc3_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_timestamp_1e2d5abca_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_uuid_1e2d5abbf_3_1d_B_0.tokudb
16G	/srv/sqldata/_log_CitationUsage_18359729_main_1e2d5abbc_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18359729_status_1e2d5abbc_1_1d.tokudb
40K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_dt_1f0f6512b_3_1d_P_0.tokudb
40K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_timestamp_1f0f64faa_3_1d_P_0.tokudb
40K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_uuid_1f0f65125_3_1d_B_0.tokudb
32K	/srv/sqldata/_log_CitationUsage_18502709_main_1f0f64fa7_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18502709_status_1f0f64fa7_1_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_dt_215d8107b_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_timestamp_215d81254_3_1d_P_0.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_uuid_215d81075_3_1d_B_0.tokudb
18G	/srv/sqldata/_log_CitationUsage_18810892_main_215d81072_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18810892_status_215d81072_1_1d.tokudb
34G	total

elukey@db1108:~$ du -hsc /srv/sqldata/_log_CitationUsage_18*
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_dt_167a6d7c_4_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_timestamp_167a6d7c_5_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18051472_key_ix_CitationUsage_18051472_uuid_167a6d7c_3_1d.tokudb
5.2G	/srv/sqldata/_log_CitationUsage_18051472_main_167a6d7c_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18051472_status_167a6d7c_1_1d.tokudb
16M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_dt_19e4cccb_5_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_timestamp_19e4cccb_4_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18219861_key_ix_CitationUsage_18219861_uuid_19e4cccb_3_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18219861_main_19e4cccb_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18219861_status_19e4cccb_1_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_dt_1eb5b5f4_4_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_timestamp_1eb5b5f4_5_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18359729_key_ix_CitationUsage_18359729_uuid_1eb5b5f4_3_1d.tokudb
19G	/srv/sqldata/_log_CitationUsage_18359729_main_1eb5b5f4_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18359729_status_1eb5b5f4_1_1d.tokudb
32K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_dt_23207cba_5_1d.tokudb
32K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_timestamp_23207cba_4_1d.tokudb
32K	/srv/sqldata/_log_CitationUsage_18502709_key_ix_CitationUsage_18502709_uuid_23207cba_3_1d.tokudb
32K	/srv/sqldata/_log_CitationUsage_18502709_main_23207cba_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18502709_status_23207cba_1_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_dt_2eaf26df_4_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_timestamp_2eaf26df_5_1d.tokudb
32M	/srv/sqldata/_log_CitationUsage_18810892_key_ix_CitationUsage_18810892_uuid_2eaf26df_3_1d.tokudb
18G	/srv/sqldata/_log_CitationUsage_18810892_main_2eaf26df_2_1d.tokudb
64K	/srv/sqldata/_log_CitationUsage_18810892_status_2eaf26df_1_1d.tokudb
42G	total

Hi @Nuria and @elukey - please feel free to drop this dataset from mysql. @tizianopiccardi -the main user for this data - also confirmed.

db1107

MariaDB [(none)]> show tables from log like 'CitationUsage%';
+--------------------------------+
| Tables_in_log (CitationUsage%) |
+--------------------------------+
| CitationUsage_18051472         |
| CitationUsage_18219861         |
| CitationUsage_18359729         |
| CitationUsage_18502709         |
| CitationUsage_18810892         |
+--------------------------------+
5 rows in set (0.00 sec)

db1108

MariaDB [(none)]> show tables from log like 'CitationUsage%';
+--------------------------------+
| Tables_in_log (CitationUsage%) |
+--------------------------------+
| CitationUsage_18051472         |
| CitationUsage_18219861         |
| CitationUsage_18359729         |
| CitationUsage_18502709         |
| CitationUsage_18810892         |
+--------------------------------+
5 rows in set (0.00 sec)

@Nuria review requested before dropping :)

fdans moved this task from Incoming to Operational Excellence on the Analytics board.

Mentioned in SAL (#wikimedia-operations) [2019-10-08T05:35:20Z] <elukey> drop CitationUsage tables from the log database on db1107/db1108 (the ones listed in the task) - T233893

Done! Checked with du -hsc /srv/sqldata/_log_CitationUsage_* on both hosts, no data anymore stored.

elukey lowered the priority of this task from High to Medium.
elukey set the point value for this task to 3.
elukey moved this task from Next Up to Done on the Analytics-Kanban board.