user_properties table bloat
Open, MediumPublic
Actions

Assigned To

None

Authored By

	• tstarling
	Aug 13 2013, 4:18 AM

Description

On enwiki, the user_properties table has about 60M rows, for only 20M users. This is incredible considering that user_properties is meant to only store non-default options, to reduce DB space. The index length is about 2.2 GB, and the data size is about 3.7GB.

By sampling, the number of user_properties rows per user can be estimated. The problem is very dependent on user_id, and is mostly confined to user_id values less than 10M, i.e. users created before mid-2009.

user_id props/user

0 8.2615
1000000 5.8696
2000000 4.9534
3000000 4.8038
4000000 4.6013
5000000 4.3775
6000000 4.4137
7000000 5.3833
8000000 5.8919
9000000 6.4356
10000000 0.8789
11000000 1.1052
12000000 1.0005
13000000 0.9774
14000000 1.0987

Sampling 1000 users with user_id<10M, we find that the main culprits are:

searchNs-1 : 968 users
skin: 964 users
thumbsize: 912 users

75% of the skin rows have an empty string as their value, which causes Skin::newFromKey() to return the default skin, same as if the row was missing. The rest are mostly "monobook", presumably manually set via the UsabilityInitiative OptIn extension.

"searchNs-1" is a bug, it relates to searching the special namespace, which is not possible. It is "0" in all sampled rows.

"thumbsize" is "3" in all sampled rows, which is not the default, the default is "4" on all WMF wikis other than svwiki. In addition to bloat of the user_properties table, this causes fragmentation of the parser cache. There's no way 91% of users prior to 2009 manually set this value, it must have been set by a bug.

We should remove unnecessary or incorrectly inserted rows, and ensure that this does not happen again (e.g. as a consequence of the resolution of T38316).

Details

Reference: bz52777

Related Objects
Search...

Status	Subtype	Assigned	Task
Resolved		None	T340453 [Epic] FY 2023-24 Growth Maintenance Work
Resolved		KStoller-WMF	T347840 [Epic] Q2 FY 2023-24 Growth Maintenance Work
Open		None	T54777 user_properties table bloat
Open		None	T54542 User preferences are inconsistently stored (bool/int as default, string for overrides)
Duplicate		None	T105760 Too many discrepancies on preferences between production and debug modes
Resolved		Tgr	T148492 Fix for MultimediaViewer $wgMediaViewerEnableByDefault
Open		None	T54778 Clean up skin user preferences in user_properties table on Wikimedia wikis
Resolved		None	T66752 Remove vector-collapsiblenav from user_properties with runBatchedQuery.php on all wikis
Resolved		None	T66754 Remove ajaxsearch from user_properties with runBatchedQuery.php on all wikis
Resolved		None	T66753 Remove showjumplinks from user_properties with runBatchedQuery.php on all wikis
Resolved		None	T66755 Remove disablesuggest from user_properties with runBatchedQuery.php on all wikis
Resolved		None	T66757 Remove searchlimit from user_properties with runBatchedQuery.php on all wikis
Resolved		None	T66756 Remove nocache from user_properties with runBatchedQuery.php on all wikis
Resolved		Krenair	T114068 Clean up search user preferences in user_properties table on Wikimedia wikis
Resolved		Krenair	T117440 Clean up "gadget--%" rows from user_properties on Wikimedia wikis
Resolved		• demon	T114208 User skin preference for MonoBook changed to default (Vector) on small and medium wikis (about 95% of wikis)
Duplicate		• demon	T114899 Recover missing values from user_properties tables
Duplicate		None	T171643 Clean up skin properties
Resolved		Jdlrobson	T171644 Clarify minerva preference name
Resolved		Krinkle	T173546 ResourceModuleSkinStyles does not resolve skin aliases
Open		None	T173191 Cleanup invalid `language` properties in user_properties table
Resolved		kostajh	T223645 Consider alternatives to recent questions and welcome survey storage
Open		None	T304538 Clean up GrowthExperiments-related user_properties rows
Resolved		Urbanecm_WMF	T304461 Delete `growthexperiments-mentor-id` properties from user_properties
Open		None	T304584 Implement alternate storage or retrieval for Recent Questions feature
Open		None	T304495 Implement database storage for Welcome Survey
Resolved		Urbanecm_WMF	T308309 Delete rows for growthexperiments-homepage-suggestededits-topics-enabled
Open		None	T308319 Remove growthexperiments-homepage-suggestededits-activated user property
Open		None	T332825 Remove growthexperiments-homepage-enable user property
Resolved		Urbanecm_WMF	T311460 Make userOptions.php capable of deleting user options
Resolved		Zabe	T342264 Remove wikibase-otherprojects from user preferences (user_properties)
Stalled		None	T342274 Remove user preference to not show popup about mul again
Open		None	T300371 Drop now unused user preferences from production database(s)
Resolved		Urbanecm_WMF	T354459 [Epic] Support conditional defaults for user properties to help address user_properties table bloat
Resolved		Urbanecm_WMF	T353225 Echo: Make use of conditional user defaults
Resolved		Urbanecm_WMF	T321527 Support conditional defaults for user properties
Declined		Sgs	T346804 Create dynamic defaults table for user properties
Resolved		Urbanecm_WMF	T352284 Move user-options classes to the MediaWiki\User\Options namespace
Resolved		Urbanecm_WMF	T354329 Audit callers of UserOptionsLookup::getDefaultOption(s) to ensure they will work with conditional options defaults
Resolved		Urbanecm_WMF	T354331 Ensure ResourceLoader::getUserDefaults() works with conditional options defaults
Resolved		Urbanecm_WMF	T354420 Write a MW.org documentation page for $wgConditionalUserDefaults
Resolved		Urbanecm_WMF	T354417 userOptions.php cannot be used to change user options from default to a new value
Resolved	BUG REPORT	Urbanecm_WMF	T355086 UserOptionsManager makes it impossible to set option to an empty string when there is no pre-existing user_properties row
Resolved	BUG REPORT	Urbanecm_WMF	T355310 userOptions.php --delete --old '' deletes all rows
Open	Spike	None	T353343 [Spike] Find features that would benefit from conditional options defaults
Resolved		Urbanecm_WMF	T355204 Make userOptions.php ready for temporary accounts
Resolved		Urbanecm_WMF	T355367 userOptions.php: Add support for --delete-defaults
Open		Ladsgroup	T357072 Echo: Drop droppable rows from user_properties
Resolved		matmarex	T357221 Handle preferences for new users using "ConditionalUserOptions" config instead of "LocalUserCreated" hook inserting preference rows
Resolved		Urbanecm_WMF	T364269 Drop user properties related to RC tours
Resolved	BUG REPORT	Urbanecm_WMF	T364311 userOptions.php fails to delete very large number of DB rows
Resolved		Dreamy_Jazz	T217451 Remove RCFilters Guided Tours
Resolved		Jdlrobson	T364347 Popups: Make use of conditional user defaults
Open		None	T166369 Migrate old Math options to current ones or delete them from the database

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Jay8g subscribed.Dec 23 2015, 4:11 AM

• MZMcBride mentioned this in T147696: Statistics about /active/ users of skins on the Wikimedia cluster.Dec 2 2016, 2:22 AM

MarcoAurelio added a project: Wikimedia-maintenance-script-run.Mar 15 2017, 12:36 PM

MarcoAurelio removed a subscriber: • wikibugs-l-list.

MarcoAurelio removed a parent task: T31782: [DO NOT USE] Maintenance scripts waiting to be run on Wikimedia wikis (tracking) [superseded by #WMF-maintenance-script-run].Mar 15 2017, 12:48 PM

• demon created subtask T171643: Clean up skin properties.Jul 25 2017, 7:27 PM

Reedy created subtask T173191: Cleanup invalid `language` properties in user_properties table.Aug 12 2017, 4:05 PM

I wonder if dropping all preferences from closed wikis would be some low hanging fruit?

In T54777#3930198, @demon wrote:

I wonder if dropping all preferences from closed wikis would be some low hanging fruit?

I'm not sure it'd help with the original issue ("large prod tables slowing things down") as the preference tables are per-wiki. It'd clean up the "accounts have preferences that we should ignore when analysing what to kill" issue, but that feels like a sub-set of the overall issues (and people often only look at a few big wikis for those numbers anyway)…

It makes the dataset of "things that need cleanup" to be smaller, if we just delete everything from those carte blanche.

(Relatedly, I ran some interesting queries re: subtask of T171643, will be following up there later)

Niharika reopened subtask T54778: Clean up skin user preferences in user_properties table on Wikimedia wikis as Open.May 1 2018, 12:47 AM

DannyS712 subscribed.Apr 24 2020, 1:54 AM

Reedy moved this task from Backlog to Blocked/Stalled on the Wikimedia-maintenance-script-run board.Apr 30 2020, 8:13 PM

Reedy mentioned this in T280470: Renaming 'email-blacklist' preference.Apr 18 2021, 11:15 PM

• tstarling mentioned this in T286521: Deadlock found when trying to get lock (UserOptionsManager::saveOptionsQuery).Jul 13 2021, 1:58 AM

Tgr mentioned this in T38316: Set "Add pages I edit to my watchlist" and "Add pages I create to my watchlist" to true by default on Wikimedia wikis (only for new users).Aug 20 2021, 1:28 AM

enwiki has about 200 million user_properties rows today (for about 40 million users); a newly registered user has 12 rows:

+------------------------------------------+---------------------------------------------------+
| up_property                              | up_value                                          |
+------------------------------------------+---------------------------------------------------+
| VectorSkinVersion                        | 1                                                 |
| echo-subscriptions-email-article-linked  | 1                                                 |
| echo-subscriptions-email-dt-subscription | 1                                                 |
| echo-subscriptions-email-edit-thank      | 1                                                 |
| echo-subscriptions-email-mention         | 1                                                 |
| echo-subscriptions-email-page-review     | 1                                                 |
| echo-subscriptions-web-article-linked    | 1                                                 |
| echo-subscriptions-web-reverted          | 0                                                 |
| popups                                   | 1                                                 |
| rcenhancedfilters-seen-tour              | 1                                                 |
| welcomesurvey-responses                  | {"_group":"NONE","_render_date":"20220126054124"} |
| wlenhancedfilters-seen-tour              | 1                                                 |
+------------------------------------------+---------------------------------------------------+

There are about a quarter million user registrations globally per month. Most of those means four local user accounts (loginwiki, metawiki, mediawikiwiki + wherever they signed up, which is enwiki about half the time) so about 7 million new user_properties rows on enwiki per year (a 3% growth rate), 15 million on wikis where accounts get autocreated, and 60 million in total.

I'm gathering requirements for this. Can someone involved in GrowthExperiments, e.g. @kostajh , comment on whether it is a hard requirement to vary new user preferences by autocreation status? The user table doesn't have autocreation status, so it's not so easy to determine it after creation is complete. If it is a requirement, an approximation might be to check if the local wiki is the CentralAuth home wiki.

If it's just a workaround for T276720 then that can be fixed in another way. But maybe what GrowthExperiments actually needs is a way to store tour completion globally. So if a user goes from wiki to wiki, they will keep getting prompted for a tour, until they do it on any wiki, then it disappears globally.

In T54777#7720272, @tstarling wrote:

I'm gathering requirements for this. Can someone involved in GrowthExperiments, e.g. @kostajh , comment on whether it is a hard requirement to vary new user preferences by autocreation status? The user table doesn't have autocreation status, so it's not so easy to determine it after creation is complete. If it is a requirement, an approximation might be to check if the local wiki is the CentralAuth home wiki.

So far in GrowthExperiments we have set non-default preferences for some percentage of newly created users, and we have ignored autocreated users.

That has recently changed because we are now giving Growth features to 100% of newly created accounts (still excluding autocreated) on almost all Wikipedias (T301820), with a plan to eventually reach 100% on the rest of the Wikipedias.

We are discussing how to enable Growth features for some subset of existing user accounts in T296702: Scale: enable Growth features for existing accounts which would likely involve setting a non-default option for preferences.

We haven't determined a new strategy for autocreated accounts, that is being discussed in T292090: Research Spike: Enable Growth features for autocreated accounts (if the user already has Growth features).

In T54777#7720275, @tstarling wrote:

If it's just a workaround for T276720 then that can be fixed in another way. But maybe what GrowthExperiments actually needs is a way to store tour completion globally. So if a user goes from wiki to wiki, they will keep getting prompted for a tour, until they do it on any wiki, then it disappears globally.

We mostly haven't worried about tour completion as a user goes from wiki to wiki, because so far, if you have Growth features enabled on e.g. enwiki, you will not have them automatically enabled when you go to eswiki. But it is something we will need a solution to and will discuss in T292090: Research Spike: Enable Growth features for autocreated accounts (if the user already has Growth features).

Daimona subscribed.Feb 18 2022, 10:45 AM

I'm thinking about this in terms of the cost of deployment of IP masking. It's proposed to have a user and globaluser row for "temporary" accounts, and there will be a lot of temporary accounts. But probably most extensions will want to treat temporary accounts the same as anonymous users, leaving them with default preferences. Delivering welcome banners and the like is probably best done after the user explicitly creates an account.

So I'm not sure we really need this, but I still had better do a brain dump in case it's needed now or in the future.

Currently several extensions are setting user options from LocalUserCreated. Here's what they are trying to achieve:

The default preference value for new users may be a value different from the default for existing users.
The default for new users may change when a new version of the extension is deployed.
The default for new users may change when a configuration variable changes.
New users may be assigned to a random A/B test bucket and then receive different preferences depending on their bucket.
Possibly the default should be set based on global newness rather than local account autocreation.
The default for new users may later become the default for everyone, or vice versa.

My idea for efficiently achieving those requirements is to have extensions statically declare new user preferences. Have a new table which holds these declarations, say user_property_default:

CREATE TABLE user_property_default (
    upd_id INT UNSIGNED AUTO_INCREMENT NOT NULL,
    upd_property VARBINARY(255) NOT NULL,
    upd_user_type INT UNSIGNED NOT NULL,
    upd_min_user INT UNSIGNED NOT NULL,
    upd_min_bucket INT UNSIGNED NOT NULL,
    upd_value BLOB,
    PRIMARY KEY (upd_id),
    UNIQUE KEY (upd_property, upd_user_type, upd_min_user, upd_min_bucket)
);

To figure out a default user preference value for a given user, you search for a user_property_default row with a minimum user less than the given user_id:

SELECT upd_value FROM user_property_default 
WHERE upd_property='$prefname' 
   AND upd_user_type='$my_type'
   AND upd_min_user <= '$my_id'
   AND upd_min_bucket <= '$my_bucket'
ORDER BY upd_min_user DESC, upd_min_bucket DESC
LIMIT 1;

When a new user is created, the configured declaration is compared against the current (highest upd_min_user) value in the database for each preference. If the declaration has changed, a new row is inserted into the database with upd_min_user being the user_id of the user being created.

The user bucket would just be "hash(user_id) mod 1000" or something similar. Most user_property_default rows would have upd_min_bucket=0 and so would catch all buckets. If you insert a row with upd_min_bucket=990 then it will only take effect for 1% of users. There would always be a fallback with upd_min_bucket=0 so that the search doesn't continue back to previous upd_min_user values.

upd_user_type would be a small integer to allow the default to depend on autocreate flag and "temporary" status.

Sprinkle in some caching and stampede protection and I think it would mostly work. If the declared default changed depending on the request parameters, it would cause a bit of a mess, and preventing that would come down to code review.

taavi subscribed.Feb 26 2022, 4:55 PM

Urbanecm_WMF mentioned this in T304461: Delete `growthexperiments-mentor-id` properties from user_properties.Mar 22 2022, 9:09 PM

Urbanecm_WMF added a subtask: T304461: Delete `growthexperiments-mentor-id` properties from user_properties.

kostajh added a subtask: T223645: Consider alternatives to recent questions and welcome survey storage.Mar 23 2022, 8:42 AM

Dinoguy1000 updated the task description. (Show Details)Mar 23 2022, 9:57 AM

Ladsgroup subscribed.Mar 23 2022, 3:10 PM

Urbanecm_WMF added a subtask: T304538: Clean up GrowthExperiments-related user_properties rows.Mar 23 2022, 4:35 PM

Urbanecm_WMF removed a subtask: T304461: Delete `growthexperiments-mentor-id` properties from user_properties.

RhinosF1 subscribed.Mar 24 2022, 7:21 AM

kostajh closed subtask T223645: Consider alternatives to recent questions and welcome survey storage as Resolved.Mar 24 2022, 8:49 AM

In T54777#7724456, @tstarling wrote

I like the idea. While we are here, I suggest normalizing up_property so it could be shared with upd_property. While upd_property doesn't get repeated much but it would make renaming user_properties possible and easy.
Something like:

CREATE TABLE user_property_type (
    upt_id INT UNSIGNED AUTO_INCREMENT NOT NULL,
    upt_property VARBINARY(255) NOT NULL,
    PRIMARY KEY (upt_id),
    UNIQUE KEY (upt_property)
);

In T54777#7805497, @Ladsgroup wrote:
In T54777#7724456, @tstarling wrote

I like the idea. While we are here, I suggest normalizing up_property so it could be shared with upd_property. While upd_property doesn't get repeated much but it would make renaming user_properties possible and easy.
Something like:
CREATE TABLE user_property_type (
    upt_id INT UNSIGNED AUTO_INCREMENT NOT NULL,
    upt_property VARBINARY(255) NOT NULL,
    PRIMARY KEY (upt_id),
    UNIQUE KEY (upt_property)
);

So this would be something like the NameTableStore system ? Makes sense to me and can be done separately and before the other refactoring. Perhaps create a separate ticket for that?

Yes but I don't have time to do it (currently doing actor migration and templatelinks normalization). Batching schema changes would make things easier but I don't know if @tstarling would want the extra work.

SD0001 subscribed.Mar 26 2022, 7:37 AM

Krinkle mentioned this in T305966: Users who have selected legacy Vector, seeing Vector 2022 on certain page views.Apr 13 2022, 11:40 PM

Krinkle mentioned this in T306056: Investigate why skin preference is set to empty string for various users.

I talked to @Tgr about this today and I actually come up with a counter-proposal to the user_property_default table which I think is a bit more flexible.

We have done a similar thing with per-page A/B testing. You feed the page id to a hash function (to make sure the bucketing is random) and then feed that to a bucketing system. You can even change that without much trouble. Say "I want first bucket" and later say "first and second buckets".

In the page-id based A/B testing system that was built. The core is basically this:

	public function getPageRandom( int $pageId ): float {
		$random = intval( substr( md5( (string)$pageId ), 0, 6 ), 16 ) / 16777216;
		return round( $random, 3 );
	}

We can use something similar but instead of feeding pageId, you can feed (string)$userId . 'name-of-experiment' to the md5 function. That can be easily computed on the fly (we don't need cryptgraphically secure randomness here) and won't take much space and is rather flexible because you can just write code for it if you need to. For example enable A/B testing for users that registered after certain time.

There are several use cases which currently contribute to user_properties row proliferation, for most user_property_default would work well (e.g. changing the default for new users without affecting old users).

The problem with A/B tests is that you can have several of them running at the same time, and often it's important that the buckets are independent of each other, or correlated in some specific way. Simple modulo-based bucketing will make them correlated in a way that's probably unhelpful. Passing the user ID + pseudo-property-name through md5 or some other pseudo-random function makes the bucketing properties independent, which is better but still not always the right thing - when testing multiple elements of the same feature in parallel, you can easily have rules like "experiment 1 has buckets A, B, C, experiment 2 has buckets X, Y, Z, users should be uniformly distributed except users in bucket A should always be in bucket X". I guess you can work around those in the code, but it will make it less intuitive.

The other problem with pseudorandom bucketing is that you can't batch select users in some given bucket, which is sometimes needed (e.g. right now we need to export users from a certain experimental bucket into a mass mailer for an experiment with email prompts). I guess you can calculate md5 in SQL, but it will be inefficient for a large wiki.

Lens0021 subscribed.Apr 15 2022, 1:48 AM

In T54777#7724456, @tstarling wrote:

My idea for efficiently achieving those requirements is to have extensions statically declare new user preferences. Have a new table which holds these declarations, say user_property_default

I wonder if instead of user ID, the registration timestamp could be used. That way, the segment definitions could usually be identical between wikis, so the whole thing could be handled in configuration instead of a database table. That would make it easier to inspect what groups a feature is currently enabled for, and changing the default would be a simple config change, instead of running a maintenance script on every wiki.

In T54777#7856746, @Tgr wrote:

There are several use cases which currently contribute to user_properties row proliferation, for most user_property_default would work well (e.g. changing the default for new users without affecting old users).

The problem with A/B tests is that you can have several of them running at the same time, and often it's important that the buckets are independent of each other, or correlated in some specific way. Simple modulo-based bucketing will make them correlated in a way that's probably unhelpful. Passing the user ID + pseudo-property-name through md5 or some other pseudo-random function makes the bucketing properties independent, which is better but still not always the right thing - when testing multiple elements of the same feature in parallel, you can easily have rules like "experiment 1 has buckets A, B, C, experiment 2 has buckets X, Y, Z, users should be uniformly distributed except users in bucket A should always be in bucket X". I guess you can work around those in the code, but it will make it less intuitive.

You can give it a shared value. e.g. user_id . name of experiment A . name of experiment B (or simply just take experiment A) and ask it to be in different buckets.

The other problem with pseudorandom bucketing is that you can't batch select users in some given bucket, which is sometimes needed (e.g. right now we need to export users from a certain experimental bucket into a mass mailer for an experiment with email prompts). I guess you can calculate md5 in SQL, but it will be inefficient for a large wiki.

For this one, the simplest solution would be to actually simply iterate over user id ranges inside the code and calculate the md5 value in the code and then pick users ids you want to send mass email through that. The whole point of this idea is to decouple storage from random bucketing.

I do understand you might end up in some edge cases that it would be hard or impossible to use with the md5 bucketing but this is a trade-off between storage and computation and I honestly think it's worth it.

In T54777#7860096, @Ladsgroup wrote:

The other problem with pseudorandom bucketing is that you can't batch select users in some given bucket, which is sometimes needed (e.g. right now we need to export users from a certain experimental bucket into a mass mailer for an experiment with email prompts). I guess you can calculate md5 in SQL, but it will be inefficient for a large wiki.

For this one, the simplest solution would be to actually simply iterate over user id ranges inside the code and calculate the md5 value in the code and then pick users ids you want to send mass email through that. The whole point of this idea is to decouple storage from random bucketing.

To iterate on that, you can basically get highest and lowest user id you want to send, split that number into batches of 1000 (or 10k) and without querying the db at all, calculate the bucketing one by one ($userId++) and pick user ids you need sending email to and then do a query on user_id IN (list) on that batch

In T54777#7720275, @tstarling wrote:

But maybe what GrowthExperiments actually needs is a way to store tour completion globally. So if a user goes from wiki to wiki, they will keep getting prompted for a tour, until they do it on any wiki, then it disappears globally.

We currently have a bunch of tours where the "tour not seen yet" flag (non-default preference value) is set on LocalUserCreated, then deleted when the user sees the tour. We should flip that, so that tour flags are not stored for temporary accounts.

We wanted to look into using GlobalPreferences for tour flags, but it's not a priority. Currently tours just aren't show on wikis where the user is autocreated (which also means no user properties spam on auto-registration wikis).

Tgr mentioned this in T304538: Clean up GrowthExperiments-related user_properties rows.Apr 17 2022, 6:50 PM

Zabe subscribed.Apr 18 2022, 11:48 AM

Marostegui subscribed.Apr 25 2022, 12:25 PM

Sgs subscribed.May 10 2022, 2:10 PM

@tstarling do you have a vague idea of when the core changes you indicated in T54777#7724456 would happen? Updating extensions which deal with lots of user preferences (like Echo or GrowthExperiments) would be a nontrivial amount of followup work, so I'm trying to figure out when to reserve some time for that.

Urbanecm_WMF added a subtask: T311460: Make userOptions.php capable of deleting user options.Jun 27 2022, 10:03 PM

Urbanecm_WMF changed the status of subtask T311460: Make userOptions.php capable of deleting user options from Open to In Progress.Aug 29 2022, 7:21 AM

Tgr mentioned this in T321527: Support conditional defaults for user properties.Oct 24 2022, 11:21 PM

Copied Tim's idea from T54777#7724456 to T321527: Support conditional defaults for user properties so it's easier to reference. Besides reducing user_property bloat it would also make progressive rollouts of features simpler.

Urbanecm_WMF changed the status of subtask T311460: Make userOptions.php capable of deleting user options from In Progress to Open.Nov 7 2022, 8:44 AM

Urbanecm_WMF closed subtask T311460: Make userOptions.php capable of deleting user options as Resolved.Nov 10 2022, 11:51 AM

Ladsgroup mentioned this in T300919: It should be possible to set a default skin for existing users.Feb 27 2023, 9:02 PM

Ladsgroup mentioned this in T330509: [IP Masking] Make Echo Notifications available to temporary users.Mar 1 2023, 8:42 PM

Tchanders mentioned this in T330815: Disallow preference setting by temporary users.Mar 9 2023, 3:39 PM

Lucas_Werkmeister_WMDE mentioned this in T300371: Drop now unused user preferences from production database(s).Jul 19 2023, 2:32 PM

Lucas_Werkmeister_WMDE subscribed.

Lucas_Werkmeister_WMDE added a subtask: T342274: Remove user preference to not show popup about mul again.Jul 19 2023, 4:18 PM

Tchanders mentioned this in T332414: Make ApiOptions unavailable to temporary users.Jul 25 2023, 2:24 PM

Ladsgroup mentioned this in T345076: Consider ways to reduce the row count in cu_useragent_clienthints_map.Aug 28 2023, 1:25 PM

Echo notification user properties have reached more than 100M rows now just in enwiki. It's because of Hooks::getNewUserPreferenceOverrides() in Echo extension that adds five rows per each new user. Please stop this.

Tgr mentioned this in T346054: Normalize user_properties table.Sep 11 2023, 3:33 PM

Ladsgroup mentioned this in T346375: [Spike] Consider the impact of dark mode and font size on user preferences table.Sep 15 2023, 8:46 AM

Sgs mentioned this in T346804: Create dynamic defaults table for user properties.Sep 19 2023, 4:48 PM

ovasileva subscribed.Sep 20 2023, 6:23 PM

thiemowmde added a subtask: T300371: Drop now unused user preferences from production database(s).Sep 21 2023, 3:50 PM

DMburugu added a parent task: T347840: [Epic] Q2 FY 2023-24 Growth Maintenance Work.Oct 2 2023, 10:11 AM

Urbanecm_WMF mentioned this in T353225: Echo: Make use of conditional user defaults.Dec 12 2023, 9:37 AM

HouseBlaster subscribed.Dec 26 2023, 3:53 AM

KStoller-WMF mentioned this in T354459: [Epic] Support conditional defaults for user properties to help address user_properties table bloat.Jan 5 2024, 10:50 PM

Bugreporter mentioned this in T278620: Auto-remove never used dormant user accounts.Jan 12 2024, 12:34 AM

Urbanecm_WMF added a subtask: T354459: [Epic] Support conditional defaults for user properties to help address user_properties table bloat.Feb 10 2024, 11:42 PM

Urbanecm_WMF mentioned this in T357221: Handle preferences for new users using "ConditionalUserOptions" config instead of "LocalUserCreated" hook inserting preference rows.Feb 10 2024, 11:49 PM

Urbanecm_WMF added a subtask: T357221: Handle preferences for new users using "ConditionalUserOptions" config instead of "LocalUserCreated" hook inserting preference rows.Feb 10 2024, 11:52 PM

Urbanecm_WMF removed a parent task: T38316: Set "Add pages I edit to my watchlist" and "Add pages I create to my watchlist" to true by default on Wikimedia wikis (only for new users).

Urbanecm_WMF closed subtask T354459: [Epic] Support conditional defaults for user properties to help address user_properties table bloat as Resolved.Mar 6 2024, 4:02 PM

Peachey88 mentioned this in T357484: Implement Logout Confirmation.Mar 17 2024, 1:12 AM

Zabe closed subtask T342264: Remove wikibase-otherprojects from user preferences (user_properties) as Resolved.Mar 25 2024, 7:54 PM

Urbanecm_WMF added a subtask: T364269: Drop user properties related to RC tours.May 6 2024, 3:54 PM

Urbanecm_WMF added a subtask: T364347: Popups: Make use of conditional user defaults.May 6 2024, 9:10 PM

matmarex closed subtask T357221: Handle preferences for new users using "ConditionalUserOptions" config instead of "LocalUserCreated" hook inserting preference rows as Resolved.May 14 2024, 6:01 PM

Jdlrobson closed subtask T364347: Popups: Make use of conditional user defaults as Resolved.May 31 2024, 9:56 PM

Func mentioned this in T366419: Default user option for Reference Previews changed for users created before 2017/08/16 without any justification.Jun 7 2024, 7:05 PM

Urbanecm_WMF closed subtask T364269: Drop user properties related to RC tours as Resolved.Jun 14 2024, 1:38 PM

Sgs mentioned this in T374471: Decide which bucketing/variant assignment system should we use.Sep 10 2024, 4:20 PM

Ladsgroup mentioned this in T166369: Migrate old Math options to current ones or delete them from the database.Nov 4 2024, 11:09 PM

Physikerwelt added a subtask: T166369: Migrate old Math options to current ones or delete them from the database.Nov 5 2024, 8:05 AM

I dropped 4M rows in enwiki belonging to VectorSkinVersion up_property which the code was removed but not the rows. I haven't looked at other wikis though.

Mentioned in SAL (#wikimedia-operations) [2024-12-12T14:30:08Z] <Amir1> ladsgroup@mwmaint2002:~$ foreachwikiindblist all userOptions.php --delete VectorSkinVersion (T54777)

user_properties table bloatOpen, MediumPublicActions