Maniphest T200937

RESTBase k-r-v storage overcommit
Closed, InvalidPublic
Actions

Assigned To

Authored By

	Eevans
	Aug 1 2018, 7:11 PM

Description

The revised k-r-v storage algorithm only evaluates renders for deletion when new renders are stored, and only those that were replaced by another TTL ago (or longer) are candidates for deletion. Likewise, revisions are only evaluated for deletion when a new revision is stored, and only if the corresponding renders were superseded TTL or more in the past. This means that in a perfect world, there will always be at least four renders stored (two revisions with two renders each), but in a significant number of cases, many more.

For example: Let's assume a TTL of 24 hours (86400 seconds). Imagine render 0 of revision A is stored for a new title.

revsion	render	timestamp
A	0	2018-07-01T00:00:00

Subsequently render 1 of revision A is stored. Render 1 supersedes render 0 making render 0 a candidate for deletion TTL seconds from the time render 1 is written, but only after a new render is stored.

revsion	render	timestamp
A	0	2018-07-01T00:00:00
A	1	2018-07-03T00:00:00

Finally, revision 2 is stored, and if TTL seconds or more has elapsed between the writing of render 1, then render 0 can be deleted.

revsion	render	timestamp
A	0	~~2018-07-01T00:00:00~~
A	1	2018-07-03T00:00:00
A	2	2018-07-05T00:00:00

Again, this is best-case scenario. Remember, the same is true of revisions, and when combined with sub-TTL edits and/or re-renders, the number of records persisted at any one time can be significant. Of course, over-stored records of this nature continue to be candidates for deletion, but only upon future writes (imagine a scenario where a flurry of edits for a title occurs within the span of TTL, followed by a period of relative quiet lasting weeks or months).

This is distinctly different from the "leakage" we experienced in T192689: Unchecked storage growth when Cassandra TTLs on the indices were set too low; This overstorage is a property of the system (even if undesirable). We understood this property to exist when working through the design, but what we failed to comprehend at the time, is the difficulty in quantifying the adjusted utilization; The amount of overstore is a function of the storage workload (the distribution of edits, re-renders, and document size), which itself isn't well understood at this time.

What we do know at this time, is that on-disk utilization (including any savings from compression, etc) is at least 2x, and since utilization continues to grow linearly, we can assume that once quiescent, the multiplier will be something well above that.

Screenshot_2018-07-31 Grafana - Cassandra.png (721×1 px, 25 KB)

Last 90 days (eqiad)

It seems unlikely at this time that it will be worthwhile to equip the cluster with enough storage to accommodate this, so we should begin evaluating ways of bounding retention through alternate means.

Related Objects
Search...

		Status	Subtype	Assigned	Task
		Invalid		Eevans	T200937 RESTBase k-r-v storage overcommit
		Resolved		Eevans	T201508 Read timeouts during full table scans

Event Timeline

Eevans created this task.Aug 1 2018, 7:11 PM

Restricted Application added a subscriber: Aklapper. · View Herald TranscriptAug 1 2018, 7:11 PM

Eevans triaged this task as High priority.Aug 1 2018, 7:12 PM

Eevans added projects: RESTBase, Services (next), User-Eevans, Cassandra.

Eevans mentioned this in T192689: Unchecked storage growth.Aug 1 2018, 7:17 PM

https://github.com/wikimedia/restbase/pull/1039 has been committed to issue a revision delete when a new render is stored. This trades some additional write amplification, in the form of redundantly issued deletes, in exchange for more aggressive culling.

Eevans moved this task from Backlog to In-Progress on the User-Eevans board.Aug 7 2018, 2:40 PM

Here is a recent histogram of ages for wikipedia_T_mobile__ng_remaining:

Weeks	Count
0	16862224
1	7493038
2	5652134
3	4790434
4	4649020
5	3351847
6	1746823
7	2371460
8	1385656
9	1562058
10	1346352
11	762808
12	1030605
13	809189
14	637257
15	703937
16	906143
17	1386464
18	539699
19	587454
20	396045
21	311160
22	301049
23	355252
24	596923
25	929272
26	770811
27	152369
28	114598
29	154434
30	142607
31	189614
32	57555
33	60892
34	73269
35	89850
36	90977
37	34092
38	38136
39	81419
40	169210
41	93017
42	54875
43	7805
44	2351
45	559