Tool Labs host replicas of production databases that are very useful for research of article edit history. Revisions whose text was deleted have been marked with rev_deleted = 1 and rev_text_id = NULL in the revision table. However, rev_len has been set to NULL as well, although the size of such revision is still displayed publicly in the article's history in production (i.e. this information is not considered secret). It would greatly help the research of articles' growth over time if rev_len would be available also for deleted revisions in the database replicas.
|operations/puppet : production||wiki replicas: unfilter deleted rev_len versions|
|Declined||jcrespo||T150767 Wikireplica service for tools and labs - issues and missing available views (tracking)|
|Resolved||None||T101631 rev_len should be available also for deleted revisions in database replicas|
- Mentioned In
- T210466: Wrong counting of the added text in the page history tools
T219769: [BUG] Edits and Bytes Changed metrics: Page Improved includes deleted revisions; Event Summary and All Edits do not
T148857: If revisions are revdel'd, articleinfo compares the surrounding edits as if it were one edit
I don't think this information is public: I cannot see the page size of a deleted revision as a regular user e.g. https://en.wikipedia.org/w/index.php?title=Special:Log&page=Talk%3AVimuttiguana
Sorry for the misunderstanding, I can confirm that those are not filtered on source:
But conditionally (unnecessarily?) nulled on view:
if((`enwiki`.`revision`.`rev_deleted` & 1),NULL,`enwiki`.`revision`.`rev_len`) AS `rev_len`
If I can get a thumbs up from @Bawolff, perhaps?
The current logic expressly filters rev_len on deleted revisions: if(rev_deleted&1,null,rev_len) as rev_len. I don't know if that's just for consistency or if someone thinks that really should be kept out of the replicas. As stated above, it does seem to be available online, though I'm not sure if that's all versions of the deleted field, since that's an integer, I think.
My question isn't really about archived revisions of deleted pages in that sense either. If we're exposing the length for revdeled non-archived revisions, why should the situation be any different for the length of revdeled archived revisions?