A new keyword textbytes should be added to allow filtering pages based on the value of the text_bytes field.
The text_bytes field is populated from Content:getSize() which defines itself as
Returns the content's nominal size in "bogo-bytes".
What is behind bogo-bytes might remain mysterious but for a wikitext page this is the number of bytes of the wikitext source encoded in UTF-8.
The keyword will be usable the same way as other numeric keyword we support File measure.
- comparison: textbytes:<1500 or textbytes:>1500 all pages with text_bytes greater and lesser than 1500
- ranges: textbytes:1500,10000 all pages with text_bytes between 1500 and 10000
- exact match are possible but probably useless, e.g. textbytes:10
AC: A search query can be issued that filters based on the number of bytes in the source text (text_bytes field of documents in elasticsearch)