Searching for Anatolian Hieroglyphs in wikitext in the mainspace in English Wiktionary doesn't yield any results, even though there are pages that contain these characters. These are outside the BMP, so perhaps there is some weird bug. However, not all non-BMP characters cause problems.
Searching for Gothic letters (: insource:/[๐ฐ-๐]/) and Egyptian hieroglyphs (: insource:/[๐-๐ฎ]/), which are also outside the BMP, works. I considered the possibility that it might be a Unicode versioning issue, but I'm not sure why regex would refer to the UCD when searching for ranges of code points. (In any case, the Anatolian Supplements block was added in version 8.0, Egyptian Hieroglyphs in 5.2, and Gothic in 3.1.) So I'm mystified as to why some non-BMP characters could be searched for and others couldn't. I wonder if I'm overlooking something obvious here.
Steps to Reproduce:
Enter : insource:/[๐-๐]/ or : intitle:/[๐-๐]/ in the search box in English Wiktionary and submit.
Actual Results:
No search results.
Expected Results:
Both regexes should match the code points in the range U+14400-U+14646 (ANATOLIAN HIEROGLYPH A001 to ANATOLIAN HIEROGLYPH A530) in English Wiktionary. As of the 2019-11-01 dump, the wikitext of two entries contained them โ ๐บ๐๐ป๐๐ฟ๐๐พ๐
and ๐ผ๐๐ท โ and as of the 2019-10-20 dump three entry titles contained them โ ๐ฑ๐ฌ๐ฌ๐ฐ๐ฑ, ๐ฎ๐๐ต๐ฌ, ๐ช๐ฑ๐ช.
Postscript:
I do get one result for insource:/[๐-๐]/ in all namespaces: Module:scripts/data. That module contains the literal string ๐-๐. It's as if the regex engine fails to parse [๐-๐] correctly and instead searches as if the query were insource:/๐-๐/. That doesn't make sense to me though.