Attempting to reindex jawiki_general fails with:
{
"index": "jawiki_general_1755293360",
"type": "_doc",
"id": "5112606",
"cause": {
"type": "execution_exception",
"reason": "execution_exception: java.lang.IndexOutOfBoundsException: Index -1024 out of bounds for length 87503",
"caused_by": {
"type": "index_out_of_bounds_exception",
"reason": "index_out_of_bounds_exception: Index -1024 out of bounds for length 87503"
}
},
"status": 500
}The referenced page contains essentially the same emoji repeated many times:
https://ja.wikipedia.org/wiki/%E5%88%A9%E7%94%A8%E8%80%85:%E4%B8%87%E6%AD%B3%E5%B8%9D%E5%9B%BD
We need to resolve whatever problem is going on there so that jawiki_general can pick up the updated analysis chain. This also means that content changes on wikis that use the japanese analysis chain may fail the indexing pipeline and be un-updatable.