hdfs://analytics-hadoop/user/analytics-platform-eng/image_placeholders contains a parquet that is used by the image suggestions pipeline to filter suggestions that are in placeholder categories.
That parquet is now empty:
>>> spark.read.parquet('/user/analytics-platform-eng/image_placeholders').show(10, False) +-------+-----+-------+----------+ |cl_from|cl_to|cl_type|page_title| +-------+-----+-------+----------+ +-------+-----+-------+----------+ >>> print(spark.read.parquet('/user/analytics-platform-eng/image_placeholders').count()) 0
It used to hold data at some point, though. An old copy lives at hdfs://analytics-hadoop/user/mfossati/image_placeholders:
>>> print(spark.read.parquet('image_placeholders').count()) 3025
AFAICT, https://commons.wikimedia.org/wiki/Category:Examples_representing_SVG is one of those placeholder categories, and it certainly still has images.
It looks like it being empty is a bug.