Page MenuHomePhabricator

OSM Replication failed at eqiad and codfw
Closed, ResolvedPublic

Description

OSM Replication is failing at eqiad and codfw with the following errors:

Reading in file: -                                                                     
Using XML parser.                                                                      
node cache: stored: 0(-nan%), storage efficiency: -nan% (dense blocks: 0, sparse nodes:
 0), hit rate: -nan%                                                                   
Osm2pgsql failed due to ERROR: XML parsing error at line 1, column 0: no element found 
Error while replicating OSM data

Event Timeline

As per https://grafana.wikimedia.org/d/000000305/maps-performances?orgId=1&var-cluster=maps1 the failure is now 10 days old. An update on the issue and the expected time to fix would be helpful.

As per https://grafana.wikimedia.org/d/000000305/maps-performances?orgId=1&var-cluster=maps1 the failure is now 10 days old. An update on the issue and the expected time to fix would be helpful.

@Arjunaraoc thanks for reaching out. I've been investigating the issue and developing the possible solution for the issue for the past 2 weeks, it didn't have a task before and I just created T238554: [Spike] Consider using imposm3 as the OSM replication system.

That's my main priority and I'll be leaving notes and reports in the spike task.

If you have any other question I would be pleased to help if I can.

@MSantos thanks for your update. I am happy to know that you are working on this as a main priority.

@MSantos , From the linked sub task, I find that the sub task has changed to "to-do". Is there a possibility to reset/reboot the current configuration, so that whatever works (for example, localised place labels continue to get the OSM updates?

MSantos claimed this task.