Page MenuHomePhabricator

simplewd tool is in a crash loop -- please fix
Closed, InvalidPublic

Description

On taking a look at what pods are crashing and restarting in a loop, I noticed 3000+ restarts on the simplewd tool. It seems to be failing. The last logs I see in kubectl logs are:

Dec 21, 2020 9:41:33 PM org.wikidata.wdtk.dumpfiles.wmf.WmfDumpFileManager findDumpsOnline
INFO: Found 7 online dumps of type SITES: [wikidatawiki-sites-20201220, wikidatawiki-sites-20201201, wikidatawiki-sites-20201120, wikidatawiki-sites-20201101, wikidatawiki-sites-20201020, wikidatawiki-sites-20201001, wikidatawiki-sites-20200920]
Dec 21, 2020 9:41:33 PM org.wikidata.wdtk.dumpfiles.MwSitesDumpFileProcessor processDumpFileContents
INFO: Processing sites dump file wikidatawiki-sites-20201201
Exception in thread "main" java.lang.ExceptionInInitializerError
	at org.wikidata.simplewd.jsonld.JsonLdBuilder.<clinit>(JsonLdBuilder.java:39)
	at org.wikidata.simplewd.http.Main.<init>(Main.java:58)
	at org.wikidata.simplewd.http.Main.main(Main.java:64)
Caused by: java.lang.IllegalArgumentException: The provided schema is not valid JSON
	at org.wikidata.simplewd.model.ShaclSchema.readSchema(ShaclSchema.java:71)
	at org.wikidata.simplewd.model.ShaclSchema.<clinit>(ShaclSchema.java:52)
	... 3 more
Caused by: java.io.FileNotFoundException: https://schema.org/version/latest/schema.nt
	at sun.net.www.protocol.http.HttpURLConnection.getInputStream0(HttpURLConnection.java:1896)
	at sun.net.www.protocol.http.HttpURLConnection.getInputStream(HttpURLConnection.java:1498)
	at sun.net.www.protocol.https.HttpsURLConnectionImpl.getInputStream(HttpsURLConnectionImpl.java:268)
	at java.net.URL.openStream(URL.java:1068)
	at org.wikidata.simplewd.model.ShaclSchema.readSchema(ShaclSchema.java:68)

if that helps with troubleshooting

Event Timeline

Bstorm triaged this task as Medium priority.Dec 21 2020, 9:48 PM
Bstorm created this task.

simplewd has been deleted