Page MenuHomePhabricator

[WIP Task] figure out why Unrecognized subjects are reported by munger
Closed, ResolvedPublic

Description

Log file

org.wikidata.query.rdf.tool.rdf.Munger$BadSubjectException: Unrecognized subjects:  [https://arq20.formulasearchengine.com/entity/statement/Q819-b86a346c-c1ab-44da-a227-4d7d8ab60a6d, https://arq20.formulasearchengine.com/entity/Q819, https://arq20.formulasearchengine.com/entity/statement/Q819-d9e0fb14-add7-49b9-95aa-b7c1b037fb35].  Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com/wiki/Special:EntityData/ and [http://arq20.formulasearchengine.com/entity/],
07:04:37.105 [update 6] WARN  org.wikidata.query.rdf.tool.Updater - Contained error syncing.  Giving up on Q819,
org.wikidata.query.rdf.tool.rdf.Munger$BadSubjectException: Unrecognized subjects:  [https://arq20.formulasearchengine.com/entity/Q808, https://arq20.formulasearchengine.com/entity/statement/Q808-2ec7364e-eec9-4e28-84ed-195173b84031, https://arq20.formulasearchengine.com/entity/statement/Q808-bbda7df9-a7a9-4f2f-a7b5-4f0473e2b5a6].  Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com/wiki/Special:EntityData/ and [http://arq20.formulasearchengine.com/entity/],
07:04:37.172 [update 2] WARN  org.wikidata.query.rdf.tool.Updater - Contained error syncing.  Giving up on Q808,
07:04:37.285 [update 1] WARN  org.wikidata.query.rdf.tool.Updater - Contained error syncing.  Giving up on Q815,
org.wikidata.query.rdf.tool.rdf.Munger$BadSubjectException: Unrecognized subjects:  [https://arq20.formulasearchengine.com/entity/statement/Q815-c22bee46-99fd-4055-bcb9-dd1e1139a03e, https://arq20.formulasearchengine.com/entity/Q815, https://arq20.formulasearchengine.com/entity/statement/Q815-0c242afc-5e72-4495-a1c3-f3e605686e30].  Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com/wiki/Special:EntityData/ and [http://arq20.formulasearchengine.com/entity/],
07:04:37.300 [update 8] WARN  org.wikidata.query.rdf.tool.Updater - Contained error syncing.  Giving up on Q804,
org.wikidata.query.rdf.tool.rdf.Munger$BadSubjectException: Unrecognized subjects:  [https://arq20.formulasearchengine.com/entity/statement/Q804-9ccfc643-a49a-40c1-a2d3-00f05c15a6e8, https://arq20.formulasearchengine.com/entity/statement/Q804-f2894ca5-e865-42c5-aecf-dba1e0f9c03a, https://arq20.formulasearchengine.com/entity/Q804].  Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com/wiki/Special:EntityData/ and [http://arq20.formulasearchengine.com/entity/],
07:04:37.416 [update 7] WARN  org.wikidata.query.rdf.tool.Updater - Contained error syncing.  Giving up on Q823,
org.wikidata.query.rdf.tool.rdf.Munger$BadSubjectException: Unrecognized subjects:  [https://arq20.formulasearchengine.com/entity/statement/Q823-ea61d6b6-bf15-4497-8bdd-10671373b450, https://arq20.formulasearchengine.com/entity/statement/Q823-77f9a13b-97de-429c-a576-44a8517bbde2, https://arq20.formulasearchengine.com/entity/Q823].  Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com/wiki/Special:EntityData/ and [http://arq20.formulasearchengine.com/entity/]

I noted that this url https://arq20.formulasearchengine.com/entity does not exist but this one does http://arq20.formulasearchengine.com/wiki/Special:EntityData

Event Timeline

Restricted Application added a subscriber: Aklapper. · View Herald Transcript

Even if the rewrite would work as outlined in T250779, it would still show Bad Request Invalid ID: Q819-b86a346c-c1ab-44da-a227-4d7d8ab60a6d. https://arq20.formulasearchengine.com/wiki/Special:EntityData/Q819-b86a346c-c1ab-44da-a227-4d7d8ab60a6d

I need to figure out what d9e0fb14-add7-49b9-95aa-b7c1b037fb35 relates to.

Unrecognized subjects: [https://arq20.formulasearchengine.com/entity/statement/Q819-b86a346c-c1ab-44da-a227-4d7d8ab60a6d, …]. Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com

It looks like your Wikibase RDF export is configured to use HTTPS, but your query service updater is configured to expect HTTP, hence the conflict.

Physikerwelt closed this task as Resolved.EditedApr 22 2020, 10:40 AM
Physikerwelt reassigned this task from Physikerwelt to Lucas_Werkmeister_WMDE.

Unrecognized subjects: [https://arq20.formulasearchengine.com/entity/statement/Q819-b86a346c-c1ab-44da-a227-4d7d8ab60a6d, …]. Expected only sitelinks and subjects starting with http://arq20.formulasearchengine.com

It looks like your Wikibase RDF export is configured to use HTTPS, but your query service updater is configured to expect HTTP, hence the conflict.

@Lucas_Werkmeister_WMDE You are my hero today. I added

- WIKIBASE_SCHEME=https

to my docker-compose config of the wdqs instances, et voila: Problem solved.