Status: Done updated 160 000 Wikidata objects with the Europeana Entity property 7704 same as Europeana agent
see also link
GITHUB: EuropeanaWebscrape
- looks like Europeana doesnt update its data when Wikidata objects are deleted or merged... see T240738
- merged WD objects the last week who are connected to Europeana Entity
- looks like Europeana have just done an one time copy 4 years ago see T241363
- Europeana Entity API agents looks like they have selected a subset of dbpedia:artists as criteria what agents to be created in Europeana and I guess Europeana needs to develop this. I have seen no documentation how Europeana will use semantics telling what a picture depict. Todays approach with agents/locations/concepts is not good enough for this
- query checking instance of connected Europeana items that are not instance of humans in Wikidata with link Google Structured data test tool
- example europeana people/123711-henri-cohen-composer links Wikidata Q5715384 that is wrong (its a duplicate that maybe is not the best to have same as with see test tool
- if we look in the change log then we can see that this Wikidata object has been split and Wikidata Q670238 looks like the new object that has been matched with Europeana agent/base/26496
- --> agent/base/26496 and agent/base/123711 is the same as Q670238 en:WIkipedia Henri_Cohen_(composer)
- if we look in the change log then we can see that this Wikidata object has been split and Wikidata Q670238 looks like the new object that has been matched with Europeana agent/base/26496
- example europeana people/123711-henri-cohen-composer links Wikidata Q5715384 that is wrong (its a duplicate that maybe is not the best to have same as with see test tool
Next step: -
Container for activities regarding Europeana Entity
- New Property 7704 "Europeana Entity"
- the old one was instable etc... see presentation Europeana 2019 by mdammers
- > 160 000 humans matched between Wikidata and Europeana (agent/base)
- not done for Places and Concepts
- places are based on geonames which feels wrong for culture objects....?!?!?! in Sweden we have church parishes that are historical correct and administrative parishes in Wikidata see github.com/salgo60/Svenskaforsamlingar
- not done for Places and Concepts
- other properties on the same objects as Property 7704 Europeana Entity
- > 160 000 humans matched between Wikidata and Europeana (agent/base)
- approve new Europeana Property T239441: New Wikidata "Europeana Entity" property proposal?
- looks like we get no feedback from Europeana --> start populate agents see
- status
- Issue with deleted WD objects see T240738: More than 1200 Europeana Entities reference deleted Wikidata objects - task identify them but also Europeana need action
- Europeana needs some action
- as you have system dependent of each other you need mature traceable change management see example issues T240809: Workflow/version management needed when Europeana reference wrong person in Wikidata and in Europeana
- Europeana needs some action
- Write some consistency checking software Wikidata <-> Europeana
- now possible to
- identify places named after a person in Europeana, same query but labels in Chinese if possible
- identify > 130 people at a cemetery who are connected to Europeana
- list of people who lately passed away connected to Europeana
- last query can timeout (try refresh), this query restricts to people with country of citizenship Denmark, Finland, France, Germany, Italy, Norway, Poland, Slovakia, Spain, Sweden...
- looks like Europeana miss death date for records before 2016 and maybe new text should be extracted for at least people who has passed away
- Wikipedia has API: Extension:TextExtracts --> Marie Fredriksson agent/base/65142 en:Wikipedia Marie_Fredriksson can be retrieved 1000 char in JSONformat hprop=extracts&exchars=1000&titles=Marie_Fredriksson --> "is" is changed to "was"
- looks like Europeana miss death date for records before 2016 and maybe new text should be extracted for at least people who has passed away
- last query can timeout (try refresh), this query restricts to people with country of citizenship Denmark, Finland, France, Germany, Italy, Norway, Poland, Slovakia, Spain, Sweden...
- 1000 objects with most properties in WD
- objects matched in Wikidata for Rijksbureau voor Kunsthistorische Documentatie and Europeana Entity
- objects matched in Wikidata for National Portrait Gallery and Europeana Entity
- objects matched in Wikidata for Swedish National museum and Europeana Entity
- objects matched in Wikidata for Alvin Uppsala University and Europeana Entity
Link Swedish Nationalmuseum -> European using Wikidata
- use Wikidata as a Hub to Europeana
- if we are at the Swedish Nationalmuseum on ID 14655 we can ask Wikidata for the related Europeana object
- Swedish National Museum is Wikidata Property 2538
- Europeana Entity is Property 7704
- https://tools.wmflabs.org/hub/P2538:14655?property=P7704
- if we are at the Swedish Nationalmuseum on ID 14655 we can ask Wikidata for the related Europeana object
Anders Zorn is 4252 --> Europeana: https://tools.wmflabs.org/hub/P2538:4252?property=P7704
Ivar Arosenius is 7813 --> Europeana: https://tools.wmflabs.org/hub/P2538:7813?property=P7704
Peeter Danckers de Rij is 5038 --> Europeana https://tools.wmflabs.org/hub/P2538:5038?property=P7704
We have been running this concept at Family Search one year. We modified one template with 5 lines and got 4 new dynamic links using Wikidata on > 1000 wiki pages at Family Search see T200515#4683374
Europeana -> Wikidata -> Museum
- Europeana Ivar Arosenius agent/base/13223
- en:Wikipedia https://tools.wmflabs.org/hub/P7704:agent/base/13223?lang=en
- fr:Wikipedia https://tools.wmflabs.org/hub/P7704:agent/base/13223?lang=fr
- Musée d'Orsay Property 2268 https://tools.wmflabs.org/hub/P7704:agent/base/13223?property=P2268
- Uppsala University Alvin Property 6821 https://tools.wmflabs.org/hub/P7704:agent/base/13223?property=P6821
See Europena matches per country
Below a search for matched Europeana objects and show what country specific properties it match
my feeling is that volontaires in Wikidata can help identify people in a specific dataset and add those people in that dataset to Wikidata. This can then help Europeana in cleaning the Europeana data and match objects easier to the correct Europeana Agent for a person - "A little Semantics Goes a Long Way"
Entries in wikidata but not in Europeana as an Entity
- Example
- Europeana text string "Jonas Thomasson Ronander" I guess same as
here we need a data roundtrip and a workflow for entities that needs to be added to Europeana. See also T240738: More than 1200 Europeana Entities reference deleted Wikidata objects - task identify them but also Europeana need action were we can see that more than 400 wikidata objects are deleted for 50 000 tested Europeana profiles linking Wikidata...
Plan with Europeana
No feedback received on question sent... see above activity with agent started
-
agree with Europena see email sent-
time schedule -
format to use -
what is needed to be developed at Europena-
implement content negotiation? -
when is the new format 100% supported and stable for Europeana entities at Europeana -
status/ambitions at Europeana to store the Wikidata Qnumber
-
- define a scenario for the information flow and data roundtrips?
- archives/museums <-> national aggregator<-> Europeana <-> Wikidata
- strategies for taking care of crowd sourcing at Wikidata
- update guidelines like "Get your vocabularies in Wikidata..."
- I guess preferred places and topics will have the new Europeana Entity P 7704 and/or those will have Qnumbers in Europeana?
- the challenge I see with all cultural institutions in Sweden except the National Archives Tora project T233275 is that the semantic understanding is very low i.e. everyone understands the concept of a person but when you start speaking about places they have a string and in best case a coordinate never same as a city, parish, building or a region. Speaking with people doing research then it was very important to match a coordinate to the correct place at a specific time. I guess we have a problem matching research and badly curated archives/museum collections (strings vs. linked data) see my visit to Uppsala University T236459 --> I guess if Europeana should have some value for people doing research then you have to be precise with defining locations and use same as Wikidata Qnumber or even better same as a location authority as pleiades.stoa.org
- I also think we should have entities for buildings e.g. the church in Stockholm called Riddarholm Church = Wikidata Q657118 = Open Street map 23841420 that is on this Europeana photo
- the challenge I see with all cultural institutions in Sweden except the National Archives Tora project T233275 is that the semantic understanding is very low i.e. everyone understands the concept of a person but when you start speaking about places they have a string and in best case a coordinate never same as a city, parish, building or a region. Speaking with people doing research then it was very important to match a coordinate to the correct place at a specific time. I guess we have a problem matching research and badly curated archives/museum collections (strings vs. linked data) see my visit to Uppsala University T236459 --> I guess if Europeana should have some value for people doing research then you have to be precise with defining locations and use same as Wikidata Qnumber or even better same as a location authority as pleiades.stoa.org
- a workflow for adding new agents (artists) when those has been matched in Wikidata and is absent in Europeana is needed???
- same for places/topics
- I guess preferred places and topics will have the new Europeana Entity P 7704 and/or those will have Qnumbers in Europeana?
- create show case scenarios?!?!
- in Sweden we have the Nationalmuseum matched in Wikidata and a member of Europeana
- Property 2538 "Nationalmuseum Sweden artist ID" list items > 5680 - related properties - on a map
- Property 2539 "Nationalmuseum Sweden artwork ID" list items > 7860 - related properties
- we have also Uppsala University database Alvin that has Property 6821 see T225522 and is also in Europena see link = Alvin alvin-record:28747 but the people are not identified its just text strings, in the source Alvin they were grouped and had if possible a birth/death year and sometimes a VIAF link. See how Wikidata has identified those people as Linked data i.e. we had some grouping in Alvin.... the data was moved to Europeana as STRINGS... but Wikidata has identified them as THINGS:
- Europeana text string: Royen, Adrianus van, 1704-1779 = Alvin alvin-person:10176 = Wikidata Q367689 -> graf
- search text string in Europeana Adrianus and Royen
- Europeana text string Gerber, Traugott, 1707-1743 = Alvin alvin-person:10178 = Wikidata Q98037 --> graf
- search text string in Europeana Gerber and Traugott
- Europeana string Boerhaave, Herman, 1668-1738 = Alvin alvin-person:7814 = Wikidata Q313093 --> graf
- search text string in Europeana Herman and Boerhaave
- Europeana string Schober, Gottlob, 1675-1739 = Alvin alvin-person:10177 not in Wikidata
- search text string in Europeana Gottlob
- Europeana string Leiden = Alvin alvin-place:604 = Wikidata Q43631 --> en:graf nl:graf zh:graf ar;graf
- search text string in Europeana Leiden
- .....
- Europeana text string: Royen, Adrianus van, 1704-1779 = Alvin alvin-person:10176 = Wikidata Q367689 -> graf
- in Sweden we have the Nationalmuseum matched in Wikidata and a member of Europeana
-
- text string search creator Europeana Adrianus and Royen
- test searching in Europeana
- who:(Royen, Adrianus van) = 29 hits
- who: Adrianus van Royen (1704-1779) = 12 hits
- who: Royen, Adrianus van = 20 hits
- test search Alvin - looks like they have duplicates
- alvin-person:57354 = Royen, Adriaan van, 1705-1779
- alvin-person:10176 = Royen, Adrianus van, 1704-1779
Alvin has in Wikidata today matched records with Wikidata that can be reused by Europeana!! --> STRINGS -> THINGS
- > 5000 people matched, map birth/death location
- when you have people as things you can translate names to more languages using tools like Tabernacle example Alvin people - related properties in Wikidata
- > 6900 places matched + birth/death location for identified people
- map matched places > 1500 places
- places in Alvin is not good defined. In worst case just a name as a birth place
- map matched places > 1500 places
- duplicates found in Alvin > 270 that we guess is not fixed
Why Linked data NOW
Wikimedia commons is now using Wikibase to deliver the SDC project --> you can now search on linked data in the pictures.
Below
- upper part is the result from a search in the title text etc
- lower part is a search in pictures that depicts "love" and love is added as a linked data object in the picture
Tool for adding linked data to a region of a picture
Example region with linked data objects displaying labels in chinese (lang=zh)
- wd-image-positions
- Wikidata Q number
- Q1231009 uselang=en english
- Q1231009 uselang=zh chinese
- Wikicommons file
- Wikidata Q number