Published on March 12, 2014
Europeana and Open Data Robina Clayphan Interoperability Manager, Europeana LDBC TUC meeting, 19 November, 2013
What is Europeana? • Europeana is a service that brings together digital content from across the cultural heritage domain in Europe • It makes the metadata freely available • It is a catalyst for change in the world of cultural heritage. • Our vision: We believe in making cultural heritage openly accessible in a digital way, to promote the exchange of ideas and information.
Europeana.eu, Europe’s cultural heritage portal Museums National Aggregators Regional Aggregators Archives Thematic collections Libraries - A network of participants in development and innovation - Nearly 30 million objects from 2,400 European galleries, museums, archives and libraries
What types of objects does Europeana give access to? Text Image Video Sound 3D
Europeana and open data
What Europeana makes available Metadata Link to digital objects online
Metadata (descriptive object information) Different options: Open – not fully open (but clear) – Not open Two categories of rights CC
The Europeana Data Model
EDM requirements & principles 1. Distinction between “provided objects” (painting, book, movie, etc.) and their digital representations 2. Distinction between objects and metadata records describing an object 3. Allow for multiple records for a same object, containing potentially contradictory statements about it 4. Support for objects that are composed of other objects 5. Support for contextual resources, including concepts from controlled vocabularies Richer metadata with finer granularity
Provide more semantics to the data Build a semantic layer on top of Cultural Heritage objects
ore:Aggregation (Identifier of aggregation) edm:WebResource (Identifier of web resource) edm:ProvidedCHO (Identifier of real object) An aggregation with a provided CHO and a web resource The three core classes edm:aggregatedCHO edm:hasView
The Aggregation with metadata
Properties for the Aggregation Mandatory: edm:aggregatedCHO edm:dataProvider edm:isShownBy or edm:isShownAt edm:provider edm:rights Optional: edm:hasView edm:object dc:rights edm:ugc The aggregation represents the set of related resources about one real object contributed by one provider. It carries the metadata that is about the whole set
Properties for the ProvidedCHO The ProvidedCHO is the cultural heritage object which is the subject of the package of data that has been submitted to Europeana. Properties: dc:contributor, dc:coverage, dc:creator, dc:date, dc:description, dc:format, dc:identifier, dc:language, dc:publisher, dc:relation, dc:rights, dc:source,dc:subject, dc:title, dc:type, dcterms:alternative, dcterms:extent, dcterms:temporal, dcterms:medium, dcterms:created, dcterms:provenance, dcterms:issued, dcterms:conformsTo, dcterms:hasFormat, dcterms:isFormatOf, dcterms:hasVersion, dcterms:isVersionOf, dcterms:hasPart, dcterms:isPartOf, dcterms:isReferencedBy, dcterms:references, dcterms:isReplacedBy, dcterms:replaces dcterms:isRequiredBy, dcterms:requires dcterms:tableOfContents edm:isNextInSequence edm:isDerivativeOf edm:currentLocation…
Properties for the web resource One or more digital representations of the provided cultural heritage object. dc:description dc:format dc:rights dc:source dcterms:conformsTo dcterms:created dcterms:extent dcterms:hasPart dcterms:isFormatOf dcterms:isPartOf dcterms:issued edm:isNextInSequence edm:rights
Contextual classes Representing (real-world) entities related to a provided object as fully fledged resources, not just strings edm:Agent foaf:name skos:altLabel rdaGr2:biographicalInformation rdaGr2:dateOfBirth…. skos:Concept skos:prefLabel skos:altLabel skos:broader skos:definition…. edm:TimeSpan skos:prefLabel dcterms:isPartOf edm:begin edm:end…. edm:Place wgs84_pos:lat wgs84_pos:long skos:prefLabel dcterms:isPartOf….
Example of a CHO with two contextual classes edm:Agent [identifier for person resource] "D arw in, C harles" edm:ProvidedCHO [identi eﬁ r for "real" object] skos:Concept [identifier for subject resource] "E volution"@ en "É volution"@ fr "12-02-1809" "12-04-1882" dc:creator dc:subject
Accessing and re-using Europeana data
How do users access Europeana content? Europeana aims to provide content in the users’ workflow – where they want it, when they want it. User focused channels: Europeana.eu portal, social media exports For programmers: API, search widget, semantic mark up, LOD pilot
Europeana’s infrastructure is open for re-use Europeana data available via API Search widgets Semantic mark-up (schema.org) on portal Linked Open Data pilot http://pro.europeana.eu/api http://data.europeana.eu
Some (approximate) numbers Europeana database – 30 Million objects LOD pilot – a subset of 20 Million objects • contained nearly 1 Billion RDF explicit statements • 4 Billion once you do all the RDF reasoning (sub-properties, sub-classes, etc) in OWLIM • Ontotext has already loaded a chunk of data and is working on the update of it, in Europeana Creative.
Possible benchmarking queries? Queries for exploring the dataset • e.g. to generate the complete ordered list of Europeana aggregators and the data providers they gather Queries for exploring the objects • e.g. a list of works with a matching location/creator/title • Simple graph traversal Expressing EDM constraints (that cannot be done in OWL) • Can RDF validation help e.g where at least one of two properties must be present (title or description)? Queries to assist in data quality improvement • Broken links, duplicates (or near duplicates), missing mandatory properties, missing thumbnails etc etc For Information: We are starting a data quality task force if you are interested!
Useful links Europeana portal europeana.eu Europeana Professional pro.europeana.eu • EDM documentation http://pro.europeana.eu/edm-documentation • Europeana API http://www.europeana.eu/portal/api-introduction.html • LOD pilot http://data.europeana.eu Data Quality task force – email@example.com Europeana Professional blog pro.europeana.eu/blog Facebook facebook.com/Europeana Twitter twitter.com/EuropeanaEU Europeana Thought Lab pro.europeana.eu/thoughtlab/ Europeana end-user blog blog.europeana.eu/
Thank you Robina Clayphan firstname.lastname@example.org
EDM design requirements Compatibility with different levels of description • Allow different levels of granularity • A book, a page, a detail of an image Standard metadata format that can be specialized • Allow the specification of domain specific application profiles • Enable the re-use of existing standards • Allow the extension of the initial model
EDM basis OAI ORE (Open Archives Initiative Object Reuse & Exchange) for organizing an object’s metadata and digital representation(s) Dublin Core for descriptive metadata SKOS (Simple Knowledge Organization System) for conceptual vocabulary representation CIDOC-CRM for the modeling of event and relationships between objects Use the Semantic Web representation principles • RDF • Re-use and mix different vocabularies together • Preserve original data and still allow for interoperability
EDM Properties (excluding ESE)
Two providers and two aggregations (the same object) 31 aggregation of DMF aggregation of Louvre v provenance metadata provenance metadata Cultural heritage object
Europeana aggregation Enriched metadata Landing page
Get involved, share your data or use ours. Develop. Labs. ... This is a Beta release of Europeana Collections and our thematic Music and Art History ...
Europeana Linked Open Data Linked Open Data is a way of publishing structured data that allows metadata to be connected and enriched, so that different ...
Data collections Our database contains over 50 million records. These featured datasets represent over a million of the best, openly licensed, directly ...
Use our data. Best practices; Case ... nominations for Europeana ... Heritage DiPP2016 is to provide open access to digitised cultural heritage and ...
This weekend on 22 February, the international Open Data Day is celebrated again. On this day, various initiatives relating to open data are organised to ...
Europeana is a single access point to millions of books, paintings, films, museum objects and archival records that have been digitized throughout Europe.
In 2010 the Europeana Data Model, ... "Europeana: Moving to Linked Open Data", article in Information Standards Quarterly Spring/Summer 2012, v.24, no. 2.
Metadata in Europeana ... • Open access to the digitized objects of European cultural ... • The Europeana data space will contain Digital Surrogate
Undeﬁned 0 (0) 1 1 IOS Press Europeana Linked Open Data – data.europeana.eu Editor(s): Pascal Hitzler, Kno.e.sis Center, Wright State University ...
Simple animation to explain what Linked Open Data is and why it's a good thing, both for users and for data providers. To find more information about ...