This is about structuring resources in order to make them reusable by machines in a better way. The data.bnf.fr project uses data which have been created in various formats such as Intermarc for the catalogue of printed books, XML-EAD for archives inventories and Dublin Core for the digital library.
Such data is automatically gathered, modelled and enriched and are published in the RDF semantic web language. The result is available on the website, in different RDF syntaxes: RDF-XML, RDF-N3, and RDF-NT.
Part of the data is matched with external value vocabularies: id.loc.gov for languages and nationalities, dewey.info for subjects, DCMI type for document types. They are also matched with data sets that are identified by CKAN: DBpedia and VIAF. The pages for RAMEAU sujects headings are matched with other thesaurus, from libraries (LCSH, DnB, BNE) or more specialized (Agrovoc, Geonames, Thesaurus W).
Understanding data.bnf.fr data model
o hôte : pef.bnf.fr
port
: 21
o login : databnf
mot de passe
: databnf
You can consult the licence to use our data.
CubicWeb is an open source platform to develop Web semantic applications and is published under LGPL licence.
Within the project, this software is used to:
It is based on the query language RQL (Relation Query Language), which is similar to the W3C'sSPARQL and the Python language.
In 2013, CubicWeb won the Dataconnexions award, organized by Etalab, a body affiliated to France's Prime Ministry, whose objective is to encourage efforts towards public open data.