some weeks ago I was suprised, when I saw the Europeana Lab site that
there is a publicly available OAI-PMH server (probably I missed the
announcement). It works with Basic HTTP Authentication, so you should
ask for credentials .
I already shared my first impressions with the Europeana stuff, but
maybe it is interesting for you as well:
1) the OAI-PMH specification says: "For purposes of interoperability,
repositories must disseminate Dublin Core, without any qualification."
Europeana's OAI-PMH service doesn't implemented this mandatory
element, the ListMetadataFormat verb returns only EDM. It fits to my
use case, but Europeana should consider implementing Dublin Core
response if it wants to be standard-compilant.
2) I miss the "completeListSize" attribute from the "resumptionToken"
element (see https://www.openarchives.org/OAI/openarchivesprotocol.html#FlowControl).
This is an optional element, so you don't have to implement it, but it
is quite helpful on the client side, because that's what gives me a
picture about the actual size of the whole set I want to download, so
it gives you an idea where the process is standing, and you can
estimate when it will be done based on this number.
3) I did not had disk space anough to download all the records, but
based on my measurements it will take more than a week, which is not
quite fast, however it is quicker than Europeana API. I really like
DPLA's solution in which they simple makes database (Coachbase) dumps
downloadable , and the full database download tooks only one or two
hours. I miss it from Europeana.
I have created a PHP based client, so if you are interested in using
or just testing, here you are:
GWDG, Göttingen - eXtensible Catalog - The Code4Lib Journal