[EP-tech] DSpace Harvester and OAI_Bibliography.pm

Tomasz Neugebauer Tomasz.Neugebauer at concordia.ca
Wed Jun 17 19:22:46 BST 2020


Hi everyone...  in attempting to harvest some EPrinst repositories using DSpace harvester, the following issue was reported in 2016:
https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdspace.2283337.n4.nabble.com%2FHarvesting-EPrints-repository-from-DSpace-td4681086.html&data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb2544561cc0b461befc008d812eb8491%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&sdata=e0AXZaPZMbSExEmGFGSoKh7REO4LtD39v9GT7udfc7M%3D&reserved=0
"What happens in this case is that EPrints has more than one entry for the supported metadata formats using OAI_DC (oai_bibl and oai_dc prefixes):

...
<metadataFormat>
  <metadataPrefix>oai_bibl</metadataPrefix>
  <schema>https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.openarchives.org%2FOAI%2F2.0%2Foai_dc.xsd&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb2544561cc0b461befc008d812eb8491%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=BFw%2BSAkyyu%2Bk81S0c3LrRkP0GnAW3i3jNTAfyx0BrfU%3D&amp;reserved=0</schema>
  <metadataNamespace>https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.openarchives.org%2FOAI%2F2.0%2Foai_dc%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb2544561cc0b461befc008d812eb8491%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=io75OFv3dUFXKh5fiOqyL0ipl6hYu%2Fo%2Bplr988ObzAk%3D&amp;reserved=0</metadataNamespace>
</metadataFormat>
<metadataFormat>
  <metadataPrefix>oai_dc</metadataPrefix>
  <schema>https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.openarchives.org%2FOAI%2F2.0%2Foai_dc.xsd&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb2544561cc0b461befc008d812eb8491%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=BFw%2BSAkyyu%2Bk81S0c3LrRkP0GnAW3i3jNTAfyx0BrfU%3D&amp;reserved=0</schema>
  <metadataNamespace>https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.openarchives.org%2FOAI%2F2.0%2Foai_dc%2F&amp;data=01%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb2544561cc0b461befc008d812eb8491%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=io75OFv3dUFXKh5fiOqyL0ipl6hYu%2Fo%2Bplr988ObzAk%3D&amp;reserved=0</metadataNamespace>
</metadataFormat>
...

DSpace's harvester is then selecting the first metadataPrefix, i.e. oai_bibl, for which EPrints is returning records with no metadata."

Someone is having a similar issue now with EPrints repositories, so I'm wondering, is this still an issue, or was there a fix/modification added to EPrints for this?
I haven't tried the solution to remove OAI_Bibliography.pm from the core files...

Tomasz


-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20200617/38fd51b8/attachment-0001.html 


More information about the Eprints-tech mailing list