[EP-tech] duplicate detection in EPrints 3.3
Tomasz Neugebauer
Tomasz.Neugebauer at concordia.ca
Thu Aug 20 17:58:59 BST 2015
I would like to run a script that will go through my repository (3.3.12) and report any likely duplicates based on title (and possibly author).
What is the best way of doing this?
I found the following two plugins in EPrints Files:
· Sebastien Francois’ EPrints 2 script: http://files.eprints.org/107/
· Jon Hallet’s EPrints 3>3.1 script: http://files.eprints.org/640/
In addition,
· There is a title_duplicates script in /cgi/users/lookup/ http://wiki.eprints.org/w/Cgi/users/lookup/
· Page 40 of this file (http://www.eprints.org/software/training/programming/api_techniques.pdf) refers to a duplicate detection script in the bin folder as an example – I couldn’t find this script – probably just an example of what could be done.
Is the Jon Hallett’s script in EPrints Files the most up-to-date version available?
Has anyone created a Bazaar version for duplicate detection and/or is there is something more recent that I am missing?
Tomasz
________________________________________________
Tomasz Neugebauer
Digital Projects & Systems Development Librarian
Libraries / Bibliothèques
Concordia University / Université Concordia
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20150820/9e971624/attachment.html
More information about the Eprints-tech
mailing list