<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"><!-- P {margin-top:0;margin-bottom:0;} --></style>
</head>
<body dir="ltr">
<div id="divtagdefaultwrapper" style="font-size:12pt; color:#000000; background-color:#FFFFFF; font-family:Calibri,Arial,Helvetica,sans-serif">
<p>Hi Tomasz,</p>
<p><br>
</p>
<p>Any reason not to use Issues for this?</p>
<p><br>
</p>
<p><a href="http://wiki.eprints.org/w/Issues" id="LPlnk622864" style="font-size:12pt">http://wiki.eprints.org/w/Issues</a><br>
</p>
<p><br>
</p>
<p>You can design your own issues to include authors too.</p>
<p><br>
</p>
<p>Cheers,</p>
<p><br>
</p>
<p>Rory</p>
<p><br>
</p>
<div id="Signature">
<div id="divtagdefaultwrapper" style="font-size:12pt; color:#000000; background-color:#FFFFFF; font-family:Calibri,Arial,Helvetica,sans-serif">
<div style="font-family:Tahoma; font-size:13px">
<div style="font-size:13px">
<div><font face="Arial">Rory McNicholl</font></div>
<div><font face="Arial">Lead developer</font></div>
<div><font face="Arial">Digital Archives & Research Technologies</font></div>
<div><font face="Arial">University of London Computer Centre</font></div>
<div><font face="Arial">Senate House</font></div>
<div><font face="Arial">Malet Street</font></div>
<div><font face="Arial">London</font></div>
<div><font face="Arial">WC1E 7HU</font></div>
<div><font face="Arial"><br>
</font></div>
<div><font face="Arial">t: +44 (0)20 7863 1344</font></div>
<div><font face="Arial">e: rory.mcnicholl@london.ac.uk</font></div>
<div><font face="Arial">w: http://www.ulcc.ac.uk/</font></div>
<div><font face="Arial"><br>
</font></div>
<div><font face="Arial">The University of London is an exempt charity in England and Wales.</font></div>
</div>
</div>
</div>
</div>
<br>
<br>
<div style="color:rgb(0,0,0)">
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> eprints-tech-bounces@ecs.soton.ac.uk <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of Tomasz Neugebauer <Tomasz.Neugebauer@concordia.ca><br>
<b>Sent:</b> 20 August 2015 17:58<br>
<b>To:</b> eprints-tech@ecs.soton.ac.uk<br>
<b>Subject:</b> [EP-tech] duplicate detection in EPrints 3.3</font>
<div> </div>
</div>
<div>
<div>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US">I would like to run a script that will go through my repository (3.3.12) and report any likely duplicates based on title (and possibly author).</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US">What is the best way of doing this?</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US"> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US">I found the following two plugins in EPrints Files:</span></p>
<p style="margin:0cm 0cm 0.0001pt; text-indent:-18pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="FR-CA" style="font-family:Symbol"><span style="">·<span style="font:7.0pt "Times New Roman"">
</span></span></span><span lang="FR-CA">Sebastien Francois’ EPrints 2 script: </span>
<span lang="EN-US"><a href="http://files.eprints.org/107/" style="color:rgb(5,99,193); text-decoration:underline"><span lang="FR-CA">http://files.eprints.org/107/</span></a></span><span lang="FR-CA"></span></p>
<p style="margin:0cm 0cm 0.0001pt; text-indent:-18pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US" style="font-family:Symbol"><span style="">·<span style="font:7.0pt "Times New Roman"">
</span></span></span><span lang="EN-US">Jon Hallet’s EPrints 3>3.1 script: <a href="http://files.eprints.org/640/" title="http://files.eprints.org/640/
Ctrl+Click or tap to follow the link" style="color:rgb(5,99,193); text-decoration:underline">
http://files.eprints.org/640/</a> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US"> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US">In addition, </span></p>
<p style="margin:0cm 0cm 0.0001pt; text-indent:-18pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US" style="font-family:Symbol"><span style="">·<span style="font:7.0pt "Times New Roman"">
</span></span></span><span lang="EN-US">There is a title_duplicates script in /cgi/users/lookup/
<a href="http://wiki.eprints.org/w/Cgi/users/lookup/" style="color:rgb(5,99,193); text-decoration:underline">
http://wiki.eprints.org/w/Cgi/users/lookup/</a></span></p>
<p style="margin:0cm 0cm 0.0001pt; text-indent:-18pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US" style="font-family:Symbol"><span style="">·<span style="font:7.0pt "Times New Roman"">
</span></span></span><span lang="EN-US">Page 40 of this file (<a href="http://www.eprints.org/software/training/programming/api_techniques.pdf" style="color:rgb(5,99,193); text-decoration:underline">http://www.eprints.org/software/training/programming/api_techniques.pdf</a>)
refers to a duplicate detection script in the bin folder as an example – I couldn’t find this script – probably just an example of what could be done.</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US"> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US"> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-US" style="color:black">Is the </span><span>Jon Hallett’s script in EPrints Files the most up-to-date version available?
</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span>Has anyone created a Bazaar version for duplicate detection and/or is there is something more recent that I am missing?</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span style="color:black"> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span>Tomasz</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
</p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span lang="EN-GB" style="font-size:8.0pt; font-family:"Courier New"; color:#A6A6A6">________________________________________________</span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span style="font-size:9.0pt; font-family:"Arial",sans-serif; color:black">Tomasz Neugebauer<span style="background:white"><br>
</span>Digital Projects & Systems Development Librarian<span style="background:white"><br>
</span>Libraries / Bibliothèques<br>
Concordia University / Université Concordia<b><br>
<br>
</b></span><span style="color:black"></span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
</p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span> </span></p>
<p style="margin:0cm 0cm 0.0001pt; font-size:11pt; font-family:Calibri,sans-serif">
<span style="color:black"> </span></p>
</div>
</div>
</div>
</div>
</body>
</html>