<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
        {font-family:Helvetica;
        panose-1:2 11 6 4 2 2 2 2 2 4;}
@font-face
        {font-family:Wingdings;
        panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
span.EmailStyle17
        {mso-style-type:personal;
        font-family:"Calibri",sans-serif;
        color:windowtext;}
span.EmailStyle19
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-GB" link="blue" vlink="purple">
<div style="padding-bottom: 10px; padding-top: 5px;">
<div style="padding:12px; border:1px solid #8D3970; background-color:#F7F9FA; color:#8D3970; font-size:14px; line-height:22px; font-family: Calibri, Arial, Helvetica, sans-serif;">
<strong>CAUTION:</strong> This e-mail originated outside the University of Southampton.
</div>
</div>
<div>
<div class="WordSection1">
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">Hi James,<br>
That's an 'interesting' set setup. The default (commented-out) offering for that set doesn't have the department.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">At a guess, it might have been added to create some disambiguation between authors of the same name, but in different departments - but that makes no sense, as it's using their IDs,
 not names.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><br>
<br>
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">To answer your question - it looks like a data-quality issue.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">The following are *<b>not</b>* the same thing:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Person = Molecular and
<span style="background:yellow;mso-highlight:yellow">C</span>linical <span style="background:yellow;mso-highlight:yellow">
P</span>harmacology<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setSpec&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 706572736F6E3D4D6F6C6563756C617220616E6420<span style="background:yellow;mso-highlight:yellow">43</span>6C696E6963616C20<span style="background:yellow;mso-highlight:yellow">50</span>6861726D61636F6C6F6779<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Person = Molecular and
<span style="background:yellow;mso-highlight:yellow">c</span>linical <span style="background:yellow;mso-highlight:yellow">
p</span>harmacology<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setSpec&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 706572736F6E3D4D6F6C6563756C617220616E6420<span style="background:yellow;mso-highlight:yellow">63</span>6C696E6963616C20<span style="background:yellow;mso-highlight:yellow">70</span>6861726D61636F6C6F6779<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setName&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; Person =
<span style="background:yellow;mso-highlight:yellow">Department of</span> Molecular and Clinical Pharmacology<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">setSpec&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 706572736F6E3D4<span style="background:yellow;mso-highlight:yellow">465706172746D656E74206F6620</span>4D6F6C6563756C617220616E6420436C696E6963616C20506861726D61636F6C6F6779<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">NB the 'setSpec' is just the name represented as characters<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">My guidance would be:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">- feed your weblogs through a tool to analyse the OAI-PMH requests, and see who's using what. If no one is using the 'person' sets, I think removing their definitions would speed your
 OAI-PMH interface up. I guess they were added for a reason at some point though - hopefully someone somewhere will know something about them!<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">- (possibly - based on the above) remove the 'Department' from that set definition.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">- add another set for 'divisions' based on the 'divisions' field you are using<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">- on your test server add some sets for testing (see Andy's email) - this is a very useful approach for testing RT2
</span><span style="font-family:Wingdings;color:#1F497D;mso-fareast-language:EN-US">J</span><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">Cheers,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US">John<o:p></o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="color:#1F497D;mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="EN-US">From:</span></b><span lang="EN-US"> eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk]
<b>On Behalf Of </b>Andy Reid via Eprints-tech<br>
<b>Sent:</b> 20 January 2022 13:21<br>
<b>To:</b> eprints-tech@ecs.soton.ac.uk; James Kerwin &lt;jkerwin2101@gmail.com&gt;<br>
<b>Subject:</b> Re: [EP-tech] OAI Harvesting<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
<div>
<div style="border:solid #8D3970 1.0pt;padding:9.0pt 9.0pt 9.0pt 9.0pt">
<p class="MsoNormal" style="line-height:16.5pt;background:#F7F9FA"><strong><span style="font-size:10.5pt;font-family:&quot;Calibri&quot;,sans-serif;color:#8D3970">CAUTION:</span></strong><span style="font-size:10.5pt;color:#8D3970"> This e-mail originated outside the
 University of Southampton. <o:p></o:p></span></p>
</div>
</div>
<div>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Hi James,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">When I was setting up RT2, I ignored the predefined sets in Elements, and created custom sets for testing and for production. I set up a cfg.d/zzz_symplectic_oai.pl, and split the production harvest
 into full-text-public, full-text-restricted, and full-text-none (metadata-only). I forget the thinking behind that split, but it does cover everything, I believe.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">I’m not sure if $c-&gt;{oai}-&gt;{custom_sets}} is something that is set up and parsed by default, or if you might need to enable that first. It was there, and I could edit it, so I did.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">##############################&nbsp; PRODUCTION SETS ####################################################<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#&nbsp; These are used in earnest by Symplectic Repository Tools 2&nbsp;&nbsp;&nbsp;
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">####################################################################################################<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;full_text_none&quot;, name =&gt; &quot;full_text_none&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;full_text_status&quot; ], value=&gt;&quot;none&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;eprint_status&quot; ], value=&gt;&quot;archive&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },&nbsp; -- live records only, not in review or deleted<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;full_text_public&quot;, name =&gt; &quot;full_text_public&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;full_text_status&quot; ], value=&gt;&quot;public&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;eprint_status&quot; ], value=&gt;&quot;archive&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <o:p>
</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;full_text_restricted&quot;, name =&gt; &quot;full_text_restricted&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;full_text_status&quot; ], value=&gt;&quot;restricted&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;eprint_status&quot; ], value=&gt;&quot;archive&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">For testing I had a variety of scratch sets, using named users, years, or lists of Eprint IDs:
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">e.g.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">NAMED USER:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;symplectic_andy_email&quot;, name =&gt; &quot;symplectic_andy_email&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;creators_id&quot; ], value=&gt;&quot;andy REID lshtm&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ALL&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">SPECIFIC RECORDS:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;symplectic_test&quot;, name =&gt; &quot;symplectic_test&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;eprintid&quot; ], value=&gt;&quot;<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4645869
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;4645797
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;4645491
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;4645719
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;4645785<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4363558<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4398757<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4433720
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;3451639
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;2783042
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;19260<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 1924927
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;333704<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 3172489<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 3174428<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; &nbsp;&nbsp;&nbsp;&nbsp;1878135<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4646586<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4645489<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4647623<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; 4647670<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <o:p>
</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&quot;, <o:p>
</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;match=&gt;&quot;IN&quot;,
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4645869 = article, OA, 2017<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4645797 = conference item, 2017<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4645491 = thesis, 2017<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4645719 = monograph<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4645458 = other, OA guide , library<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4363558 = book section [now recoded to article]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#4398757 = [Accepted manuscript] of 4363558<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#3451639 = podcast<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#2783042 = video<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#2869451 = dataset<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#19260 = patent<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#1924927 = image<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#333704 = artefact<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"># 4646586&nbsp; exhibition<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#http://researchonline.lshtm.ac.uk/4645489/&nbsp; Teaching Resource<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#3172489 = [Accepted Manuscript]<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#3174428 = Final version of above<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">#1878135/ = [Inc; Grosskurth, H;]&nbsp; Manually added author&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">MULTIPLE FILTERS:<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">push @{$c-&gt;{oai}-&gt;{custom_sets}}, { spec =&gt; &quot;full_text_public_live_patel2016&quot;, name =&gt; &quot;full_text_public_live_patel2016&quot;, filters =&gt; [<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;eprint_status&quot; ], value=&gt;&quot;archive&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;full_text_status&quot; ], value=&gt;&quot;public&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;view_date&quot; ], value=&gt;&quot;2016&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ANY&quot; },&nbsp;
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; { meta_fields =&gt; [ &quot;creators_id&quot; ], value=&gt;&quot;vikram patel lshtm&quot;, match=&gt;&quot;IN&quot;, merge=&gt;&quot;ALL&quot; },&nbsp;&nbsp; -- matches
<a href="mailto:Vikram.patel@lshtm.ac.uk">Vikram.patel@lshtm.ac.uk</a><o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <o:p>
</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">] };<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Hope that is useful<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Andy&nbsp;&nbsp;&nbsp;&nbsp;&nbsp; <o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p>&nbsp;</o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span style="font-size:12.0pt;color:black">From: </span></b><span style="font-size:12.0pt;color:black">&lt;<a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk">eprints-tech-bounces@ecs.soton.ac.uk</a>&gt; on behalf of James Kerwin via Eprints-tech
 &lt;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&gt;<br>
<b>Reply to: </b>&quot;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&quot; &lt;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&gt;, James Kerwin &lt;<a href="mailto:jkerwin2101@gmail.com">jkerwin2101@gmail.com</a>&gt;<br>
<b>Date: </b>Thursday, 20 January 2022 at 12:49<br>
<b>To: </b>&quot;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&quot; &lt;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&gt;<br>
<b>Subject: </b>[EP-tech] OAI Harvesting<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal"><strong><span style="font-size:9.0pt;font-family:&quot;Helvetica&quot;,sans-serif;color:black">*** This message originated outside LSHTM ***</span></strong><span style="font-size:9.0pt;font-family:&quot;Helvetica&quot;,sans-serif;color:black"><o:p></o:p></span></p>
<div class="MsoNormal" align="center" style="text-align:center">
<hr size="1" width="100%" align="center">
</div>
</div>
<div>
<div style="border:solid #8D3970 1.0pt;padding:9.0pt 9.0pt 9.0pt 9.0pt">
<p class="MsoNormal" style="line-height:16.5pt;background:#F7F9FA"><strong><span style="font-size:10.5pt;font-family:&quot;Calibri&quot;,sans-serif;color:#8D3970">CAUTION:</span></strong><span style="font-size:10.5pt;color:#8D3970"> This e-mail originated outside the
 University of Southampton. <o:p></o:p></span></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">Hi All,<o:p></o:p></p>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">We're setting up RT2 (Elements) at the moment and working through some bugs. This is not a specific EPrints problem, but I'm hoping the collective wisdom of those here can provide some clarity...<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">In our OAI ListSets pages it has become apparent that we have duplicate sets. We appear to have a peculiar setup whereby we have :<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">$oai-&gt;{sets} = [<br>
{ id=&gt;&quot;person&quot;, allow_null=&gt;0, fields=&gt;&quot;contributors_id/editors_id/department&quot; }<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">This puts department in the person set. We don't even use department in our current EPrints records (we have Divisions which I've spoken about a LOT previously). What I'm curious about is:<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">1) How do duplicate sets come about? I thought the idea of a set would be if items have the same value they would be in the same set.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">2) Is there any easy way to identify the duplicate sets? Somebody from Symplectic that I'm working with was kind enough to point them out on our live repository and sure enough if I ctrl+f for &quot;Molecular and Clinical Pharmacology&quot; on
<a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Fcgi%2Foai2%3Fverb%3DListSets&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C074e1a51026d42223b9a08d9dc215fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637782858215041138%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000&amp;sdata=xQfz9qsbQDDFv1ayQEPkI%2BR3FFkG50qHSfRcSA3dO8g%3D&amp;reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/cgi/oai2?verb=ListSets" shash="N4yBr/l5lcAofFQND7ELfxHzU7t7Igc6WN9kCHPA9i8Rf9xSmsBIDQMREt1QLEPnuKQZzcHsQJurqrEJM3mmyXAAsBHY30tXhTfhNHxePzeERnrHwCyDOoSssHWeoqdvbH95e9uM2f1l1X6i1y504ItzJVhOzRo1VAL+GhjKPvQ=">
https://livrepository.liverpool.ac.uk/cgi/oai2?verb=ListSets</a> it appears twice.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">I've tried to learn about OAI, but it does unfortunately make my brain scream because I just do not understand it properly.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p>&nbsp;</o:p></p>
</div>
<div>
<p class="MsoNormal">Thanks,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">James<o:p></o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>