<html><head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
  </head>
  <body>
    <p>Hi James,</p>
    <p>Fortunately (or unfortunately) I have had quite a few thoughts on
      the matter.&nbsp; I have done my best to keep them to the point.<br>
    </p>
    <p>First, I don't think it is possible to account for the same item
      being in multiple repositories.&nbsp; As an individual institutional
      repository owner you have no control over other institutional
      repositories who may have shared authors on publications and have
      the right to make the same publication available on their
      institutional repositories.&nbsp; Having a background in the Semantic
      Web, trying to determine if two things with different unique
      identifiers are actually the same thing is a near impossible
      problem to solve definitively.&nbsp; The best you can do is ensure the
      same unique identifier is not somehow used for two different
      things and also avoid creating and using more unique identifiers
      than are absolutely necessary.</p>
    <p>EPrints has always had a unique identifier in the form of a URI
      (e.g. <a class="moz-txt-link-freetext" href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Feprints.example.org%2Fid%2Feprint%2F123&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514220749%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=mppuRMTj6lsZnbgXaKiuD0tfcdsbG7IaTm3XTJQvN3s%3D&amp;reserved=0" originalSrc="http://eprints.example.org/id/eprint/123" shash="HkcVRe+t9ifhx1edoCw4RKVg7PBApqsg5eEp/QbvC495N7b0gft12HmraaF4xm4lklWAt8fzzqUA9l3X6eDTyzoDLfcYPMrAbIbULXbGkHZwLWngXH7DzZ6bYx7aFJW8RhPsv1UO+POCl+tdmGRwX5wRE7LuONUgBNprsbGFyjk=">http://eprints.example.org/id/eprint/123</a>).&nbsp; I would suggest
      this is the most appropriate unique identifier to use as every
      item in your repository will have one but not every item will
      necessarily have a DOI or similar unique identifier.&nbsp; You could
      configure your repository to use a DOI minting service (e.g. data
      repositories often use DataCite) but this rather breaks the rule
      of creating more unique identifiers than are absolutely
      necessary.&nbsp; <br>
    </p>
    <p>One potential problem I have noted with EPrints URIs is that
      these were all originally http but if you modify you HTTPS
      configuration to ensure HTTPS is used everywhere, then these URIs
      will likely also be changed to https, making them non-persistent
      which is another big no-no.&nbsp; For this reason, early on in EPrints
      3.4 I introduced a configuration properly 'uri_url' to ensure that
      you could modify a repository's HTTPS configuration but if you had
      this configuration option set you could keep the URIs as http.&nbsp; As
      in the context of being a unique identifier, you need to consider
      the URI as being a string of characters and if this string of
      characters changes, then it is no longer the same unique
      identifier, even though it is still describing the same thing.&nbsp; <br>
    </p>
    <p>I think you also identified another potential problem with the
      structure of an EPrints URI, which is if there is a change to the
      hostname of the repository itself.&nbsp; Again the uri_url option
      should allow you to ensure URIs do not change.&nbsp; Unfortunately,
      this may lead to confusion for users who wonder why the hostname
      for these URIs is different to the hostname of the repository.&nbsp;
      Also, depending what happens to the old hostname's DNS
      registration these URIs may become unresolvable.&nbsp; However, there
      is no requirement for URIs, as any unique identifier, to be
      resolvable.<br>
    </p>
    <p>If an item has a DOI provided by a journal, an ISBN provided by a
      book publisher, etc. then this would typically be more useful than
      an institutional repository's URI, as this would be used in a
      general context (i.e. you would expect a DOI or ISBN to appear in
      the citation for such an item).&nbsp; However, I think to provide the
      best possible coverage there is need for both forms for unique
      identifier: the one from the original publisher (if that is not
      the institutional repository, which would likely be the case for
      theses, etc.) and one from the institutional repository.&nbsp; If you
      provide export formats that can be ingested by third-party
      applications that include both unique identifiers and therefore
      build a link between the two, it is possible to build and network
      of unique identifiers for a particular item.&nbsp; Then when you get a
      journal article that has authors from multiple institutions, it
      will be possible to see that a publication from institution A is
      the same publication as from institution B.</p>
    <p>Regards</p>
    <p>David Newman<br>
    </p>
    <p><br>
    </p>
    <p>On 28/04/2021 10:02, James Kerwin via Eprints-tech wrote:<br>
    </p>
    <blockquote type="cite" cite="mid:EMEW3|78d372fc7a2e0bf131f446655293956ax3RA3j14eprints-tech-bounces|ecs.soton.ac.uk|CAKkNZ9BSXLBgD0SKnDo+Xdk66z7YVefph1y=bJi_zGGJJgvpRA@mail.gmail.com">
      
      <div style="padding-bottom: 10px; padding-top: 5px;">
        <div style="padding:12px; border:1px solid #8D3970;
          background-color:#F7F9FA; color:#8D3970; font-size:14px;
          line-height:22px; font-family: Calibri, Arial, Helvetica,
          sans-serif;">
          <strong>CAUTION:</strong> This e-mail originated outside the
          University of Southampton.
        </div>
      </div>
      <div>
        <div dir="ltr">Hi All,<br>
          <div><br>
          </div>
          <div>For once I have not broken anything, just looking for
            opinions and advice.</div>
          <div><br>
          </div>
          <div>As part of Plan S we need to have persistent identifiers
            for scholarly publications. I have read this EPrints wiki:<br>
            <br>
            <a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwiki.eprints.org%2Fw%2FPlan_S&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514220749%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=8y2b3nrmxD7PBU08cffAImvkP8cOFnbu9tubx1SggxQ%3D&amp;reserved=0" originalSrc="https://wiki.eprints.org/w/Plan_S" shash="QwOx0KjvfhmYJDDEzrt7ji/MvfeQ+q+gOY2W2rN4JndNp0TmCIyya1nGHXJsCI0ALjQ6JLIjfiCeZwVX6RlL3gPk+ufUXvZ6PD80vu2FO87W7PfR0/+TJ//N4ShD3A2JwcFDDy7WunaFcGwBErDkUlMOM37h/rvFjKxXiE1WV40=" originalsrc="https://wiki.eprints.org/w/Plan_S" shash="go4cYmWcvVK0jyHiciI4Z35ZKw+zZNjb3PfrQE9ir7LIcNhJbkhXba3HU1JzePX3Rt4hNvEbOJ+bB7O1gTUuGzAB14iuAsqbSUTJlryzP5jHKnVJol9U+RfuIPQgIgtPBXsYgh8l5yy/8JP4XuatjwPYupW30vWPOR7qv/NkuIE=" moz-do-not-send="true">https://wiki.eprints.org/w/Plan_S</a><br>
          </div>
          <div><br>
          </div>
          <div>At Liverpool we aren't 100% sure about this topic. DOI
            would be the obvious choice, but there are some on my team
            who reasonably point out that the same item could be in
            several repositories and end up having several separate DOIs
            associated with it. I'm not sure how much that matters.<br>
            <br>
            Does anybody have any thoughts on this point? We spoke with
            my predecessor, Adam, who was really helpful. Unconvinced
            team members have suggested using
            <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fhandle.net%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514230699%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=b80XBZi8kOEk1Pr5%2F%2FwQySzqR8aqj2HSQQumHxeWVeM%3D&amp;reserved=0" originalSrc="http://handle.net/" shash="xO678dcPaEtLOFIbQ4XDV+xHaVljwtyeTUmOH+c1Z/NPoc5rzWfcFDsvq/h7VjTfxNOhKGr5xbdKBJ7ma4Spv1z2d/x/4SRVgzqPcq4WBxdxH6/K9MWCWNOHK5fqg3F+qwO/f2qpEwJHCsCaOHB3DpJxCdDmhoqH0oy9jZzB/1s=" originalsrc="http://handle.net/" shash="ydX9Isj6MHUAjs48+qIKt2VHdtonMpHUDYR5Zn9ozh67sTk/v7lP+NJ2bGF/vItcGikmscUHRyVa+ZV8hE4nQpP5C6fLmexlQsLXVfzypi+683kRXyVzOKc62JSTm5o+BuUfoV73ZI5PxMMLOq9wnE5yf/BMUyRnwDghX48nDIg=" moz-do-not-send="true">handle.net</a> which I think is
            overkill and doesn't necessarily meet the needs of Plan S in
            itself.<br>
            <br>
            Also, the URL/EPrints ID for each item, is this not a
            suitable persistent identifier? The wiki linked above does
            mention this. There's always the possibility a repository
            URL could change in the future, but I would expect some sort
            of redirect to overcome this.</div>
          <div><br>
          </div>
          <div>If there is a more suitable place for this type of
            discussion please send me there.<br>
          </div>
          <div><br>
          </div>
          <div>Thanks,</div>
          <div>James</div>
        </div>
      </div>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <pre class="moz-quote-pre" wrap="">*** Options: <a class="moz-txt-link-freetext" href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a>
*** Archive: <a class="moz-txt-link-freetext" href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514230699%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=K%2B3kCWCljhEnrCggQYTtLCeYpH5X6hBNSGNrfnD4yW4%3D&amp;reserved=0" originalSrc="http://www.eprints.org/tech.php/" shash="WUSKmcPrz5cpxCUpsjw84dSlOHLu4x4CMa4kVZtPGL8K6/ShCUzw0Gb1znSo9UsHRHRQUwb4a7XgrPW9tIbGUTK3+Okcu+MBp5qpWAJAG3p8FX7lUMTzfoJWYH1OnwlXVG3ZU0JglcAITYFFC75ivpTb8tFFwksAsME7XVeIMGg=">http://www.eprints.org/tech.php/</a>
*** EPrints community wiki: <a class="moz-txt-link-freetext" href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514240656%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=G0JES3ZmgAJ6RlUiEEyaZFlNKE%2FHYcphi4056smFLx4%3D&amp;reserved=0" originalSrc="http://wiki.eprints.org/" shash="bdRyyGEa9c4ZTAuwLP6abhdzqsSxtiW1+Ui8KOLmSY7nfUa+IcPiJDWXkHU2bg6QYDnchtj2CUWWVFwngo2SYoG35ZyEvJwmNByBYSCTygLi27w/gs4WiKVksFpymNa3iVyV3rEFmGC3F/YE/sjefEjm/Ch0nGyURJvlY2nkq08=">http://wiki.eprints.org/</a></pre>
    </blockquote>
  <div id="DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2"><br>
<table style="border-top: 1px solid #D3D4DE;">
        <tr>
        <td style="width: 55px; padding-top: 13px;"><a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514240656%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=Gl%2FpxeC5676s8GAvmaB7kQN2diLF9iGKNqKSO7sZIJU%3D&amp;reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="BSzgs9983ShT5wbKFf90qm1qXeATBkj+HmfKsXOm+JqjI7fs870tnEnOFxyKuPFuB0DHWQ0m9UZMl9lCDOGpjLBqNQW9PPXaogNj1v2DUobZALuOvtrHxOdyaoghVufUgdmhqmJVSXnPtBzm0u7rHbkrAbf3s6vQcseS7j2WjA0=" target="_blank"><img src="https://ipmcdn.avast.com/images/icons/icon-envelope-tick-green-avg-v1.png" alt="" width="46" height="29" style="width: 46px; height: 29px;"></a></td>
                <td style="width: 470px; padding-top: 12px; color: #41424e; font-size: 13px; font-family: Arial, Helvetica, sans-serif; line-height: 18px;">Virus-free. <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7C77295f4a36d24085692708d90a2b1889%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637552002514250617%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=XPpLPQFjppgs6KgM77sK6KAqzZhU05EZPoZHgUwxQTI%3D&amp;reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="WaQpltt+hgUujh5BDAnCCY/km+woE66Vgb1c/DuNGcLJKGwpjZjkxDngCj0coY7y5T/eKxSjUNM2hjV5UbsfHSswfRw4VzHl/h8eyYClhavun7pHEf3cSzGB3xYaZe+VC8NhnU2H+z+toR/2Zxbmf3Vox/H9FZheEEOA1GgsmWg=" target="_blank" style="color: #4453ea;">www.avg.com</a>
                </td>
        </tr>
</table><a href="#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2" width="1" height="1"> </a></div></body>
</html>