<html><head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body>
<p>Hi James,</p>
<p>Yes, that does look like a bit if a complex issue. The URL it is
pointing to on EPrints seems to be its representation of the
record through Symplectic's RT1 rt4eprints handler. Thay URL
looks ok once you decode it but it is not the same URL as the file
URL for EPrints, although I would expect it to return the same
things. However, the rt4eprints handler may then parse the file
parameter in the get header is some interestingw ay. Semi-colons
are often used as a separator in various different ways so it may
think you have specified two files:</p>
<p>Induction with Thymoglobulin in High-Risk Renal Transplant
Patients</p>
<p>Beauty and the Beast.pdf</p>
<p>What is does in this situation is uncertain without reviewing
their code. It may just try to recover one or other of the
potential files neither of which exists, so it fails. I think you
are probably right that RT2 will fix this as there will no longer
be an rt4eprints handler to which the URL is refering.</p>
<p>Regards</p>
<p>David Newman<br>
</p>
<div class="moz-cite-prefix">On 17/02/2021 12:52, James Kerwin
wrote:<br>
</div>
<blockquote type="cite" cite="mid:CAKkNZ9CGHiukFZzGo5=0qANWLHTpAshqpyeupUuuPO2EsLBDcQ@mail.gmail.com">
<div style="padding-bottom: 10px; padding-top: 5px;">
<div style="padding:12px; border:1px solid #8D3970;
background-color:#F7F9FA; color:#8D3970; font-size:14px;
line-height:22px; font-family: Calibri, Arial, Helvetica,
sans-serif;">
<strong>CAUTION:</strong> This e-mail originated outside the
University of Southampton.
</div>
</div>
<div>
<div dir="ltr">Right it did not take long to refamiliarise
myself. The following EPrints record has one file with a
semicolon in the file name (";" for the avoidance of doubt). I
can access the file on EPrints, but in Elements via Chrome and
FireFox I get quite similar errors.
<div><br>
</div>
<div><b>EPrints Record:</b> <a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2F3008387%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725645575%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=XPhhqgZpoilYwB9Lrb%2Bunz1L8iiBltyc%2BjRL5bG7dIY%3D&reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/3008387/" shash="u9Evx6PcTwAtfm9iH17lixRIYff0ZibZjLrIVLe5DdYEvM+4z3x0utvp3M9ENfQkDFkKovwyDdAhUHMS0kISrG8O3opZQLDX2ywJ/r3MlgqHeqY+Npx1aNuQVySZAUKraG/0rxMU4+DRKPgUg27olNoUPF76JGHRPrJHFwnj4lg=" originalsrc="https://livrepository.liverpool.ac.uk/3008387/" shash="Oyrsof5KrtClBYLrFZ0Ip9Q9Uwfi8KczfEGMO06CPlAB2SxXq1h0jQsdQ+O1GlfVessysBdLy5grFVGo6appqxf8MHltoxYyRNDl1JKp0ouVzBgZEAmD7f/a9W4mhVMg8ylUW6L9ZEdhK/KyM1YECV3eJO2kZMBD2d9srMaP0us=" moz-do-not-send="true">https://livrepository.liverpool.ac.uk/3008387/</a></div>
<div><b>File link/url:</b> <a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2F3008387%2F1%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725650554%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=J%2FxCrgyq6f3U85kN%2BuIej0vJJbFIeITcSJH47fwMYPU%3D&reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf" shash="t24BjrVNd7V+gkQLXkpAS/E4Evyos5tO0CdFMhAfSaBZJrGvv9n2gl0lhuI4M2Wcd/d+sXpWqiFqzZgpcLRwEARXROcTaVRTeMi0d1pRetGu+iovGZUbbkmkZDseb6/069bDD8M04eSTxEzFwdyYkk7Kf09HI84MIAc7OQlA2nw=" originalsrc="https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf" shash="oxGoBUFe5WVtus5K6Brf05u64oOeIZx3UP+ORy7DIiH+/FWyRdXAC27IH+GSqF48uuntshV656NJVdabixPWpazL7SlbrCsSIvOMHDmpwYiidV4ZdfkR1Cl+PWRy8LjEThrmsym7rESp4X4Be+6X9NXL0qh8BA35+wMflnIVByw=" moz-do-not-send="true">https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf</a></div>
<div><br>
</div>
<div>In Elements when I go to the record and click the file
icon I would expect it to download or open in the browser,
instead I get an error:</div>
<div><br>
</div>
<blockquote style="margin:0 0 0 40px;border:none;padding:0px">
<blockquote style="margin:0 0 0
40px;border:none;padding:0px">
<div>The web page at <a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Felements.liverpool.ac.uk%2Frepository.html%3Fcom%3Dget-file%26publication-id%3D220114%26rfid%3Dhttps%253A%252F%252Flivrepository.liverpool.ac.uk%252Frt4eprints%252Ffile%252F106106%252FInduction%252520with%252520Thymoglobulin%252520in%252520High-Risk%252520Renal%252520Transplant%252520Patients%25253B%252520Beauty%252520and%252520the%252520Beast.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725660509%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=vd30SuhKbzM8JJDWc8mUTdcN3bFKoUgsiW8ZxY8JZsU%3D&reserved=0" originalSrc="https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=220114&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F106106%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf" shash="YVRbcC5XC59OrTNKRJRolYXb6kBSFbrkVL/8aq0gV2hP9RfdhhkBU5f80YqT0VbG19beFbKWL/YmKaxE75ZFFgq8LrnSyKSg15IHiDkhsKE3EoN0J+ULEILDLhvf+TCFeKA5nWybNqEdxReTe01PQjLw8sIYgrai/Qz0yeZHSNY=" originalsrc="https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=220114&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F106106%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf" shash="EDkn2roh8FPTJRhpf/To7JOJfNuVsKcmnwdHM8E3RgCXPTMr/alLGgN40fudUai0F6EiwmhYp1w4I5pruGcvbs6Ypwg2gEPdUNbCchxqRMRye0s80h4fbn9bG/TxLra4u6KJ+eTU0lP+FqOOTykvWzwI/bPQXHGmzT4kPJ2tfco=" moz-do-not-send="true">
https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=220114&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F106106%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf</a></div>
<div><br>
</div>
</blockquote>
</blockquote>
Going through this with another file on another record that
has no ";" in the filename it works as expected.
<div><br>
</div>
<div>Looking at the links from both EPrints and Elements
there's some sort of difference in the link to the file:</div>
<div><br>
</div>
<div><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2F3008387%2F1%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725665490%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=9XZyIZ2C0RGUgrR1yNVV6grezvyXNg86MeCyWiEUGpk%3D&reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf" shash="GpmiiWQluR3PZdfNaVt6ZzwZM5D1+AzJBs355nuXSPutlpEbwS+ZgrSi5UMdcVEVpADoVqr5jJeHpVMwjf1Fz67susV4Fp6BAd7kN4LcFXVB9/EwM+7l/JDBe01/5ukdBL46qpbtUpJHV+Mq6u0ZlGNrlbis26qiSyN/kcjVcJ0=" originalsrc="https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf" shash="Cua5ESthnVQYv94sZSywenzHUW2E+85lN8DjfZKhfCeq/HWsSeB0IbCRWctheqTV0PgKgPrtQiqS7jk9c6OLNC7qnqmIoaI1am4xLHRJy58HGy83M32ckMLa0kqfY1QxJ6gQneUzF9zRIp0OTnU03AcEdo7pCDYNb43Gz40dLjk=" moz-do-not-send="true">https://livrepository.liverpool.ac.uk/3008387/1/Induction%20with%20Thymoglobulin%20in%20High-Risk%20Renal%20Transplant%20Patients%3B%20Beauty%20and%20the%20Beast.pdf</a>
<div><br>
</div>
<div>https%3A%2F%<a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2F2flivrepository.liverpool.ac.uk%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725670463%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=yAlzEISRO7xo6nHhQDWVWaTrTtLGWXtR6NhLs0U0RyI%3D&reserved=0" originalSrc="http://2flivrepository.liverpool.ac.uk/" shash="dIWb/xvCVCqOLVtsjP+jDK0uZTQ0ZyxvV/sI6Arg80cX6iyZ6OkNTM7pCROTe+Sd1N7SUxQMcBJ1oTEBgnc5QIewUBFmUuKrSFt+K8vZ3XCNCMBCqOLLnSTr6L9O74/qQdswl7hLtm6IJc7YYOtLgdHPevi/9T+RLqmu56Wa6OY=" originalsrc="http://2flivrepository.liverpool.ac.uk/" shash="xpERxHIhpznLZohsct/PxcUyYvvJXI/xkDx18o+ft1hZ0ZMKyyLnG643q7oyXddW168kjQ6i27XBRNT/qawKL8YJYdzL2INB80laUmpS3r3f7P7dA/KUiRkWziLVY7DRd3uoFYQ4yM8jUHclXVK1YIFw50FmULPpNppPc1teeP8=" moz-do-not-send="true">2Flivrepository.liverpool.ac.uk</a>%2Frt4eprints%2Ffile%2F106106%2FInduction%2520with%2520Thymoglobulin%2520in%2520High-Risk%2520Renal%2520Transplant%2520Patients%253B%2520Beauty%2520and%2520the%2520Beast.pdf
</div>
</div>
<div><br>
</div>
<div>This is an example of what we get for a working record:</div>
<div><br>
</div>
<div>Eprints: <a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2F3115606%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725680420%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=CiGKYuD3qIXx3mhUemH9B2yjwlvqXhK%2FfpeJWfaCX%2BU%3D&reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/3115606/" shash="EO6hRFDGAemQjFkzHMJJaILInEmmEUxAQPLAXYT1lSY6k71WJaPqDhfehBjrwRFe95s7E4wFMlqXkfrNo3VcvrKqSbdh5WxwVRyQ5UZS57xfgo7iz3IyJ4X7TgmZFxcOyOD1NaBEVNjA6paI9PFASOlBaDlUKgOn6X9ChpvUQic=" originalsrc="https://livrepository.liverpool.ac.uk/3115606/" shash="WF1IdW6vnyw6yUytyIMM+2OjLkCUMynpk8CzwDz5Y6RdwoandJWkva1oefB/J+YVRQUoyMw7m/GjmArLW6KQacW4hEb3v6VYIED7wf+2nEF23B124mxfQ2o2fKirpz35DClVt7eydm5Wddhl1TUTMIqxBlNR2SezLdtKqrcm8CQ=" moz-do-not-send="true">https://livrepository.liverpool.ac.uk/3115606/</a></div>
<div>EPrints File Link:</div>
<div><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Flivrepository.liverpool.ac.uk%2F3115606%2F1%2FLee%2520et%2520al%2520Erratum%25202021pdf.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725685403%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=dL%2Fa52BhnaQinXDakmDog6aTniErF7PqNFuWdgFrufY%3D&reserved=0" originalSrc="https://livrepository.liverpool.ac.uk/3115606/1/Lee%20et%20al%20Erratum%202021pdf.pdf" shash="c5AfOGzoeQEbNxSZ5ReAOk6e8pTUUcGjS8MJby/NoOZare2N4dckxqaAQ3lt67GEQ7qQBdml5zasVdf/bxv5TSXqhkPSMxSoPFGeAm5MtC5lf6pK+9Tx512KXo5rwANui2IsrdF6BzjrrpEyTBO/MO0dLrVK5bPPhkAJJhfKUXw=" originalsrc="https://livrepository.liverpool.ac.uk/3115606/1/Lee%20et%20al%20Erratum%202021pdf.pdf" shash="qWoTlHmXlcgPmaBjvGPRYmd8wE/FBKGF6k7BUdSgDWQ7BSu7LKZcmDYRef+UBCqBAOqJdeDY/mZe66RhGUs6m/m6kBF37Rp8p3vATyD4twVVJ/Ldy2dqZCHjT6f2Bq+dZpJJBk4gCSMP8SShP1HcRRmE0mOYo+8KqVLwmR8q/qw=" moz-do-not-send="true">https://livrepository.liverpool.ac.uk/3115606/1/Lee%20et%20al%20Erratum%202021pdf.pdf</a><br>
</div>
<div>Elements File Link:</div>
<div><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Felements.liverpool.ac.uk%2Frepository.html%3Fcom%3Dget-file%26publication-id%3D486086%26rfid%3Dhttps%253A%252F%252Flivrepository.liverpool.ac.uk%252Frt4eprints%252Ffile%252F371659%252FLee%252520et%252520al%252520Erratum%2525202021pdf.pdf&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725690377%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=Xiu6NuzQgAMSWPdsv7xCxzhm1IpjYJ1168NIxuh4T64%3D&reserved=0" originalSrc="https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=486086&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F371659%2FLee%2520et%2520al%2520Erratum%25202021pdf.pdf" shash="n7Y0xl7yAEMOplzkwo5MogeJ1yKa8Itsw3RZ3nIgMqM99kfBNoo1sxbDeHZTXThMIqKxDxe8zD9u/pjTsjESwzBwJ7gd+5fyrK15iOvYVBV5YGabCk+0eomlFP8tIut9agX1MP+e67LfWBmLegC3a+1h23vQpf3X0/bksj5WrzQ=" originalsrc="https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=486086&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F371659%2FLee%2520et%2520al%2520Erratum%25202021pdf.pdf" shash="g9JPNNeYfMyb17QliC17kLjaMulBJgWotQEAaf7UJ5C2EgPHKJV59wfSZkwQyIctzNJm/K8WMG3UCPeGLhjO6vxaZ7Xdh9KjoxQm4f1DzmSy9/spPeEA48ZuuezuFA6Br7wjmHPnZOS+VBoelhK4kCeAhBhyD3lukXbf1pTRNDg=" moz-do-not-send="true">https://elements.liverpool.ac.uk/repository.html?com=get-file&publication-id=486086&rfid=https%3A%2F%2Flivrepository.liverpool.ac.uk%2Frt4eprints%2Ffile%2F371659%2FLee%2520et%2520al%2520Erratum%25202021pdf.pdf</a><br>
</div>
<div><br>
</div>
<div>The only consistent difference is the presence of these
characters, but I can't really understand where it's going
wrong. I had wanted to take a proper look at it and get in
touch with Symplectic, but people have the audacity to keep
finding new problems and giving me more work... We are due
to upgrade to RT2 soonish (2nd half of this year probably)
and I am hoping all of this goes away with that.</div>
<div><br>
</div>
<div>Thanks,</div>
<div>James</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Wed, Feb 17, 2021 at
12:18 PM James Kerwin <<a href="mailto:jkerwin2101@gmail.com" moz-do-not-send="true">jkerwin2101@gmail.com</a>>
wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div dir="ltr">Hi David,
<div><br>
</div>
<div>Thank you for your reply. Unfortunately I don't have
access to the Elements database(s) but I've explained
this issue to our Elements people and hopefully should
get a response. Meanwhile, some time ago Mr Salter gave
me the means to extract the Elements xml and transform
it via the crosswalks outside of EPrints, so I may do
that with the different records and see what we get.
Doing this has only just now occurred to me now so I'll
give it a go.</div>
<div><br>
</div>
<div>On the subject of the character in question... The
error code comes from (I think!):</div>
<div><br>
</div>
<div>eprints3/perl_lib/URI/Escape.pm</div>
<div><br>
</div>
<div>Specifically here in the _fail_hi sub:</div>
<div><br>
</div>
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<div>
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<div>"sub uri_escape {<br>
my($text, $patn) = @_;<br>
return undef unless defined $text;<br>
if (defined $patn){<br>
unless (exists $subst{$patn}) {<br>
# Because we can't compile the regex
we fake it with a cached sub<br>
(my $tmp = $patn) =~ s,/,\\/,g;<br>
eval "\$subst{\$patn} = sub {\$_[0]
=~ s/([$tmp])/\$escapes{\$1} ||
_fail_hi(\$1)/ge; }";<br>
Carp::croak("uri_escape: $@") if $@;<br>
}<br>
&{$subst{$patn}}($text);<br>
} else {<br>
$text =~
s/($Unsafe{RFC3986})/$escapes{$1} ||
_fail_hi($1)/ge;<br>
}<br>
$text;<br>
}<br>
<br>
sub _fail_hi {<br>
my $chr = shift;<br>
Carp::croak(sprintf "Can't escape \\x{%04X},
try uri_escape_utf8() instead", ord($chr));"<br>
</div>
<div><br>
</div>
</blockquote>
</div>
</blockquote>
</blockquote>
<div>The FULL error log line says:</div>
<div><br>
</div>
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<blockquote style="margin:0px 0px 0px
40px;border:none;padding:0px">
<div>Can't escape \\x{2019}, try uri_escape_utf8()
instead at /opt/eprints3/perl_lib/URI/Escape.pm
line 178.\n\tURI::Escape::_fail_hi('\xe2\x80\x99')
called at /opt/eprints3/perl_lib/URI/Escape.pm
line 171\n\tURI::Escape::uri_escape('Published by
the American Physical Society under the terms
of...') called at (eval 177) line
82\n\tEPrints::Config::uolrepo::__ANON__('dataset',
'EPrints::DataSet=HASH(0x7f21238f9358)',
'repository',
'Symplectic::Wrappers::EPrintsSession=HASH(0x7f2124610710)',
'dataobj',
'EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
'changed', 'HASH(0x7f212d684f18)') called at
/opt/eprints3/perl_lib/EPrints/DataSet.pm line
1517\n\tEPrints::DataSet::run_trigger('EPrints::DataSet=HASH(0x7f21238f9358)',
105, 'dataobj',
'EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
'changed', 'HASH(0x7f212d684f18)') called at
/opt/eprints3/perl_lib/EPrints/DataObj.pm line
669\n\tEPrints::DataObj::commit('EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
undef) called at
/opt/eprints3/perl_lib/EPrints/DataObj/EPrint.pm
line
1011\n\tEPrints::DataObj::EPrint::commit('EPrints::DataObj::EPrint=HASH(0x7f21285879b0)')
called at
/opt/eprints3/perl_lib/Symplectic/RepoProcess/MetadataManager.pm
line
355\n\tSymplectic::RepoProcess::MetadataManager::add_preferred_bibliographic('Symplectic::RepoProcess::MetadataManager=HASH(0x7f2123858468)',
'eprint',
'EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
'raw_record',
'XML::LibXML::Document=SCALAR(0x7f212858bb60)',
'types', 'ARRAY(0x7f21254315a0)', 'limit_to',
'ARRAY(0x7f21215fceb8)', ...) called at
/opt/eprints3/perl_lib/Symplectic/RepoProcess/MetadataManager.pm
line
240\n\tSymplectic::RepoProcess::MetadataManager::add_bibliographic('Symplectic::RepoProcess::MetadataManager=HASH(0x7f2123858468)',
'eprint',
'EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
'publication',
'Symplectic::PubsModel::Publication=HASH(0x7f212d6b7fe8)')
called at
/opt/eprints3/perl_lib/Symplectic/RepoProcess/IngestWorkflow.pm
line
203\n\tSymplectic::RepoProcess::IngestWorkflow::update_metadata('Symplectic::RepoProcess::IngestWorkflow=HASH(0x7f212858f348)',
'eprint',
'EPrints::DataObj::EPrint=HASH(0x7f21285879b0)',
'publication',
'Symplectic::PubsModel::Publication=HASH(0x7f212d6b7fe8)',
'auth_details',
'Symplectic::PubsModel::AuthDetails=HASH(0x7f212d785c38)',
'record',
'Symplectic::RepoModel::PublicationsRecord=HASH(0x7f212c73f510)',
...) called at
/opt/eprints3/perl_lib/Symplectic/RepoProcess/PublicationManager.pm
line
65\n\tSymplectic::RepoProcess::PublicationManager::get_deposit_representation('Symplectic::RepoProcess::PublicationManager=HASH(0x7f212d7ac290)',
'publication',
'Symplectic::PubsModel::Publication=HASH(0x7f212d6b7fe8)',
'auth_details',
'Symplectic::PubsModel::AuthDetails=HASH(0x7f212d785c38)')
called at
/opt/eprints3/perl_lib/Symplectic/Process/FileDepositProcessor.pm
line
148\n\tSymplectic::Process::FileDepositProcessor::handle('Symplectic::Process::FileDepositProcessor=HASH(0x7f212d6d73b0)',
'pid', 485375, 'auth_details',
'Symplectic::PubsModel::AuthDetails=HASH(0x7f212d785c38)',
'deposit_props',
'Symplectic::PubsModel::DepositProperties=HASH(0x7f212e8a0440)',
'atom', 'CGI::File::Temp=GLOB(0x7f212d7fae08)',
...) called at
/opt/eprints3/perl_lib/Symplectic/Handlers/RepositoryHandler.pm
line
235\n\tSymplectic::Handlers::RepositoryHandler::post_handler('session',
'Symplectic::Wrappers::EPrintsSession=HASH(0x7f2124610710)', 'request',
'Apache2::RequestRec=SCALAR(0x7f212e8a77a8)',
'auth_details',
'Symplectic::PubsModel::AuthDetails=HASH(0x7f212d785c38)')
called at
/opt/eprints3/perl_lib/Symplectic/Handlers/RepositoryHandler.pm
line
109\n\tSymplectic::Handlers::RepositoryHandler::handler_multi('Apache2::RequestRec=SCALAR(0x7f212e8a77a8)',
undef) called at
/opt/eprints3/perl_lib/Symplectic/Apache/Rewrite.pm
line
98\n\tSymplectic::Apache::Rewrite::__ANON__('Apache2::RequestRec=SCALAR(0x7f212e8a77a8)')
called at -e line 0\n\teval {...} called at -e
line 0\n</div>
</blockquote>
</blockquote>
</blockquote>
<div><br>
</div>
<div>I'm making some big assumptions, but I THINK the
"\\x{%04X}" is saying "take 4 characters from the result
of ord($chr) and put them here". I'm possibly very
wrong. I think any solution for this needs to belong in
the Symplectic code on the repo server. I don't fancy
altering core EPrints code for the sake of this. I'll be
in a whole world of hell before I know it. Yesterday
when tracing this I ended up at:</div>
<div><br>
</div>
<div>eprints3/symplectic/perl_lib/Symplectic/RepoProcess/MetadataManager.pm</div>
<div><br>
</div>
<div>Reading through the code it appears to identify the
preferred record and start processing it. Perhaps this
is a good opportunity to intervene and either swap bad
characters for good ones or encode/decode "properly" (as
if I know what I'm talking about). Complicated slightly
by not being able to thoroughly test it. I suppose
another option would be to see what XSLT etc. can do
with regard to this and so catch the problem within the
crosswalks.</div>
<div><br>
</div>
<div>If we verify the manual record in Elements it gets a
higher precedence than the Scopus record and so the
problem disappears.</div>
<div><br>
</div>
<div>Regarding the other problem with the file link I will
need to refamiliarise myself with it and I'll reply
later. Plus this email is already wordy enough as it is!</div>
<div><br>
</div>
<div>Thanks,</div>
<div>James</div>
<div><br>
</div>
<div><br>
</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Wed, Feb 17, 2021 at
10:31 AM David R Newman <<a href="mailto:drn@ecs.soton.ac.uk" target="_blank" moz-do-not-send="true">drn@ecs.soton.ac.uk</a>>
wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px
0.8ex;border-left:1px solid
rgb(204,204,204);padding-left:1ex">
<div>
<p>Hi James,</p>
<p>I think you would need to look at this field in the
Elements record in its database to look how it is
being stored differently when there is an import
compared to where there is manual entry. As you
said I think the problem is in part that text box
entries get parsed and encoded before going into the
database but imports do not (or at very least the
process between input and output to the Elements
database is different). It would be useful to know
how they look different in the Elements database as
they may assist making EPrints more resilient to
unexpected encodings in future. <br>
</p>
<p>However "\\x{2019}" looks like an escaped version
of something that is not particularly valid. If
this was "\\u{2019}" this would probably work better
as \x I think can only be used to represent a
standard ASCII character that can be only two hex
digits like \x3a is a colon ":". \u is used for the
extended character set (i.e. UTF-16). \u{2019} in
UTF-8 would be \xE2\x80\x99.<br>
</p>
<p>It would be interesting to get a bit more
information about your other issue with regular
quote marks and semi-colons that are part of the
standard ASCII set rather than an extended
characters. These really should not be causing a
problem.</p>
<p>Regards</p>
<p>David Newman<br>
</p>
<div>On 17/02/2021 09:44, James Kerwin via
Eprints-tech wrote:<br>
</div>
<blockquote type="cite">
<div style="padding-bottom:10px;padding-top:5px">
<div style="padding:12px;border:1px solid
rgb(141,57,112);background-color:rgb(247,249,250);color:rgb(141,57,112);font-size:14px;line-height:22px;font-family:Calibri,Arial,Helvetica,sans-serif"><strong>CAUTION:</strong>
This e-mail originated outside the University of
Southampton.
</div>
</div>
<div>
<div dir="ltr">Hi All,<br>
<div><br>
</div>
<div>This is an Elements/EPrints question.
Apologies that it isn't purely EPrints, but
this is probably the best place to get an
answer. I want to know if others experience
this or it's some oddity to our setup.</div>
<div><br>
</div>
<div>We are using RT1 (for now) and EPrints
3.3.14 (also for now until upgrade).
Occasionally we get an Elements record that is
from Scopus, PubMed etc. that has an odd
character in it that prevents upload. When I
look in the Apache logs it tells me the
problem. Yesterday's one was the presence of:<br>
<br>
"Unicode Character “’” (U+2019)" <br>
<br>
Which showed in the logs as:<br>
<br>
"Can't escape \\x{2019}, try uri_escape_utf8()
instead at
/opt/eprints3/perl_lib/URI/Escape.pm"<br>
<br>
Importantly if I copy the problem characters
to the manual elements record it doesn't pose
a problem. There appears some processing to
properly encode characters entered via text
box, but not characters dragged in from other
sources into Elements.<br>
<br>
I've also had the issue with the files
containing "'" or" ";" etc not being
accessible via Elements (a very similar, but
different problem).<br>
<br>
I found where I COULD fix the former issue,
but it involves changing EPrints code when I
SHOULD be altering the Symplectic connector
code on the repo server.<br>
<br>
Anyway, I'm not specifically looking for a
solution, but has anybody else experienced
anything similar? If so, does it stop with
RT2? I hope to raise a ticket with Symplectic
over this eventually.</div>
<div><br>
</div>
<div>Thanks,</div>
<div>James<br>
<br>
<br>
</div>
</div>
</div>
<br>
<fieldset></fieldset>
<pre>*** Options: <a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech" target="_blank" moz-do-not-send="true">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a>
*** Archive: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725700340%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ZklZjXgYsipZJ5iPTN%2B7Ew7Y%2BygNyI0R5lvYHdp66uE%3D&reserved=0" originalSrc="http://www.eprints.org/tech.php/" shash="PCEki2CPJZ5sXr5MxPTRImUCruGao9RZ0apuzwKvR+XtR9GQv43JuAjdAOsP1wmE9C8YVnmJyCUWH7iHb5eVIt4AhvrZxP/APulqcNsPtKN1G8/qRoj62HljooNQqR4X/V3fKW3VuqcB/WZYSkMm+WpCp1mKLA9Ydi4pNA2Ea/A=" originalsrc="http://www.eprints.org/tech.php/" shash="PxPfReUwBIT1ar0uTZjd40gEEZFO0EERG9BfzHg5dncRR566P9qRjnitlgeImJiktOxKf4ijkto7owZcZrfjIwjWsqHnBKLUZbRS3ls7L0lmfra1aD/MizTMDV3Y6HpDgKvl39bu1xopxWSKODqsquM5X22BSwLndmbg1tnXAto=" target="_blank" moz-do-not-send="true">http://www.eprints.org/tech.php/</a>
*** EPrints community wiki: <a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725705311%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=bxN%2FTYMlPm3boCWtgspSFkELJwo3AnFV1t40F6D%2BxNw%3D&reserved=0" originalSrc="http://wiki.eprints.org/" shash="bjfu1Opfb5Re83X7BnNT0TOe78HenobxUbwjcNUu2FsbCyf1TB5NUxDSkvD6/n5DlvMS01KqkrmwQzgryCqmIGKFoIEjqpffv+3jSVeMy9/7QOHG5yIliIvxpb8rAR8m+gthN6ANHtou2loDrvpCk5sKmz4l7OipJ0lEdlFayAQ=" originalsrc="http://wiki.eprints.org/" shash="dVdhpGDIz4q3nChGZwrRTHKbuFToTYsbwnjLUoX8nou/wzDdiHq4JPjtcujnYzX8DLS1P5uBCOe6LK4dF1PmLpXOH5Qpv1l8gul5bKJ1WCjasLja87LqCn0cJ0bufHR6tsGraUdQMc4mFJbA+t4iFUIh/nO/Hn/UYw9b3zLQmu8=" target="_blank" moz-do-not-send="true">http://wiki.eprints.org/</a></pre>
</blockquote>
<div id="gmail-m_8232395957973897161gmail-m_8377127222243325055DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2"><br>
<table style="border-top:1px solid rgb(211,212,222)">
<tbody>
<tr>
<td style="width:55px;padding-top:13px"><a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725710294%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=E4%2BQevytQyfrWSKNc%2FargRac0IUHZj3H90vmRQCZZLk%3D&reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="AMxXm33/8Yue0yVsy2PQvyUsnJFsVfD1Vm9av811ghRVrjh03znRB13v8wsp2Hq+6Gds2KEOFOzbBN+0L6JOFzDEZztGI8UIyCrKSeG8/AVVNqq5ibb/JKNoDKGhgJF5UHw4EeN2XiFs5lq7T54TDrecba9qiazxo+/a8JhibnY=" originalsrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="OD1aE+s+3Wa0MLkl8awV8DMYJ2rKKetiCUmpdNVL+bB0Glb8EAKRocsib4snaCJj1THz/5/BXIxNS/YPa+Te/PIVS+/ThcQ/B02jo75c0/bo39pQdUOdEjhnbPhNj9L07p4DIDwLuGBzYVXXeRGuTpHXyhiQ0oi/ogy3eeW0sFQ=" target="_blank" moz-do-not-send="true"><img src="https://ipmcdn.avast.com/images/icons/icon-envelope-tick-green-avg-v1.png" alt="" style="width: 46px; height:
29px;" moz-do-not-send="true" width="46" height="29"></a></td>
<td style="width:470px;padding-top:12px;color:rgb(65,66,78);font-size:13px;font-family:Arial,Helvetica,sans-serif;line-height:18px">Virus-free.
<a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cfb1a3312d8574fab00e008d8d344cc3e%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637491639725715269%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ycuEM%2FoGXYL%2FxP4ad0WKz9E6JQzHXhT62lM8s3mTKUY%3D&reserved=0" originalSrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="zydBcPvEbSMyH36RvK9Q3YWOFvhZLBMHNgXpiziu+hKo5S/2xmHDBcPKLdPM/3+ngvNxNAypawlscUJWxSHRieuch8h+arGvkQnbW6VWhWmHYpfetp/aMk+zLwublrzeXLUie2vD/ItsxuX8JC9E+zcUjiWuLliyQD8LvGmYBBw=" originalsrc="http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=emailclient" shash="kyXR6T+GBTMHZtNr7u8LGjdplqjnjhWfvWHi6GSmyDes8aIWEwCH1+2s4SLl2sKHAAgJFQObqm8HN9Wwg7bK1Fli42KPOeU6YRH5HGhOW+rPWdZme7/n2ggRxTdC7Q/6UN5PRIZDSXJEGcAubKSeqxbxY3sb5zFSKQFocYYJQiY=" style="color:rgb(68,83,234)" target="_blank" moz-do-not-send="true">
www.avg.com</a> </td>
</tr>
</tbody>
</table>
</div>
</div>
</blockquote>
</div>
</blockquote>
</div>
</div>
</blockquote>
</body>
</html>