[EP-tech] DataCite/minting problem

David R Newman drn at ecs.soton.ac.uk
Fri Dec 4 11:56:58 GMT 2020


Hi James,

At very cursory glance the hyphen used in 1189 looks like a long hyphen 
(e.g. an em dash) when as 1199 looks to be a regular hypen (i.e. 
typically on th key to the right of the 0 key on a regular keyboard).  I 
know that EPrints wraps up the metadata it sends to DataCite in XML.  It 
is possible that an encoding issue caused a problem with generating 
valid XML to send to DataCite's API.

I would try fixing the long hyphen in the title (for 1189) and also 
checking author's names and other fields that may have special 
characters (i.e. cannot be type form a standard QWERTY UK/US keyboard) 
and replace these if you can.  I am not exactly sure off hand what 
fields you need to fix, although the abstract may be worth a check as well.

It is likely there is a failed event queue task for minting a DOI for 
1189, so rather than clicking the "Coin DOI" button in the actions menu 
for this item, it is worth going into the event queue via the admin menu 
and resetting the appropriate task to "Waiting" and see if this runs 
successfully a second time.

In general, I have encountered encoding issues a lot when working with 
third party applications.  I have been trying to do a few things to 
address encoding issues in general for the next release of EPrints 3.4 
(i.e. the wide character error message that often occurs in the Apache 
error logs) but I have ot had a chance to see if these improvements help 
with third-party applications like DataCite and Twitter.

Regards

David Newman

On 04/12/2020 11:17, James Kerwin via Eprints-tech wrote:
> *CAUTION:* This e-mail originated outside the University of Southampton.
> Hi everyone,
>
> I hope we are all in good health and good spirits.
>
> I'm having some difficulty with DataCite minting DOIs for a batch of 
> records I uploaded to our data repository. I was awake until around 
> 3am trying to sort it and did not get far.
>
> These are records I imported myself by parsing an excel spreadsheet, 
> writing the EPrints XML and importing them. I am almost 100% sure that 
> the fault lies with me.
>
> Some are minting and some aren't. I did suspect that it may be 
> characters in the title field such as brackets, colons, hyphens etc. 
> but that doesn't appear to be the case. (Please excuse the state of 
> the abstract pages, I'm working on it.)
>
> Minted: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdatacat.liverpool.ac.uk%2F1199%2F&data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=8jEUsWIbWrssD4VV1VqpzG6EumExhi%2F7qo%2BVcDbTECk%3D&reserved=0 
> <https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdatacat.liverpool.ac.uk%2F1199%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=8jEUsWIbWrssD4VV1VqpzG6EumExhi%2F7qo%2BVcDbTECk%3D&amp;reserved=0>
>
> Not Minted: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdatacat.liverpool.ac.uk%2F1189%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=hvJ1zzm4yvVx6yTNlJHwJp8nli56SxFqszSxkWDsfqU%3D&amp;reserved=0 
> <https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fdatacat.liverpool.ac.uk%2F1189%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=hvJ1zzm4yvVx6yTNlJHwJp8nli56SxFqszSxkWDsfqU%3D&amp;reserved=0>
>
> Oddly it populates the DOI field and shows on the abstract. The 
> unminted DOIs are not on the DataCite dashboard.
>
> Any pointers would be welcome, otherwise I might need to mint 100 DOIs 
> manually, which I'm not super excited about.
>
> Thanks,
> James
>
> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
> *** Archive: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=IW%2Fvae4H1eEfEaEV4zfBHOJJ%2B8AifeL8fryArXsB484%3D&amp;reserved=0
> *** EPrints community wiki: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=aoru8Vr8XAjKBBHCifqbNQJUZ79kpUiDpTZ%2Fk%2FODAq8%3D&amp;reserved=0


-- 
This email has been checked for viruses by AVG.
https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.avg.com%2F&amp;data=04%7C01%7C%7C5304de78fd8f48db533608d8984bb5f0%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637426798224606925%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&amp;sdata=KlAVbm0FWKSAblH6bcGDNIGZYCP7m27UNcV1Z2pj%2BNo%3D&amp;reserved=0
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20201204/dead3c16/attachment.html 


More information about the Eprints-tech mailing list