[EP-tech] Re: Normalize characters for correct sorting

Ian Stuart Ian.Stuart at ed.ac.uk
Tue Jun 9 08:57:36 BST 2015


I suspect this is a Perl problem rather than an EPrints problem..... I 
would expect Perl to sort by Unicode Value (so 0386 before 0391)

On 09/06/15 08:40, pgasinos pgs wrote:
> Is there any configuration file(s) in Eprints that someone can normalize
> utf-8 characters so they are sorting correctly in non English languages?
> For example the Unicode entities: Ƃ GREEK CAPITAL LETTER ALPHA
> WITH TONOS and
> Ƈ GREEK CAPITAL LETTER ALPHA are the same and they have to be
> sorted together, not in separate lists.
> The vowels are even more complicated. All below, are the same letter and
> they have to be in the same list:
> υ    υ  GREEK SMALL LETTER UPSILON
> ύ    ύ  GREEK SMALL LETTER UPSILON WITH TONOS
> ϋ    ϋ  GREEK SMALL LETTER UPSILON WITH DIALYTIKA
> ΰ    ΰ  GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND TONOS


-- 

Ian Stuart.
Developer: ORI, RJ-Broker, and OpenDepot.org
Bibliographics and Multimedia Service Delivery team,
EDINA,
The University of Edinburgh.

http://edina.ac.uk/

This email was sent via the University of Edinburgh.

The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.




More information about the Eprints-tech mailing list