<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1">
<style type="text/css" style="display:none;"> P {margin-top:0;margin-bottom:0;} </style>
</head>
<body dir="ltr">
<div style="padding-bottom: 10px; padding-top: 5px;">
<div style="padding:12px; border:1px solid #8D3970; background-color:#F7F9FA; color:#8D3970; font-size:14px; line-height:22px; font-family: Calibri, Arial, Helvetica, sans-serif;">
<strong>CAUTION:</strong> This e-mail originated outside the University of Southampton.
</div>
</div>
<div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Hello Sonu,</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
In general, I think that repository software, metadata standards, and search engines need to do a better job for making internationalized multilingual content accessible. I've been thinking a lot about this as a part of an Ideas Challenge team for the upcoming
Open Repositories conference. I wasn't aware of the multilingual Bazaar package, so thanks for mentioning it.</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
There are different "levels" of internationalization/translation, and it isn't clear from your question which one(s) you need:</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<ul>
<li>Interface: Translations of the repository <b>interface</b> and metadata field
<b>labels</b></li><li>Metadata: Translations of metadata field <b>values, </b>for example: including a translation of the
<b>title</b> or <b>abstract</b> or <b>keywords</b> (these are the three main fields that have conventionally received this type of treatment) into another language other than what the full-text/content is; this is provided for accessibility</li><li>Content: Translations of the <b>full-text</b>.</li></ul>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Here are some points to keep in mind as to "best practices" around this, as far as I was able to learn:</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<ul>
<ul>
<li><b style="font-weight:normal">
<h4 dir="ltr" style="line-height:1.38;margin-top:14pt;margin-bottom:4pt"><span style="font-size:12pt;font-family:Arial;color:#666666;font-weight:400">Language Codes (ISO)</span></h4>
<p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:#000000;font-weight:400">Use either 2-letter (ISO-639-1) or 3-letter (ISO-639-2) language codes. See:
</span><a href="https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.loc.gov%2Fstandards%2Fiso639-2%2Fphp%2Fcode_list.php&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464854348%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=Pbu7KhKhjJ05aGNacTk7oIRGvGm2bwSB0VkGgS9fk%2Bg%3D&reserved=0" originalSrc="http://www.loc.gov/standards/iso639-2/php/code_list.php" shash="ga/i/nzi2HgX+Dw8S2BmPZp2TN683kCJF3V1rCBcQkupvwEfFQDhZJX37G1K7ftvh4e5+NCM9l/6cg8pRRZAABtSr1QLq3xYBsDuQ3KAoxHvun0+RwdCcu5iOcZt4YWU6LPsb7aVnYvQ+eUJiTKadvc28K28uyP24mcqAlvji1U="><span style="font-size:11pt;font-family:Arial;color:#1155cc;font-weight:400;text-decoration:underline;text-decoration-skip-ink:none">http://www.loc.gov/standards/iso639-2/php/code_list.php</span></a><span style="font-size:11pt;font-family:Arial;color:#000000;font-weight:400"> </span></p>
<p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;font-family:Arial;color:#000000;font-weight:400">IANA recommends using the 2-letter codes whenever they are available, and 3 letter codes if necessary.<br>
</span></p>
</b></li><li>
<p dir="ltr" style="line-height:1.38;margin-top:0pt;margin-bottom:0pt"><span style="font-size: 11pt; font-family: Arial; color: rgb(0, 0, 0);"><b style=""></b></span></p>
<p dir="ltr" style="font-weight: normal; line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<b style=""><span style="font-size:12pt;font-family:Calibri,sans-serif;color:#000000;font-weight:400">SciELO, PubMed Central and many other publishers are using something called JATS (</span><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FJournal_Article_Tag_Suite&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464854348%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=Hvnu%2BvClW3tbCglB3olvOGFbVfUnmJVGgcC10cWqx3A%3D&reserved=0" originalSrc="https://en.wikipedia.org/wiki/Journal_Article_Tag_Suite" shash="JU9OCFcL0U+QFw9QPHm7XSKgqylin29M2FLEw+ltBXtiXk2thbdm6D6j7U+TNppZ0gormOrD70xFkac2dfhVHVtTLAjEoIUQuBcHP/z1jws4wroypSO2niliD/3vRWi3JRhUtkqzZR90Oue3ry99e8Td35GoY3fScoKyxGrG0RQ="><span style="font-size:12pt;font-family:Calibri,sans-serif;color:#1155cc;font-weight:400;text-decoration:underline;text-decoration-skip-ink:none">https://en.wikipedia.org/wiki/Journal_Article_Tag_Suite</span></a><span style="font-size:12pt;font-family:Calibri,sans-serif;color:#000000;font-weight:400">),
a NISO standard for scholarly article encoding, you can see more detail about that here:
</span><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjats4r.org%2F&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464864303%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=JSJA7CDuR%2BDmfv1rih39vOx5MaghCpr3RoLipyzBa3s%3D&reserved=0" originalSrc="https://jats4r.org/" shash="CFxaWgLakKalX+VP0i65mrIdaSJmoqdpqALDPqarqnifAAjFY5oGNFnKdeelBhWsBafUs8MTIvuXhTlx/Ke9rnz8FNrReyjt5UT7FMXyk+qaaCTEFSwJcwx4Y024OxV1pw79d2Qb1HNk8ZXbQOqbcQE9tx6ilYQkru0mBJZxe14="><span style="font-size:12pt;font-family:Calibri,sans-serif;color:#1155cc;font-weight:400;text-decoration:underline;text-decoration-skip-ink:none">https://jats4r.org/</span></a><span style="font-size:12pt;font-family:Calibri,sans-serif;color:#000000;font-weight:400"> </span></b></p>
<p dir="ltr" style="font-weight: normal; line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<b style=""></b></p>
<p></p>
</li><li>
<p dir="ltr" style="line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="color: rgb(102, 102, 102); font-size: 11pt;"><b>xml:lang attribute</b></span></p>
</li><ul>
<li>
<p dir="ltr" style="font-weight: normal; line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="color: rgb(102, 102, 102); font-size: 11pt;"><b style="color:rgb(0, 0, 0);font-size:16px"><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif">When examining the DTDs of the JATS schema (</span><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2FJATS4R%2Fjats-dtds%2Ftree%2Fa53dd76b4dd393028015de00e5760b39b36176e2%2Fschema&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464864303%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=GJOlFVGdYYbhBjShVns3DdfMIH1mvb8vRW5H9GN5ZUI%3D&reserved=0" originalSrc="https://github.com/JATS4R/jats-dtds/tree/a53dd76b4dd393028015de00e5760b39b36176e2/schema" shash="cn1u/ZUbx2STzXpqYFH4Y0XkBok/e1NvXTvm3srCp8wsarO9UyZKbOdqW97odjw7tA2qeDyd2L+rzhHKfYxq+RQI2W4/8OPumCldX3eLRvrPdBZymlhlnpqq5v3+LyG/JYP/hdG10gkA5vV/OpvByhvS06p3ma3Cdk3Rfe8QDlQ=" style="margin:0px"><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif;color:rgb(17, 85, 204);text-decoration:underline;text-decoration-skip-ink:none">https://github.com/JATS4R/jats-dtds/tree/a53dd76b4dd393028015de00e5760b39b36176e2/schema</span></a><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif">)
, the<span> </span></span><span style="margin:0px;font-weight:700;font-size:12pt;font-family:Calibri, sans-serif">xml:lang<span> </span></span><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif">attribute can be applied
to almost any element, see:<span> </span></span><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fjats.nlm.nih.gov%2Farticleauthoring%2Ftag-library%2F1.2%2Fattribute%2Fxml-lang.html&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464874257%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=AMlT4veuYUMFY09KJmWuPyTcW%2BMFQlwqyLywq82MURo%3D&reserved=0" originalSrc="https://jats.nlm.nih.gov/articleauthoring/tag-library/1.2/attribute/xml-lang.html" shash="N7AR6fSC2kCt+y5v7H9/grvICI3DQjGnseJaMWsmSV5LjmhaXB868ydFK57VpByfz0zD3C/ah+/w68fzWUiOOw+UpjeJXZZ3le6D//b0/bLKoBTiAoQdvixAk7EkA/kAVXRc6C5P0STBFXSMYIBLwpUorp6CrcFrhWv3DvDL0bs=" style="margin:0px"><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif;color:rgb(17, 85, 204);text-decoration:underline;text-decoration-skip-ink:none">https://jats.nlm.nih.gov/articleauthoring/tag-library/1.2/attribute/xml-lang.html</span></a><span style="margin:0px;font-weight:400;font-size:12pt;font-family:Calibri, sans-serif"> </span></b><br>
</span></p>
</li></ul>
<li>
<p dir="ltr" style="line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="color: rgb(102, 102, 102); font-size: 11pt;"><b style="color: rgb(0, 0, 0); font-size: 16px;"><span style="margin: 0px; font-size: 12pt; font-family: Calibri, sans-serif;"><b style=""></p>
<p dir="ltr" style="line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="font-weight: 400; font-size: 10.5pt; font-family: Calibri, sans-serif; color: rgb(0, 0, 0);">I didn't find any recommendations for multilingual metadata ( translated title, abstract or keywords ) in
</span><span style="font-size: 10.5pt; font-family: Calibri, sans-serif; color: rgb(0, 0, 0);">OpenAIRE guidelines for literature repositories
</span><span style="font-weight: 400; font-size: 10.5pt; font-family: Calibri, sans-serif; color: rgb(0, 0, 0);">(</span><a href="https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fopenaire-guidelines-for-literature-repository-managers.readthedocs.io%2Fen%2Fv4.0.0%2Fapplication_profile.html&data=04%7C01%7Ceprints-tech%40ecs.soton.ac.uk%7Cb4cc175b26004579321608d91c6a5fe1%7C4a5378f929f44d3ebe89669d03ada9d8%7C0%7C0%7C637572065464874257%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=kBKE8e4N4RRd3%2BkMXRxwShbaJS7nZw1Lky%2BuahSvv5A%3D&reserved=0" originalSrc="https://openaire-guidelines-for-literature-repository-managers.readthedocs.io/en/v4.0.0/application_profile.html" shash="LxySeo8e0s41HrapIYGL+nIvN0EMHFAAFufK73auRV+Z0X2tbRrcWNkGVI+rAGpj9TWLgEGrfSYK6LO2TuZEB2BPn4V/k7WDhnJMmqWcwy4VfKOMd74SS27ynjTMCMVJdwfLl9vj0SvUWcRF7PSCRRTErAtHEY69RJjmaFxyjms=" style="font-weight: normal;"><span style="font-size:10.5pt;font-family:Calibri,sans-serif;color:#1155cc;font-weight:400;text-decoration:underline;text-decoration-skip-ink:none">https://openaire-guidelines-for-literature-repository-managers.readthedocs.io/en/v4.0.0/application_profile.html</span></a><span style="font-weight: 400; font-size: 10.5pt; font-family: Calibri, sans-serif; color: rgb(0, 0, 0);">)</span></p>
<p dir="ltr" style="font-weight: normal; line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="font-size:10.5pt;font-family:Calibri,sans-serif;color:#000000;font-weight:400">All that is there is a language at the "Content" level, but no explanation for how to include granularity for providing translated titles/abstracts/keywords. So only
"content" level language info can be provided, no metadata level.</span></p>
</b></span></b></span>
<p></p>
</li><li>
<p dir="ltr" style="font-weight: normal; line-height: 1.38; background-color: rgb(255, 255, 255); margin-top: 0pt; margin-bottom: 0pt;">
<span style="font-size:10.5pt;font-family:Calibri,sans-serif;color:#000000;font-weight:400">Search engines like Google Scholar exhibit a preference for translations of full-text only (content-level), and have difficulties/bias (as a matter of policy and/or
technology) indexing translated metadata (metadata level) and especially multilingual content that isn't clearly partitioned into "pages" that include only one language at a time. </span></p>
</li></ul>
</ul>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
I think there is a useful discussion to be had here about how we can improve our systems/infrastructure to support multilingual access; it is important, I believe we can and need to do better. </div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
Tomasz</div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div>
<div style="font-family: Calibri, Arial, Helvetica, sans-serif; font-size: 12pt; color: rgb(0, 0, 0);">
<br>
</div>
<div id="Signature">
<div>
<div name="divtagdefaultwrapper" style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:; margin:0">
<div style="font-family:Tahoma; font-size:13px"></div>
</div>
</div>
</div>
</div>
<div>
<div id="appendonsend"></div>
<div style="font-family:Calibri,Arial,Helvetica,sans-serif; font-size:12pt; color:rgb(0,0,0)">
<br>
</div>
<hr tabindex="-1" style="display:inline-block; width:98%">
<div id="divRplyFwdMsg" dir="ltr"><font face="Calibri, sans-serif" color="#000000" style="font-size:11pt"><b>From:</b> eprints-tech-bounces@ecs.soton.ac.uk <eprints-tech-bounces@ecs.soton.ac.uk> on behalf of Sonu Yadav via Eprints-tech <eprints-tech@ecs.soton.ac.uk><br>
<b>Sent:</b> Thursday, May 20, 2021 4:24 AM<br>
<b>To:</b> eprints-tech@ecs.soton.ac.uk <eprints-tech@ecs.soton.ac.uk><br>
<b>Subject:</b> [EP-tech] about the multi-lingual metadata field</font>
<div> </div>
</div>
<div>
<div style="padding-bottom:10px; padding-top:5px">
<div style="padding:12px; border:1px solid #8D3970; background-color:#F7F9FA; color:#8D3970; font-size:14px; line-height:22px; font-family:Calibri,Arial,Helvetica,sans-serif">
<strong>CAUTION:</strong> This e-mail originated outside the University of Southampton.
</div>
</div>
<div>
<div dir="ltr">Dear all,
<div><br>
</div>
<div>I have the Document in Hindi, English, Kannada, Tamil, etc.</div>
<div>On the Summary_page, I need to show the metadata field name like title, abstract, contributors written in the Hindi language, and other vernacular languages. I import the multilingual Bazar package but it only converts the title and abstract name in Hindi.
But I need to do with all metadata field names.</div>
<div><br>
</div>
<div>What is the best practice to do so? and how to do it.</div>
<div><br>
</div>
<div>Thanks, and Regards,</div>
<div>Sonu</div>
</div>
</div>
</div>
</div>
</div>
</body>
</html>