<html><body>
<p><font size="2" face="sans-serif">Dear Adam,</font><br>
<br>
<font size="2" face="sans-serif">because Jens is out of the office for a few days, I jump in.</font><br>
<br>
<font size="2" face="sans-serif">I take one of the records that is found by the saved search and where there is a German umlaut in the creator's name (Krüger, G):</font><br>
<br>
<font size="2" face="sans-serif">bin/export zora eprint XMLforCMS2 95663</font><br>
<br>
<font size="2" face="sans-serif">just exports fine, and we obtain the expected XML:</font><br>
<br>
<font size="2" face="sans-serif"><?xml version="1.0" encoding="utf-8" ?></font><br>
<font size="2" face="sans-serif"><eprints xmlns="http://eprints.org/ep2/data/2.0"></font><br>
<font size="2" face="sans-serif"> </font><br>
<font size="2" face="sans-serif"> <eprint id="http://www.zora.uzh.ch/id/eprint/95663"></font><br>
<font size="2" face="sans-serif"> <eprintid>95663</eprintid></font><br>
<font size="2" face="sans-serif"> <title>Krabben, Würmer, Schwein und Hund. Wie machen Tiere Geschichte?</title></font><br>
<font size="2" face="sans-serif"> <date>2014-04</date></font><br>
<font size="2" face="sans-serif"> <year_from_date>2014</year_from_date></font><br>
<font size="2" face="sans-serif"> <creators__editors_if_edited_scientific_work>Krüger, Gesine</creators__editors_if_edited_scientific_work></font><br>
<font size="2" face="sans-serif"> <first_creator__or__first_editor_if_edited_scientific_work>Krüger, Gesine</first_creator__or__first_editor_if_edited_scientific_work></font><br>
<font size="2" face="sans-serif"> <type_in_text>Book Section</type_in_text></font><br>
<font size="2" face="sans-serif"> <citation>Krüger, Gesine (2014). &lt;a href="http://www.zora.uzh.ch/95663" target="_blank" class="uzh" title="zoracitationlink 95663"&gt;Krabben, Würmer, Schwein und Hund. Wie machen Tiere Geschichte?&lt;/a&gt; In: Grumblies, Florian; Weise, Anton. Unterdrückung und Emanzipation in der Weltgeschichte. Zum Ringen um Freiheit, Kaffee und Deutungshoheit. Hannover, 26-41. ISBN 978-3-944342-47-4.</citation></font><br>
<font size="2" face="sans-serif"> <coins>url_ver=Z39.88-2004&amp;rft_id=http%3A%2F%2Fwww.zora.uzh.ch%2F95663&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&amp;rft.au=Kr%C3%BCger%2C+Gesine&amp;rft.aulast=Kr%C3%BCger&amp;rft.aufirst=Gesine&amp;rft.date=April+2014&amp;rft.isbn=978-3-944342-47-4&amp;rft.title=Krabben%2C+W%C3%BCrmer%2C+Schwein+und+Hund.+Wie+machen+Tiere+Geschichte%3F&amp;rft.btitle=Unterdr%C3%BCckung+und+Emanzipation+in+der+Weltgeschichte.+Zum+Ringen+um+Freiheit%2C+Kaffee+und+Deutungshoheit&amp;rft.genre=bookitem&amp;rft.place=Hannover</coins></font><br>
<font size="2" face="sans-serif"> </eprint></font><br>
<font size="2" face="sans-serif"> </font><br>
<font size="2" face="sans-serif"></eprints></font><br>
<br>
<br>
<font size="2" face="sans-serif">If we open the saved search with the "offending" umlaut (by clicking the link in the "Name of search" column), the search is execute and yields a result list.</font><br>
<font size="2" face="sans-serif">You can then export the results by choosing an export plugin from the drop down menu. All export plugins (including XMLforCMS2) do work this way.</font><br>
<br>
<br>
<font size="2" face="sans-serif">In the last column of the saved search table there is a special button that calls cgi/saved_search by passing savedsearch_id as parameter.</font><br>
<font size="2" face="sans-serif">This button and the saved_search cgi script (seem to) have been extended by EPrints Services for us.</font><br>
<br>
<font size="2" face="sans-serif">Jens has opened a support case with Justin to check this script - we assume that the problem is somewhere generated in the line </font><br>
<br>
<font size="2" face="Menlo-Regular">print $saved_search->make_searchexp->perform_search->export( $format );</font><br>
<br>
<font size="2" face="sans-serif">when a "virtual" dataset is passed to the export plugin and there is an umlaut in the originating query.</font><br>
<br>
<font size="2" face="sans-serif">This problem not only happens with the XMLforCMS2 export - it happens with any export format that is passed to the extended saved_search CGI script.</font><br>
<br>
<font size="2" face="sans-serif">Best regards,</font><br>
<br>
<font size="2" face="sans-serif">Martin</font><br>
<br>
<br>
<font size="2" face="sans-serif">--</font><br>
<font size="2" face="sans-serif">Dr. Martin Brändle</font><br>
<font size="2" face="sans-serif">Zentrale Informatik</font><br>
<font size="2" face="sans-serif">Universität Zürich</font><br>
<font size="2" face="sans-serif">Winterthurerstr. 190</font><br>
<font size="2" face="sans-serif">CH-8057 Zürich</font><br>
<br>
<font size="2" face="sans-serif">mail: martin.braendle@id.uzh.ch</font><br>
<font size="2" face="sans-serif">phone: +41 44 63 56705</font><br>
<font size="2" face="sans-serif">fax: +41 44 63 54505</font><br>
<font size="2" face="sans-serif"><a href="http://www.id.uzh.ch">http://www.id.uzh.ch</a></font><br>
<br>
<img width="16" height="16" src="cid:1__=4EBBF746DFC0382C8f9e8a93df9@lotus.uzh.ch" border="0" alt="Inactive hide details for "Field A.N." ---22/01/2015 15:56:15---What happens if you export the record on the command line? --"><font size="2" color="#424282" face="sans-serif">"Field A.N." ---22/01/2015 15:56:15---What happens if you export the record on the command line? --</font><br>
<br>
<font size="1" color="#5F5F5F" face="sans-serif">Von:        </font><font size="1" face="sans-serif">"Field A.N." <af05v@ecs.soton.ac.uk></font><br>
<font size="1" color="#5F5F5F" face="sans-serif">An:        </font><font size="1" face="sans-serif">eprints-tech@ecs.soton.ac.uk</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Datum:        </font><font size="1" face="sans-serif">22/01/2015 15:56</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Betreff:        </font><font size="1" face="sans-serif">[EP-tech] Re: {Disarmed} International characters in advanced        search fail for XML-Export</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Gesendet von:        </font><font size="1" face="sans-serif">eprints-tech-bounces@ecs.soton.ac.uk</font><br>
<hr width="100%" size="2" align="left" noshade style="color:#8091A5; "><br>
<br>
<br>
<tt><font size="2">What happens if you export the record on the command line?<br>
<br>
<br>
--<br>
Adam Field<br>
Business Relationship Manager and Community Lead<br>
EPrints Services<br>
<br>
<br>
<br>
<br>
On 20 Jan 2015, at 16:13, jens.vieler@id.uzh.ch wrote:<br>
<br>
> Hi together,<br>
> <br>
> (using ePrints V3.3.12)<br>
> <br>
> found a strange behaviour in combination Advanced Search / Saved Search / XML-Export whithin context of international characters: If we use a saved search on a author/creator with german Umlauts (international encoding), the XML-Export-Plugin returns an empty XML-Dataset. Database entry savedsearch|spec looks like smart utf8 to us (look at the bottom of this message).<br>
> <br>
> Does anybody know this behaviour ...or better know how to fix it? :)<br>
> <br>
> Cheers<br>
> Jens<br>
> <br>
> <br>
> In detail:<br>
> <br>
> 1.) Creating an Advanced Searching for an author/creator WITHOUT German Umlauts (e.g. "Vieler") <br>
> <br>
> - Database shows spec: <br>
> <br>
> ?plugin=Internal&searchid=advanced&dataset=archive&exp=0%7C1%7C-date%2Fcreators_name%2Ftitle%7Carchive%7C-%7Ccreators_name%3Acreators_name%3AALL%3AEQ%3AVieler%7C-%7Ceprint_status%3Aeprint_status%3AANY%3AEQ%3Aarchive%7Cmetadata_visibility%3Ametadata_visibility%3AANY%3AEQ%3Ashow<br>
> <br>
> - Screen-View:<br>
> <br>
> </font></tt><tt><font size="2">http://www</font></tt><tt><font size="2">.<eprint-server>.ch/id/saved_search/<savedsearch_id> <br>
> <br>
> will be redirected to <br>
> <br>
> MailScanner has detected a possible fraud attempt from "</font></tt><tt><font size="2">www.zoratest.uzh.ch</font></tt><tt><font size="2">" claiming to be </font></tt><tt><font size="2">http://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/search/archive/advanced?_action_search=1&dataset=archive&exp=0|1|-date%2Fcreators_name%2Ftitle|archive|-|creators_name%3Acreators_name%3AALL%3AEQ%3AVieler|-|eprint_status%3Aeprint_status%3AANY%3AEQ%3Aarchive&order=-date%2Fcreators_name%2Ftitle <br>
> <br>
> and works!<br>
> <br>
> - XML-Export for our CMS:<br>
> <br>
> </font></tt><tt><font size="2">https://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/users/home?screen=Workflow%3A%3AExportSavedSearchResults&dataset=saved_search&dataobj=<savedsearch_id><br>
> <br>
> will be redirected to<br>
> <br>
> MailScanner has detected a possible fraud attempt from "</font></tt><tt><font size="2">www.zoratest.uzh.ch</font></tt><tt><font size="2">" claiming to be </font></tt><tt><font size="2">https://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/saved_search/export_zora_XMLforCMS2.xml?savedsearchid=<savedsearch_id>&_action_export=1&_output=XMLforCMS2<br>
> <br>
> and works!<br>
> <br>
> <br>
> 2.) Creating an Advanced Searching for an author/creator WITH German Umlauts (e.g. "Krüger,G") <br>
> <br>
> - Database shows spec:<br>
> <br>
> ?plugin=Internal&searchid=advanced&dataset=archive&exp=0%7C1%7C-date%2Fcreators_name%2Ftitle%7Carchive%7C-%7Ccreators_name%2Feditors_name%3Acreators_name%2Feditors_name%3AALL%3AEQ%3AKr%C3%BCger%2C+G%7C-%7Ceprint_status%3Aeprint_status%3AANY%3AEQ%3Aarchive%7Cmetadata_visibility%3Ametadata_visibility%3AANY%3AEQ%3Ashow<br>
> <br>
> (so "Kr%C3%BCger" looks like good old utf8 stuff to me)<br>
> <br>
> - Screen-View:<br>
> <br>
> </font></tt><tt><font size="2">http://www</font></tt><tt><font size="2">.<eprint-server>.ch/id/saved_search/<savedsearch_id><br>
> <br>
> will be redirected to <br>
> <br>
> MailScanner has detected a possible fraud attempt from "</font></tt><tt><font size="2">www.zoratest.uzh.ch</font></tt><tt><font size="2">" claiming to be </font></tt><tt><font size="2">http://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/search/archive/advanced?_action_search=1&dataset=archive&exp=0|1|-date%2Fcreators_name%2Ftitle|archive|-|creators_name%2Feditors_name%3Acreators_name%2Feditors_name%3AALL%3AEQ%3AKr%C3%BCger%2C+G|-|eprint_status%3Aeprint_status%3AANY%3AEQ%3Aarchive&order=-date%2Fcreators_name%2Ftitle<br>
> <br>
> and works!<br>
> <br>
> - XML-Export for our CMS:<br>
> <br>
> </font></tt><tt><font size="2">https://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/users/home?screen=Workflow%3A%3AExportSavedSearchResults&dataset=saved_search&dataobj=<savedsearch_id><br>
> <br>
> will be redirected to<br>
> <br>
> MailScanner has detected a possible fraud attempt from "</font></tt><tt><font size="2">www.zoratest.uzh.ch</font></tt><tt><font size="2">" claiming to be </font></tt><tt><font size="2">https://www</font></tt><tt><font size="2">.<eprint-server>.ch/cgi/saved_search/export_zora_XMLforCMS2.xml?savedsearchid=<savedsearch_id>&_action_export=1&_output=XMLforCMS2<br>
> <br>
> and fails... let's say: It's empty:<br>
> <br>
> <?xml version="1.0" encoding="utf-8" ?><br>
> <eprints xmlns="</font></tt><tt><font size="2"><a href="http://eprints.org/ep2/data/2.0">http://eprints.org/ep2/data/2.0</a></font></tt><tt><font size="2">"><br>
> </eprints><br>
> <br>
> -- <br>
> Jens Vieler<br>
> Informatikdienste<br>
> Universität Zürich<br>
> Winterthurerstr. 190<br>
> CH-8057 Zürich<br>
> <br>
> mail: jens.vieler@id.uzh.ch<br>
> phone: +41 44 63 56777<br>
> </font></tt><tt><font size="2"><a href="http://www.id.uzh.ch">http://www.id.uzh.ch</a></font></tt><tt><font size="2"><br>
> <br>
> *** Options: </font></tt><tt><font size="2"><a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a></font></tt><tt><font size="2"><br>
> *** Archive: </font></tt><tt><font size="2"><a href="http://www.eprints.org/tech.php/">http://www.eprints.org/tech.php/</a></font></tt><tt><font size="2"><br>
> *** EPrints community wiki: </font></tt><tt><font size="2"><a href="http://wiki.eprints.org/">http://wiki.eprints.org/</a></font></tt><tt><font size="2"><br>
> *** EPrints developers Forum: </font></tt><tt><font size="2"><a href="http://forum.eprints.org/">http://forum.eprints.org/</a></font></tt><tt><font size="2"><br>
<br>
<br>
*** Options: </font></tt><tt><font size="2"><a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a></font></tt><tt><font size="2"><br>
*** Archive: </font></tt><tt><font size="2"><a href="http://www.eprints.org/tech.php/">http://www.eprints.org/tech.php/</a></font></tt><tt><font size="2"><br>
*** EPrints community wiki: </font></tt><tt><font size="2"><a href="http://wiki.eprints.org/">http://wiki.eprints.org/</a></font></tt><tt><font size="2"><br>
*** EPrints developers Forum: </font></tt><tt><font size="2"><a href="http://forum.eprints.org/">http://forum.eprints.org/</a></font></tt><tt><font size="2"><br>
</font></tt><br>
</body></html>