<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=utf-8"><meta name=Generator content="Microsoft Word 15 (filtered medium)"><!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:12.0pt;
        font-family:"Times New Roman",serif;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
p
        {mso-style-priority:99;
        mso-margin-top-alt:auto;
        margin-right:0cm;
        mso-margin-bottom-alt:auto;
        margin-left:0cm;
        font-size:12.0pt;
        font-family:"Times New Roman",serif;}
tt
        {mso-style-priority:99;
        font-family:"Courier New";}
p.msonormal0, li.msonormal0, div.msonormal0
        {mso-style-name:msonormal;
        mso-style-priority:99;
        mso-margin-top-alt:auto;
        margin-right:0cm;
        mso-margin-bottom-alt:auto;
        margin-left:0cm;
        font-size:12.0pt;
        font-family:"Times New Roman",serif;}
span.E-MailFormatvorlage20
        {mso-style-type:personal;
        font-family:"Calibri",sans-serif;
        color:#1F497D;}
span.E-MailFormatvorlage21
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:#1F497D;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=DE link=blue vlink=purple><div class=WordSection1><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>Hi John and Martin,<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>you´re right - i think there is a mismatch of word extraction. The patch you have posted solved my problem.<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>@Martin: many thanks for the link to the Xapian Plugin, i will have a look at it. <o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>Thanks and best,<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>Holger<o:p></o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><div><div style='border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm'><p class=MsoNormal><b><span style='font-size:11.0pt;font-family:"Calibri",sans-serif'>Von:</span></b><span style='font-size:11.0pt;font-family:"Calibri",sans-serif'> eprints-tech-bounces@ecs.soton.ac.uk [mailto:eprints-tech-bounces@ecs.soton.ac.uk] <b>Im Auftrag von </b>John Salter<br><b>Gesendet:</b> Mittwoch, 13. Juli 2016 17:29<br><b>An:</b> eprints-tech@ecs.soton.ac.uk<br><b>Betreff:</b> Re: [EP-tech] Antwort: stop words in simple search dont stop<o:p></o:p></span></p></div></div><p class=MsoNormal><o:p>&nbsp;</o:p></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>Also, could this be related: <a href="https://github.com/eprints/eprints/issues/182">https://github.com/eprints/eprints/issues/182</a><o:p></o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>I think there is (or was?) a mismatch of word extraction in the search, and the indexing?<o:p></o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>Cheers,<o:p></o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'>John<o:p></o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><p class=MsoNormal><span lang=EN-GB style='font-size:11.0pt;font-family:"Calibri",sans-serif;color:#1F497D;mso-fareast-language:EN-US'><o:p>&nbsp;</o:p></span></p><div><div style='border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm'><p class=MsoNormal><b><span lang=EN-US style='font-size:11.0pt;font-family:"Calibri",sans-serif'>From:</span></b><span lang=EN-US style='font-size:11.0pt;font-family:"Calibri",sans-serif'> <a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk">eprints-tech-bounces@ecs.soton.ac.uk</a> [<a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk">mailto:eprints-tech-bounces@ecs.soton.ac.uk</a>] <b>On Behalf Of </b><a href="mailto:martin.braendle@id.uzh.ch">martin.braendle@id.uzh.ch</a><br><b>Sent:</b> 13 July 2016 16:15<br><b>To:</b> <a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a><br><b>Subject:</b> [EP-tech] Antwort: stop words in simple search dont stop<o:p></o:p></span></p></div></div><p class=MsoNormal><span lang=EN-GB><o:p>&nbsp;</o:p></span></p><p><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Hi Holger,</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>looks like you use the standard SQL database for simple search.</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Not sure what happens there. It might be a mistake in the configuration.&nbsp;You may inspect lib/defaultcfg/cfg.d/indexing.pl&nbsp;or archives/{archive}/cfg/cfg.d/indexing.pl and also do a epadmin reindex for all your eprints.</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Another option is to install and use Search::Xapian extension for simple search. See </span><span lang=EN-GB><a href="https://wiki.eprints.org/w/API:EPrints/Plugin/Search/Xapian"><span style='font-size:10.0pt;font-family:"Arial",sans-serif'>https://wiki.eprints.org/w/API:EPrints/Plugin/Search/Xapian</span></a><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Best regards,</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Martin</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>--</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Dr. Martin Brändle</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Zentrale Informatik</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Universität Zürich</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>Stampfenbachstr. 73</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif'>CH-8006 Zürich</span><span lang=EN-GB><br><br><br><img border=0 width=16 height=16 id="_x0000_i1025" src="cid:image001.gif@01D1DD30.9BAAB640" alt="Inactive hide details for &quot;holger.berth@mdc-berlin.de&quot; ---13/07/2016 16:13:47---Hi @ all,"></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Arial",sans-serif;color:#424282'>&quot;<a href="mailto:holger.berth@mdc-berlin.de">holger.berth@mdc-berlin.de</a>&quot; ---13/07/2016 16:13:47---Hi @ all,</span><span lang=EN-GB><br><br></span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif;color:#5F5F5F'>Von: </span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif'>&quot;<a href="mailto:holger.berth@mdc-berlin.de">holger.berth@mdc-berlin.de</a>&quot; &lt;<a href="mailto:holger.berth@mdc-berlin.de">holger.berth@mdc-berlin.de</a>&gt;</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif;color:#5F5F5F'>An: </span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif'>&quot;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&quot; &lt;<a href="mailto:eprints-tech@ecs.soton.ac.uk">eprints-tech@ecs.soton.ac.uk</a>&gt;</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif;color:#5F5F5F'>Datum: </span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif'>13/07/2016 16:13</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif;color:#5F5F5F'>Betreff: </span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif'>[EP-tech] stop words in simple search dont stop</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif;color:#5F5F5F'>Gesendet von: </span><span lang=EN-GB style='font-size:7.5pt;font-family:"Arial",sans-serif'><a href="mailto:eprints-tech-bounces@ecs.soton.ac.uk">eprints-tech-bounces@ecs.soton.ac.uk</a></span><span lang=EN-GB><o:p></o:p></span></p><div><div class=MsoNormal><span lang=EN-GB><hr size=2 width="100%" noshade style='color:#8091A5' align=left></span></div></div><p class=MsoNormal style='margin-bottom:12.0pt'><span lang=EN-GB><br><br><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>Hi @ all,</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>&nbsp;</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>if i use the simple search and fill in the search field with a full title of an article, i get no search result. If i use the advanced search, the article will be found. </span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>I think the simple search don´t ignores the predefined stop words, but it says „Ignoring: &quot;the&quot;, &quot;of&quot;, &quot;by&quot;“ If i use the simple search with the smaller title without the stop words, i get an result.</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>&nbsp;</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>Does anybody has an idea why this do not work?</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>&nbsp;</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>Thanks &amp; best,</span><span lang=EN-GB><br></span><span lang=EN-GB style='font-size:10.0pt;font-family:"Calibri",sans-serif'>Holger[Anhang &quot;smime.p7s&quot; gelöscht von Martin Brändle/at/UZH] </span><tt><span lang=EN-GB style='font-size:10.0pt'>*** Options: <a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a></span></tt><span lang=EN-GB style='font-size:10.0pt;font-family:"Courier New"'><br><tt>*** Archive: <a href="http://www.eprints.org/tech.php/">http://www.eprints.org/tech.php/</a></tt><br><tt>*** EPrints community wiki: <a href="http://wiki.eprints.org/">http://wiki.eprints.org/</a></tt><br><tt>*** EPrints developers Forum: <a href="http://forum.eprints.org/">http://forum.eprints.org/</a></tt></span><span lang=EN-GB><o:p></o:p></span></p></div></body></html>