<html><body>
<p><font size="2" face="sans-serif">Hi Gilles,</font><br>
<br>
<font size="2" face="sans-serif">our repo has about 80'000 records and 56% fulltext, so is comparable to yours.</font><br>
<br>
<font size="2" face="sans-serif">Advanced search of thermograph* in</font><br>
<br>
<font size="2" face="sans-serif">title: immediate (1-2 seconds)</font><br>
<font size="2" face="sans-serif">documents (full text): 20-30 seconds. The mysql daemon goes up to 70-100% CPU load.</font><br>
<br>
<font size="2" face="sans-serif">Quick search (Xapian):</font><br>
<br>
<font size="2" face="sans-serif">title:thermograph* : immediate</font><br>
<font size="2" face="sans-serif">thermograph* : immediate</font><br>
<br>
<font size="2" face="sans-serif">We recommend in our help page (</font><a href="http://www.zora.uzh.ch/help/"><font size="2" face="sans-serif">http://www.zora.uzh.ch/help/</font></a><font size="2" face="sans-serif">) that Quick Search should be the tool of choice and only for very precise searches Advanced Search should be used.</font><br>
<br>
<br>
<font size="2" face="sans-serif">From a recent debug session (on another issue) I know that EPrints translates behind the scenes an advanced search query into a series of dozens of complicated SQL statements. It might be that for certain cases these are not optimized.</font><br>
<br>
<font size="2" face="sans-serif">If it were that simple as</font><br>
<br>
<font size="2" face="Menlo-Regular">select distinct ei.eprintid from eprint__rindex ei, eprint e where ei.field='documents' and ei.word like 'thermograph%' and e.eprint_status='archive' and e.eprintid=ei.eprintid;</font><br>
<br>
<font size="2" face="sans-serif">then that query would be answered in a fraction of a second. But it isn't, and can't be, and EPrints software engineers surely have put a lot of effort into the EPrints database engine part to cover all possible situations.</font><br>
<br>
<font size="2" face="sans-serif">Best regards,</font><br>
<br>
<font size="2" face="sans-serif">Martin</font><br>
<br>
<font size="2" face="sans-serif">--</font><br>
<font size="2" face="sans-serif">Dr. Martin Brändle</font><br>
<font size="2" face="sans-serif">Zentrale Informatik</font><br>
<font size="2" face="sans-serif">Universität Zürich</font><br>
<font size="2" face="sans-serif">Winterthurerstr. 190</font><br>
<font size="2" face="sans-serif">CH-8057 Zürich</font><br>
<br>
<br>
<img width="16" height="16" src="cid:1__=4EBBF4BDDFA34DEA8f9e8a93df9@lotus.uzh.ch" border="0" alt="Inactive hide details for Gilles Fournié ---21/04/2015 10:22:27---Hi, I have a question about right-hand truncation in advanced"><font size="2" color="#424282" face="sans-serif">Gilles Fournié ---21/04/2015 10:22:27---Hi, I have a question about right-hand truncation in advanced searches.</font><br>
<br>
<font size="1" color="#5F5F5F" face="sans-serif">Von:        </font><font size="1" face="sans-serif">Gilles Fournié <gilles.fournie@cirad.fr></font><br>
<font size="1" color="#5F5F5F" face="sans-serif">An:        </font><font size="1" face="sans-serif">eprints-tech@ecs.soton.ac.uk</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Datum:        </font><font size="1" face="sans-serif">21/04/2015 10:22</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Betreff:        </font><font size="1" face="sans-serif">[EP-tech] Use of truncation in advanced searches</font><br>
<font size="1" color="#5F5F5F" face="sans-serif">Gesendet von:        </font><font size="1" face="sans-serif">eprints-tech-bounces@ecs.soton.ac.uk</font><br>
<hr width="100%" size="2" align="left" noshade style="color:#8091A5; "><br>
<br>
<br>
<tt><font size="2">Hi,<br>
<br>
I have a question about right-hand truncation in advanced searches.<br>
<br>
If we search for (in title for example) :<br>
<br>
thermography<br>
<br>
the search runs for 1 to 3 seconds before returning results.<br>
<br>
If we extend our search to :<br>
<br>
thermography thermographie<br>
<br>
the search time is about the same.<br>
<br>
But if we try to use a wildcard :<br>
<br>
thermograph*<br>
<br>
the search takes a very long time (counts in minutes) !<br>
<br>
Does somebody have experienced such delays ?<br>
Any clues about what we can do to solve this problem ?<br>
<br>
(our archive contains ~ 91000 eprints)<br>
<br>
Best regards,<br>
GF<br>
*** Options: </font></tt><tt><font size="2"><a href="http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech">http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech</a></font></tt><tt><font size="2"><br>
*** Archive: </font></tt><tt><font size="2"><a href="http://www.eprints.org/tech.php/">http://www.eprints.org/tech.php/</a></font></tt><tt><font size="2"><br>
*** EPrints community wiki: </font></tt><tt><font size="2"><a href="http://wiki.eprints.org/">http://wiki.eprints.org/</a></font></tt><tt><font size="2"><br>
*** EPrints developers Forum: </font></tt><tt><font size="2"><a href="http://forum.eprints.org/">http://forum.eprints.org/</a></font></tt><tt><font size="2"><br>
</font></tt><br>
</body></html>