[EP-tech] Antwort: stop words in simple search dont stop

holger.berth at mdc-berlin.de holger.berth at mdc-berlin.de
Wed Jul 13 17:01:42 BST 2016


Hi John and Martin,

 

you´re right - i think there is a mismatch of word extraction. The patch you have posted solved my problem.

 

@Martin: many thanks for the link to the Xapian Plugin, i will have a look at it. 

 

Thanks and best,

Holger

 

 

Von: eprints-tech-bounces at ecs.soton.ac.uk [mailto:eprints-tech-bounces at ecs.soton.ac.uk] Im Auftrag von John Salter
Gesendet: Mittwoch, 13. Juli 2016 17:29
An: eprints-tech at ecs.soton.ac.uk
Betreff: Re: [EP-tech] Antwort: stop words in simple search dont stop

 

Also, could this be related: https://github.com/eprints/eprints/issues/182

I think there is (or was?) a mismatch of word extraction in the search, and the indexing?

 

Cheers,

John

 

 

From: eprints-tech-bounces at ecs.soton.ac.uk <mailto:eprints-tech-bounces at ecs.soton.ac.uk>  [mailto:eprints-tech-bounces at ecs.soton.ac.uk] On Behalf Of martin.braendle at id.uzh.ch <mailto:martin.braendle at id.uzh.ch> 
Sent: 13 July 2016 16:15
To: eprints-tech at ecs.soton.ac.uk <mailto:eprints-tech at ecs.soton.ac.uk> 
Subject: [EP-tech] Antwort: stop words in simple search dont stop

 

Hi Holger,

looks like you use the standard SQL database for simple search.

Not sure what happens there. It might be a mistake in the configuration. You may inspect lib/defaultcfg/cfg.d/indexing.pl or archives/{archive}/cfg/cfg.d/indexing.pl and also do a epadmin reindex for all your eprints.

Another option is to install and use Search::Xapian extension for simple search. See  <https://wiki.eprints.org/w/API:EPrints/Plugin/Search/Xapian> https://wiki.eprints.org/w/API:EPrints/Plugin/Search/Xapian

Best regards,

Martin

--
Dr. Martin Brändle
Zentrale Informatik
Universität Zürich
Stampfenbachstr. 73
CH-8006 Zürich


"holger.berth at mdc-berlin.de <mailto:holger.berth at mdc-berlin.de> " ---13/07/2016 16:13:47---Hi @ all,

Von: "holger.berth at mdc-berlin.de <mailto:holger.berth at mdc-berlin.de> " <holger.berth at mdc-berlin.de <mailto:holger.berth at mdc-berlin.de> >
An: "eprints-tech at ecs.soton.ac.uk <mailto:eprints-tech at ecs.soton.ac.uk> " <eprints-tech at ecs.soton.ac.uk <mailto:eprints-tech at ecs.soton.ac.uk> >
Datum: 13/07/2016 16:13
Betreff: [EP-tech] stop words in simple search dont stop
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk <mailto:eprints-tech-bounces at ecs.soton.ac.uk> 

  _____  




Hi @ all,
 
if i use the simple search and fill in the search field with a full title of an article, i get no search result. If i use the advanced search, the article will be found. 
I think the simple search don´t ignores the predefined stop words, but it says „Ignoring: "the", "of", "by"“ If i use the simple search with the smaller title without the stop words, i get an result.
 
Does anybody has an idea why this do not work?
 
Thanks & best,
Holger[Anhang "smime.p7s" gelöscht von Martin Brändle/at/UZH] *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20160713/047a0c29/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20160713/047a0c29/attachment-0001.gif 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 6135 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20160713/047a0c29/attachment-0001.bin 


More information about the Eprints-tech mailing list