[EP-tech] Antwort: Antwort: Re: Xapian Search

martin.braendle at id.uzh.ch martin.braendle at id.uzh.ch
Thu Sep 3 14:11:57 BST 2015


Just inspected our Xapian index with the delve utility.
Datestamps are indexed like this, e.g.

datestamp:2008-02-11 12:11:33

However, you can't search on the - character. Currently, it's only possible
to search like this: datestamp:(2008*)  but not datestamp:(2008-02*) or
datestamp:(2008-02-11*) or with double quotes.

Can lib/search_xapian.pl be modified so that datestamps are indexed
differently?
Has something gone on with Seb's work https://github.com/eprints/xapianv2 ?

Best regards,

Martin

--
Dr. Martin Brändle
Zentrale Informatik
Universität Zürich
Stampfenbachstr. 73
CH-8006 Zürich




Von:	martin.braendle at id.uzh.ch
An:	eprints-tech at ecs.soton.ac.uk
Datum:	03/09/2015 13:39
Betreff:	[EP-tech]  Antwort:  Re: Xapian Search
Gesendet von:	eprints-tech-bounces at ecs.soton.ac.uk



Hi Josée,

for author searches, don't put the names into double quotes, but into
brackets:

creators_name:(Noyer, Jean-Louis)

For datestamps, I'm not sure how they are indexed; in our case it just
works with years:

date:2014

We have written a search help, http://www.zora.uzh.ch/help/  that details
some Xapian options. Feel free to use what is usable.

Best regards,

Martin

--
Dr. Martin Brändle
Zentrale Informatik
Universität Zürich
Stampfenbachstr. 73
CH-8006 Zürich

Inactive hide details for Lessard Josée ---03/09/2015 11:54:22---Sorry, I
think that I have to make my question more understandLessard Josée
---03/09/2015 11:54:22---Sorry, I think that I have to make my question
more understandable...  We have configured our search

Von: Lessard Josée <josee.lessard at cirad.fr>
An: eprints-tech at ecs.soton.ac.uk
Datum: 03/09/2015 11:54
Betreff: [EP-tech] Re: Xapian Search
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk



Sorry, I think that I have to make my question more understandable...


We have configured our search this way :

      $c->{search}->{simple} = {
        search_fields => [
                           {
                             id => "q",
                             meta_fields => [
                                "documents",
                                "eprintid",
                                "title",
                                "creators_name",
                                "abstract",
                                "datestamp",
                                                          …



So all these fields should be indexed by Xapian


We have reloaded the config (epadmin), restarted Apache and reindexed our
eprints.


But now we get two problems :


            1. We can't search by datestamp.


                        We tried "datestamp:2015-08-*",
"datestamp:2015/08/*", "datestamp:201508*", ...
                        Nothing seems to work. We always get empty results.


                        Do you know what is the right syntax to do a Xapian
search based in this field ?


            2. We can't search by creators_name


                        We can search by last name : creators_name:"Noyer"
                        Or by first name : creators_name:"Jean-Louis"


                        But we can't combine firsrt and last name :
                        creators_name:"Noyer Jean-Louis" or
creators_name:"Noyer, Jean-Louis"
                        both  give and empty result set.


                        Do you know how to do such a search using both
first and last name ?



Thank you for your help




Le 01/09/2015 15:25, Field A.N. a écrit :
      Have you restarted the webserver?  Changes to the .pl files do not
      load automatically when the file changes.  You'll need to reload the
      configuration (/<eprints_root>/bin/epadmin reload [repositoryid]) or
      restart apache before the changes will be visible at the front end.

      --
      Adam Field
      Business Relationship Manager and Community Lead
      EPrints Services
      +44 (0)23 8059 8814





      On 27 Aug 2015, at 17:17, Lessard Josée wrote:

            Hello,
            I have two questions on research Xapian :

            1.   How query the fields : datestamp ?

            2.   How query the fields :  creators_name (name_family with
            name_given ) ?



            We added fields (datestamp and creators_name)  in the file
            cfg.d/eprints_search_simple.pl and reindexed the database.
            $c->{search}->{simple} =
            {
                               search_fields => [
                                                    {
                                                                        id
            => "q",

            meta_fields => [
            "documents",
            "eprintid",
            "title",
            "creators_name",
            "abstract",
            "datestamp",
            …

            Thank you for your help

            Josée Lessard


            *** Options:
            http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
            *** Archive: http://www.eprints.org/tech.php/
            *** EPrints community wiki: http://wiki.eprints.org/
            *** EPrints developers Forum: http://forum.eprints.org/




      *** Options:
      http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
      *** Archive: http://www.eprints.org/tech.php/
      *** EPrints community wiki: http://wiki.eprints.org/
      *** EPrints developers Forum: http://forum.eprints.org/


*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20150903/35c7581d/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20150903/35c7581d/attachment-0001.gif 


More information about the Eprints-tech mailing list