[EP-tech] Antwort: Re: Antwort: Re: Xapian install on Ubuntu 12.04
martin.braendle at id.uzh.ch
martin.braendle at id.uzh.ch
Tue Dec 23 15:12:35 GMT 2014
Thank you Seb,
indeed, testing the installation at 3) shows that it is fast!
Best regards,
Martin
--
Dr. Martin Brändle
Zentrale Informatik
Universität Zürich
Winterthurerstr. 190
CH-8057 Zürich
mail: martin.braendle at id.uzh.ch
phone: +41 44 63 56705
fax: +41 44 63 54505
http://www.id.uzh.ch
Von: sf2 <sf2 at ecs.soton.ac.uk>
An: eprints-tech at ecs.soton.ac.uk
Datum: 23/12/2014 13:59
Betreff: [EP-tech] Re: Antwort: Re: Xapian install on Ubuntu 12.04
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk
Hello Martin,
So:
1- not real issues, no - but it's a fairly basic implementation - I think
it indexes fields which shouldn't be indexed (eg search for "00" will match
every records cos the "dir" field is indexed). And there's no advanced
features like record caching, faceting, fields collapsing, suggestions, ...
all supported by xapian though.
2- nope - I think EPS have one install that uses it but that was being
implemented around the time I left so don't know where this went. It's at
the "please test this" stage I'd say. But I appreciate it's not an easy
task to take on as it requires knowledge of xapian (or solr or else) "under
the hood".
3- my test dataset used 22k records: http://vmdev1.eprints.org/cgi/xapian
(note I don't control that url anymore) - seems fast. It scales in O(n*m)
if I recall correctly, with n the number of (matching) records and m the
max number of facet values (m=1 for single fields and max(m) = 5 by default
for multiple).
Hope this helps,
Seb
On 23.12.2014 08:55, martin.braendle at id.uzh.ch wrote:
Hi Seb,
can you answer the following questions:
- what do you mean by "be careful using the default eprints-xapian
indexing" (shipped with eprints 3.3.12) ? Are there any known problems?
- in as far can the code on https://github.com/eprints/xapianv2 be
considered as finished and be recommended for production?
- was faceting tested on a real-world repo having 10'000s of records (and
not only on 93 as with http://puffin.ecs.soton.ac.uk/) ? Is performance
still good? E.g., on http://www.zora.uzh.ch/, depending on the search
terms used, one may obtain 1000s of records.
Best regards,
Martin
--
Dr. Martin Brändle
Zentrale Informatik
Universität Zürich
Winterthurerstr. 190
CH-8057 Zürich
mail: martin.braendle at id.uzh.ch
phone: +41 44 63 56705
fax: +41 44 63 54505
http://www.id.uzh.ch
Inactive hide details for sf2 ---19/12/2014 21:51:32--- Sure thing..
install libxapian, libsearch-xapian-perl (yup that'ssf2 ---19/12/2014
21:51:32--- Sure thing.. install libxapian, libsearch-xapian-perl (yup
that's
Von: sf2 <sf2 at ecs.soton.ac.uk>
An: eprints-tech at ecs.soton.ac.uk
Datum: 19/12/2014 21:51
Betreff: [EP-tech] Re: Xapian install on Ubuntu 12.04
Gesendet von: eprints-tech-bounces at ecs.soton.ac.uk
Sure thing.. install libxapian, libsearch-xapian-perl (yup that's
Search::Xapian) and voila. Then I'd install xapian-tools because some of
their utility are damned useful to debup/map a xapian DB.
Then as a word of caution, I'd say be careful in using the default
eprints-xapian indexing (what's shipped with eprints 3.3.x basically) cos
it's very basic. Perhaps look up https://github.com/eprints/xapianv2 to do
more advanced stuff such as faceting.
Seb
On 19.12.2014 20:25, Tomasz Neugebauer wrote:
We have the following instructions for installing Xapian on Ubuntu 12.04:
Install Xapian:
wget http://oligarchy.co.uk/xapian/1.2.13/xapian-core-1.2.13.tar.gz
wget http://oligarchy.co.uk/xapian/1.2.13/xapian-omega-1.2.13.tar.gz
wget http://oligarchy.co.uk/xapian/1.2.13/xapian-bindings-1.2.13.tar.gz
tar zxvf xapian-core-1.2.13.tar.gz
tar zxvf xapian-omega-1.2.13.tar.gz
tar zxvf xapian-bindings-1.2.13.tar.gz
cd xapian-core-1.2.13
sudo apt-get install uuid-dev
sudo ./configure
sudo make
sudo make install
cd xapian-omega-1.2.13
sudo apt-get install libpcre3-dev
sudo ./configure
sudo make
sudo make install
cd xapian-bindings-1.2.13
sudo ./configure
sudo make
sudo make install
sudo cpan Search::Xapian
We were wondering if it is preferable to use the Ubuntu packages instead?
We found these packages:
libept-dev - High-level library for managing Debian package information
libxapian-dev - Development files for Xapian search engine library
libxapian22 - Search engine library
libxapian22-dbg - Debugging symbols for the Xapian Search engine library
xapian-doc - Core Xapian documentation
xapian-examples - Xapian simple example programs
libsearch-xapian-perl - Perl bindings for the Xapian search library
xapian-omega - CGI search interface and indexers using Xapian
xapian-tools - Basic tools for Xapian search engine library
libsearch-xapian-perl looks to be the equivalent to CPAN’s Search::Xapian?
Does anyone have any experience with installing xapian on Ubuntu this way?
Thanks!
Tomasz
________________________________________________
Tomasz Neugebauer
Digital Projects & Systems Development Librarian
tomasz.neugebauer at concordia.ca
Concordia University Libraries
1400 de Maisonneuve West (LB 341-3)
Tel.: (514) 848-2424 ex. 7738
Montreal, Canada
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
*** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
*** Archive: http://www.eprints.org/tech.php/
*** EPrints community wiki: http://wiki.eprints.org/
*** EPrints developers Forum: http://forum.eprints.org/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20141223/8dc6c65a/attachment-0001.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20141223/8dc6c65a/attachment-0001.gif
More information about the Eprints-tech
mailing list