[EP-tech] indexer/tokenizer config
Matthew Kerwin
matthew.kerwin at qut.edu.au
Tue Nov 12 00:28:05 GMT 2013
Hi EPrints world,
I was having a look at the prevalence of the common typo "seperator" (for
"separator") in EPrints trunk, and discovered that
perl_lib/EPrints/Index/Tokenizer.pm defines a hashref
$EPrints::Index::FREETEXT_SEPERATOR_CHARS which is almost exactly the same
as the config value $c->{indexing}->{freetext_seperator_chars} in
cfg.d/indexing.pl, however the former includes an extra character, and I
can't work out how either of them are referenced (if at all) in the
codebase.
Could someone provide some clarification here on which is used where/how,
and how they could be cleaned up or better integrated?
Cheers
--
Matthew Kerwin | Library eServices Developer |
<https://wiki.qut.edu.au/display/lib/Digital+Repository+Team> Applications &
Development Team | Library eServices | Queensland University of
Technology | Level 3, R Block, Kelvin Grove | ph <tel:+61731383910>
3138 3910 | <mailto:matthew.kerwin at qut.edu.au> matthew.kerwin at qut.edu.au
| CRICOS No 00213J
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20131112/18ca9e12/attachment-0001.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 6087 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20131112/18ca9e12/attachment-0001.bin
More information about the Eprints-tech
mailing list