[EP-tech] indexer/tokenizer config

Matthew Kerwin matthew.kerwin at qut.edu.au
Tue Nov 12 00:28:05 GMT 2013


Hi EPrints world,

 

I was having a look at the prevalence of the common typo "seperator" (for
"separator") in EPrints trunk, and discovered that
perl_lib/EPrints/Index/Tokenizer.pm defines a hashref
$EPrints::Index::FREETEXT_SEPERATOR_CHARS which is almost exactly the same
as the config value $c->{indexing}->{freetext_seperator_chars} in
cfg.d/indexing.pl, however the former includes an extra character, and I
can't work out how either of them are referenced (if at all) in the
codebase.

 

Could someone provide some clarification here on which is used where/how,
and how they could be cleaned up or better integrated?

 

Cheers

-- 

Matthew Kerwin  |  Library eServices Developer  |
<https://wiki.qut.edu.au/display/lib/Digital+Repository+Team> Applications &
Development Team  |  Library eServices  |  Queensland University of
Technology  |  Level 3, R Block, Kelvin Grove  |  ph  <tel:+61731383910>
3138 3910  |   <mailto:matthew.kerwin at qut.edu.au> matthew.kerwin at qut.edu.au
|  CRICOS No 00213J

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20131112/18ca9e12/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 6087 bytes
Desc: not available
Url : http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20131112/18ca9e12/attachment-0001.bin 


More information about the Eprints-tech mailing list