[EP-tech] Re: Question about full text search (Documents in Advanced Search page)

Michael Street mstreet at yorku.ca
Fri Jan 22 21:00:38 GMT 2016


Hi again,

Does anyone have any idea why these documents are not showing up in the 
search results?

Any suggestions would really be appreciated.  I'm at a loss as to why 
it's not returning results that clearly have the search term in the pdf 
(and the converted text document).

--Mike Street

On 1/15/2016 11:05 AM, Michael Street wrote:
> Hi John,
>
> Thanks very much for your response.  Please find my answers below:
>
> 1)  Indexer is running and confirmed to be working.  The documents that
> don't show up are some of the oldest and are available through other
> links.  Newly deposited items also show up in the Views.
>
> 2)  I have tried pdftotext on the system and had no issues with
> converting it.  I also was able to find the search term within the
> document easily.
>
> 3)  I run a cronjob that updates the DB and switches everything to be
> visible, every 15 minutes.  My client does not want anything to be
> hidden, especially previous versions of eprints, so this was the easiest
> way to achieve that, for me.  Also, the eprints in question do show up
> in the Views, which shows they're set to visible.
>
> So if you have any other ideas, I'd really appreciate it.  I'm at a loss
> here.
>
> Thanks,
> Mike.
>
>
> On 1/14/2016 4:35 PM, John Salter wrote:
>> Hi,
>> I'd check that you indexer is running, and that the task queue is processed.
>>
>> I'd also check that the PDFs aren't restricted in some way (maybe see what something like pdftotext returns when run against one of the not-returned PDFs.
>>
>> Also, as was mentioned in a different thread recently, check what the 'metadata visibility' flag for the EPrint is.
>>
>> If none of that gets you anywhere, let us know and we'll put our collective thinking caps on!
>>
>> Cheers,
>> John
>>
>> ________________________________________
>> From: eprints-tech-bounces at ecs.soton.ac.uk <eprints-tech-bounces at ecs.soton.ac.uk> on behalf of Michael Street <mstreet at yorku.ca>
>> Sent: 14 January 2016 16:04
>> To: eprints-tech at ecs.soton.ac.uk
>> Subject: [EP-tech] Question about full text search (Documents in Advanced       Search page)
>>
>> Hi,
>>
>> I've got some pdfs in the repository that include the phrase 'bohm' many
>> times but the Advanced Search page is only returning 4 out of probably
>> 25+ eprints as hits on the phrase.  I'm using the Documents search box,
>> which I believe it the full-text search box.  Is there something I'm
>> missing?
>>
>> Any help would be appreciated thanks,
>> Mike.
>>
>> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive: http://www.eprints.org/tech.php/
>> *** EPrints community wiki: http://wiki.eprints.org/
>> *** EPrints developers Forum: http://forum.eprints.org/
>>
>> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive: http://www.eprints.org/tech.php/
>> *** EPrints community wiki: http://wiki.eprints.org/
>> *** EPrints developers Forum: http://forum.eprints.org/
> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
> *** Archive: http://www.eprints.org/tech.php/
> *** EPrints community wiki: http://wiki.eprints.org/
> *** EPrints developers Forum: http://forum.eprints.org/



More information about the Eprints-tech mailing list