[EP-tech] Re: Question about full text search (Documents in Advanced Search page)
Michael Street
mstreet at yorku.ca
Fri Jan 22 21:00:38 GMT 2016
Hi again,
Does anyone have any idea why these documents are not showing up in the
search results?
Any suggestions would really be appreciated. I'm at a loss as to why
it's not returning results that clearly have the search term in the pdf
(and the converted text document).
--Mike Street
On 1/15/2016 11:05 AM, Michael Street wrote:
> Hi John,
>
> Thanks very much for your response. Please find my answers below:
>
> 1) Indexer is running and confirmed to be working. The documents that
> don't show up are some of the oldest and are available through other
> links. Newly deposited items also show up in the Views.
>
> 2) I have tried pdftotext on the system and had no issues with
> converting it. I also was able to find the search term within the
> document easily.
>
> 3) I run a cronjob that updates the DB and switches everything to be
> visible, every 15 minutes. My client does not want anything to be
> hidden, especially previous versions of eprints, so this was the easiest
> way to achieve that, for me. Also, the eprints in question do show up
> in the Views, which shows they're set to visible.
>
> So if you have any other ideas, I'd really appreciate it. I'm at a loss
> here.
>
> Thanks,
> Mike.
>
>
> On 1/14/2016 4:35 PM, John Salter wrote:
>> Hi,
>> I'd check that you indexer is running, and that the task queue is processed.
>>
>> I'd also check that the PDFs aren't restricted in some way (maybe see what something like pdftotext returns when run against one of the not-returned PDFs.
>>
>> Also, as was mentioned in a different thread recently, check what the 'metadata visibility' flag for the EPrint is.
>>
>> If none of that gets you anywhere, let us know and we'll put our collective thinking caps on!
>>
>> Cheers,
>> John
>>
>> ________________________________________
>> From: eprints-tech-bounces at ecs.soton.ac.uk <eprints-tech-bounces at ecs.soton.ac.uk> on behalf of Michael Street <mstreet at yorku.ca>
>> Sent: 14 January 2016 16:04
>> To: eprints-tech at ecs.soton.ac.uk
>> Subject: [EP-tech] Question about full text search (Documents in Advanced Search page)
>>
>> Hi,
>>
>> I've got some pdfs in the repository that include the phrase 'bohm' many
>> times but the Advanced Search page is only returning 4 out of probably
>> 25+ eprints as hits on the phrase. I'm using the Documents search box,
>> which I believe it the full-text search box. Is there something I'm
>> missing?
>>
>> Any help would be appreciated thanks,
>> Mike.
>>
>> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive: http://www.eprints.org/tech.php/
>> *** EPrints community wiki: http://wiki.eprints.org/
>> *** EPrints developers Forum: http://forum.eprints.org/
>>
>> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive: http://www.eprints.org/tech.php/
>> *** EPrints community wiki: http://wiki.eprints.org/
>> *** EPrints developers Forum: http://forum.eprints.org/
> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
> *** Archive: http://www.eprints.org/tech.php/
> *** EPrints community wiki: http://wiki.eprints.org/
> *** EPrints developers Forum: http://forum.eprints.org/
More information about the Eprints-tech
mailing list