[EP-tech] Experimental integration with MS Azure Cognitive Services

David R Newman drn at ecs.soton.ac.uk
Mon Jun 29 10:05:31 BST 2020


One thing I forgot to mention was that this call is probably better as 
an event queue task rather than a before commit trigger.  As if you are 
relying on a third-party application you don't want to be waiting on 
this before the page can reload.  If you are only send the abstract that 
is probably going to be fairly rapid.  However, sometimes abstracts can 
be rather long and I can imagine the service being less responsive at 
times. So you may wait a while before you get a response back, so you 
can reload the page.  Obviously the disadvantage of having an event 
queue task is that you will end up with two revisions rather than one.  
However, your code at the moment suggests this service will only be 
called once, as once there are keywords it cannot be run again.


Just another thought: If you were to change this so keywords could be 
updated, obviously you would want to check to see the fields that were 
being sent for keywords analysis had changed and only call this service 
if they had.  You would also need some code to parse the current and 
returned keywords to merge them together.


On 29/06/2020 09:54, David R Newman via Eprints-tech wrote:
>
> Hi Liam,
>
>
> Looks interesting.  I have been working on improving search on 
> keywords within EPrints by introducing a new Keywords MetaField type 
> that is backwards with the Longtext MetaField that is currently the 
> type with the keywords field.  I have also introduced a Idci (short 
> for ID Case Insentive) field that could be used for keywords when set 
> to be a multiple field. Hopefully, I will be able to make the official 
> release of 3.4.2 that includes these available this week.
>
>
> Regards
>
>
> David Newman
>
>
> On 29/06/2020 09:44, Liam Green-Hughes via Eprints-tech wrote:
>> Hi everyone,
>>
>> I've been experimenting with integrating EPrints with an off the 
>> shelf AI solution to generate keywords. The service I used was the 
>> Text Analytics service element of the Microsoft Azure Text Analytics. 
>> I've only gone as far as experimenting with the Key Phrases endpoint 
>> so far, but the Named Entities endpoint looks like it could add real 
>> value to EPrints records too.
>>
>> The integration file is here: 
>> https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fliamgh%2Feprints-ai-expt%2Fblob%2Fmaster%2Fz_azure_keywords.pl&data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&sdata=qHjFvDL6h%2FEGl5Me2NCg9fpXWU6KepvORh%2BTZWUgS08%3D&reserved=0 
>> <https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fliamgh%2Feprints-ai-expt%2Fblob%2Fmaster%2Fz_azure_keywords.pl&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=qHjFvDL6h%2FEGl5Me2NCg9fpXWU6KepvORh%2BTZWUgS08%3D&amp;reserved=0>. 
>> It isn't really production ready, but it is a starting point.
>>
>> Let me know your thoughts!
>>
>> Thanks
>> Liam
>>
>> *** Options:http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
>> *** Archive:https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=0iYKNx1aFzBjvyxJ8AS%2FQcrrMjsINM770y7uImosSkg%3D&amp;reserved=0
>> *** EPrints community wiki:https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=xOtix3mtbRZ63uJwb1%2B3MPEQp6jWEhr%2FaBZoSRaZYw8%3D&amp;reserved=0
>
> <https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=QwFh3Qwdw1mJDBbHYiTZB2hWA6TkV204e8cJ128Xeuc%3D&amp;reserved=0> 
> 	Virus-free. https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=Xx3E37BEG2wO%2FTCSVkEGgFcE2CY5yZjbX6XenNZ%2B8wA%3D&amp;reserved=0 
> <https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.avg.com%2Femail-signature%3Futm_medium%3Demail%26utm_source%3Dlink%26utm_campaign%3Dsig-email%26utm_content%3Demailclient&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=QwFh3Qwdw1mJDBbHYiTZB2hWA6TkV204e8cJ128Xeuc%3D&amp;reserved=0> 
>
>
> <#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>
>
> *** Options: http://mailman.ecs.soton.ac.uk/mailman/listinfo/eprints-tech
> *** Archive: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwww.eprints.org%2Ftech.php%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=0iYKNx1aFzBjvyxJ8AS%2FQcrrMjsINM770y7uImosSkg%3D&amp;reserved=0
> *** EPrints community wiki: https://eur03.safelinks.protection.outlook.com/?url=http%3A%2F%2Fwiki.eprints.org%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=xOtix3mtbRZ63uJwb1%2B3MPEQp6jWEhr%2FaBZoSRaZYw8%3D&amp;reserved=0


-- 
This email has been checked for viruses by AVG.
https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.avg.com%2F&amp;data=01%7C01%7C%7Cf62a5359dded42515c4f08d81c0b941c%7C4a5378f929f44d3ebe89669d03ada9d8%7C0&amp;sdata=viqL7zl8iF3XqWcwegPuChJqlIhcwCbzMo0sumGY2mY%3D&amp;reserved=0
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/eprints-tech/attachments/20200629/4b520c61/attachment.html 


More information about the Eprints-tech mailing list