[GOAL] Re: One way to expand the OA movement: be more inclusive
Peter Murray-Rust
pm286 at cam.ac.uk
Mon Jun 1 21:17:01 BST 2015
On Mon, Jun 1, 2015 at 6:47 PM, Heather Morrison <
Heather.Morrison at uottawa.ca> wrote:
> Question and challenge
>
> Question: Didn't the UK recently change its legislation explicitly to
> allow for data and text mining?
>
Yes,
The Statutory Instrument came into force in June2014 following
recommendations In Prof Hargreaves' report. They basically provide for
"private research for non-commercial puposes". They allow miners to
override restrictive clauses inserted by publishers in contracts. However
it is unclear what can be published as a result of mining and it is unclear
how it interacts with the European sui generis directive on Database rights.
We are mining the literature under the new Instrument. However this has not
been tested in court.
A full and authoritative account is given by Prof Charles Oppenheim who is
an active adviser to TheContentMine
> Challenge: My research blog and data verses are both fully open with no
> CC license at all. They are All Rights Reserved, and yet posted on the web,
> in the case of the dataverse deliberately so that people can go ahead and
> download and manipulate the data. I challenge anyone to go ahead and try
> some text and data mining. If you think there are legalities preventing you
> from doing this, please explain what they are.
>
This language makes little sense. The material is not "fully open". It is
posted on your web site and could be withdrawn at any time. This is not
Open - it is temporarily free-to-view. You posses the copyright.
ContentMine inevitably requires copying from your site which is potentially
an infringement of copyright and in most jurisdictions, including Canada,
you would have the right to sue a ContentMiner.
Please accept that posting on the web, with whatever good intentions but
without explicit licence, gives no rights to any potential user.
> ...
>
> I think we need to understand what barriers exist to data and text mining
> and resolve them, rather than assuming that pushing everyone to make their
> work CC-BY is the answer.
>
We understand many of the barriers already and they prevent us mining the
content without an agreement. Only with CC-BY or CC0 can we do this knowing
that we can do it without the permission of the copyright owner. The UK
legislation gives UK researchers an additional resource, which we are now
using.
> For example, if my blog were CC-BY licensed, this wouldn't help with
> Wordpress not being set up to search the comments. Another example: there
> is nothing to stop the Licensor (as opposed to the downstream user) to put
> TPMs in a CC-BY or CC-0 work that would effectively prevent people from
> data and text mining.
>
A CC BY document, with only one copy behind the LIcensor's firewall is not
accessible and is therefore operationally closed. If one copy is published,
then it can be copied and cannot be revoked by the licensor.
>
> If one is legally prevented from data and text mining works that are in
> the open, no doubt as a law-abiding citizen you're not using any internet
> search engine.
>
the legality of search engines is unclear in many cases. Many have
agreements with large content providers - most STM publishers allow Google
to search and index their content. This does not mean that everyone can
spider everything and if you try it you will get pushback.
>
>
--
Peter Murray-Rust
Reader in Molecular Informatics
Unilever Centre, Dep. Of Chemistry
University of Cambridge
CB2 1EW, UK
+44-1223-763069
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.ecs.soton.ac.uk/pipermail/goal/attachments/20150601/a6b80b82/attachment.html
More information about the GOAL
mailing list