[GOAL] Re: CC-BY: the wrong goal for open access, and neither necessary nor sufficient for data and text mining
Steve Hitchcock
sh94r at ecs.soton.ac.uk
Wed Oct 10 13:17:38 BST 2012
This brings to mind the idea of the data paper, described here
http://www.pensoft.net/J_FILES/Pensoft_Data_Publishing_Policies_and_Guidelines.pdf
This seems to have been pioneered by this publisher. There is also a data paper journal in archaeology, JOAD
http://openarchaeologydata.metajnl.com/submit-a-data-paper/
How might this impact on repositories and the suggestion of green CC-BY? Interestingly there is one IR in JOAD's list of recommended repositories for data deposit, I think because UCL is also home to JOAD
http://openarchaeologydata.metajnl.com/repositories/
Might IRs become the primary source of data papers? Will these data papers become the primary publications or supplementary publications? If the former then presumably there would be competition with journals to publish data papers.
Through initiatives such as the JISC MRD programme, IRs in the UK are investigating how they may become data sources. Some may be looking specifically at data publications.
We are anticipating substantial changes in author practices here. I would agree that IRs should be exploring the options here and looking to innovate, but developments are likely to be small scale initially.
Are data papers, openly licenced, machine readable with data that can be manipulated, the real target here, rather than the typical published paper where the data may not have these features whether CC licenced or not?
Steve Hitchcock
DataPool Joint-Project Manager
WAIS Group, Building 32
Faculty of Electronics and Computer Science
University of Southampton, SO17 1BJ, UK
Email: sh94r at ecs.soton.ac.uk
Tel: +44 (0)23 8059 9379 Fax: +44 (0)23 8059 2865
http://datapool.soton.ac.uk/
Twitter @jiscdatapool
On 10 Oct 2012, at 10:24, Peter Murray-Rust wrote:
>
> On Wed, Oct 10, 2012 at 9:14 AM, Dan Stowell <dan.stowell at eecs.qmul.ac.uk> wrote:
> Hi all,
>
> Some points re this discussion:
>
> Helen wrote:
>
> Heather??
>
> > 1. CC-BY is not necessary for data and text-mining. Internet search engines such as google and social media companies do extensive data and text mining, and they do not limit themselves to CC-BY material. This is true even in the EU, so is not prevented by the EU's support for copyright of data. To illustrate: if data and text-mining is not permissible without CC-BY, then Google must shut down, immediately.
>
> This point is a bit weird. Firstly, just because Google is doing
> something and getting away with it, doesn't mean a lone academic can be
> confident of doing something similar and getting away with it. I was
> always amazed by how brazenly Youtube set up its service *before* making
> agreements with the major media companies, when I would have assumed
> they would have been sued out of existence.
>
> Completely agree with Dan. Licences are critical for data. That is why we spent 2 years creating the Panton Principles for Scientific data with the result that we strongly recommend and explicit licence such as CCZero. The large content-oriented companies get away with a huge amount and there is a Faustian acceptance that they do a good job for academia by apparenty breaking rights, while academics are prevented from doing the same.
>
> This is why it is urgent that repositories work as hard as possible to make the content available in as Open a form as possible. Many repos do the reverse, adding additional clauses of their own preventing re-use. They are often of the form "You cannot re-use this content in any way without the permission of the individual copyright owners".
>
>
> The Open Database Licence also appears to assert "that digital material
> must be made available in a readily machine-interpretable form"
> <http://opendatacommons.org/licenses/odbl/summary/> though I'm less
> familiar with that (see the "Keep open" part of the summary).
>
> Yes. This means that a database provider who wishes to conform should make it as easy as possible to discover and re-use the data.
>
> --
> Peter Murray-Rust
> Reader in Molecular Informatics
> Unilever Centre, Dep. Of Chemistry
> University of Cambridge
> CB2 1EW, UK
> +44-1223-763069
> _______________________________________________
> GOAL mailing list
> GOAL at eprints.org
> http://mailman.ecs.soton.ac.uk/mailman/listinfo/goal
More information about the GOAL
mailing list