[GOAL] Re: Open data

Andrew A. Adams aaa at meiji.ac.jp
Wed May 9 00:53:55 BST 2012


Jan Velterop wrote:
> The trouble with focussing on 'green', rather than on full
> BOAI-compliant OA for research literature, is that it has become an a
> priori concession and an end in itself. That only confuses matters (as
> do ill-defined labels such as 'gratis' and 'libre'). 

> We should insist on BOAI-compliant OA (CC-BY or CC-0) for all research
> articles, including for self-archived articles. And if anything, we
> should insist on institutional repositories to actually be searchable
> and accessible also for text mining. Human-readable OA is a conditio
> sine qua non, but it is not sufficient for modern science.

The trouble with focussing on this high level is that there isn't agreement 
amongst scientists that this is what is needed, and on exactly what the 
limits of this are. Indeed, opening up one's data can involve significantly 
more work for the scientist/scholar. Green OA requires a few keystrokes per 
paper. Perhaps five minutes work (so long as one's repository is set up well 
and one keeps focussed on providing the paper's text and does not get too 
hung up about more than citing meta-data).

I have a PhD student, for example, who has just finished her thesis. The 
thesis and papers from it contain various statistics and quotes drawn from 
her field work. We are still working on further papers from the thesis with 
an expectatin of two more to come. The data has been appropriately developed 
for the publications written at present, but the fullw interviews from which 
quotes are drawn have not had their source seudonymised; the numeric data has 
only been put systematised for the precise analyses used in the thesis and 
the papers. Some of it is in incompatible file formats with chunks of the raw 
data put into different tools in different (overlapping but not a single set 
in any one tool).

What rights to first publication of specific analysis on this data do my 
student and her supervisors have? Which elements of the data are required to 
be made available?

If we wait until we can answer these questions before providing the 
additional access to the existing outputs then we are likely to wait another 
twenty years or more before achieving full access to the papers. Yes, in a 
few fields perhaps, the data must be in a publishable form before a paper can 
be published, but there are currently no social mechanisms, and indeed few 
technological mechanisms, that can cope with providing this data at present. 
There is an easy, simple solution to providing access to the text of papers: 
put a pdf, word, html, rtf, odt or even plain text of the author's final 
submitted text in an institutional repository or the opendepot.

Human-readable OA is within our grasp but we're not grasping it! Let us grasp 
this first and THEN go on to sort out the more difficult issues. Otherwise 
we're just fiddling while Rome burns (struggling to reform the whole of 
scholarly and scientific communications in one go rather than doing what is 
simple and achievable now with little in the way of controversy about its 
beneficial effects on science and scholarship and then and only then dealing 
with the more difficult issues).


-- 
Professor Andrew A Adams                      aaa at meiji.ac.jp
Professor at Graduate School of Business Administration,  and
Deputy Director of the Centre for Business Information Ethics
Meiji University, Tokyo, Japan       http://www.a-cubed.info/





More information about the GOAL mailing list