<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Science in the Open</title>
	<atom:link href="http://cameronneylon.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://cameronneylon.net</link>
	<description>The online home of Cameron Neylon</description>
	<lastBuildDate>Wed, 09 May 2012 18:52:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Parsing the Willetts Speech on Access to UK Research Outputs</title>
		<link>http://cameronneylon.net/blog/parsing-the-willetts-speech-on-access-to-uk-research-outputs/</link>
		<comments>http://cameronneylon.net/blog/parsing-the-willetts-speech-on-access-to-uk-research-outputs/#comments</comments>
		<pubDate>Thu, 03 May 2012 19:24:06 +0000</pubDate>
		<dc:creator>Cameron Neylon</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[David Willetts]]></category>
		<category><![CDATA[Jimmy Wales]]></category>
		<category><![CDATA[mandate]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[policy]]></category>
		<category><![CDATA[Research Councils UK]]></category>
		<category><![CDATA[scipolicy]]></category>

		<guid isPermaLink="false">http://cameronneylon.net/?p=16217</guid>
		<description><![CDATA[Yesterday David Willetts, the UK Science and Universities Minister gave a speech to the Publishers Association that has got wide coverage. However it is worth pulling apart both the speech and the accompanying opinion piece from the Guardian because there are some interesting elements in there, and also some things have got a little confused.
The first really key point is that there is nothing new here. This is basically a re-announcement of the previous position from the December Innovation Strategy on moving towards a freely accessible literature and a more public announcement ...]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignleft" style="width: 210px"><a href="http://commons.wikipedia.org/wiki/File:David_Willetts.jpg" target="_blank"><img class="zemanta-img-inserted zemanta-img-configured" title="David Willetts speaking at the Big Society pol..." src="http://cameronneylon.net/wp-content/uploads/2012/05/300px-David_Willetts14.jpg" alt="David Willetts speaking at the Big Society pol..." width="200" /></a><p class="wp-caption-text">David Willetts speaking at the Big Society policy launch, Coin St, London. (Photo credit: Wikipedia)</p></div>
<p>Yesterday <a class="zem_slink" title="David Willetts" href="http://en.wikipedia.org/wiki/David_Willetts" rel="wikipedia" target="_blank">David Willetts</a>, the UK Science and Universities Minister gave <a href="http://www.bis.gov.uk/news/speeches/david-willetts-public-access-to-research">a speech</a> to the Publishers Association that has got wide coverage. However it is worth pulling apart both the speech and the accompanying <a href="http://www.guardian.co.uk/commentisfree/2012/may/01/open-free-access-academic-research">opinion piece</a> from the <a class="zem_slink" title="The Guardian" href="http://www.guardian.co.uk/" rel="homepage" target="_blank">Guardian</a> because there are some interesting elements in there, and also some things have got a little confused.</p>
<p>The first really key point is that there is nothing new here. This is basically a re-announcement of the previous position from the December <a href="http://www.bis.gov.uk/innovatingforgrowth">Innovation Strategy</a> on moving towards a freely accessible literature and a more public announcement of the Gateway to Research project previously mentioned in <a href="http://www.rcuk.ac.uk/media/news/2011news/Pages/111208.aspx">the RCUK response to the Innovation Statement</a>.</p>
<p>The Gateway to Research project is a joint venture of the <a href="http://www.bis.gov.uk/">Department of Business Innovation and Skills</a> and <a class="zem_slink" title="Research Councils UK" href="http://www.rcuk.ac.uk" rel="homepage" target="_blank">Research Councils UK</a> to provide a one stop shop for information on UK research <em>funding</em> as well as pointers to outputs. It will essentially draw information directly from sources that already exist (the Research Outputs System and eVal) as well as some new ones with the intention of helping the UK public and enterprise find research and researchers that is of interest to them, and see how they are funded.</p>
<p>The new announcement was that <a class="zem_slink" title="Jimmy Wales" href="http://jimmywales.com/" rel="homepage" target="_blank">Jimmy Wales</a> of <a class="zem_slink" title="Wikipedia" href="http://en.wikipedia.org/wiki/Wikipedia" rel="wikipedia" target="_blank">Wikipedia</a> fame will be advising on the GTR portal. This is a good thing and he is well placed to provide both technical and social expertise on the provision of public facing information portals as well as providing a more radical perspective than might come out of <a class="zem_slink" title="Department for Business, Innovation and Skills" href="http://www.bis.gov.uk/" rel="homepage" target="_blank">BIS</a> itself. While this might in part be cynically viewed as another example of bringing in celebrities to advise on policy this is a celebrity with relevant expertise and real credibility based on making similar systems work.</p>
<p>The rest of the information that we can gather relates to government efforts in moving towards making the UK research literature accessible. Wales also gets a look in here, and will be &#8220;advising us on [..] common standards to ensure information is presented in a readily reusable form&#8221;. My reading of this is that the Minister understands the importance of interoperability and my hope is that this will mean that government is getting good advice on appropriate licensing approaches to support this.</p>
<p>However, many have read this section of the speech as saying that GTR will act as some form of national repository for research articles. I do not believe this is the intention, and reading between the lines the comment that it will &#8220;provide direct <em>links</em> to actual research outputs such as data sets and publications&#8221; [my emphasis] is the key. The point of GTR is to make UK research more easily <em>discoverable</em>. Access is a somewhat orthogonal issue. This is better read as an expression of Willetts&#8217; and the wider government&#8217;s agenda on transparency of public spending than as a mechanism for providing access.</p>
<p>What else can we tell from the speech? Well the term &#8220;open access&#8221; is used several times, something that was absent from the innovation statement, but still the emphasis is on achieving &#8220;public access&#8221; in the near term with &#8220;open access&#8221; cast as the future goal as I read it. It&#8217;s not clear to me whether this is a well informed distinction. There is a somewhat muddled commentary on Green vs Gold OA but not that much more muddled than what often comes from our own community. There are also some clear statements on the challenges for all involved.</p>
<p>As an aside I found it interesting that Willetts gave a parenthetical endorsement of usage metrics for the research literature when speaking of his own experience.</p>
<blockquote><p>As well as reading some of the articles set by my tutors, I also remember browsing through the pages of the leading journals to see which articles were well-thumbed. It helped me to spot the key ones I ought to be familiar with – a primitive version of crowd-sourcing. The web should make that kind of search behaviour far easier.</p></blockquote>
<p>This is the most sophisticated appreciation of the potential for the combination of measurement and usage data in discovery that I have seen from any politician. It needs to be set against his endorsement of rather cruder filters earlier in the speech but it nonetheless gives me a sense that there is a level of understanding within government that is greater than we often fear.</p>
<p>Much of the rest of the speech is hedging. Options are discussed but not selected and certainly not promoted. The key message: wait for the <a href="http://www.researchinfonet.org/publish/wg-expand-access/">Finch Report</a> which will be the major guide for the route the government will take and the mechanisms that will be put in place to support it.</p>
<p>But there are some clearer statements. There is a strong sense that <a href="http://www.ipo.gov.uk/ipreview.htm">Hargreave&#8217;s recommendations</a> on enabling text mining should be implemented. And the logic for this is well laid out. The speech and the policy agenda is embedded in a framework of enabling innovation &#8211; making it clear what kinds of evidence and argument we will need to marshal in order to persuade. There is also a strong emphasis on data as well as an appreciation that there is much to do in this space.</p>
<p>But the clearest statement made here is on the end goals. No-one can be left in any doubt of Willetts&#8217; ultimate target. Full access to the outputs of research, ideally at the time of publication, in a way that enables them to be fully exploited, manipulated and modified for any purpose by any party. Indeed the vision is strongly congruent with the Berlin, Bethesda, and Budapest declarations on Open Access. There is still much to be argued about the route and and its length, but in the UK at least, the destination appears to be in little doubt.</p>
<h6 class="zemanta-related-title">Related articles</h6>
<ul class="zemanta-article-ul zemanta-article-ul-image">
<li class="zemanta-article-ul-li-image zemanta-article-ul-li"><a href="http://blogs.nature.com/news/2012/05/key-questions-in-the-uks-shift-to-open-access-research.html" target="_blank"><img src="http://cameronneylon.net/wp-content/uploads/2012/05/noimg_1715.jpg" alt="" /></a><a href="http://blogs.nature.com/news/2012/05/key-questions-in-the-uks-shift-to-open-access-research.html" target="_blank">Key questions in the UK&#8217;s shift to open-access research</a><span>(blogs.nature.com)</span>
<div>
<hr />
</div>
</li>
</ul>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://scholarlykitchen.sspnet.org/2012/05/03/crossing-the-rubicon-is-the-uk-going-to-enable-open-access-for-all-taxpayer-funded-research-by-2014/" target="_blank">Crossing the Rubicon &#8211; Is the UK Going to Enable Open Access for All Taxpayer-Funded Research by 2014?</a> (scholarlykitchen.sspnet.org)</li>
<li class="zemanta-article-ul-li"><a href="http://svpow.com/2012/05/03/uk-government-on-open-access-better-than-i-could-have-hoped/" target="_blank">UK Government on open access: better than I could have hoped</a> (svpow.com)</li>
<li class="zemanta-article-ul-li"><a href="http://occamstypewriter.org/scurry/2012/05/03/willetts-speech-on-open-access-analysis/">Willetts&#8217; Speech on Open Access: Analysis</a> (reciprocal space)</li>
</ul>
<div class="zemanta-pixie"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=42bc1672-387e-43ce-965f-b252319427dd" alt="Enhanced by Zemanta" /></a></div>
 <p><a href="http://cameronneylon.net/?flattrss_redirect&amp;id=16217&amp;md5=70bd0755d0765c29018c491c2cbf7822" title="Flattr" target="_blank"><img src="http://cameronneylon.net/wp-content/plugins/flattrss/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://cameronneylon.net/blog/parsing-the-willetts-speech-on-access-to-uk-research-outputs/feed/</wfw:commentRss>
		<slash:comments>9</slash:comments>
		<atom:link rel="payment" href="https://flattr.com/submit/auto?user_id=27882&amp;popout=1&amp;url=http%3A%2F%2Fcameronneylon.net%2Fblog%2Fparsing-the-willetts-speech-on-access-to-uk-research-outputs%2F&amp;language=en_GB&amp;category=text&amp;title=Parsing+the+Willetts+Speech+on+Access+to+UK+Research+Outputs&amp;description=Yesterday+David+Willetts%2C+the+UK+Science+and+Universities+Minister+gave+a+speech+to+the+Publishers+Association+that+has+got+wide+coverage.+However+it+is+worth+pulling+apart+both+the+speech...&amp;tags=David+Willetts%2CJimmy+Wales%2Cmandate%2Copen+access%2Copen+data%2Cpolicy%2CResearch+Councils+UK%2Cscipolicy%2Cblog" type="text/html" />
	</item>
		<item>
		<title>Some brief responses to the Sage Bionetworks Congress</title>
		<link>http://cameronneylon.net/blog/some-brief-responses-to-the-sage-bionetworks-congress/</link>
		<comments>http://cameronneylon.net/blog/some-brief-responses-to-the-sage-bionetworks-congress/#comments</comments>
		<pubDate>Sun, 22 Apr 2012 22:50:33 +0000</pubDate>
		<dc:creator>Cameron Neylon</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Featured]]></category>

		<guid isPermaLink="false">http://cameronneylon.net/?p=16112</guid>
		<description><![CDATA[I attended the first Sage Bionetworks Congress in 2010 and it left a powerful impression on my thinking. I have just attended the third congress in San Francisco and again the challenging nature of views, the real desire to make a difference, and the standard of thinking in the room will take me some time to process. But a series of comments, and soundbites over the course of the meeting have made me realise just how seriously bad our situation is.

Attempts by a variety of big pharma to replicate disease ...]]></description>
			<content:encoded><![CDATA[<p>I attended the first <a class="zem_slink" title="Sage Bionetworks" href="http://www.sagebase.org/index.html" rel="homepage" target="_blank">Sage Bionetworks</a> Congress in 2010 and it <a href="http://cameronneylon.net/blog/engage-or-become-irrelevant/">left a powerful impression on my thinking</a>. I have just attended the third congress in <a class="zem_slink" title="San Francisco" href="http://maps.google.com/maps?ll=37.7793,-122.4192&amp;spn=0.1,0.1&amp;q=37.7793,-122.4192 (San%20Francisco)&amp;t=h" rel="geolocation" target="_blank">San Francisco</a> and again the challenging nature of views, the real desire to make a difference, and the standard of thinking in the room will take me some time to process. But a series of comments, and soundbites over the course of the meeting have made me realise just how seriously bad our situation is.</p>
<ul>
<li>Attempts by a variety of big pharma to replicate disease relevant results published by academic labs failed in ~80% of cases (see for instance <a href="http://www.ahrp.org/cms/content/view/839/9/">this story</a> about <a href="http://www.nature.com/nature/journal/v483/n7391/full/483531a.html">this commentary in Nature</a>[$])</li>
<li>When a particular blood cancer group was asked what factor about their disease was most to them, they said gastro-intestinal problems. No health professional had ever even considered this as a gastro-intestinal disease.</li>
<li><a class="zem_slink" title="James Heywood (chief executive)" href="http://en.wikipedia.org/wiki/James_Heywood_%28chief_executive%29" rel="wikipedia" target="_blank">Jamie Heywood</a> spent $25M of his own money on <a href="http://informahealthcare.com/doi/abs/10.1080/17482960701856300">attempting to replicate around 500 published results</a> that were therapeutically relevant to ALS and could not repeat the findings in a single case.</li>
<li>A cancer patient, advocate, and fundraiser of 25 years standing said the following to me: &#8220;We&#8217;ve been at this for 25 years, we&#8217;ve raised over $2B for research, and new patients today get the same treatment I did. What&#8217;s the point?&#8221;</li>
</ul>
<p>In a room full of very smart people absolutely committed to making a difference there were very few new ideas on how we actually cut through the thicket of perverse incentives, institutional inertia, disregard for replicability, and personal ego-stroking which is perpetuating these problems. I&#8217;ve been uncertain for some time whether change from within our existing structures and systems is really possible. I&#8217;m leaning further and further to the view that it is not. That doesn&#8217;t mean that we can&#8217;t do anything &#8211; just that it may be more effective to simply bypass existing institutions to do it.</p>
<h6 class="zemanta-related-title">Related articles</h6>
<ul class="zemanta-article-ul">
<li class="zemanta-article-ul-li"><a href="http://radar.oreilly.com/2012/04/sage-congress-the-synthesis-of.html" target="_blank">Sage Congress: The synthesis of open source with genetics</a> (radar.oreilly.com)</li>
</ul>
<div class="zemanta-pixie"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=8a9c1105-0fa5-4b48-a942-f8b3bb03dba5" alt="Enhanced by Zemanta" /></a></div>
 <p><a href="http://cameronneylon.net/?flattrss_redirect&amp;id=16112&amp;md5=01e9742bbb4438cdf2039f2cbc662083" title="Flattr" target="_blank"><img src="http://cameronneylon.net/wp-content/plugins/flattrss/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://cameronneylon.net/blog/some-brief-responses-to-the-sage-bionetworks-congress/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		<atom:link rel="payment" href="https://flattr.com/submit/auto?user_id=27882&amp;popout=1&amp;url=http%3A%2F%2Fcameronneylon.net%2Fblog%2Fsome-brief-responses-to-the-sage-bionetworks-congress%2F&amp;language=en_GB&amp;category=text&amp;title=Some+brief+responses+to+the+Sage+Bionetworks+Congress&amp;description=I+attended+the+first+Sage+Bionetworks+Congress+in+2010+and+it+left+a+powerful+impression+on+my+thinking.+I+have+just+attended+the+third+congress+in+San+Francisco+and+again...&amp;tags=blog" type="text/html" />
	</item>
		<item>
		<title>A big leap and a logical step: Moving to PLoS</title>
		<link>http://cameronneylon.net/blog/a-big-leap-and-a-logical-step-moving-to-plos/</link>
		<comments>http://cameronneylon.net/blog/a-big-leap-and-a-logical-step-moving-to-plos/#comments</comments>
		<pubDate>Tue, 27 Mar 2012 14:48:23 +0000</pubDate>
		<dc:creator>Cameron Neylon</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Headline]]></category>
		<category><![CDATA[job]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[plos]]></category>

		<guid isPermaLink="false">http://cameronneylon.net/?p=15925</guid>
		<description><![CDATA[As a child I was very clear I wanted to be a scientist. I am not sure exactly where the idea came from. In part I blame Isaac Asimov but it must have been a combination of things. I can&#8217;t remember not having a clear idea of wanting to go into research.
I started off a conventional career with big ideas &#8211; understanding the underlying physics, chemistry, and information theory that limits molecular evolution &#8211; but my problem was always that I was interested in too many things. I kept getting ...]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignleft" style="width: 250px"><a href="http://www.flickr.com/photos/14829735@N00/3928195989" target="_blank"><img class="zemanta-img-inserted zemanta-img-configured" title="PLoS: The Public Library of Science" src="http://cameronneylon.net/wp-content/uploads/2012/03/3928195989_d1ee018e86_m3.jpg" alt="PLoS: The Public Library of Science" width="240" height="240" /></a><p class="wp-caption-text">PLoS: The Public Library of Science (Photo credit: dullhunk)</p></div>
<p>As a child I was very clear I wanted to be a scientist. I am not sure exactly where the idea came from. In part I blame Isaac Asimov but it must have been a combination of things. I can&#8217;t remember not having a clear idea of wanting to go into research.</p>
<p>I started off a conventional career with big ideas &#8211; understanding the underlying physics, chemistry, and information theory that limits molecular evolution &#8211; but my problem was always that I was interested in too many things. I kept getting distracted. Along with this I also started to wonder how much of a difference the research I was doing was really making. This led to a shift towards working on methods development &#8211; developing tools that would support many researchers to do better and more efficient work. In turn it lead to my current position, with the aim of developing the potential of neutron scattering as a tool for the biosciences. I got gradually more interested in the question of  how to make the biggest difference I could, rather than just pursuing one research question.</p>
<p>And at the same time I was developing a growing interest in the power of the web and how it had the potential, as yet unrealized, to transform the effectiveness of the research community. This has grown from side interest to hobby to something like a full time job, on top of the other full time job I have. This wasn&#8217;t sustainable. At the same time I&#8217;ve realized I am pretty good at the strategy, advocacy, speaking and writing; at articulating a view of where we might go, and how we might get there. That in this space I can make a bigger difference. If we can increase the efficiency of research by just 5%, reduce the time for the developing world to bring a significant research capacity on stream by just a few years, give a few patients better access to information, or increase the wider public interest and involvement in science just a small amount, then this will be a far reader good than I could possibly make doing my own research.</p>
<p>Which is why, from July <a href="http://blogs.plos.org/plos/2012/03/cameron-neylon-to-join-plos-as-director-of-advocacy/">I will be moving to PLoS to take up the role of Advocacy Director</a>.</p>
<p>PLoS is an organization that right from the beginning has had a vision, not just of making research papers more accessible but of transforming research communication, of making it ready for, making it of the 21st century. This is a vision I share and one that I am very excited to playing a part in.</p>
<p>In the new role I will obviously be doing a lot of advocacy, planning, speaking, and writing on open access. There is a lot to play for over the next few years with FRPAA in the US, new policies being developed in Europe, and a growing awareness of the need to think hard about data as a form of publication. But I will also be taking the long view, looking out on a ten year horizon to try and identify the things we haven&#8217;t seen yet, the opportunities that are already there and how we can navigate a path between them. Again there is huge potential in this space, gradually turning from ideas and vaporware into real demos and even products.</p>
<p>The two issues, near term policy and longer term technical development are inextricably linked. The full potential of networked research cannot be realized except in a world of open content, open standards, APIs, process, and data. Interoperability is crucial, technical interoperability, standards interoperability, social interoperability, and legal interoperability. It is being at the heart of the community that is working to link these together and make them work that really excites me about this position.</p>
<p>PLoS has been an engine of innovation since it was formed, changing the landscape of scholarly publishing in a way that no-one would have dreamed was possible. Some have argued that this hasn&#8217;t been so much the case in the last few years. But really things have just been quiet, plans have been laid, and I think you will find the next few years exciting.</p>
<p>Inevitably, I will be leaving some things behind. I won&#8217;t be abandoning research completely, I hope to keep my toe in a range of projects but I will be scaling back a lot. I will be stepping down as an Academic Editor for PLoS ONE (and apologies for all those reviews and editorial requests for PLoS ONE that I&#8217;ve turned down in the last few months) because this would be a clear conflict of interest. I&#8217;ve got a lot to clear up before July.</p>
<p>I will be sad to leave behind some of those roles but above all I am excited and looking forward to working in a great organisation, with people I respect doing things I believe are important. Up until now I&#8217;ve been trying to fit these things in, more or less as a hobby around the research. Now I can focus on them full time, while still staying at least a bit connected. It&#8217;s a big leap for me, but a logical step along the way to trying to make a difference.</p>
<p>&nbsp;</p>
<div class="zemanta-pixie"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=44474633-0a15-4175-aff7-1cbcdb1e28e6" alt="Enhanced by Zemanta" /></a></div>
 <p><a href="http://cameronneylon.net/?flattrss_redirect&amp;id=15925&amp;md5=32ed90549c6e46f7d55b11fe5a3772c9" title="Flattr" target="_blank"><img src="http://cameronneylon.net/wp-content/plugins/flattrss/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://cameronneylon.net/blog/a-big-leap-and-a-logical-step-moving-to-plos/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		<atom:link rel="payment" href="https://flattr.com/submit/auto?user_id=27882&amp;popout=1&amp;url=http%3A%2F%2Fcameronneylon.net%2Fblog%2Fa-big-leap-and-a-logical-step-moving-to-plos%2F&amp;language=en_GB&amp;category=text&amp;title=A+big+leap+and+a+logical+step%3A+Moving+to+PLoS&amp;description=As+a+child+I+was+very+clear+I+wanted+to+be+a+scientist.+I+am+not+sure+exactly+where+the+idea+came+from.+In+part+I+blame+Isaac+Asimov+but...&amp;tags=job%2Copen+access%2Cplos%2Cblog" type="text/html" />
	</item>
		<item>
		<title>They. Just. Don&#8217;t. Get. It&#8230;</title>
		<link>http://cameronneylon.net/blog/they-just-dont-get-it/</link>
		<comments>http://cameronneylon.net/blog/they-just-dont-get-it/#comments</comments>
		<pubDate>Tue, 06 Mar 2012 15:39:10 +0000</pubDate>
		<dc:creator>Cameron Neylon</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[boai]]></category>
		<category><![CDATA[cc-by]]></category>
		<category><![CDATA[open access]]></category>
		<category><![CDATA[Scholarly communication]]></category>
		<category><![CDATA[scholarly-publishing]]></category>
		<category><![CDATA[text-mining]]></category>

		<guid isPermaLink="false">http://cameronneylon.net/?p=15771</guid>
		<description><![CDATA[

&#8230;although some are perhaps starting to see the problems that are going to arise.
Last week I spoke at a Question Time style event held at Oxford University and organised by Simon Benjamin and Victoria Watson called &#8220;The Scientific Evolution: Open Science and the Future of Publishing&#8221; featuring Tim Gowers (Cambridge), Victor Henning (Mendeley), Alison Mitchell (Nature Publishing Group), Alicia Wise (Elsevier), and Robert Winston (mainly in his role as TV talking head on science issues). You can get a feel for the proceedings from Lucy Pratt&#8217;s summary but I want to focus ...]]></description>
			<content:encoded><![CDATA[<div class="mceTemp">
<div class="wp-caption alignleft" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/File:Trafficjamdelhi.jpg" target="_blank"><img class="zemanta-img-inserted zemanta-img-configured" title="English: Traffic Jam in Delhi Français : Un em..." src="http://cameronneylon.net/wp-content/uploads/2012/03/300px-Trafficjamdelhi2.jpg" alt="English: Traffic Jam in Delhi Français : Un em..." width="300" height="225" /></a><p class="wp-caption-text">Image via Wikipedia</p></div>
</div>
<p>&#8230;although some are perhaps starting to see the problems that are going to arise.</p>
<p>Last week I spoke at a Question Time style event held at Oxford University and organised by Simon Benjamin and Victoria Watson called &#8220;The Scientific Evolution: Open Science and the Future of Publishing&#8221; featuring <a class="zem_slink" title="Timothy Gowers" href="http://en.wikipedia.org/wiki/Timothy_Gowers" rel="wikipedia" target="_blank">Tim Gowers</a> (Cambridge), <a class="zem_slink" title="Victor Henning" href="http://twitter.com/mendeley_com" rel="twitter" target="_blank">Victor Henning</a> (Mendeley), Alison Mitchell (Nature Publishing Group), Alicia Wise (Elsevier), and <a class="zem_slink" title="Robert Winston" href="http://en.wikipedia.org/wiki/Robert_Winston" rel="wikipedia" target="_blank">Robert Winston</a> (mainly in his role as TV talking head on science issues). You can get a feel for the proceedings from Lucy Pratt&#8217;s <a href="http://f1000research.com/2012/03/02/open-science-and-the-future-of-publishing-a-round-up-of-this-weeks-debate/">summary</a> but I want to focus on one specific issue.</p>
<p>As is common for me recently I emphasised the fact that <a href="http://cameronneylon.net/blog/network-enabled-research/">networked research communication needs to be different</a> to what we are used to. I made a comparison to the fact that when the printing press was developed one of the first things that happened was that people created facsimiles of hand written manuscripts. It took hundreds of years for someone to come up with the idea of a newspaper and to some extent our current use of the network is exactly that &#8211; digital facsimiles of paper objects, not truly networked communication.</p>
<p>It&#8217;s difficult to predict exactly what form a real networked communication system will take, in much the same way that asking a 16th century printer how newspaper advertising would work would not provide a detailed and accurate answer, but there are some principles of successful network systems that we can see emerging. Effective network systems distribute control and avoid centralisation, they are loosely coupled, and distributed. Very different to the centralised systems for control of access and control we have today.</p>
<p>This is a difficult concept and one that scholarly publishers simply don&#8217;t get for the most part. This is not particularly suprising because truly disruptive innovation rarely comes from incumbent players. Large and entrenched organisations don&#8217;t generally enable the kind of thinking that is required to see the new possibilities. This is seen in publishers statements that they are providing &#8220;more access than ever before&#8221; via &#8220;more routes&#8221;, but all routes that are under tight centralised control, with control systems that don&#8217;t scale. By insisting on centralised control over access publishers are setting themselves up to fail.</p>
<p>Nowhere is this going to play out more starkly than in the area of text mining. Bob Campbell from Wiley-Blackwell walked into this &#8211; but few noticed it &#8211; with the now familiar claim that &#8220;text mining is not a problem because people can ask permission&#8221;. Centralised control, failure to appreciate scale, and failure to understand the <em>necessity </em>of distribution and distributed systems. I have with me a device capable of holding the text of perhaps 100,000 papers It also has the processor power to mine that text. It is my phone. In 2-3 years our phones, hell our watches, will have the capacity to not only hold the world&#8217;s literature but also to mine it, in context for what I want right now. Is Bob Campbell ready for every researcher, indeed every interested person in the world, to come into his office and discuss an agreement for text mining? Because the mining I want to do and the mining that <a class="zem_slink" title="Peter Murray-Rust" href="http://en.wikipedia.org/wiki/Peter_Murray-Rust" rel="wikipedia" target="_blank">Peter Murray-Rust</a> wants to do will be different, and what I will want to do tomorrow is different to what I want to do today. This kind of personalised mining is going to be the accepted norm of handling information online very soon and will be at the very centre of how we discover the information we need. Google will provide a high quality service for free, subscription based scholarly publishers will charge an arm and a leg for a deeply inferior one &#8211; because Google is built to exploit network scale.</p>
<p>The problem of scale has also just played out in fact. Heather Piwowar <a href="http://researchremix.wordpress.com/2012/03/05/talking-text-mining-with-elsevier/">writing yesterday</a> describes a call with <em>six </em>Elsevier staffers to discuss her project and needs for text mining. Heather of course now has to have this same conversation with Wiley, NPG, ACS, and all the other subscription based publishers, who will no doubt demand different conditions, creating a nightmare patchwork of different levels of access on different parts of the corpus. But the bit I want to draw out is at the bottom of the post where Heather describes the concerns of Alicia Wise:</p>
<blockquote><p>At the end of the call, I stated that I’d like to blog the call… it was quickly agreed that was fine. Alicia mentioned her only hesitation was that she might be overwhelmed by requests from others who also want text mining access. Reasonable.</p></blockquote>
<p>Except that it isn&#8217;t. It&#8217;s perfectly reasonable for every single person who wants to text mine to want a conversation about access. Elsevier, because they demand control, have set themselves up as the bottleneck. This is really the key point, because the subscription business model implies an imperative to extract income from all possible uses of the content it sets up a need for control of access for differential uses. This means in turn that each different use, and especially each new use, has to be individually negotiated, usually by humans, apparently about six of them. This <em>will</em> fail because it <em>cannot</em> scale in the same way that the demand <em>will</em>.</p>
<p>The technology exists today to make this kind of mass distributed text mining trivial. Publishers could push content to bit torrent servers and then publish regular deltas to notify users of new content. The infrastructure for this <em>already exists. </em>There is no infrastructure investment required. The problems that publishers raise of their servers not coping is one that they have created for themselves. The catch is that distributed systems can&#8217;t be controlled from the centre and giving up control requires a different business model. But this is also an opportunity. The publishers also save money  if they give up control &#8211; no more need for six people to sit in on each of hundreds of thousands of meetings. I often wonder how much lower subscriptions would be if they didn&#8217;t need to cover the cost of access control, sales, and legal teams.</p>
<p>We are increasingly going to see these kinds of failures. Legal and technical incompatibility of resources, contractual requirements at odds with local legal systems, and above all the claim &#8220;you can just ask for permission&#8221; without the backing of the hundreds or thousands of people that would be required to provide a timely answer. And that&#8217;s before we deal with the fact that the most common answer will be &#8220;mumble&#8221;. A centralised access control system is simply not fit for purpose in a networked world. As demand scales, people making legitimate requests for access will have the effect of a distributed denial of service attack. The clue is in the name; the demand is distributed. If the access control mechanisms are manual, human and centralised, they <em>will </em>fail. But if that&#8217;s what it takes to get subscription publishers to wake up to the fact that the networked world is different then so be it.</p>
<div class="zemanta-pixie"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=61ce6e30-dfdb-4492-b484-631a2d11bed0" alt="Enhanced by Zemanta" /></a></div>
 <p><a href="http://cameronneylon.net/?flattrss_redirect&amp;id=15771&amp;md5=223b5ae21c88f4ea82d3e7824d84cf19" title="Flattr" target="_blank"><img src="http://cameronneylon.net/wp-content/plugins/flattrss/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://cameronneylon.net/blog/they-just-dont-get-it/feed/</wfw:commentRss>
		<slash:comments>22</slash:comments>
		<atom:link rel="payment" href="https://flattr.com/submit/auto?user_id=27882&amp;popout=1&amp;url=http%3A%2F%2Fcameronneylon.net%2Fblog%2Fthey-just-dont-get-it%2F&amp;language=en_GB&amp;category=text&amp;title=They.+Just.+Don%26%238217%3Bt.+Get.+It%26%238230%3B&amp;description=%26%238230%3Balthough+some+are+perhaps+starting+to+see+the+problems+that+are+going+to+arise.+Last+week+I+spoke+at+a+Question+Time+style+event+held+at+Oxford+University+and+organised...&amp;tags=boai%2Ccc-by%2Copen+access%2CScholarly+communication%2Cscholarly-publishing%2Ctext-mining%2Cblog" type="text/html" />
	</item>
		<item>
		<title>Github for science? Shouldn’t we perhaps build TCP/IP first?</title>
		<link>http://cameronneylon.net/blog/github-for-science-shouldn%e2%80%99t-we-perhaps-build-tcpip-first/</link>
		<comments>http://cameronneylon.net/blog/github-for-science-shouldn%e2%80%99t-we-perhaps-build-tcpip-first/#comments</comments>
		<pubDate>Wed, 22 Feb 2012 09:59:23 +0000</pubDate>
		<dc:creator>Cameron Neylon</dc:creator>
				<category><![CDATA[Blog]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[Fb4Sci]]></category>
		<category><![CDATA[GitHub]]></category>
		<category><![CDATA[github for science]]></category>
		<category><![CDATA[John Wilbanks]]></category>
		<category><![CDATA[research object]]></category>
		<category><![CDATA[stack]]></category>
		<category><![CDATA[web]]></category>

		<guid isPermaLink="false">http://cameronneylon.net/?p=15644</guid>
		<description><![CDATA[Github for science sounds like a great plan? But do we have the underlying stack of equivalent services needed to provide "http for science" and "tcp/ip" for science. I argue that until we do we will struggle to really deliver on the excitement that examples (rightly) inspire.]]></description>
			<content:encoded><![CDATA[<div class="wp-caption alignleft" style="width: 310px"><a href="http://commons.wikipedia.org/wiki/File:TCP-IP.jpg"><img class="zemanta-img-inserted zemanta-img-configured" title="Mapa mental do TCP/IP" src="http://cameronneylon.net/wp-content/uploads/2012/02/300px-TCP-IP2.jpg" alt="Mapa mental do TCP/IP" width="300" height="186" /></a><p class="wp-caption-text">Image via Wikipedia</p></div>
<p>It’s one of those throw away lines, “Before we can talk about a github for science we really need to sort out a TCP/IP for science”, that’s geeky, sharp, a bit needly and goes down a treat <a href="https://twitter.com/#!/CameronNeylon/status/171688972710453248">on Twitter</a>. But there is a serious point behind it. And its not intended to be dismissive of the <a href="http://marciovm.com/i-want-a-github-of-science/index.html">ideas</a> that are <a href="http://www.trevorbedford.com/">swirling</a> around about scholarly communication at the moment either. So it seems worth exploring in a bit more detail.</p>
<p>The line is stolen almost wholesale from <a class="zem_slink" title="John Wilbanks" href="http://del-fi.org" rel="wikipedia">John Wilbanks</a> who used it (I think) in <a href="http://www.slideshare.net/wilbanks/seattle-sc-symposium-2010">the talk he gave</a> at a Science Commons meetup in Redmond a few years back. At the time I think we were awash in “Facebooks for Science” so that was the target but the sentiment holds. As once was the case with Facebook and now is for <a class="zem_slink" title="GitHub" href="http://github.com" rel="homepage">Github</a>, or <a class="zem_slink" title="Wikipedia" href="http://www.wikipedia.org" rel="homepage">Wikipedia</a>, or <a class="zem_slink" title="Stack Overflow" href="http://stackoverflow.com/" rel="homepage">StackOverflow</a>, the possibilities opened up by these new services and technologies to support a much more efficient and effective research process look amazing. And they are. But you’ve got to be a little careful about taking the analogy too far.</p>
<p>If you look at what these services provide, particularly those that are focused on coding, they deliver commentary and documentation, nearly always in the form of text about code – which is also basically text. The web is very good at transferring text, and code, and data. The stack that delivers this is built on a set of standards, with each layer building on the layer beneath it. StackOverflow and Github are built on a set of services, that in turn sit on top of the web standards of http, which in turn are built on network standards like TCP/IP that control the actual transfer of bits and bytes.</p>
<p>The fundamental <em>stuff</em> of these coding sites and Wikipedia is text, and text is really well supported by the stack of web technologies. Open Source approaches to software development didn’t just develop because of the web, they <em>developed</em> the web so its not surprising that they fit well together. They grew up together and nurtured each other. But the bottom line is that the stack is optimized to transfer the grains of material, text and code, that make up the core of these services.</p>
<p>When we look at research we can see that when we dig down to the granular level it isn’t just made up of text. Sure most research could be represented as text but we don’t have the standardized forms to do this. We don’t have standard <em>granules</em> of research that we can transfer from place to place. This is because its complicated to transfer the stuff of research. I picked on TCP/IP specifically because it is the <em>transfer </em>protocol that supports moving bits and bytes from one place to another. What we need are protocols that support moving the substance of a piece of my research from one place to another.</p>
<p>Work on <a href="http://wiki.myexperiment.org/index.php/Research_Objects">Research Objects</a> [see also <a href="http://eprints.ecs.soton.ac.uk/18555/">this paper</a>], intended to be self-contained but useable pieces of research is a step in this direction, as are the <a href="http://www.wf4ever-project.org/">developing set of workflow tools</a>, that will ultimately allow us to describe and share the process by which we’ve transformed at least some parts of the research process into others. <a href="http://www.mylabnotebook.ac.uk/index.shtml">Laboratory recording systems</a> will help us to capture and workflow-ify records of the physical parts of the research process. But until we can agree how to transfer these in a standardized fashion then I think it is premature to talk about Githubs for research.</p>
<p>Now there is a flip side to this, which is that where there are such services that do support the transfer of pieces of the research process we absolutely should be  experimenting with them. But in most cases the type-case itself will do the job. Github is great for sharing research code and some people are doing <a href="https://github.com/ehec-outbreak-crowdsourced/BGI-data-analysis/wiki">terrific things with data there as well</a>. But if it does the job for those kinds of things why do we need one for researchers? The scale that the consumer web brings, and the exposure to a much bigger community, is a powerful counter argument to building things ‘just for researchers’. To justify a service focused on a small community you need to have very strong engagement or very specific needs. By the time that a mainstream service has mindshare and researchers are using it, your chances of pulling them away to a new service just for them are very small.</p>
<p>So yes, we should be inspired by the possibilities that these new services open up, and we should absolutely build and experiment but while we are at it can we also focus on the lower levels of the stack?They aren’t as sexy and they probably won’t make anyone rich, but we’ve got to get serious about the underlying mechanisms that will transfer our research in comprehensible packages from one place to another.</p>
<p>We have to think carefully about capturing the context of research and presenting that to the next user. Github works in large part because the people using it know how to use code, can recognize specific languages, and know how to drive it. It’s actually pretty poor for the user who just wants to do something – we’ve had to build up another set of services at different levels, the <a class="zem_slink" title="Python Package Index" href="http://pypi.python.org/pypi" rel="homepage">Python Package Index</a>, tools for making and distributing executables, that help provide the context required for different types of user. This is going to be much, much harder, for all the different types of use we might want to put research to.</p>
<div class="mceTemp"></div>
<p>But if we can get this right – if we can standardize transfer protocols and build in the context of the research into those ‘packets’ that lets people use it then what we have seen on the wider web will happen naturally. As we build the stack up these services that seem so hard to build at the moment will become as easy today as throwing up a blog, downloading a rubygem, or firing up a machine instance. If we can achieve that then we’ll have much more than a github for research, we’ll have a whole web for research.</p>
<p><em>There’s nothing new here that wasn’t written some time ago by John Wilbanks and others but it seemed worth repeating. In particular I recommend these posts [<a href="http://scienceblogs.com/commonknowledge/2009/10/open_source_science_or_distrib.php">1</a>, <a href="http://scienceblogs.com/commonknowledge/2009/11/distributed_science_part_2.php">2</a>] from John.</em></p>
<div class="zemanta-pixie"><a class="zemanta-pixie-a" title="Enhanced by Zemanta" href="http://www.zemanta.com/"><img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=cc14b125-7430-411e-98e5-b69fe3456e9c" alt="Enhanced by Zemanta" /></a></div>
 <p><a href="http://cameronneylon.net/?flattrss_redirect&amp;id=15644&amp;md5=4f16ebeb07b43757c30a755882ed9195" title="Flattr" target="_blank"><img src="http://cameronneylon.net/wp-content/plugins/flattrss/img/flattr-badge-large.png" alt="flattr this!"/></a></p>]]></content:encoded>
			<wfw:commentRss>http://cameronneylon.net/blog/github-for-science-shouldn%e2%80%99t-we-perhaps-build-tcpip-first/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
		<atom:link rel="payment" href="https://flattr.com/submit/auto?user_id=27882&amp;popout=1&amp;url=http%3A%2F%2Fcameronneylon.net%2Fblog%2Fgithub-for-science-shouldn%25e2%2580%2599t-we-perhaps-build-tcpip-first%2F&amp;language=en_GB&amp;category=text&amp;title=Github+for+science%3F+Shouldn%E2%80%99t+we+perhaps+build+TCP%2FIP+first%3F&amp;description=It%E2%80%99s+one+of+those+throw+away+lines%2C+%E2%80%9CBefore+we+can+talk+about+a+github+for+science+we+really+need+to+sort+out+a+TCP%2FIP+for+science%E2%80%9D%2C+that%E2%80%99s+geeky%2C+sharp%2C+a...&amp;tags=Fb4Sci%2CGitHub%2Cgithub+for+science%2CJohn+Wilbanks%2Cresearch+object%2Cstack%2Cweb%2Cblog" type="text/html" />
	</item>
	</channel>
</rss>

<!-- Dynamic page generated in 0.724 seconds. -->
<!-- Cached page generated by WP-Super-Cache on 2012-05-09 21:26:36 -->

