Home » Blog, Featured

P ≠ NP and the future of peer review

10 August 2010 2,451 views 75 Comments
Decomposition method (constraint satisfaction)
Image via Wikipedia

“We demonstrate the separation of the complexity class NP from its subclass P. Throughout our proof, we observe that the ability to compute a property on structures in polynomial time is intimately related to the statistical notions of conditional independence and sufficient statistics. The presence of conditional independencies manifests in the form of economical parametrizations of the joint distribution of covariates. In order to apply this analysis to the space of solutions of random constraint satisfaction problems, we utilize and expand upon ideas from several fields spanning logic, statistics, graphical models, random ensembles, and statistical physics.”

Vinay Deolalikar [pdf]

No. I have no idea either, and the rest of the document just gets more confusing for a non-mathematician. Nonetheless the online maths community has lit up with excitement as this document, claiming to prove one of the major outstanding theorems in maths circulated. And in the process we are seeing online collaborative post publication peer review take off.

It has become easy to say that review of research after it has been published doesn’t work. Many examples have failed, or been partially successful. Most journals with commenting systems still get relatively few comments on the average paper. Open peer review tests have generally been judged a failure. And so we stick with traditional pre-publication peer review despite the lack of any credible evidence that it does anything except cost around a few billion pounds a year.

Yesterday, Bill Hooker, not exactly a nay-sayer when it comes to using the social web to make research more effective wrote:

“…when you get into “likes” etc, to me that’s post-publication review — in other words, a filter. I love the idea, but a glance at PLoS journals (and other experiments) will show that it hasn’t taken off: people just don’t interact with the research literature (yet?) in a way that makes social filtering effective.”

But actually the picture isn’t so negative. We are starting to see examples of post-publication peer review and see it radically out-perform traditional pre-publication peer review. The rapid demolition [1, 2, 3] of the JACS hydride oxidation paper last year (not least pointing out that the result wasn’t even novel) demonstrated the chemical blogosphere was more effective than peer review of one of the premiere chemistry journals. More recently 23andMe issued a detailed, and at least from an outside perspective devastating, peer review (with an attempt at replication!) of a widely reported Science paper describing the identification of genes associated with longevity. This followed detailed critiques from a number of online writers.

These, though were of published papers, demonstrating that a post-publication approach can work, but not showing it working for an “informally published” piece of research such as a a blog post or other online posting. In the case of this new mathematical proof, the author Vinay Deolalikar, apparently took the standard approach that one does in maths, sent a pre-print to a number of experts in the field for comments and criticisms. The paper is not in the ArXiv and was in fact made public by one of the email correspondents. The rumours then spread like wildfire, with widespread media reporting, and widespread online commentary.

Some of that commentary was expert and well informed. Firstly a series of posts appeared stating that the proof is “credible”. That is, that it was worth deeper consideration and the time of experts to look for holes. There appears a widespread skepticism that the proof will be correct, including a $200,000 bet from Scott Aaronson, but also a widespread view that it nonetheless is useful, that it will progress the field in a helpful way even if it is wrong.

After this first round, there have been summaries of the proof, and now the identification of potential issues is occurring (see RJLipton for a great summary). As far as I can tell these issues are potentially extremely subtle and will require the attention of the best domain experts to resolve. In a couple of cases these experts have already potentially “patched” the problem, adding their own expertise to contribute to the proof. And in the last couple of hours as Michael Nielsen pointed out to me there is the beginning of a more organised collaboration to check through the paper.

This is collaborative, and positive peer review, and it is happening at web scale. I suspect that there are relatively few experts in the area who aren’t spending some of their time on this problem this week. In the market for expert attention this proof is buying big, as it should be. An important problem is getting a good going over and being tested, possibly to destruction, in a much more efficient manner than could possibly be done by traditional peer review.

There are a number of objections to seeing this as a generalizable to other research problems and fields. Firstly, maths has a strong pre-publication communication and review structure which has been strengthened over the years by the success of the ArXiv. Moreover there is a culture of much higher standards of peer review in maths, review which can take years to complete. Both of these encourage circulation of drafts to a wider community than in most other disciplines, priming the community for distributed review to take place.

The other argument is that only high profile work will get this attention, only high profile work will get reviewed, at this level, possibly at all. Actually I think this is a good thing. Most papers are never cited, so why should they suck up the resource required to review them? Of those that are or aren’t published whether they are useful to someone, somewhere, is not something that can be determined by one or two reviewers. Whether they are useful to you is something that only you can decide. The only person competent to review which papers you should look at in detail is you. Sorry.

Many of us have argued for some time that post-publication peer review with little or no pre-publication review is the way forward. Many have argued against this on practical grounds that we simply can’t get it to happen, there is no motivation for people to review work that has already been published. What I think this proof, and the other stories of online review tell us is that these forms of review will grow of their own accord, particularly around work that is high profile. My hope is that this will start to create an ecosystem where this type of commenting and review is seen as valuable. That would be a more positive route than the other alternative, which seems to be a wholesale breakdown of the current system as the workloads rise too high and the willingness of people to contribute drops.

The argument always brought forward for peer review is that it improves papers. What interests me about the online activity around Deolalikar’s paper is that there is a positive attitude. By finding the problems, the proof can be improved, and new insights found, even if the overall claim is wrong. If we bring a positive attitude to making peer review work more effectively and efficiently then perhaps we can find a good route to improving the system for everyone.

Enhanced by Zemanta

1 Star2 Stars3 Stars4 Stars5 Stars (2 votes, average: 4.50 out of 5)
Loading ... Loading ...

  • http://friendfeed.com/jcbradley Jean-Claude Bradley

    Cameron – you mention the NaH discussions. The actual outcome from the blogosphere was much more interesting (and useful) than true or false. Instead of linking to these discussions the journal just retracted it without explanation, which isn’t very useful for advancing knowledge.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cameronneylon Cameron Neylon

    JC, that’s a good point. Not only was it quicker and more comprehensive, but also much more useful and informative.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cameronneylon Cameron Neylon

    JC, that’s a good point. Not only was it quicker and more comprehensive, out was also much more useful and informative.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/egonw Egon Willighagen

    JC++

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/joergkurtwegner joergkurtwegner

    OMG, *shivering*, theoretical computer science exam memories are coming back, can this be true? Honestly just reaching that proof level is already unbelievable, and if the gaps could be fixed, it would be just mindblowing. So, "Vinay Deolalikar you rock!", and @community "That might keep you, the smartest brains, busy for a while".

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/joergkurtwegner joergkurtwegner

    OMG, *shivering*, theoretical computer science exam memories are coming back, can this be true? Honestly just reaching that proof level is already unbelievable, and if the gaps could be fixed, it would be just mindblowing. So, "Vinay Deolalikar you rock!" (or you are , and @community "That might keep you, the smartest brains, busy for a while".

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/joergkurtwegner joergkurtwegner

    Besides, please help voting for non-deletion on WP! http://en.wikipedia.org/wiki/Talk:Vinay_Deolalikar

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/billhooker Bill Hooker

    I vote special case — it’s such an important problem that post-pub peer review works great because everyone is interested.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cameronneylon Cameron Neylon

    I certainly agree that it is a special case. I guess with some of these other examples I’m becoming hopeful that its not _so_ special that we won’t start to see more of this going on – indeed there has been extremely little commentary that I’ve seen that seems remotely surprised by the process that seems to be being followed by the community. It’s the fact that most seem to be assuming that this is a natural process that I find most interesting in some ways…

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/joergkurtwegner joergkurtwegner

    Does anyone have a PDF copy, proof was removed from his web site? http://www.hpl.hp.com/personal/Vinay_Deolalikar/Papers/pnp_updated.pdf

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cpikas Christina Pikas

    I don’t think it’s one of a kind, but I think that it is extraordinary or out of the ordinary. Probably

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/bonnieswoger Bonnie Swoger

    If we agree that this is a special case, it seems to me we are seeing a lot more "special cases" than there used to be, making this kind of review a practical possibility, not just a theoretical idea.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/axiomsofchoice Dan Hagon

    One issue I haven’t seen being given a lot of attention by bloggers is the issue of copyright. I’m assuming Vinay Deolalikar will have copyright over the TeX manuscript and PDF pre-prints, but maybe HP will have some sort of claim here? What happens if there’s a bidding war amongst journals? Have scribd and the other hosting sites been issued with take down notices? Could the issue of copyright affect these collaborative peer review efforts if they quote sizable chunks or even the whole thing? What about the ethics of "leaking" the pre-print? (See also: http://dabacon.org/pontiff/?p=4292)

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/billhooker Bill Hooker

    Further to the above, it’s also possible that a handful of super-special special cases will whet the public appetite for this kind of process, so that we start to see it carried out for more and more "routine" sorts of discoveries. It would be nice even to get to the point where "extraordinary claims require extraordinary evidence" (say, the top 10 or 20% of odd/suprising/controversial/etc papers) brings to mind not just stringent peer review or submission to a Prestige Journal, but subjection of the claims in question to the scrutiny of the "peer crowd"…

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/billhooker Bill Hooker

    Further to the above, it’s also possible that a handful of super-special special cases will whet the public appetite for this kind of process, so that we start to see it carried out for more and more "routine" sorts of discoveries. It would be nice even to get to the point where "extraordinary claims require extraordinary evidence" (say, the top 10 or 20% of odd/suprising/controversial/etc papers) brings to mind not just stringent peer review or submission to a Prestige Journal, but subjection of the claims in question to the scrutiny of the "peer crowd"… [edit: which was the whole point of Cameron’s OP. Mea culpa, I’ll try to remember to rtfp before I open my yap in future.)

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/mariadroujkova Maria Droujkova

    Upon reading, "Most journals with commenting systems still get relatively few comments on the average paper" my immediate comment was, "The average paper is rarely worth reading, let alone reviewing." If people can’t get at least several friendly colleagues to review their work, it tells us something about the scope and the quality, right there. You get to this point toward the end of the essay. Thank you for the story!

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/mersin33 ZN Moment

    @joergkurtwegner I added to Google Docs for reading. I am not sure this is legal, but you can download from http://docs.google.com/fileview?id=0B8GrXDgFZOiaY2VmOGQ0NjQtOWYzNS00MmVkLWI2NzgtZGIzMTVmYzg2MGI3&hl=en I will change the share setting to private soon.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    every movement needs a figurehead, so why not let’s take this example

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    every movement needs a figurehead, so why not take this example to figure well in Open Science

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/brembs Björn Brembs

    @ Maria: "If people can’t get at least several friendly colleagues to review their work, it tells us something about the scope and the quality, right there." Not necessarily. Obviously, currently out of fashion research will attract less interest. But little interest doesn’t necessarily imply lack of scope or quality – it could simply be ahead of its time. So, unfortunately, this reasoning only works one way.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/joergkurtwegner joergkurtwegner

    +1 Claudia – Agreed, everything is in there, science, controversy, copyright issues. I would prefer a better life science example, but some maths+informatics example will do as well. Though, lets be honest the IP situations and relevance are totally different in those fields. So, we should never assume "one movement", its not that simple.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    @joergkurtwegner good point, it might still work in general, though, if the most general points (relevant for interdisciplinary perspectives, I mean) could be picked up for every example, btw there’s quite a few useful bytes for this in http://michaelnielsen.org/blog/cameron-neylon-on-practical-steps-toward-open-science/

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    @Björn "it could simply be ahead of its time" yepp, fair point!

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    see an interesting discussion here http://ff.im/p0HGj

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cameronneylon Cameron Neylon

    I’m not sure that it matters. One of the points of post-publication peer review is that it doesn’t matter when it happens. Something might be harder to discover when it becomes relevant five years down the track than it would otherwise be but once it is discovered and reviewed that will improve. Much better than it not being published at all!

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    hm, certainly at the stage of encouraging someone to get it out there, the "ahead of tis time" factor is important if I seek critical feedback from my most immediate peers that I trust in a first tentative round, don’t you think? Once it IS up there: I agree with your point @Cameron

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    hm, certainly at the stage of encouraging someone to get it out there, the "ahead of tis time" factor is important if I seek critical feedback from my most immediate peers that I trust in a first tentative round, don’t you think? Once it IS up there: agree with your point Cameron

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    hm, certainly at the stage of encouraging someone to get it out there, the "ahead of its time" factor is important, maybe crucial – imagine I wish to seek critical feedback from my most immediate peers who I trust for a first tentative round, what if I do not have them because of the "ahead of its time" factor? — Once it IS up there: yes, I agree with your point @Cameron

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    @the-"ahead of its time"-factor: maybe like the inventors of double-blind peer review we might have to discuss anonymous authorship for Open Science post-publication review scenarios – anyone seen ideas & discussions on this one?

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    like the inventors of double-blind peer review we might have to discuss anonymous authorship for Open Science post-publication review scenarios – anyone seen ideas discussions onthis one?

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/jessykate jessykate

    wondering if you guys have seen this article about the european research effort "liquidpubication"? features the sentiments, "don’t print it, post it" and "don’t review it, use it". the latter especially is a really interesting point, although perhaps easier in computer science than, say, high energy physics research– i’m not going to just go and build my own LHC… but, "using" it could also mean referencing it in another paper that builds on those results. of course, you don’t want to build on them if they are wrong…

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/jessykate jessykate

    perhaps a form of "using it" could also be in showing it’s wrong. that’s interesting because it gets at the question of incentive, and also of quality. if i review a paper and reject it (especially in an anonymous peer review system), what professional credit (incentive) do i get? especially in an open peer review system where i probably don’t even get my name on a list of editors. but if i write a (micro?)paper showing it is right or wrong, i keep the emphasis on implementation (which is arguably the point ultimately), i create link back to the previous paper’s efforts (right or wrong), which creates a record for science, an acknowledgement (in the peer review "graph") of their work, and some incentive/recognition for me, too.

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    @jessykate, re @liquidpub see http://ff.im/oW73i

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    @jessykate, re @liguidpub see http://ff.im/oW73i

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    very interesting point about ”using it" @jessykate – let’s look at what disciplines may share or maybe not: what is a use case in a certain field, and what is the "it", respectively?

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    very interesting point about ”using it" @jessykate – let’s look at what disciplines may share or maybe not: what is a use case in a given field, and what is the "it", respectively?

    This comment was originally posted on FriendFeed

  • http://friendfeed.com/cameronneylon Cameron Neylon

    Claudia – agree this will be a difficulty, but if someone really can’t get feedback from _anyone_ on work then I think they’ve got bigger problems than being ahead of their time. No-one works alone any more – if nothing else you should be able to get local colleagues to take a look. @jessykate My view is that using it is inherently a form of review, as is citation, or any other reference, if we could capture them. The LiquidPub people are certainly pushing this angle and I agree with them philosophically. I’m less sure about their technical approach but I have to admit I haven’t looked in detail for a while at what they are doing.

    This comment was originally posted on FriendFeed

  • Anonymous

    I think that one reason the commenting and “likes” etc post-review process hasn’t taken off is that the infrastructure for making those comments is poor. As a researcher reviewing online, you would want a system that gives you credit for the work you are doing, also somewhere you can go to see all the comments and reviews you have made on different articles across different journals and blogs. You would want to be in control of those reviews. I have made a number of comments both on PLoS and Biomed Central but there is no way of me accessing, reviewing or deleting those comments I made and have never received any comments back from authors. If a system like DISQUS was set-up across different journals/publishers, you would have a way of being in control of the work you have done and be able to build a reputation in the research community by making reviews/comments and getting liked or disliked for it.

  • http://friendfeed.com/claudiakoltzenburg Claudia Koltzenburg

    Cameron, partly agree, which disciplines do you have in mind?

    This comment was originally posted on FriendFeed