NOTE: This has been running two weeks at the top of WUWT, discussion has slowed, so I’m placing it back in regular que. – Anthony
UPDATES:
Statistician William Briggs weighs in here
Eduardo Zorita weighs in here
Anonymous blogger “Deep Climate” weighs in with what he/she calls a “deeply flawed study” here
After a week of being “preoccupied” Real Climate finally breaks radio silence here. It appears to be a prelude to a dismissal with a “wave of the hand”
Supplementary Info now available: All data and code used in this paper are available at the Annals of Applied Statistics supplementary materials website:
http://www.imstat.org/aoas/supplements/default.htm
=========================================
Sticky Wicket – phrase, meaning: “A difficult situation”.
Oh, my. There is a new and important study on temperature proxy reconstructions (McShane and Wyner 2010) submitted into the Annals of Applied Statistics and is listed to be published in the next issue. According to Steve McIntyre, this is one of the “top statistical journals”. This paper is a direct and serious rebuttal to the proxy reconstructions of Mann. It seems watertight on the surface, because instead of trying to attack the proxy data quality issues, they assumed the proxy data was accurate for their purpose, then created a bayesian backcast method. Then, using the proxy data, they demonstrate it fails to reproduce the sharp 20th century uptick.
Now, there’s a new look to the familiar “hockey stick”.
Before:

After:

Not only are the results stunning, but the paper is highly readable, written in a sensible style that most laymen can absorb, even if they don’t understand some of the finer points of bayesian and loess filters, or principal components. Not only that, this paper is a confirmation of McIntyre and McKitrick’s work, with a strong nod to Wegman. I highly recommend reading this and distributing this story widely.
Here’s the submitted paper:
(PDF, 2.5 MB. Backup download available here: McShane and Wyner 2010 )
It states in its abstract:
We find that the proxies do not predict temperature significantly better than random series generated independently of temperature. Furthermore, various model specifications that perform similarly at predicting temperature produce extremely different historical backcasts. Finally, the proxies seem unable to forecast the high levels of and sharp run-up in temperature in the 1990s either in-sample or from contiguous holdout blocks, thus casting doubt on their ability to predict such phenomena if in fact they occurred several hundred years ago.
Here are some excerpts from the paper (emphasis in paragraphs mine):
This one shows that M&M hit the mark, because it is independent validation:
In other words, our model performs better when using highly autocorrelated
noise rather than proxies to ”predict” temperature. The real proxies are less predictive than our ”fake” data. While the Lasso generated reconstructions using the proxies are highly statistically significant compared to simple null models, they do not achieve statistical significance against sophisticated null models.
We are not the first to observe this effect. It was shown, in McIntyre
and McKitrick (2005a,c), that random sequences with complex local dependence
structures can predict temperatures. Their approach has been
roundly dismissed in the climate science literature:
To generate ”random” noise series, MM05c apply the full autoregressive structure of the real world proxy series. In this way, they in fact train their stochastic engine with significant (if not dominant) low frequency climate signal rather than purely non-climatic noise and its persistence. [Emphasis in original]
Ammann and Wahl (2007)
…
On the power of the proxy data to actually detect climate change:
This is disturbing: if a model cannot predict the occurrence of a sharp run-up in an out-of-sample block which is contiguous with the insample training set, then it seems highly unlikely that it has power to detect such levels or run-ups in the more distant past. It is even more discouraging when one recalls Figure 15: the model cannot capture the sharp run-up even in-sample. In sum, these results suggest that the ninety-three sequences that comprise the 1,000 year old proxy record simply lack power to detect a sharp increase in temperature. See Footnote 12
Footnote 12:
On the other hand, perhaps our model is unable to detect the high level of and sharp run-up in recent temperatures because anthropogenic factors have, for example, caused a regime change in the relation between temperatures and proxies. While this is certainly a consistent line of reasoning, it is also fraught with peril for, once one admits the possibility of regime changes in the instrumental period, it raises the question of whether such changes exist elsewhere over the past 1,000 years. Furthermore, it implies that up to half of the already short instrumental record is corrupted by anthropogenic factors, thus undermining paleoclimatology as a statistical enterprise.
…

We plot the in-sample portion of this backcast (1850-1998 AD) in Figure 15. Not surprisingly, the model tracks CRU reasonably well because it is in-sample. However, despite the fact that the backcast is both in-sample and initialized with the high true temperatures from 1999 AD and 2000 AD, it still cannot capture either the high level of or the sharp run-up in temperatures of the 1990s. It is substantially biased low. That the model cannot capture run-up even in-sample does not portend well for its ability
to capture similar levels and run-ups if they exist out-of-sample.
…
Conclusion.
Research on multi-proxy temperature reconstructions of the earth’s temperature is now entering its second decade. While the literature is large, there has been very little collaboration with universitylevel, professional statisticians (Wegman et al., 2006; Wegman, 2006). Our paper is an effort to apply some modern statistical methods to these problems. While our results agree with the climate scientists findings in some
respects, our methods of estimating model uncertainty and accuracy are in sharp disagreement.
On the one hand, we conclude unequivocally that the evidence for a ”long-handled” hockey stick (where the shaft of the hockey stick extends to the year 1000 AD) is lacking in the data. The fundamental problem is that there is a limited amount of proxy data which dates back to 1000 AD; what is available is weakly predictive of global annual temperature. Our backcasting methods, which track quite closely the methods applied most recently in Mann (2008) to the same data, are unable to catch the sharp run up in temperatures recorded in the 1990s, even in-sample.
As can be seen in Figure 15, our estimate of the run up in temperature in the 1990s has
a much smaller slope than the actual temperature series. Furthermore, the lower frame of Figure 18 clearly reveals that the proxy model is not at all able to track the high gradient segment. Consequently, the long flat handle of the hockey stick is best understood to be a feature of regression and less a reflection of our knowledge of the truth. Nevertheless, the temperatures of the last few decades have been relatively warm compared to many of the thousand year temperature curves sampled from the posterior distribution of our model.
Our main contribution is our efforts to seriously grapple with the uncertainty involved in paleoclimatological reconstructions. Regression of high dimensional time series is always a complex problem with many traps. In our case, the particular challenges include (i) a short sequence of training data, (ii) more predictors than observations, (iii) a very weak signal, and (iv) response and predictor variables which are both strongly autocorrelated.
The final point is particularly troublesome: since the data is not easily modeled by a simple autoregressive process it follows that the number of truly independent observations (i.e., the effective sample size) may be just too small for accurate reconstruction.
Climate scientists have greatly underestimated the uncertainty of proxy based reconstructions and hence have been overconfident in their models. We have shown that time dependence in the temperature series is sufficiently strong to permit complex sequences of random numbers to forecast out-of-sample reasonably well fairly frequently (see, for example, Figure 9). Furthermore, even proxy based models with approximately the same amount of reconstructive skill (Figures 11,12, and 13), produce strikingly dissimilar historical backcasts: some of these look like hockey sticks but most do not (Figure 14).
Natural climate variability is not well understood and is probably quite large. It is not clear that the proxies currently used to predict temperature are even predictive of it at the scale of several decades let alone over many centuries. Nonetheless, paleoclimatoligical reconstructions constitute only one source of evidence in the AGW debate. Our work stands entirely on the shoulders of those environmental scientists who labored untold years to assemble the vast network of natural proxies. Although we assume the reliability of their data for our purposes here, there still remains a considerable number of outstanding questions that can only be answered with a free and open inquiry and a great deal of replication.
===============================================================
Commenters on WUWT report that Tamino and Romm are deleting comments even mentioning this paper on their blog comment forum. Their refusal to even acknowledge it tells you it has squarely hit the target, and the fat lady has sung – loudly.
(h/t to WUWT reader “thechuckr”)

Pamela Gray-
There is a gigantic difference between taking a grad level stats class, and being a statistician. Mann proves that, for sure!
No, no joke. Even the authors state as much.
Lay the graphs over each other if you need “eye-ball confirmation”.
@ur momisugly Chris H,
I completely agree with you re: medical research. I’ve served as a reviewer once and it’s very time consuming to do the job well. You literally have to dissect each and every aspect of the study. I’ve never reviewed a paper by authors I had even heard of. Frequently in medical research (and drug studies in particular) the researchers employ the assistance of respected statisticians to crunch their numbers. You seldom encounter running gun battles over statistical methodology in medical research (usually it’s stuff like appropriate end points or sampling methodology).
In medical research theories and findings are challenged and restudied constantly. Something like the MBH98 paper would never survive for over a decade without being challenged. But then, we’re science, not religion.
Doug McGee says:
August 15, 2010 at 2:33 pm
Not sure why everyone is jumping up and down. The “hockey stick” is still within the uncertainties cited by these authors.
So is this.
http://tinyurl.com/28x3hl7
Lucy Skywalker says:
August 15, 2010 at 1:27 am
I cannot get the pdf page 21 to show up without disrupting Adobe. Unfortunately it’s the nice graphs page. Had to whisk past it. Anyone else had probs??
I have had no problems with the document but then i dont use Adobe, I use Foxit reader.
Try it, it’s a free download and it’s faster than Adobe Reader.
At such a pivotal moment in this clash of titans, one wants – as in any important historical event – to be able to say “I was there. I witnessed it.” Which is why I step briefly into the great light cast by the the ‘deniers’/ heroes and their supporters in order to say that I was here.
The debt owed to the stalwart seekers of truth is great.
To that end, Anthony’s contribution has been and remains immense.
(And Smokey. You’re smokin’! ;P
‘The people who understand and develop the data are the reliable sources,’
The only data you can develop are the ones that you yourself create as in computer modeling. Mann et al are using data that other people measure, store, and manage, so in essence Mann et al use other peoples’ measured data to create their own data.
‘Besides… Species are migrating north.’
Then come winter they’re migrating south again. And besides if the SH gets colder all them animals are all welcome to come stay in the NH, unless of course you’re an animal hater?
‘Glaciers and Arctic ice are melting at unheard of rates.’
Really how many of the handful of thousands of the 110 000 and then some glaciers are melting at an unheard of rate? How many are growing at an unheard of rate you think? And apparently arctic has stopped melting early this summer, and besides the damn ice grows still during winter time.
‘The ocean is becoming more acidic, and has experienced a 40% decline in fish biomass since 1950 due to CO2′s effect on phytoplankton.’
Right and you can prove that, and that overfishing didn’t’ve anything to do with it at all. In any case I hope, for you sake, you don’t eat fish then.
And do you know how limestone form and grow?
Pamela Gray says:
August 15, 2010 at 2:34 pm
============
Pam, and Chris
I didn’t say I was in your field of study and I didn’t say the way it works in mine is the same as anyone elses.
I said, it’s the way it works in my field, and I stand by it because it’s true….
Yea verily, in the beginning there was only noise. The Not So Great Statistician molded that noise into the image of a Mann and Mann said, “It is good! I will call it ‘hockey stick’.”
Then the Really Great Statistician molded that noise into a Womann and Womann said, “It is good. I will call it anything I wish; today a plow share; tomorrow a plump chicken or perhaps some new dancing shoes!”
Mann was not pleased. The Not So Great Statistician looked like a deer in the headlights.
CH
But it’s universally agreed that increases in manmade CO2 emissions significant enough to affect global temperature didn’t begin until 1950.
In order for the “hockey stick” to be alarming, the shaft must be flat and horizontal, making the current warming trend uniquely steep and the current temperature uniquely warm. As Geo says:
D. King,
Prove it?
I mean really, posts like that and “skeptics” wonder why they viewed with contempt and aren’t taken seriously?
John A says:
August 15, 2010 at 12:38 pm
“Prediction: Mann will claim this paper has already been debunked, […]”
LOL!
I eagerly await confirmation of your prediction and it shouldn’t take long ;o)
I emailed Kuccinelli this article. Maybe he’ll find something he can use – at some point being consistently incompetent in the direction of a single conclusion smacks of fraud.
Mike says:
August 15, 2010 at 3:03 pm
“According to Steve McIntyre, this is one of the “top statistical journals”. ”
“A very minor point here, the journal is new, only in its forth year. So, it is not likely to be a top journal yet.”
Yes, it is. It is a high impact journal, which by a recognized standard, is a “top” journal. And it’s “fourth” year. The number of years a journal has published is not a global criteria for determining status.
“It takes many years to establish a reputation.”
Actually, it has a very good reputation after only 4 years of pubs.
“The journal claims to have the 6th highest impact factor among stat journals, but it is not clear what that really means.”
Clear to those who know what it means and can check. But your previous claims would have more weight by saying that.
“The authors of the paper work in business schools (good ones) and have little background in science.”
Really? How much background, and what is “little” and what is “enough”? Does everyone that “works in business schools” lack science background?
Doug McGee says at 4:09 pm:
“D. King…
I mean really, posts like that and ‘skeptics’ wonder why they viewed with contempt and aren’t taken seriously?”
I LOL’d when I clicked on the link in D. King’s post @3:33 pm. I guess some folks lack a sense of humor.
Re: skeptics being viewed with ‘contempt,’ let me explain something to you.
Without skeptics there would be no Scientific Method. With no Scientific Method you would be going to your neighborhood witch doctor to cure cancer.
It is the job of skeptics to shoot holes in a hypothesis if they can. So far, skeptics have been doing an excellent job of deconstructing the CO2=CAGW hypothesis, to the point where it is now only a conjecture; an opinion.
A skeptic is the only honest kind of scientist. Why do you have contempt for honest scientists?
Mike Roddy says: August 14, 2010 at 7:13 pm
The authors of the 20- odd studies that confirmed Mann’s data are not really interested in what professional statisticians and mathematicians are saying about it.
_____________________________________
Sonicfrog says: August 14, 2010 at 7:32 pm
Yet they rely in stats and math to deduce the state of climate….. do you realize just what you’re saying?
_____________________________________
I know I just about bust a gut laughing at that statement. Mike Roddy just acknowledge that Mann and the other ” authors of the 20- odd studies” are not scientists but advocates!
Thanks Mike for openly acknowledging that Mann and Co. are psuedo-scientists intent on dishing out propaganda.
All those Ph.D.’ed statistician professors in “good” business schools would have a thing or two to say about whether or not they have a background in science.
Smokey,
I was laughing (because I believe he believes his statement is true), just not with him.
And there was a reason “skeptic” was in quotes. By the uncritical and automatic acceptance of this paper (just based on the title it seems) one can scroll through the comments and discern who the real skeptics are. [snip]
A skeptic critically examines all claims, not just the ones they find ideologically uncomfortable.
Ulf says at Aug. 15 10:47 AM:
“Generally speaking, this sort of advance does not necessarily cast previous work in disrepute, even though it may overturn their conclusions. Authors of previous work can, OTOH, cast themselves in disrepute by refusing to accept that their results were wrong, even if confronted with convincing evidence.”
MM, Wegman and Beenstock and Reingewertz have all demonstrated that Mann and his fellow climatological statisticians were wrong yet this same group has refused to accept that their results were wrong and they were certainly aware of the evidence.
James Allison says:
August 15, 2010 at 2:53 pm
Mikael Pihlström says:
August 15, 2010 at 11:33 am
But, the accusation that Mann and others neglected to do so, just to be
able to manipulate and distort results;I don’t believe it.”
Does the quote “We have to get rid of the Medieval Warm Period.” ring a bell?
Doug McGee says:
August 15, 2010 at 4:09 pm
D. King,
Prove it?
I mean really, posts like that and “skeptics” wonder why they viewed with contempt and aren’t taken seriously?
I’m glad you didn’t take it seriously!
latitude says:
August 15, 2010 at 7:11 am
…
If this paper proves to be true, then it can only mean one of two things:
1 Mann lied and cheated
2 Mann doesn’t know what he’s doing and is inept
Which of these, or both, were removed from consideration at his review at PSU?
duckster says:
August 14, 2010 at 7:48 pm
Looking at the paper above…
No medieval warming period, I see. And no temperature decline post-1998?? I thought you were arguing that the world was getting cooler, and arctic ice was recovering? [Cough, cough].
I guess we can put those ones to rest then, can’t we? After the way you’ve embraced this paper!……
____________________________________________
You totally misunderstand what this paper is all about. Anthony even TELLS you up front:
“…instead of trying to attack the proxy data quality issues, they assumed the proxy data was accurate for their purpose, then created a bayesian backcast method. Then, using the proxy data, they demonstrate it fails to reproduce the sharp 20th century uptick….”
The paper is not about the temperature readings at all. It is about the mathematics and statistics used in the proxy reconstructions to generate the “hockey stick graph” and the paper shows the proxy reconstructions are worthless. “We find that the proxies do not predict temperature significantly better than random series generated independently of temperature.”
@Glenn Skankey
I said: “The journal claims to have the 6th highest impact factor among stat journals, but it is not clear what that really means.”
Glenn said: “Clear to those who know what it means and can check. But your previous claims would have more weight by saying that.”
The impact factor is a measure of how often the articles in a journal are cited. But if a paper is cited a lot because others are criticising it, that counts the same as when it is cited by others praising it. That is one reason impact factors are not clear indicators of quality. See http://www.ams.org/notices/200603/comm-milman.pdf for another view of this. (A sub may be required.)
I will repeat that the M&W article seems interesting and should be judged on its merits.
AGW opponents may mourn the passage of the ‘Hockey Stick’ as this was one of the first and most obvious indications that the climate company store was selling a rotten bill of goods.
Perhaps we are seeing reports like this because many modern scientists have been required to take courses like ‘Ecology 101’ or ‘Environment 101’ which may be crossing the line between science and indoctrination in a modern form of nature and animal worship. It seems that a new cadre of educated professionals may have arrived who believe that there is an urgent need to halt the impact modern industry is having on this planet and the genesis of the ‘hockey stick’ seems to show that they are willing to use questionable scientific methods and perhaps outright falsehood in a desperate effort to prove their belief to be correct.