New paper makes a hockey sticky wicket of Mann et al 98/99/08

NOTE: This has been running two weeks at the top of WUWT, discussion has slowed, so I’m placing it back in regular que.  – Anthony

UPDATES:

Statistician William Briggs weighs in here

Eduardo Zorita weighs in here

Anonymous blogger “Deep Climate” weighs in with what he/she calls a “deeply flawed study” here

After a week of being “preoccupied” Real Climate finally breaks radio silence here. It appears to be a prelude to a dismissal with a “wave of the hand”

Supplementary Info now available: All data and code used in this paper are available at the Annals of Applied Statistics supplementary materials website:

http://www.imstat.org/aoas/supplements/default.htm

=========================================

Sticky Wicket – phrase, meaning: “A difficult situation”.

Oh, my. There is a new and important study on temperature proxy reconstructions (McShane and Wyner 2010) submitted into the Annals of Applied Statistics and is listed to be published in the next issue. According to Steve McIntyre, this is one of the “top statistical journals”. This paper is a direct and serious rebuttal to the proxy reconstructions of Mann. It seems watertight on the surface, because instead of trying to attack the proxy data quality issues, they assumed the proxy data was accurate for their purpose, then created a bayesian backcast method. Then, using the proxy data, they demonstrate it fails to reproduce the sharp 20th century uptick.

Now, there’s a new look to the familiar “hockey stick”.

Before:

Multiproxy reconstruction of Northern Hemisphere surface temperature variations over the past millennium (blue), along with 50-year average (black), a measure of the statistical uncertainty associated with the reconstruction (gray), and instrumental surface temperature data for the last 150 years (red), based on the work by Mann et al. (1999). This figure has sometimes been referred to as the hockey stick. Source: IPCC (2001).

After:

FIG 16. Backcast from Bayesian Model of Section 5. CRU Northern Hemisphere annual mean land temperature is given by the thin black line and a smoothed version is given by the thick black line. The forecast is given by the thin red line and a smoothed version is given by the thick red line. The model is fit on 1850-1998 AD and backcasts 998-1849 AD. The cyan region indicates uncertainty due to t, the green region indicates uncertainty due to β, and the gray region indicates total uncertainty.

Not only are the results stunning, but the paper is highly readable, written in a sensible style that most laymen can absorb, even if they don’t understand some of the finer points of bayesian and loess filters, or principal components. Not only that, this paper is a confirmation of McIntyre and McKitrick’s work, with a strong nod to Wegman. I highly recommend reading this and distributing this story widely.

Here’s the submitted paper:

A Statistical Analysis of Multiple Temperature Proxies: Are Reconstructions of Surface Temperatures Over the Last 1000 Years Reliable?

(PDF, 2.5 MB. Backup download available here: McShane and Wyner 2010 )

It states in its abstract:

We find that the proxies do not predict temperature significantly better than random series generated independently of temperature. Furthermore, various model specifications that perform similarly at predicting temperature produce extremely different historical backcasts. Finally, the proxies seem unable to forecast the high levels of and sharp run-up in temperature in the 1990s either in-sample or from contiguous holdout blocks, thus casting doubt on their ability to predict such phenomena if in fact they occurred several hundred years ago.

Here are some excerpts from the paper (emphasis in paragraphs mine):

This one shows that M&M hit the mark, because it is independent validation:

In other words, our model performs better when using highly autocorrelated

noise rather than proxies to ”predict” temperature. The real proxies are less predictive than our ”fake” data. While the Lasso generated reconstructions using the proxies are highly statistically significant compared to simple null models, they do not achieve statistical significance against sophisticated null models.

We are not the first to observe this effect. It was shown, in McIntyre

and McKitrick (2005a,c), that random sequences with complex local dependence

structures can predict temperatures. Their approach has been

roundly dismissed in the climate science literature:

To generate ”random” noise series, MM05c apply the full autoregressive structure of the real world proxy series. In this way, they in fact train their stochastic engine with significant (if not dominant) low frequency climate signal rather than purely non-climatic noise and its persistence. [Emphasis in original]

Ammann and Wahl (2007)

On the power of the proxy data to actually detect climate change:

This is disturbing: if a model cannot predict the occurrence of a sharp run-up in an out-of-sample block which is contiguous with the insample training set, then it seems highly unlikely that it has power to detect such levels or run-ups in the more distant past. It is even more discouraging when one recalls Figure 15: the model cannot capture the sharp run-up even in-sample. In sum, these results suggest that the ninety-three sequences that comprise the 1,000 year old proxy record simply lack power to detect a sharp increase in temperature. See Footnote 12

Footnote 12:

On the other hand, perhaps our model is unable to detect the high level of and sharp run-up in recent temperatures because anthropogenic factors have, for example, caused a regime change in the relation between temperatures and proxies. While this is certainly a consistent line of reasoning, it is also fraught with peril for, once one admits the possibility of regime changes in the instrumental period, it raises the question of whether such changes exist elsewhere over the past 1,000 years. Furthermore, it implies that up to half of the already short instrumental record is corrupted by anthropogenic factors, thus undermining paleoclimatology as a statistical enterprise.

FIG 15. In-sample Backcast from Bayesian Model of Section 5. CRU Northern Hemisphere annual mean land temperature is given by the thin black line and a smoothed version is given by the thick black line. The forecast is given by the thin red line and a smoothed version is given by the thick red line. The model is fit on 1850-1998 AD.

We plot the in-sample portion of this backcast (1850-1998 AD) in Figure 15. Not surprisingly, the model tracks CRU reasonably well because it is in-sample. However, despite the fact that the backcast is both in-sample and initialized with the high true temperatures from 1999 AD and 2000 AD, it still cannot capture either the high level of or the sharp run-up in temperatures of the 1990s. It is substantially biased low. That the model cannot capture run-up even in-sample does not portend well for its ability

to capture similar levels and run-ups if they exist out-of-sample.

Conclusion.

Research on multi-proxy temperature reconstructions of the earth’s temperature is now entering its second decade. While the literature is large, there has been very little collaboration with universitylevel, professional statisticians (Wegman et al., 2006; Wegman, 2006). Our paper is an effort to apply some modern statistical methods to these problems. While our results agree with the climate scientists findings in some

respects, our methods of estimating model uncertainty and accuracy are in sharp disagreement.

On the one hand, we conclude unequivocally that the evidence for a ”long-handled” hockey stick (where the shaft of the hockey stick extends to the year 1000 AD) is lacking in the data. The fundamental problem is that there is a limited amount of proxy data which dates back to 1000 AD; what is available is weakly predictive of global annual temperature. Our backcasting methods, which track quite closely the methods applied most recently in Mann (2008) to the same data, are unable to catch the sharp run up in temperatures recorded in the 1990s, even in-sample.

As can be seen in Figure 15, our estimate of the run up in temperature in the 1990s has

a much smaller slope than the actual temperature series. Furthermore, the lower frame of Figure 18 clearly reveals that the proxy model is not at all able to track the high gradient segment. Consequently, the long flat handle of the hockey stick is best understood to be a feature of regression and less a reflection of our knowledge of the truth. Nevertheless, the temperatures of the last few decades have been relatively warm compared to many of the thousand year temperature curves sampled from the posterior distribution of our model.

Our main contribution is our efforts to seriously grapple with the uncertainty involved in paleoclimatological reconstructions. Regression of high dimensional time series is always a complex problem with many traps. In our case, the particular challenges include (i) a short sequence of training data, (ii) more predictors than observations, (iii) a very weak signal, and (iv) response and predictor variables which are both strongly autocorrelated.

The final point is particularly troublesome: since the data is not easily modeled by a simple autoregressive process it follows that the number of truly independent observations (i.e., the effective sample size) may be just too small for accurate reconstruction.

Climate scientists have greatly underestimated the uncertainty of proxy based reconstructions and hence have been overconfident in their models. We have shown that time dependence in the temperature series is sufficiently strong to permit complex sequences of random numbers to forecast out-of-sample reasonably well fairly frequently (see, for example, Figure 9). Furthermore, even proxy based models with approximately the same amount of reconstructive skill (Figures 11,12, and 13), produce strikingly dissimilar historical backcasts: some of these look like hockey sticks but most do not (Figure 14).

Natural climate variability is not well understood and is probably quite large. It is not clear that the proxies currently used to predict temperature are even predictive of it at the scale of several decades let alone over many centuries. Nonetheless, paleoclimatoligical reconstructions constitute only one source of evidence in the AGW debate. Our work stands entirely on the shoulders of those environmental scientists who labored untold years to assemble the vast network of natural proxies. Although we assume the reliability of their data for our purposes here, there still remains a considerable number of outstanding questions that can only be answered with a free and open inquiry and a great deal of replication.

===============================================================

Commenters on WUWT report that Tamino and Romm are deleting comments even mentioning this paper on their blog comment forum. Their refusal to even acknowledge it tells you it has squarely hit the target, and the fat lady has sung – loudly.

(h/t to WUWT reader “thechuckr”)

Share

0 0 votes
Article Rating

Discover more from Watts Up With That?

Subscribe to get the latest posts sent to your email.

1.2K Comments
Inline Feedbacks
View all comments
Mike G
August 14, 2010 8:32 pm

@duckster
CAGW has been falsified to my satisfaction.

August 14, 2010 8:33 pm

Rocky Road,
Here is the Phil Jones chart. It shows recurring natural cycles. Only a scientific charlatan would point to the last ramp and say, “Look! AGW!”
Duckster says:
“Asking me to accept a medieval warming period (which is what I have been asked to do here) means showing how and where it got warmer, and then how and when it got cooler. A steady downward temperature trend is not a warming period.”
No one knows exactly how the planet gets warmer and colder. There are hypotheses, and conjectures like the CO2=AGW assumption. But it is not necessary to know the mechanics in detail to observe the MWP. Science doesn’t work like that.
Here are eighteen proxies showing the MWP. The warming peak around 1000 A.D. is the same as the McShane and Wyner paper shows, and the same as the GISP2 ice cores show. The ice cores are empirical observations that trump all MWP speculation.
You really need to get up to speed. I recommend doing a search of the WUWT archives for “MWP.” There is plenty there for you to learn.

Mike G
August 14, 2010 8:35 pm

@duckster
Open your eyes. Half the graph is hotter than today. Not that it means anything because they took mann’s garbage at face value, for the sake of the exercise. But, garbage they found it to be.

Robinson
August 14, 2010 8:36 pm

So where exactly would you place a medieval warming period here? Asking me to accept a medieval warming period (which is what I have been asked to do here) means showing how and where it got warmer, and then how and when it got cooler. A steady downward temperature trend is not a warming period.

I interpret the paper as saying given the data we have, what’s the best we can make of it?, rather than the proxies are all strong indicators of the temperature record, let’s process them correctly. I’m not sure but I think you may be missing this subtle but important distinction as you scan their graph for a MWP.

Mike G
August 14, 2010 8:38 pm

Mike Jowsey says:
August 14, 2010 at 8:32 pm
duckster says:
August 14, 2010 at 7:48 pm
Looking at the paper above…
The main point of this paper is to debunk the maths Mann used. You can get similar hockey sticks by using random numbers. Speak to that subject please.
By shifting focus to whether or not the graph shows a MWP is a strawman and is completely irrelevant to the point of the paper. Besides, the graph (fig.16) uses the same proxy data Mann used, with correct maths. Mann’s proxy data (and maths) explicitly set out to remove the MWP so it is no surprise that his biased proxy selections camouflage the MWP. Nevertheless, fig.16 does show temperatures 1000 years ago were on a par with today’s (according to Mann’s proxies).
————
Is it possible to get what mann got, instead of this, if you’re trying to do it right? I don’t think so.

Zeke the Sneak
August 14, 2010 8:40 pm

“Similarly, climate scientists are getting bored with arguments from…individuals that the “trace gas” CO2 does not play the major role in the recent and rapid temperature increases. “
Is that right? Is that so.
They no longer amuse us, either.

Dave F
August 14, 2010 8:41 pm

ZT @August 14, 2010 at 8:16 pm says:
The paper has some witty one liners too, such as:
“We assume that the data selection, collection, and processing performed by climate scientists meets the standards of their discipline.”

I caught that, too, but I took it as an attempt to stay away from that issue, with the problems that have been raised here and other places, but at the same time note that there may be issues there that are not addressed in the paper.

Michael Jankowski
August 14, 2010 8:42 pm

—Nick Stokes said: “The fundamental problem is that there is a limited amount of proxy data which dates back to 1000 AD; what is available is weakly predictive of global annual temperature. “
But they give a backcast anyway?—
You clearly missed one of the major points of the paper. I’m guessing you probably missed all of its points.
In any case, Mann “gave a backcast anyway,” too…as did the other temperature reconstructions going back to 1000 AD. Why don’t/didn’t you have a problem with those?

RockyRoad
August 14, 2010 8:44 pm

Duckster, the LIA was between 1560 and 1850 (note in the graph that the temperature line is lowest for this interval) while the MWP was before that. We’re only now back up to what is considered the MWP (AD 800 to 1300); note that the graph only goes back to about AD 950, so you don’t see the beginning up-tick in temperatures leading to the MWP.
But the critical aspect here is the vast difference between the shape of the temperature curve from this mathematical analysis and the shape of Mann’s Hokey Stick (my personal vernacular). James Sexton above makes the point clearly—Mann SCREWED UP! So when correct mathematics is applied to Mann’s data, the LIA and MWP are clearly seen and there is no horrific, unprecedented upswing in the temperature graph at the end. Mann’s pseudo-statistics is blatently obvious.

geo
August 14, 2010 8:46 pm

I’m tempted to head for the basement, take out my trumpet (unplayed for many years), turn it upside down, and play Taps on it –except that Taps is meant to be a sign of respect (back in the day, I played it at many veterans funerals), and so it would not be appropriate here.

Michael Jankowski
August 14, 2010 8:47 pm

Robinson,
While the paper does include a discusson of “MBH98” and the original “hockey stick” controversy, it does also cover Mann et all 2008 – which is quite recent and relevant – along with any number of other proxy reconstructions.

Michael Jankowski
August 14, 2010 8:47 pm

I wonder how many emails went back-and-forth between team members today?

Evan Jones
Editor
August 14, 2010 8:48 pm

Mike:
It does not change the fact that CO2 warms the earth and we need to be thinking about what to do about our CO2 emissions.
I think we need to fine down the forcing and make a determination as to feedbacks. then we will know what actions we have to or do not have to take.
I don’t see how it adds up. Even if the adjusted temperatures are correct (which I doubt), and all the warming is from CO2 increase (also unlikely), the 0.7 degree 20th Century warming form a 40% increase of CO2 does not compute.
Doubling of CO2 is supposed to hike temps by 1.2C and positive feedbacks are supposed to almost triple that. So temperature increase should be around +2.0C, not +0.7C
If it demonstrably does not add up over the past century, why would it add up over the next century?

chris y
August 14, 2010 8:50 pm

Mike Roddy-
You mean untrained individuals like Andy Revkin- “So climate super-extremes are inevitable, the number of people is doubling, and greenhouse-driven change, given the uncertainties, is — at best — a tertiary wild card.”
Meanwhile, Prof. Richard Alley sounds like an undergraduate polysci major when he makes asinine comments like this to congress- “What is going on in the Arctic now is the biggest and fastest thing that nature has ever done.”
Everyone now recognizes your CAGW fingerprint list for what it is- Gaian phrenology masquerading as science.

duckster
August 14, 2010 8:55 pm

@Jowser By shifting focus to whether or not the graph shows a MWP is a strawman and is completely irrelevant to the point of the paper.
No. I am saying that by accepting this paper you need to either show why it doesn’t show a MWP or you need to discard one of your major arguments. You can’t just choose any paper that casts doubt on CAGW because it casts doubt on global warming. You need to show that it is consistent with the other arguments you have made that cast doubt on CAGW.
Now a quick look back through the archives here shows a graph of when the MWP occurred. This doesn’t look anything like the graph above:
MWP compared to Mann
If you are going to accept the article above, then the graph I have linked to is wrong. Isn’t it? Be consistent!!

Beth Cooper
August 14, 2010 8:55 pm

McShane and Wynes’s published rebutal of Mann’s Hockey Stick in the Annals of Applied Statistics is welcome support for Steve McIntyre’s findings. They have taken what is called in military map making, a ‘cross bearing’ of the terrain.

John Blake
August 14, 2010 8:57 pm

Andrew30,
True, alas too true… but Prof. Orozngo’s 1880 – 2100 chart (termed Figure 3) as depicted by WUWT last April has a curiously aesthetic look of finality about it. Speaking of the estimable Mike Roddy, as Charlie Brown said: “How can we lose when we’re so sincere?” (To have suffered a thunderbolt from AW himself is a great coup.)

August 14, 2010 8:58 pm

“Furthermore, it is hard to argue that a procedure is truly skillful if it cannot consistently outperform noise–no matter how artfully structured.”—another heh.
For duckster, We plot these backcasts in Figure 14 in grey and show the CRU average in black. As can be seen, while these models all perform similarly in terms of cross-validation, they have wildly different implications about climate history. and
According to some of them (e.g., the ten proxy principal component model given in green or the two stage model featuring one local temperature principal component and ten proxy principal components featured in blue), the recent run-up in temperatures is not that abnormal, and similarly high temperatures would have been seen over the last millennium. Interestingly, the blue backcast seems to feature both a Medieval Warm Period and a Little Ice Age whereas the green one shows only increasing temperatures
going back in time.
——-back to reading.

GrantB
August 14, 2010 8:58 pm

In Conclusions p 41 – “Consequently, the long flat handle
of the hockey stick is best understood to be a feature of regression and less a reflection of our knowledge of the truth”

August 14, 2010 9:04 pm

Michael Jankowski says: August 14, 2010 at 8:42 pm
Actually there’s quite a lot in this paper I agree with, including the suggestion that uncertainty levels may be higher than often thought. I suspect, though, that people here will get more excited over the shape of the reconstruction than over the observation of its uncertainty.
I particularly liked this observation:
This effort to reconstruct our planet’s climate history has become linked to the topic of Anthropogenic Global Warming (AGW). On the one hand, this is peculiar since paleoclimatological reconstructions can provide evidence only for the detection of AGW and even then they constitute only one such source of evidence. The principal sources of evidence for the detection of global warming and in particular the attribution of it to anthropogenic factors come from basic science as well as General Circulation Models (GCMs) that have been fit to data accumulated during the instrumental period (IPCC, 2007). These models show that carbon dioxide, when released into the atmosphere in sufficient concentration, can force temperature increases.

August 14, 2010 9:05 pm

Duckster, you don’t seem to get the fact that this new paper is not using all available evidence. It is using ONLY what Michael Mann cherry-picked [read The Hockey Stick Illusion by A.W. Montford to clearly see Mann’s cherry-picking shenanigans].
Do you see? It’s statistics, using only Mann’s selected data. Is this starting to sink in?

Lew Skannen
August 14, 2010 9:08 pm

Nice article.
“instead of trying to attack the proxy data quality issues, they assumed the proxy data was accurate for their purpose, then created a bayesian backcast method”
What I especially like about this particular article is that it now moves the battlefield to where we want it. Rather than squabble over how thick the ice is this year or how hot last April was in South Tuvalu we need to get to the heart of the matter and ask – How accurate are the models?
It is quite clear that any half believable model will have to include thresholds, tipping points, runaway processes etc. ie they will be highly non-linear and most likely chaotic.
I want to see more work in this area – exposing the models for the hamfisted guesses that they are.

Dave F
August 14, 2010 9:10 pm

…and hence have been overconfident in their models…
Are not these models the basis of much of the work done in attributing climate change? Boy, if they go down, the flag is soon to follow…

duckster
August 14, 2010 9:20 pm

@Smokey
Duckster, you don’t seem to get the fact that this new paper is not using all available evidence
So is this how you get around the fact that McShane and Wyner is showing almost 2 degrees of warming since 1850? This is way beyond what Mann et al show – and would be truly unprecedented, wouldn’t it?
Do you see? It’s statistics, using only Mann’s selected data.
OK. So your job now would be to show consistency by fitting it into the available evidence so that it doesn’t contradict the other points you have made against CAGW. There is no point at all in destroying Mann if you have to throw out half of the all the other things that have been said on this blog in order to do so.
REPLY: Sorry “duckster” but you are wrong, there is not “almost 2 degrees since 1850” – about 0.7C maybe 1C if I were to be generous. Have a look at the intersecting blue lines to the red mean line for 1850 and the most recent data point:
Annotated by Anthony - Fig 16
Also, Figure 15 from the paper shows essentially the same:
Figure 15
You really can’t argue on the basis of noise, or annual values. The mean line is the message. – Anthony

August 14, 2010 9:21 pm

Mike Roddy says:
August 14, 2010 at 7:13 pm

Here’s the definitive article on questions about the Mann Hockey Stick:
http://www.realclimate.org/index.php/archives/2009/09/hey-ya-mal/

But no actual discussion is allowed….

The authors of the 20- odd studies that confirmed Mann’s data are not really interested in what professional statisticians and mathematicians are saying about it. The people who understand and develop the data are the reliable sources, including actual climate scientists who produce their own outlier charts of the upward march of temperatures (are there any?)

Head In Sand much?
“The people who understand and develop the data are the reliable sources” but very poor statisticians, as has been demonstrated widely now. They somehow refuse to see this, however.

Besides… Species are migrating north. Glaciers and Arctic ice are melting at unheard of rates. The ocean is becoming more acidic, and has experienced a 40% decline in fish biomass since 1950 due to CO2′s effect on phytoplankton.

Species move all the time. There is no proof whatsoever that any migration is due to any warming (real or not) as opposed to natural variation and other changes we make to the environment that have nothing to do with CO2.

Similarly, climate scientists are getting bored with arguments from untrained individuals that the “trace gas” CO2 does not play the major role in the recent and rapid temperature increases. This role was proven in a laboratory in the 19th century by Arrhenius, and has not been seriously disputed since.

And we are getting bored with untrained statisticians wilfully ignoring fatal statistical flaws in their work so that they can continue a political agenda.
Arrhenius proved that CO2 is a greenhouse gas. He also disputed his own findings in the 20th century and found its influence to me much lower that he first thought (in the 19th century), but most alarmists fail to mention that minor point. That proof, however, does not demonstrate any catastrophic effect on climate. He thought it would be good in fact! The required forcing and positive feedbacks that warmists need to create scary scenarios are nowhere near observed or proven, so Arrhenius was probably right – it is a GOOD THING!

Verified by MonsterInsights