Guest Post by Willis Eschenbach
The discussion of the 1998 Mann “Hockeystick” seems like it will never die. (The “Hockeystick” was Dr. Michael Mann’s famous graph showing flatline historical temperatures followed by a huge modern rise.) Claims of the Hockeystick’s veracity continue apace, with people doggedly wanting to believe that the results are “robust”. I thought I’d revisit something I first posted and then expanded on at ClimateAudit a few years ago, which are the proxies in Michael Mann et al.’s 2008 paper, “Proxy-based reconstructions of hemispheric and global surface temperature variations over the past two millennia” (M2008). This was another salvo in Mann’s unending attempt to revive his fatally flawed 1998 “Hockeystick” paper. I used what is called “Cluster Analysis” to look at the proxies. Cluster analysis creates a tree-shaped structure called a “dendrogram” that shows the similarity between the individual datasets involved. Figure 1 shows the dendrogram of the 95 full-length proxies used in the M2008 study:
Figure 1. Cluster Dendrogram of the 95 proxies in the Mann 2008 dataset which extend from the year 1001 to 1980. The closer together two proxies are in the dendrogram, the more similar they are. Absolute similarity is indicated by the left-right position of the fork connecting two datasets. The names give the dataset abbreviation as used by Mann2008, the type (e.g. tree ring, ice core) the location as lat/long, the name of the princiipal investigator, and if tree rings the species abbreviation (e.g. PIBA, PILO).
What can we learn from this dendrogram showing the results of the cluster analysis of the Mann 2008 proxies?
First let me start by describing how the dendrogram is made. The program compares all possible pairs of proxies, and measures their similarity. It selects the most similar pair, and draws a “fork” that connects the two.
Take a look at the “forks” in the dendrogram. The further to the left the fork occurs, the more similar are the pairs. The two most similar proxy datasets in the whole bunch are ones that are furthest to the left. They turn out to be the Tiljander “lightsum” and “thicknessmm” datasets.
Once these two are identified, they are then averaged. The individual proxy datasets are replaced by the average of the two. Then the procedure is repeated. This time it compares all possible remaining pairs, including the average of the first two as a single dataset. Again the most similar pair is selected, marked with a “fork” (slightly to the right of the first fork), and averaged. In the dataset above, the most similar pair is again among the Tiljander proxies. In this case, the pair consists of the “darksum” proxy on the one hand, and the average of the two Tiljander proxies from the first step on the other hand. These two are then removed and replaced with their average.
This procedure is repeated over and over again, until all of the available proxies have been averaged together and added to the dendrogram and it is complete.
In this case, the clustering is clearly not random. In general a cluster is composed of measurements of similar things in a single geographical area (e.g. Argentinian Cypress tree rings). In addition, the proxies tend to cluster by proxy type (e.g. speleothems and sediments vs. tree rings).
Next, the dendrogram can be read from the bottom up to show which groups of proxies are most dissimilar to the others. The more outlying and more unusual group a group is, the nearer it is to the top of the dendrogram.
Next, note that many of the groups of proxies are much more similar to each other than they are to any of the other proxies. In particular the bristlecone “stripbark pines” end up right at the top of the dendrogram, because they are the most atypical group of the lot. Only when there is absolutely no other choice are the bristlecones at the top of the dendrogram added to the dendrogram.
So how does this type of analysis clarify whether the “Hockeystick” is real? The question at issue all along has been, is the “hockeystick” shape something that can be seen in a majority of the proxies, or is it limited to a few proxies? This is usually phrased as whether the results are “robust” to the removal of subsets of the proxies. And as usual in climate science, there are several backstories to this question of “robustness”.
The first backstory on this question is that well prior to this study, the National Research Council (2006) “Surface Temperature Reconstructions for the Last 2,000 Years” recommended that the bristlecone and related “stripbark” pines not be used in paleotemperature reconstructions. This recommendation had also been made previously by other experts in the field. The problem for Mann, of course, is that the hockeystick signal doesn’t show up much when one leaves out the bristlecones. So like a junkie unable to resist going back for one last fix, Dr. Mann and his adherents have found it almost impossible to give up the bristlecones.
The next backstory is that a number of the bristlecone proxy records collected by Graybill have failed replication, as shown by the Ababneh Thesis. Not only that, but one of the authors of M2008 (Malcolm Hughes) had to have known that, because he was on her PhD committee … so the M2008 study used proxies that were not only not recommended for use, but proxies not recommended for use that they knew had failed replication. Bad scientists, no cookies.
The final backstory is that the Tiljander proxies a) were said by the original authors to be hopelessly compromised in recent times and who advised against their use as temperature proxies, and b) were used upside-down by Mann (what he called warming the proxies actually showed as cooling).
With all of that as prologue, Figure 2 shows the average signals of the clusters of normalized proxies (averaged after each proxy is normalized to an average of zero and a standard deviation of one). See if you can tell where the Hockeystick shaped signal is located …
Figure 2. Left column shows average signals of the clusters of proxies shown in Figure 1, from the year 1001 to 1980. Averages are of the cluster to which each is connected by a short black line.
You can see the problems with the various Tiljander series, which are obviously contaminated … they go off the charts in the latter part of the record. In addition, if the Tiljander data were real it would be saying record cold, not record hot, but the computational method of Mann et al. flipped it over.
The reason for the unending addiction of Mann and his adherents to certain groups of proxies becomes obvious in this analysis. The hockeystick shape is entirely contained in a few clusters—the Greybill bristlecones and related stripbark species, the upside-down Tiljander proxies, and a few Asian tree ring records. The speleothems and lake sediments tell a very different story, one of falling temperatures … and in most of the clusters, there’s not much of a common signal at all. Which is why the attempts to rescue the original 1998 “hockeystick” have re-used and refuse to stop re-using those few proxies, proxies which are known to be unsuitable for use in paleotemperature reconstructions. They refuse to stop recycling them for a simple reason … you can’t make hockeysticks without those few proxies.
To sum up. Is the mining of “hockeystick” shaped climate reconstructions from this dataset a “robust” finding?
Not for me, not one bit. While you can get a hockeystick if you waterboard this data long enough, the result is a chimera, a false result of improper analysis. The hockeystick shaped signal is far too localized, and occurs in far too few of the clusters, to call it “robust” in any sense of the word.
w.
PS – The entire saga of the Ababneh Thesis, along with lots and lots of other interesting information, is available over at ClimateAudit. People who want to improve their knowledge about things like the proxy records and the Climategate FOI requests and the whole climate saga should certainly do their homework at ClimateAudit first … because in the marvelous world of Climate Science, things are rarely what they seem.
[UPDATE] Some commenters asked for the data, my apologies for not providing it. It is located at the NOAA Paleoclimate repository here.
Discover more from Watts Up With That?
Subscribe to get the latest posts sent to your email.
Nice review – I hadn’t seen this analysis of the hockey stick data sets laid out this neatly before. Too bad material like this never makes the news – it might help reduce the nonsense.
Nice. Very clear. Thanks Willis!
This is a very useful breakdown of the work (cough) that Mann does. Mann is exactly like a kid who just can’t stop taking cookies from the cookie jar. It doesn’t matter what happens, he just keeps going back for more.
I have never found a reconstruction that was independent of Mann that has EVER shown a hockey stick. That is everything that needs to be known about the *robust* method of Mann.
Great explanation for people like me who have no technical background. It never ceases to amaze me how the team continue to cling to their precious icon, even as it melts slowly into the category of science fiction.
Another interesting and very informative piece, keep em coming Willis
I take it that Mann ignored the data from “Speleotherms and Lake Sediments” which shows an upside down hockey stick and the data from everything else which shows precisely nothing. This is the “science” of Man Made Global Warming.
Excellent analysis – very well presented. Kill it with fire indeed.
Nice work Willis. Climategate got me interested in this stuff, but it has been a long-haul since then getting up to speed. Things like how Mann et al have treated Tiljander alone are mind-boggling – if it weren’t for the apparently blatant incompetence of applying the data upside-down, one would be tempted to infer outright fraud.
I remember a heated thread over at CA where the Tiljander defenders came out on one side, amac and others on the other side and I just could not believe seemingly intelligent people (gosh they may very well have been University academics) arguing for the Teljander proxies. I believe later S. Mosher did a very funny parody of that post here at WUWT based on Abbot and Costello’s “Who’s on First”.
That was another turning point in my understanding – (apparently) well educated, well spoken individuals are entirely willing to vigorously defend an obvious error…something weird is going on in Climate Science.
This is high-level tree geek stuff. Thanks for the analysis.
Nice Willis. Thanks.
The dendrogram is an interesting concept that seems able to group foibles together. I see the big spike at the end of the Tiljander-derived chart that seems even more hockeystick-ish than the Greybill bristlecone chart above it. Interesting comment that the Tiljander data was “haplessly compromised in recent times” according to the original authors, but that Mann used them anyhow in a sort of inverse manner. It would be interesting to learn why, if properly used, the Tiljander proxies would have indicated cooling instead of warming.
But maybe tide is turning — our Governor Christie has just pulled NJ out of the “Cap And Trade” agreement that has been in effect for several northeastern States (excluding PA) that Christie said was costing the average NJ utility user (like me) an EXTRA $3.50 PER MONTH just to fund the damaging anti-carbon war unleashed by Gore, Mann et al. I wonder how many other uninformed utility-bill payers like me were unaware of this EnviroTax Rip-off.
Andrew H says:
May 30, 2011 at 12:19 pm
Not true. A group of random proxies will be dominated if there is a cohesive group of similar outliers. In this case, what we have is a lot of proxies that mostly do nothing except cancel each other out. So in any kind of averaging, whether you use sophisticated weighting techniques or not, once those have cancelled out what’s left is the cohesive (but false) signal of the Graybill bristlecone and Tiljander and Tornetrask proxies.
What I’m saying is that Mann didn’t “ignore” those other proxies, because there is no need to ignore them. They just cancel out overall, leaving the Graybill etc. data to dominate the landscape. Once that stack of proxies is chosen, the die is cast, and even a simple average of all of the proxies will show a hockeystick.
w.
bob paglee says:
May 30, 2011 at 12:31 pm
Google “Tiljander site:climateaudit.org”, there’s heaps of backstory, I’ve only touched the surface. Mann not only used Tiljander upside-down once. Unbelievably, some of Mann’s co-authors re-used it upside-down in a subsequent paper after being informed of it … talk about unending hubris.
Anyhow, Steve McIntyre has the whole saga over at climateaudit. I highly recommend it.
w.
Great work. Another great contribution to the defense of science. Thanks. Too bad that defense of science is pretty much limited to WUWT, ClimateAudit, Montford’s blog and a few others.
It was not until the Tiljander episode that I fully understood the lengths that the team were willing to go to fudge the record.
A mistake in science, particularly in a method so dependent on programming and messy data, is not unusual and is forgivable. The team’s treatment of Steve McIntyre was bad, but was not yet in my mind unforgivable – egos being what they are.
But to goof up so fundamentally on a dataset whose collectors warned of its contamination and then to dissemble the way the team did on Tiljander was what convinced me that something other than good faith science was at work and tempted me to use the “f” word to describe their actions. Noting else could explain their actions.
I wonder when the truth will finally dawn on politicians that they have been well and truly hoodwinked by the team? Some will likely refuse to see the truth, but how long will it take before the majority get a clue?
We should ask the NHL players if their hockey sticks are made of stripbark pine. It take the best wood for the best Hockey sticks.
Thanks for pointing that out to me Willis, I stand corrected.
I read some time ago that the methodology of Mann’s statistical analysis meant that even telephone numbers inputted from the Yellow Pages at random would produce a hockey stick. Your explanation confirms this?
[REPLY] Steve McIntyre demonstrated this most conclusively. My explanation has nothing to do with that.
The program compares all possible pairs of proxies, and measures their similarity.
How is ‘similarity’ measured in your analysis?
Willis, can you confirm that in your cluster analysis the variables are measurements (ring width, density, etc) for the individual years between 1001 and 1980? I’m assuming you normalized each series to a mean of zero and stdev of one before doing the CA.
Can you speculate on why the SW USA Bristlecones split out into three clusters? Is there some pattern in tree location or lab processing? Or does it really come down to strip-bark specimens alone?
Did they use the data upside down? How come this is not widely known?
Oh, the wonders of modern parsing theory and the software that uses it! I scanned the article but did not notice any mention of what software package was used? BTW, this is a type of analysis that I am pretty familiar with as it is similar to what taxonomists use to produce cladograms.
Thanks for all this tasty meat to chew on.
Here is a sceptics intro to dendrochronology and data keeping.
http://uts.cc.utexas.edu/~wd/courses/373F/notes/lec20den.html
http://scienceblogs.com/aardvarchaeology/2009/06/open_source_dendrochronology.php
Leif Svalgaard
May 30, 2011 at 1:29 pm
How is ‘similarity’ measured in your analysis?
###
Good question. I am assuming wavelet analysis. A simple Haar would probably work, give the proper definition of the normal.
When I first saw the hockey stick alarm bells started to ring. It was completely contrary to everything Hubert Lamb had written in a series of books. H Lamb was the climate scientist responsible for the setting up of climate research at the UEA – he lived in East Anglia. If Mann was right everything known about the last 2 thousand years was wrong – how could Mann have been right. My thanks to Steve McIntyre for providing the evidence to rebut the nonsense science of Mann and his pals
just wondering whether James Annan ever returned to this area?
Willis puts it in a nutshell — much appreciated article.