Note: Steve McIntyre is also quite baffled by the Marcott et al paper, finding it currently unreproducible given the current information available. I’ve added some comments from him at the bottom of this post – Anthony
Guest Post by Willis Eschenbach
I don’t know what it is about proxies that makes normal scientists lose their senses. The recent paper in Science (paywalled of course) entitled A Reconstruction of Regional and Global Temperature for the Past 11,300 Years” (hereinafter M2012) is a good example. It has been touted as the latest hockeystick paper. It is similar to the previous ones … but as far as I can see it’s only similar in how bizarre the proxies are.
Nowhere in the paper do they show you the raw data, although it’s available in their Supplement. I hate it when people don’t show me their starting point. So let me start by remedying that oversight:

Figure 1. All of the proxies from M2012. The colors are only to distinguish individual records, they have no meaning otherwise.
I do love the fact that from that collection of temperature records they draw the conclusion that:
Current global temperatures of the past decade have not yet exceeded peak interglacial values but are warmer than during ~75% of the Holocene temperature history.
Really? Current global temperature is about 14°C … and from those proxies they can say what the past and present global average temperatures are? Well, let’s let that claim go for a moment and take a look at the individual records.
Here’s the first 25 of them:
Figure 2. M2012 proxies 1 to 25. Colors as in Figure 1. Note that each panel has its own vertical axis. Numbers to the left of each title are row/column.
Well … I’d start by saying that it seems doubtful that all of those are measuring the same thing. Panel 3/1 (row 3, column 1) shows the temperature decreasing for the last ten thousand years. Panels 4/4 and 4/5 show the opposite, warming for the last ten thousand years. Panel 4/3 shows four thousand years of warming and the remainder cooling.
Let’s move on to the next 25 contestants:
Figure 3. M2012 proxies 26 to 50. Colors as in Figure 1. Note that each panel has its own vertical axis. Numbers to the left of each title are row/column.
Here we see the same thing. Panels 1/1 and 4/1 show five thousand years of warming followed by five thousand years of cooling. Panel 1/5 shows the exact opposite, five thousand cooling years followed by five thousand of warming. Panel 4/5 show steady warming, panel 5/2 shows steady cooling, and panel 2/2 has something badly wrong near the start. Panel 2/4 also contains visible bad data.
Onwards, we near the finish line …
Figure 4. M2012 proxies 51 to 73. Colors as in Figure 1. Note that each panel has its own vertical axis. Numbers to the left of each title are row/column.
Panel 2/1 shows steadily rising temperatures for ten thousand years, as does panel 3/4. Panels 4/1 and 5/1, on the other hand, show steadily decreasing temperatures. Panel 4/2 has a hump in the middle. but panel 1/2 shows a valley in the middle.
Finally, here’s all the proxies, with each one shown as anomalies about the average of its last 2,000 years of data:

Figure 5. All Marcott proxies, expressed as anomalies about their most recent 2,000 years of record. Black line shows 401-point Gaussian average. N=9,288.
A fine example of their choice of proxies can be seen in the fact that they’ve included a proxy which claims a cooling about nine degrees in the last 10,000 years … although to be fair, they’ve also included some proxies that show seven degrees of warming over the same period …
I’m sorry, guys, but I’m simply not buying the claim that we can tell anything at all about the global temperatures from these proxies. We’re deep into the GIGO range here. When one proxy shows rising temperatures for ten thousand years and another shows dropping temperatures for ten thousand years, what does any kind of average of those two tell us? That the temperature was rising seven degrees while it was falling nine degrees?
And finally, their claim of turning that dogs breakfast shown in Figure 1 into an absolute global temperature and comparing it to the current 14°C average temperature estimate?
Don’t make me laugh.
I say the reviewers of this paper didn’t use their Mark I eyeball. The first thing to do when dealing with a multi-proxy study is to establish ex-ante criteria for the selection of the proxies (“ex-ante” meaning choose your criteria before looking at the proxies). Here are their claimed criteria …
This study is based on the following data selection criteria:
• Sampling resolution is typically better than ~300 yr.
• At least four age-control points span or closely bracket the full measured interval.
• Chronological control is derived from the site itself and not primarily based on tuning to other sites. Layer counting is permitted if annual resolution is plausibly confirmed (e.g., ice-core chronologies). Core tops are assumed to be 1950 AD unless otherwise indicated in original publication.
• Each time series spans greater than 6500 years in duration and spans the entire 4500 – 5500 yr B.P. reference period.
• Established, quantitative temperature proxies
• Data are publicly available (PANGAEA, NOAA-Paleoclimate) or were provided directly by the original authors in non-proprietary form.
• All datasets included the original sampling depth and proxy measurement for complete error analysis and for consistent calibration of age models (Calib 6.0.1 using INTCAL09 (1)).
Now, that sounds all very reasonable … except that unfortunately, more than ten percent of the proxies don’t meet the very first criterion, they don’t have sampling resolution that is better than one sample per 300 years. Nice try, but eight of the proxies fail their own test.
I must say … when a study puts up its ex-ante proxy criteria and 10% of their own proxies fail the very first test … well, I must say, I don’t know what to say.
In any case, then you need to LOOK AT EACH AND EVERY PROXY. Only then can you begin to see if the choices make any sense at all. And in this case … not so much. Some of them are obviously bogus. Others, well, you’d have to check them one by one.
Final summary?
Bad proxies, bad scientists, no cookies for anyone.
Regards,
w.
==============================================================
Steve McIntyre writes in a post at CA today:
Marcott et al 2013 has received lots of publicity, mainly because of its supposed vindication of the Stick. A number of commenters have observed that they are unable to figure out how Marcott got the Stick portion of his graph from his data set. Add me to that group.
The uptick occurs in the final plot-point of his graphic (1940) and is a singleton. I wrote to Marcott asking him for further details of how he actually obtained the uptick, noting that the enormous 1920-to-1940 uptick is not characteristic of the underlying data. Marcott’s response was unhelpful: instead of explaining how he got the result, Marcott stated that they had “clearly” stated that the 1890-on portion of their reconstruction was “not robust”. I agree that the 20th century portion of their reconstruction is “not robust”, but do not feel that merely describing the recent portion as “not robust” does full justice to the issues. Nor does it provide an explanation.
Read Steve’s preliminary analysis here:
[UPDATE] In the comments, Steve McIntyre suggested dividing the proxies by latitude bands. Here are those results:
Note that there may be some interesting things buried in there … just not what Marcott says.
Also, regarding the reliability of his recent data, he describes it as “not robust”. It is also scarce. Only 0.6% of the data points are post 1900, for example. This raises the question of how he compared modern temperatures to the proxies, since there is so little overlap.
Finally, about a fifth of the proxies (14 of 73) have the most recent date as exactly 1950 … they said:
Core tops are assumed to be 1950 AD unless otherwise indicated in original publication.
Seems like an assumption that is almost assuredly wrong. I don’t know if that’s a difference that makes a difference, depends on how wrong it is. If we take the error as half the distance to the next data point for each affected proxy, it averages about ninety years … pushing 1950 back to 1860 … yeah, I’ll go with “not robust” for that.
[UPDATE 2] Yes, I am shoveling gravel, one ton down, six to go … and I do get to take breaks. Here’s the result of my break, the Marcott proxies by type:
And here’s a picture of yr. unbending author playing what we used to call the “Swedish Banjo”.
Best to all,
w.


Willis an interesting summary.
When dealing with proxies, one needs a very large dolop of caution. They are notoriously unreliable, with wide error margins. As regards averaging, it appears that the usual position in climate science is that an average of a collection of sow’s ears produces a silk purse, whereas, in reality, average of ‘crap’ remains ‘crap’.
I would have thought that the starting point with respect to each and every one of the proxies is to precisely identify what the proxy is, to detail precisiely from where it was taken, and how it was taken. The study author ought to then set out, on a proxy by proxy basis, what he thinks that the proxy (in question) is measuring, how and why it can be concluded that it is a metric for the measurement ascribed to it by the study author. Finally, the study aithor should evaluate each and every one of the proxies used and set out what he considers to be the reliability and the error margin of the proxy and why he holds such analysis.
What bugs me the most about this is how the authors can claim “Global temperatures are warmer than at any time in at least 4,000 years” and “Global temperature….. has risen from near the coldest to the warmest levels of the Holocene within the past century.” A heat spike like this has never happened before, at least not in the last 11,300 years”” when those very authors acknowledge that the 1890-on portion of their reconstruction was “NOT ROBUST”
In other words, on their own admission, there is no robust evidence to support the contention that it is now warmer than anytime in the past 4,000 years, nor that in the past century, the temperature has risen from near the coldest to the warmest levels, nor that a heat spike like this has never happened before etc.
The upshot of their own admission is that there study simply suggests that there is a stick and does not evidence that there is a blade attached to the stick.
Why MSM were not told that there is no robust evidence for the 1890s onwards beggars belief.
Willis, thanks for the effort …
since you have the data available, have you tried leaving out the 8 “failed their own criteria” proxies, and what effect that would have on your fig. 5 ?
tia
Willis,
Re the 300 years condition, I presume your numbers on resolution come from col 7 of Table S1 of the SM. These are headed just resolution, and I think they are not the sampling resolution.
If you take the Dome F d180O, it lists a resolution of 500 years. But if you look at the raw data, you’ll see values listed every 250 yr. Each 250 years is typically 7-12 m depth in the last 10000 yr, and in the header, it says that they analyze 10 cm slices.
So I think sampling resolution is frequent. The discussion on p 6-7 of the SI of the cited reference seems to focus on “age model error” limiting the resolution.
“A number of commenters have observed that they are unable to figure out how Marcott got the Stick portion of his graph from his data set.”
Mikes “Nature Trick”???????
Thank you Willis. This is a far better way to debunk the Marcott study than what Easterbrook was doing. You just need to look at their own data, as you’ve done here.
But there is something wrong with your horizontal axis labelling. In their notation year 0 is 1950. If you are using the same numbering system your plots go about 1000 years into the future. Please could you check\clarify? 🙂
Here is a plot of their proxies over the last 300 years (note my time axis goes the other way) showing that there is no 20th century upturn in their data.
Later today I will do an averaging plot like your fig 5.
So in other words they are saying that ~75% of the Holocene temperature is less than the current average global temperature of 14°C? I could be wrong here but I vaguely recall that previous inter-glacials were warmer than the Holocene. Should I be worried?
“I must say … when a study puts up its ex-ante proxy criteria and 10% of their own proxies fail the very first test … well, I must say, I don’t know what to say.”
The word you are groping for is “Bullshit”……
Did they or did they not splice the instrument data onto the proxy data ?
I haven’t studied the paper behind the pay-wall but I am assuming they first performed an area weighted average of the proxy data, to yield a global proxy anomaly. Looking at your curves Willis, I see no evidence of a spike in the proxy data.
So the question is are they claiming a spike in the proxy only data, or just in the proxy + instrument data ? If it is the latter then they are comparing apples and pairs, because the former have a resolution of ~100years and the latter a resultion of 1 year.
One of the main flaws is this particular criterion:
“Chronological control is derived from the site itself and not primarily based on tuning to other sites. ”
This is akin to correlating wells in the Gulf of Mexico using only paleo reports and without correlating the well logs… Which would be the same as not correlating the wells.
Willis,
Cannot the National Science Foundation (NSF) who financed this paper be called to account or to publicly explain these glaring anomalies? Is there any peer reviewed paper that can be published to refute this nonsense?
Spaghetti factory explosion springs to mind, really this is too much, how can anybody look at this garbage and pretend it is constructive science. But I suppose it will in some people’s warped minds back up the original hockey stick debacle.
Another way to produce toilet paper.
Thanks Willis.
Thanks, Willis, for a presentation that even someone as mathematically challenged as I am can readily grasp.
It really is a complete load of doggy-doo!
How can any, intelligent, being believe that all (ANY?) of these “proxies” do really measure temperature?
When I compare your figure 5 with Marcott’s HockeyStick
http://i90.photobucket.com/albums/k247/dhm1353/Marcott1a_zpsed12aa62.png
their 1 degree uptick is well within the noice of their proxys. So imho it has no information value whatsoever. Not to mention that the inherent smoothing of the proxys already flattened the proxys… So the 1 degree uptick is stealthy enlarged and still does not show anything significant. Yet the climate scene is cheering. How desperate must they all be?
Marcott’s HockeyStick seems to be a complete and utter scientific failure.
“I’m sorry, guys, but I’m simply not buying the claim that we can tell anything at all about the global temperatures from these proxies. We’re deep into the GIGO range here. When one proxy shows rising temperatures for ten thousand years and another shows dropping temperatures for ten thousand years, what does any kind of average of those two tell us?”
So right Willis!
Why, why, why are so many reputable scientists sitting back and letting this utter NONSENSE go through???? Are they all corrupt? Is western science rotten to its core? Can’t they see that this kind of outright LIE does absolutely nothing for their cause? I just don’t get it. I’m just shaking my head in total disbelief.
When some said they just wanted to get a hockey stick in for IPCC 5, I was skeptical. So I double-checked. Yep, sure enough – the deadline is March 15, 2013 to be included in IPCC 5.
Willis, you say “… today and yesterday I spent shoveling about 12 tonnes of gravel. And writing that post …”
So…more shovelling then. Just a different consistency of material – and colour!
Great work!!!
On the timescales they’re looking at, do not other long-term effects come into play? For example, rift valley activity in eastern Africa may have significantly changed climate for a fairly large region, which in turn may have had global impacts.
Seems most of the proxies are ocean proxies, so would generally NOT show up any major changes in temperature (there is one heck of a lot of water)
This totally explains the lack of major peaks and troughs.
Willis, can you determine if those proxies that are wildly inconsistent are land or sea proxies ?
A lot of things have happened during the Earth’s history that might explain some of those inconsistencies, but I really can’t imagine events that would disrupt ocean based proxies all that much. Land temps , maybe, but water is a great regulator, and there is one heck of a lot of it. !
Theo Goodwin says:
March 13, 2013 at 11:06 pm
NSF funded this disaster and one of their program directors hyped it for the media. If there is congressional oversight over NSF, the overseers should come down hard right now.
—————————————
When you’re watching a magic show, I don’t believe the emcee is going to tell the audience how the magician did the trick or even that it was a trick. The sponsors don’t care either as long as their product is sold by whatever means.
In short, the overseers won’t come down at all much less, “hard”.
cn
I think the next step is to throw out the random proxies – the Tex86 and mg/ca -based ones. We already threw out the tree-rings (climate science has almost done so now); these proxy methods are next.
But maybe before that, the main question is how does Marcott get the uptick. Marcott writes back to McIntyre that the results past 1890 are not robust.
Oh I see, “Not robust”. Then why do 5 news releases and 100 headlines around the world and a dozen media interviews sound so clear that recent temperatures are higher than at any time in the past 11,300 years. The spin sounded robust enough.
I don’t think this is right. I don’t think this is ethical. Why should we believe scientists that show they have a propensity to act unethically. Its simple; we should not. We should double-check and be clear about what the results really are.
Thanks Willis. This took alot of effort. Especially after shoveling gravel which is probably the most taxing thing a person can do.
Instead of taking proxies and splicing recent instrumental records onto the end, wouldn’t it be more logical to calibrate the proxies using the instrumental record? I mean, a proxy by definition is not actual data, it’s some OTHER thing being recorded that is a representation of the value you actually want to determine. Since the value we want to determine (temperature) is so far off from the values recorded, it would make far more sense to assume the proxies are inaccurate and should be adjusted to match the real observed world.
This would, of course, completely eliminate the entire concept of a hockey stick.
Then again, what we are seeing here is the same level of dishonesty that is required to show “average” sea ice extent while excluding the last 10 years of data from the “average”. When I was in school, an average meant adding up all of the data available for a given number of years, then dividing by the number of years. This is an “average”. You don’t get to cherry pick a few years that you KNOW from anecdotal evidence were unusual, then cackle about how the years since have been way off of that “average”. If the last decade of lower sea ice extent was included in the “average”, then the “average” would become lower, which would make the current trends a lot less frightening to the weak minded.
These people wonder why “we” are skeptical of their claims. Well, we are skeptical because the claims are not credible, and are easily shown to be ridiculous by, well, even my 8 year old.
It really doesn’t matter how many proxies you add to the mix. Because a proxy IS NOT DATA, conclusions drawn from multi-proxy spaghetti rate low on the credibility scale.
No known temperature proxy can possibly be accurate other than on the most macro scale. We’ve already figured out that even with modern instruments and even satellites, we have no really valid current planetary temperature… the thought that any kind of proxy is any more accurate is starting to look more like insanity.
Rachael Maddow showed the graph on her show, celebrated by Mann, here:
http://www.facebook.com/photo.php?fbid=503540216368852&l=eac15ddfb7
majormike1 says:
March 13, 2013 at 10:36 pm
How they got to where they say they did is a fundamental mystery …
————————————-
If that is a riddle, then I propose as possible answers:
“Leap of Faith”
or
“Jumping to forgone conclusions”