Throwing down the gauntlet on reproducibility in Climate Science – Forest et al. (2006)

After spending a year trying to get the data from the author without success, Nic Lewis has sent a letter to the editor of Geophysical Research Letters (GRL) and has written to me to ask that I bring attention to his letter published at Judith Curry’s website, and I am happy to do so. He writes:

I would much appreciate it if you could post a link at WUWT to an article of mine (as attached) that has just been published at Climate Etc. It concerns the alteration of data used in an important climate sensitivity study, Forest 2006, with a radical effect on the resulting climate sensitivity estimated PDF.

I’m including the foreword here (bolding mine) and there is a link to the entire letter to the editor of GRL.

Questioning the Forest et al. (2006) sensitivity study

By Nicholas Lewis

Re: Data inconsistencies in Forest, Stone and Sokolov (2006) GRL paper 2005GL023977 ‘Estimated PDFs of climate system properties including natural and anthropogenic forcings‘

In recent years one of the most important methods of estimating probability distributions for key properties of the climate system has been comparison of observations with multiple model simulations, run at varying settings for climate parameters. Usually such studies are formulated in Bayesian terms and involve ‘optimal fingerprints’. In particular, equilibrium climate sensitivity (S), effective vertical deep ocean diffusivity (K_v) and total aerosol forcing (F_aer) have been estimated in this way. Although such methods estimate climate system properties indirectly, the models concerned, unlike AOGCMs, have adjustable parameters controlling those properties that, at least in principle, are calibrated in terms of those properties and which enable the entire parameter space to be explored.

In the IPCC’s Fourth Assessment Report (AR4), an appendix to WGI Chapter 9, ‘Understanding and attributing climate change’[i], was devoted to these methods, which provided six of the chapter’s eight estimated probability density functions (PDFs) for S inferred from observed changes in climate. Estimates of climate properties derived from those studies have been widely cited and used as an input to other climate science work. The PDFs for S were set out in Figure 9.20 of AR4 WG1, reproduced below.

The results of Forest 2006 and its predecessor study Forest 2002 are particularly important since, unlike all other studies utilising model simulations, they were based on direct comparisons thereof with a wide range of instrumental data observations – surface, upper air and deep-ocean temperature changes – and they provided simultaneous estimates for K_v and F_aer as well as S. Jointly estimating K_v and F_aer together with S is important, as it avoids dependence on existing very uncertain estimates of those parameters. Reflecting their importance, the IPCC featured both Forest studies in Figure 9.20. The Forest 2006 PDF has a strong peak which is in line with the IPCC’s central estimate of S = 3, but the PDF is poorly constrained at high S.

I have been trying for over a year, without success, to obtain from Dr Forest the data used in Forest 2006. However, I have been able to obtain without any difficulty the data used in two related studies that were stated to be based on the Forest 2006 data. It appears that Dr Forest only provided pre-processed data for use in those studies, which is understandable as the raw model dataset is very large.

Unfortunately, Dr Forest reports that the raw model data is now lost. Worse, the sets of pre-processed model data that he provided for use in the two related studies, while both apparently deriving from the same set of model simulation runs, were very different. One dataset appears to correspond to what was actually used in Forest 2006, although I have only been able to approximate the Forest 2006 results using it. In the absence of computer code and related ancillary data, replication of the Forest 2006 results is problematical. However, that dataset is compatible, when using the surface, upper air and deep-ocean data in combination, with a central estimate for climate sensitivity close to S = 3, in line with the Forest 2006 results.

The other set of data, however, supports a central estimate of S = 1, with a well constrained PDF.

I have written the below letter to the editor-in-chief of the journal in which Forest 2006 was published, seeking his assistance in resolving this mystery. Until and unless Dr Forest demonstrates that the model data used in Forest 2006 was correctly processed from the raw model simulation run data, I cannot see that much confidence can be placed in the validity of the Forest 2006 results. The difficulty is that, with the raw model data lost, there is no simple way of proving which version of the processed model data, if either, is correct. However, so far as I can see, the evidence points to the CSF 2005 version of the key surface temperature model data, at least, being the correct one. If I am right, then correct processing of the data used in Forest 2006 would lead to the conclusion that equilibrium climate sensitivity (to a doubling of CO2 in the atmosphere) is close to 1°C, not 3°C, implying that likely future warming has been grossly overestimated by the IPCC.

This sad state of affairs would not have arisen if Dr Forest had been required to place all the data and computer code used for the study in a public archive at the time of publication. Imposition by journals of such a requirement, and its enforcement, is in my view an important step in restoring trust in climate science amongst people who base their beliefs on empirical, verifiable, evidence.

Nic Lewis

==============================================================

Just let me say that there’s movement afoot to address the issues brought up about reproducibility in journal publications in the last paragraph. I’ll have more on this at a future date.

Here’s the foreword and letter to the GRL editor in PDF form: Post on Forest 2006 GRL letter final

This figure from that letter by Lewis suggests a lower climate sensitivity to a doubling of CO2 than the original:

-Anthony

0 0 votes

Article Rating

82 Comments

Inline Feedbacks

View all comments

tonyb

June 25, 2012 11:32 am

Pamela Gray
Phil Jones also seemed to have lost his data so Forster is in good company.
A journalist described Hansens office as ‘comically cluttered’ and he was concerned enough to email her saying it was much better than it used to be.
It seems the higher up the food chain the more haphazard the treatment of the data. Personally I’m not sure I could rely on a paper produced by someone who tries to work in a ‘comically cluttered’; office.
tonyb

G. Karst

June 25, 2012 11:37 am

“Dr Forest reports that the raw model data is now lost.”

Someone has been reading Climategate E-mails, as an instruction manual. GK

Tom Murphy

June 25, 2012 11:38 am

I continue to be amazed that neither the journals nor their peers mandate the release of the data used to support a climate researcher’s or group’s resulting paper. The golden rule of auditing (in any discipline) is that if you don’t write it down, it never happened. In today’s over-hyped Information Age, this presumption that the journal or peer should trust the researcher or group is antiquated to the point of naiveté.
I’m reminded what Stephen J. Gould stated in his own (controversial) book “The Mismeasure of Man” says it best, “Phony psychics like Uri Geller have had particular success in bamboozling scientists with ordinary stage magic, because only scientists are arrogant enough to think that they always observe with rigorous and objective scrutiny, and therefore could never be so fooled – while ordinary mortals know perfectly well that good performers can always find a way to trick people.” The same, I think, could be said of some climate scientists.
They seemingly and desperately want to believe that humankind is solely responsible for this “catastrophe,” which has been predicted by the models. At day’s end, it’s really rather sad to witness such educated persons failing so publicly, while they remain “eyes wide shut” to the last.

E. Z. Duzzit

June 25, 2012 11:43 am

Data that have been lost, destroyed, secreted or otherwise unavailable are no different than data that is non-existent. Conclusions based on non-existent data are useless.

atheok

June 25, 2012 11:51 am

What a surprise! How utterly original!
Phil can’t figure out what he did with the original data.
UVA claims they lost the emails, oops, unfortunately (for UVA) they were found. It appears that delete key doesn’t always work.
Now the trees (data) can’t be seen and the forest got lost. One does wonder if the data ever really existed. If it did exist, one then wonders if the original data suffered from a reaction to delete key pressing or if it got lost in the recycling (strictly paper bound).
The trouble with the data going the recycle route, just what/when was it electronic data so that computer manipulation was possible? I bet those darn backup servers still have copies.

RHS

June 25, 2012 12:03 pm

I don’t think the dog ate the homework, I think his virus ate the data…

kakatoa

June 25, 2012 12:22 pm

I assume that Dr. Forest, et al. will be using their models to predict, make that simulate via a few scenarios, the effect of CO2 levels for AR5. I hope that more robust means of data management will be followed this time around.
I can’t imagine Mr. Putin agreeing to modify his countries behavior in regards to CO2 if the scientific experts in his government can’t review the details……..

TallDave

June 25, 2012 12:35 pm

Calm down everyone, data is right here.

timetochooseagain

June 25, 2012 12:36 pm

Nic Lewis- Is there a reason to prefer the median of these distributions as a “central estimate” to the mode?
Well, could be worse, they could have gone with the mean, which would really skew right.

björn

June 25, 2012 12:37 pm

You do not lose your data, period!
It is impossible to tell when and if or why you want to use them again!
Besides, ir feels really good to have a huge stack of data, makes you proud of your efforts.
I would feel terrible losing all that work, even if only for sentimenral reasons.

Nic Lewis

June 25, 2012 12:56 pm

timetochooseagain: ‘Is there a reason to prefer the median of these distributions as a “central estimate” to the mode?’
I suppose that the median reflects the full distribution to a greater extent than the mode does. But I’m not sure that any ‘central estimate’ is that useful with wide, skewed distributions like these. I prefer to see the full PDF. That has an added advantage: if its shape is peculiar, it warns you to regard the study involved with some suspicion.

Follow the Money

June 25, 2012 1:07 pm

Don’t be so harsh, people. They lose and forget lots of things at Penn State.

Lucy Skywalker

June 25, 2012 1:14 pm

So Dr Forrest faffs for a year… and after a year of faffing, ~~admits~~ claims he’s lost the data…
What did he gain by his paper? Quoting in IPCC and consequent kudos…
Dog-Ate-My-DataGate

Mike Jowsey

June 25, 2012 1:23 pm

Excellent research Mr. Lewis. Clearly you have put an enormous amount of (unpaid) work into this study. We await with interest a response from the GRL editor.

timetochooseagain

June 25, 2012 1:25 pm

Nic Lewis says: “I prefer to see the full PDF. That has an added advantage: if its shape is peculiar, it warns you to regard the study involved with some suspicion.”
The shape of the distributions is not surprising, or suspicious, in and of itself, I think. Sensitivity scales with the feedback factor as 1/(1-f), so if the estimate of f is normally distributed, and the mean is greater than zero, you inevitably get the fat tail for the distribution for estimated sensitivity. But even if we aren’t suspicious, we should check studies to see if the distributions that can be derived from the data really meet the conditions which would lead to a fat tail (mean of estimated f greater than zero is, I think, the crucial condition, or close enough, but it depends on the variance of the estimate of estimates of f) and if they don’t, the fat tail should disappear.

Latimer Alder

June 25, 2012 1:29 pm

Its probably just slipped down the back of the sofa and will turn up soon. Or maybe Forest put it somewhere safe and forgot where it was.
After all he’s not expected to be a well-organised and analytical professional scientist or anything is he? Anybody can lose the data associated with the most important paper they’ll ever write. It’s just so forgettable. And you only remember you’ve lost it when somebody asks to see it…….

Lucy Skywalker

June 25, 2012 1:33 pm

from Judith Curry’s comment:
Nic Lewis’ academic background is mathematics, with a minor in physics, at Cambridge University (UK). His career has been outside academia. Two or three years ago, he returned to his original scientific and mathematical interests and, being interested in the controversy surrounding AGW, started to learn about climate science. He is co-author of the paper that rebutted Steig et al. Antarctic temperature reconstruction (Ryan O’Donnell, Nicholas Lewis, Steve McIntyre and Jeff Condon, 2011)…
I have been discussing this issue with Nic over the past two weeks. Particularly based upon his past track record of careful investigation, I take seriously any such issue that Nic raises. Forest et al. (2006) has been an important paper, cited over 100 times and included in the IPCC AR4...
This particular situation raises some thorny issues, that are of particular interest especially in light of the recent report on Open Science from the Royal Society:
.. assuming for the sake of argument that there is a serious error in the paper: should a paper be withdrawn from a journal, after it has already been heavily cited?..

Nic Lewis

June 25, 2012 1:47 pm

timetochooseagain: “The shape of the distributions is not surprising, or suspicious, in and of itself”
I agree that a fat tail distribution is to be expected. I don’t regard a fat tail in itself as a peculiarity, but I do regard multiple peaks and strange bumps and shoulders in the PDF as being peculiar.
Only one of the distributions in the IPCC figure, Gregory 02, is genuinely consistent with a normally distributed estimate of f – and the Gregory 02 is missing nearly half of its probability mass, due to being cut off at f=1. The Forster/Gregory 06 PDF represents a normally distributed estimate for f, but the IPCC experts decided to multiply the resulting climate sensitivity PDF by sensitivity squared – supposedly to make it comaprable to the other PDFs!

Berényi Péter

June 25, 2012 2:10 pm

“Unfortunately, Dr Forest reports that the raw model data is now lost.”
Unfortunate indeed. For Dr. Forest the honest course of action to follow at this point is
1. withdraw the paper from GLR immediately, as results described in it are irreproducible
2. remove all references to it from the IPCC AR4 report retroactively
3. have all other researchers withdraw their papers, who have relied on it
4. pay back all grant money gained for this and subsequent research
5. serve proper jail term for animal abuse, letting the dog eat raw data instead of cooked ones

Stacey

June 25, 2012 2:21 pm

Is it worth trying his co authors Messrs Stone and Sokolov surely they must have a copy of the data?

HankH

June 25, 2012 2:26 pm

I work with volumes of clinical research data all the time. In the course of a research project I might have several subsets of the original data as queries of the original data produce output that looks at how the experimental variable(s) affect different stratifications of the sample group. Each dataset must be properly validated, versioned, systematically stored according to “best practices.” Further, all data is mirrored and stored in two data centers and warehoused with a data vault company. Such data is considered so precious that such controls are an absolute requirement.
Anyone who outright looses the original data has such bad organization and lack of controls in place that any results of their work must be called into question. I continue to be astounded at the shoddy research practices of these climatologists and even more astounded that their work is not thrown in the waste bin by the publishing journal when such gross negligence is discovered.

Manfred

June 25, 2012 2:43 pm

Small planet
Dr. Forest is now with the Department of Meteorology at the Pennsylvania State University.
Before that, he was with MIT, his thesis advisors were Kerry A. Emanuel and Peter Molnar.
http://ploneprod.met.psu.edu/people/cef13/

Nic Lewis

June 25, 2012 2:45 pm

Stacey: “Is it worth trying his co authors Messrs Stone and Sokolov surely they must have a copy of the data?”
I have tried. I understand Dr Stone was seriously ill when I emailed last year, so I have let him be, poor chap.
I have failed to obtain any response from Dr Sokolov, who is the expert on the MIT 2D climate model. Maybe he thinks that it is entirely Dr Forest’s responsibility to respond, or perhaps he doesn’t like a non-academic poking his nose in.

timetochooseagain

June 25, 2012 2:48 pm

Nic Lewis-Yes, you are correct, I should have said I don’t regard the fat tail itself as suspicious, but like you I do find the odd shoulders or extra local maxima (secondary “modes”) as curious and suspicious. In this regard the worst offender appears to be “Knutti 02” which gives the most outrageous estimate for sensitivity of all of them, surely!

Hot under the collar

June 25, 2012 2:54 pm

@Stacey says,
I suspect the dogs paw accidentally hit the delete button on the co authors computer.

« Previous 1 2 3 4 Next »

wpDiscuz

Welcome to Watts Up With That, one of the most well-known climate blogs! We gather the latest scientific research, news, and expert opinion to help you understand how our planet is changing and what implications it may have for humanity. Our approach is based on facts, objective analysis, and open discussions about one of the most critical issues of our time. Watts up with that climate and what changes await us – let’s figure it out together!

Watts Up With That covers a wide range of topics related to climate change and its impact on the world. Here’s what’s important to us:

Global warming – its causes, consequences, and future forecasts.
Analysis of current climate research and its findings.
Climate change news.
Extreme weather events – hurricanes, droughts, floods, and their connection to climate change.
The impact of different energy sources on the environment and the development of sustainable technologies.
Political and economic aspects and how states and international organizations respond to climate change.

Watts Up With That?

Throwing down the gauntlet on reproducibility in Climate Science – Forest et al. (2006)

Questioning the Forest et al. (2006) sensitivity study

Like this:

Questioning the Forest et al. (2006) sensitivity study

Share this:

Like this:

Related Posts

Stefani on the Sun vs. CO2 as climate drivers

Turning “What If” into “How Many”: The Rhetorical Alchemy of Climate Modeling

Models & Lab Studies

Peer-Reviewing Peer Review