The Berkeley Earth Surface Temperature project puts PR before peer review

UPDATE: see this new story

BEST: What I agree with and what I disagree with – plus a call for additional transparency to prevent “pal” review

=======================================================

Readers may recall this post last week where I complained about being put in a uncomfortable quandary by an author of a new paper. Despite that, I chose to honor the confidentiality request of the author Dr. Richard Muller, even though I knew that behind the scenes, they were planning a media blitz to MSM outlets. In the past few days I have been contacted by James Astill of the Economist, Ian Sample of the Guardian, and Leslie Kaufman of the New York Times. They have all contacted me regarding the release of papers from BEST today.

There’s only one problem: Not one of the BEST papers have completed peer review.

Nor has one has been published in a journal to my knowledge, nor is the one paper I’ve been asked to comment on in press at JGR, (where I was told it was submitted) yet BEST is making a “pre-peer review” media blitz.

One willing participant to this blitz, that I spent the last week corresponding with, is James Astill of The Economist, who presumably wrote the article below, but we can’t be sure since the Economist has not the integrity to put author names to articles:

The full article is here. Apparently, Astill has never heard of the UAH and RSS Global Temperature records, nor does he apparently know that all the surface temperature records come from one source, NCDC.

Now compare that headline and subtitle to this line in the article:

It will be interesting to see whether this makes it past the review process.

And, The Economist still doesn’t get it. The issue of “the world is warming” is not one that climate skeptics question, it is the magnitude and causes.

I was given a pre-release draft copy of one of the papers, related to my work as a courtesy. It contained several errors, some minor (such as getting the name of our paper wrong i.e. Fell et al in several places, plus a title that implied global rather than USA) some major enough to require revision (incorrect time period comparisons).

I made these errors known to all the players, including the journal editor, and the hapless Astill, who despite such concerns went ahead with BEST’s plan for a media blitz anyway. I was told by a BEST spokesperson that all of this was “coordinated to happen on October 20th”.

My response, penned days ago, went unheeded as far as I can tell, because I’ve received no response from Muller or the Journal author. Apparently, PR trumps the scientific process now, no need to do that pesky peer review, no need to address the errors with those you ask for comments prior to publication, just get it to press.

This is sad, because I had very high hopes for this project as the methodology is looked very promising to get a better handle on station discontinuity issues with their “scalpel” method. Now it looks just like another rush to judgement, peer review be damned.

Below is my response along with the draft paper from BEST, since the cat is publicly out of the bag now, I am not bound by any confidentiality requests. Readers should note I have not seen any other papers (there may be up to 4, I don’t know the BEST website is down right now) except the one that concerns me.

My response as sent to all media outlets who sent requests for comment to me:

===========================================================

In contradiction to normal scientific method and protocol, I have been asked to provide public commentary to a mass media outlet (The Economist) on this new paper. The lead author,  Dr. Richard Muller has released me from a previous request of confidentiality on the matter in a written communication on 10/14/2011. 10/15/2011 at 4:07PM PST in an email.  The paper in question is:

Earth Atmospheric Land Surface Temperature and  Station Quality [Tentative title, may have changed] by Muller et al 2011, submitted to the AGU JGR Atmospheres Journal, which apparently has neither completed peer review on the paper nor has it been accepted for publication by JGR.

Since the paper has not completed peer review yet, it would be inappropriate for me to publicly comment on the conclusions, especially in light of a basic procedural error that has been discovered in the methodology that will likely require a rework of the data and calculations, and thus the conclusions may also change. The methodology however does require comment.

The problem has to do with the time period of the data used, a time period which is inconsistent with two prior papers cited as this Muller et al paper being in agreement with. They are:

Fall et al (2011), Analysis of the impacts of station exposure on the U.S. Historical Climatology Network temperatures and temperature trends J. Geophys. Res.

http://pielkeclimatesci.files.wordpress.com/2011/07/r-367.pdf

and

Menne et al  (2010), On the reliability of the U.S. surface temperature record, J. Geophys. Res.

Both papers listed above (and cited by Muller et al) do an analysis over a thirty year time period while the Muller et al paper uses data for comparison from 1950 – 2010 as stated on lines 142-143:

“We calculated the mean temperature from 1950 to the present for each of these sites, and subtracted the mean of the poor sites from the OK sites.”

I see this as a basic failure in understanding the limitations of the siting survey we conducted on the USHCN, rendering the Muller et al paper conclusions highly uncertain, if not erroneous.

There is simply no way siting quality can be established as static for that long. The USHCN survey was based on photographs and site surveys starting in of 2007, plus historical metadata. Since the siting of COOP stations change as volunteers move, die, or discontinue their service, we know the record of siting stability to be tenuous over time. This is why we tracked only from 1979 and excluded stations whose locations were unknown prior to 2002. 1979 represented the practical limit of which we assumed we could reasonably ascertain siting conditions by our survey.

We felt that the further back the station siting changes occurred, the more uncertainty was introduced into the analysis, thus we limited meaningful comparisons of temperature data to siting quality to thirty years, starting in 1979.

Our ratings from surfacestations.org are assumed to be valid for the 1979 – 2008 period, but with Muller et all doing analysis from 1950, it renders the station survey data moot since neither Menne et al nor Fall et al made any claim of the station survey data being representative prior to 1979. The comparisons made in Muller et al are inappropriate because they are outside of the bounds of our station siting quality data set.

Also, by using a 60 year period, Muller et al spans two 30 year climate normals periods, thus further complicating the analysis. Both Menne et al and Fall et al spanned only one.

Because of the long time periods involved in Muller et al analysis, and because both Menne et al and Fall et al made no claims of knowing anything about siting quality prior to 1979, I consider the paper fatally flawed as it now stands, and thus I recommend it be removed from publication consideration by JGR until such time that it can be reworked.

For me to comment on the conclusions of Muller et al would be inappropriate until this time period error is corrected and the analysis reworked for time scale appropriate comparisons.

The Berkeley Earth Surface Temperature analysis methodology is new, and may yield some new and potentially important results on siting effects once the appropriate time period comparisons are made. I welcome the BEST effort provided that appropriate time periods are used that match our work. But, by using time period mismatched comparisons, it becomes clear that the Muller et al paper in its current form lost the opportunity for a meaningful comparison.

As I was invited by The Economist to comment publicly, I would recommend rejecting Muller et al in the current form and suggest that it be resubmitted with meaningful and appropriate 30 year comparisons for the same time periods used by the Menne et al and Fall et al cited papers. I would be happy to review the paper again at that time.

I also believe it would be premature and inappropriate to have a news article highlighting the conclusions of this paper until such time meaningful data comparisons are produced and the paper passes peer review. Given the new techniques from BEST, there may be much to gain from a rework of the analysis limited to identical thirty year periods used in Menne et al and Fall et al.

Thank you for your consideration, I hope that the information I have provided will be helpful in determining the best course of action on this paper.

Best Regards,

Anthony Watts

cc list: James Astill, The Economist, Dr. Joost DeGouw, JGR Atmospheres editor, Richard A. Muller, Leslie Kaufman, Ian Sample

===========================================================

Despite my concerns, The Economist author James Astill told me that “the issue is important” and decided to forge ahead, and presumably produced the article above.

Here is the copy of the paper I was provided by Richard Muller. I don’t know if they have addressed my concerns or not, since I was not given any follow up drafts of the paper.

BEST_Station_Quality (PDF 1.2 MB)

I assume the journalists that are part of the media blitz have the same copy.

I urge readers to read it in entirety and to comment on it, because as Dr. Muller wrote to me:

I know that is prior to acceptance, but in the tradition that I grew up in (under Nobel Laureate Luis Alvarez) we always widely distributed “preprints” of papers prior to their publication or even submission.  That guaranteed a much wider peer review than we obtained from mere referees.

Please keep it confidential until we post it ourselves.

They want it widely reviewed. Now that The Economist has published on it, it is public knowledge.

There might be useful and interesting work here done by BEST, but I find it troubling that they can’t wait for science to do its work and run the peer review process first. Is their work so important, so earth shattering, that they can’t be bothered to run the gauntlet like other scientists? This is post normal science at its absolute worst.

In my opinion, this is a very, very, bad move by BEST. I look forward to seeing what changes might be made in peer review should these papers be accepted and published.

==============================================================

UPDATE: Judith Curry, who was co-author to some of these papers, has a post on it here

Also I know that I’ll be critcized for my position on this, since I said back in March that I would accept their findings whatever they were, but that was when I expected them to do science per the scientific process.

When BEST approached me, I was told they were doing science by the regular process, and that would include peer review. Now it appears they have circumvented the scientific process in favor of PR.

For those wishing to criticize me on that point, please note this caveat in my response above:

The Berkeley Earth Surface Temperature analysis methodology is new, and may yield some new and potentially important results on siting effects once the appropriate time period comparisons are made. I welcome the BEST effort provided that appropriate time periods are used that match our work. But, by using time period mismatched comparisons, it becomes clear that the Muller et al paper in its current form lost the opportunity for a meaningful comparison.

Given the new techniques from BEST, there may be much to gain from a rework of the analysis limited to identical thirty year periods used in Menne et al and Fall et al.

My issue has to do with the lost opportunity of finding something new, the findings may agree, or they may be different if run on the same time periods. I think it is a fair question to ask since my peer reviewed paper (Fall et al) and NOAA’s (Menne et al) paper both used 30 year periods.

If BEST can run their comparison on the 30 year period for which our data is valid, instead of 60 years, as stated before, I’ll be happy to accept the results, whatever they are. I’m only asking for the correct time period to be used. Normally things like this are addressed in peer review, but BEST has blown that chance by taking it public first before such things can be addressed.

As for the other papers supposedly being released today, I have not seen them, so I can’t comment on them. There may be good and useful work here, but it is a pity they could not wait for the scientific process to decide that.

================================================================

UPDATE2: 12:08 PM BEST has sent out their press release, below:

The Berkeley Earth team has completed the preliminary analysis of the land surface temperature records, and our findings are now available on the Berkeley Earth website, together with the data and our code at

www.BerkeleyEarth.org/resources.php.

Four scientific papers have been submitted to peer reviewed journals, covering the following topics:

1. Berkeley Earth Temperature Averaging Process

2. Influence of Urban Heating on the Global Temperature Land Average

3. Earth Atmospheric Land Surface Temperature and Station Quality in the United States

4. Decadal Variations in the Global Atmospheric Land Temperatures

By making our work accessible and transparent to both professional and amateur exploration, we hope to encourage feedback and further analysis of the data and our findings.  We encourage every substantive question and challenge to our work in order to enrich our understanding of global land temperature change, and we will attempt to address as many inquiries as possible.

If you have questions or reflections on this phase of our work, please contact, info@berkeleyearth.org.  We look forward to hearing from you.

All the best,

Elizabeth

Elizabeth Muller

Founder and Executive Director

Berkeley Earth Surface Temperature

www.berkeleyearth.org

=========================================================

I’m still happy to accept the results, whatever they might be, all I’m asking for is an “apples to apples” comparison of data on the 30 year time period.

They have a new technique, why not try it out on the correct time period?

UPDATE4: Apparently BEST can’t be bothered to fix basic errors, even though I pointed them out, They can’t even get the name of our paper right:

http://berkeleyearth.org/Resources/Berkeley_Earth_Station_Quality

I sent an email over a week ago advising of the error in names, got a response, and they still have not fixed it, what sort of quality is this? Fell et all? right under figure 1

And repeated six times in the document they released today.

Sheesh. Why can’t they be troubled to fix basic errors? This is what peer review is for. Here’s my email from October 6th

—–Original Message—–
From: Anthony Watts- ItWorks
Date: Thursday, October 06, 2011 3:25 PM
To: Richard A Muller
Subject: Re: Our paper is attached
Dear Richard,
Thank you for the courtesy, correction:  Fell et al needs to be corrected to
Fall et al in several occurrences.
When we complete GHCN (which we are starting on now) we’ll have a greater
insight globally.
Best Regards,
Anthony Watts

Here is the reply I got from Dr. Muller

—–Original Message—–
From: Richard A Muller
Date: Friday, October 14, 2011 3:35 PM
To: Anthony Watts- ItWorks
Subject: Re: Our paper is attached
Anthony,
We sent a copy to only one media person, from The Economist, whom we trust to keep it confidential.  I sent a copy to you because I knew you would also keep it confidential.
I apologize for not having gotten back to you about your comments.  I particularly like your suggestion about the title; that is an improvement.
Rich
On Oct 14, 2011, at 3:04 PM, Anthony Watts- ItWorks wrote:
> Dear Richard,
>
> I sent a reply with some suggested corrections. But I have not heard back
> from you.
>
> Does the preprints peer review you speak of for this paper include sending
> copies to media?
>
> Best Regards,
>
>
> Anthony Watts

==========================================================

UPDATE 5: The Guardian writer Ian Samples writes in this article:

The Berkeley Earth project has been attacked by some climate bloggers, who point out that one of the funders is linked to Koch Industries, a company Greenpeace called a “financial kingpin of climate science denial“.

Reader AK writes at Judth Curry’s blog:

I’ve just taken a quick look at the funding information for the BEST team, which is:

Funded through Novim, a 501(c)(3) corporation, the Berkeley Earth Surface Temperature study has received a total of $623,087 in financial support.

Major Donors include:

– The Lee and Juliet Folger Fund ($20,000)

– William K. Bowes, Jr. Foundation ($100,000)

– Fund for Innovative Climate and Energy Research (created by Bill Gates) ($100,000)

– Charles G. Koch Charitable Foundation ($150,000)

– The Ann & Gordon Getty Foundation ($50,000)

We have also received funding from a number of private individuals, totaling $14,500 as of June 2011.

In addition to donations:

This work was supported in part by the Director, Office of Science, of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 ($188,587)

So now (pending peer-review and publication) we have the interesting situation of a Koch institution, a left-wing boogy-man, funding an unbiased study that confirms the previous temperature estimates, “consistent with global land-surface warming results previously reported, but with reduced uncertainty.

The identities of the people involved with these two organizations can be found on their websites. Let the smirching begin.

The climate data they don't want you to find — free, to your inbox.
Join readers who get 5–8 new articles daily — no algorithms, no shadow bans.
0 0 votes
Article Rating
409 Comments
Inline Feedbacks
View all comments
Steve Garcia
October 20, 2011 9:13 pm

Muller: “I know that is prior to acceptance, but in the tradition that I grew up in (under Nobel Laureate Luis Alvarez) we always widely distributed “preprints” of papers prior to their publication or even submission. That guaranteed a much wider peer review than we obtained from mere referees.”
Is he serious? Peer review is BY THE REVIEWERS SELECTED BY THE JOURNAL, not by a wide number of scientists.
No. What he is saying is that he wants to spoil the objectivity of the possible reviewer population before the official reviewers get a chance to review it.
Shame on this man. He is a total insider in the politics of science (see his Wikipedia entry), and Anthony was set up to believe that he was going to be given a part in it as a pursuit of objective truth. It has been, and will quickly even more so become, a political stab in the back. Muller was never going to pull the rug out from under those the government and the U.N. favor.
Anthony, ya got taken, Dude.

FleshNotMachine
Reply to  Steve Garcia
October 20, 2011 9:37 pm

feet2thefire commented
Is he serious? Peer review is BY THE REVIEWERS SELECTED BY THE JOURNAL, not by a wide number of scientists.
On this Muller is right. Physicists (at least) have a long history of sending their pre-peer-reviewed papers around, to friends and colleagues and, in the last several decades, to many institutions as preprints. And they give seminars. Their work gets evaluated via these routes just as it does in peer-review.
It’s important to remember that a paper that passes peer review isn’t necessarily right. Peer review just means it isn’t obviously wrong, that it recognizes past work, and that it meets scholarly standards. Lots of peer reviewed papers published in journals are wrong. It is the community who ultimately decides on a paper’s correctness and worth, in a long, crooked process.

jc
October 20, 2011 9:17 pm

I am a relatively new observer to this “debate”. I am quite ready believe that AGW is occuring if there is clear evidence to support this, and if other possible variables are fully investigated and understood.
Over the past month I have not read posts here or elsewhere with the conceit that through my evaluation of the underlying science that I personally will be able to balance various factors to the extent that I can independently form a scientifically based view. Others feel they can and perhaps that is justified, or at least, their contribution may be significant in refining an understanding.
The above discussion has been inspired by an evaluation of the credibilty or otherwise of temperature data collection methodology. Some of the above seems to consist of repetition – perhaps inevitable given the format – and more or less mindless pointscoring associated with evasion and basic intellectual dishonesty. The balance gives the impression of sincerly seeking clarity, whether pursued coherently or not. There is no doubt that a forensic examination of any claims is obligitory to understanding anything.
What is striking to me about this “debate” is how easily the basic point is lost or obscured. The most basic question about this study is: why is it neccessary? How is it remotely possible that only now, after a quarter of a century of an escalating crisis, that this most rudimentary evaluation is occuring? How is it that it would seem to be occuring only because one man from outside the established scientific and educational institutions realized that what should have been automatic – the evaluation of any implications of modifications to data collection methodology – was not done? How can such a situation have possibly arisen? Billions upon billions of dollars have been expended on this area of inquiry. To ensure the capacity to accurately measure what is occuring is the very first priority. Without that not only is it impossible to know with certainty what is occuring, but it is also impossible to make progress on understanding what might be effecting this. This is not a requirement for science alone, it is the mechanism by which people utilize their intelligence. Scientific method is merely the formalization of this.
The point of this post is to urge all participants to always look at the basics first and to maintain an overview. Regardless of the legitimacy – or otherwise- of the above study this recent process should bring to centre stage the core question of why this was not done 25 years ago. This is beyond extraordinary, it is frightening.

October 20, 2011 9:22 pm

John Dodds – you’re kidding me, right? The exact same physics that describes how CO2 moderates temperature is responsible for ensuring that heat seeking rockets work right and that industrial CO2 lasers function. It’s all one and the same. You can’t have one without the other.

kadaka (KD Knoebel)
October 20, 2011 9:29 pm

From David Ball on October 20, 2011 at 8:50 pm:

Anthony (and anyone else planning to attend), I am hoping you will go into the Trenberth arena with eyes wide open. (…)

Dear Dr. Ball,
As I noted in a comment above, Anthony has stated the meeting was canceled.

Gordon
October 20, 2011 9:31 pm

“I’m prepared to accept whatever result they produce, even if it proves my premise wrong.“
— Anthony Watts

October 20, 2011 9:34 pm

FNM says:
“BEST provides all this. What are they lacking?”
Transparency. They purport to use fictinal data that was totally fabricated. Get a clue. Peruse the Harry_read_me file. You need to get up to speed on climate fraud. Check out the “Climategate” header.

FleshNotMachine
Reply to  dbstealey
October 20, 2011 11:17 pm

Smokey commented
Transparency. They purport to use fictional data that was totally fabricated.
Please specify exactly what data you claim was fabricated.

Brian H
October 20, 2011 9:36 pm

FleshNotMachine says:
October 20, 2011 at 8:28 pm
Smokey commented
-> “…and will commenters and posters here accept the BEST results if
-> the paper DOES pass peer-review?”
Absolutely… IF the results are per the scientific method, and the data is testable and verifiable.

The second is already satisfied, no? BEST has made their data and programs available on their Web site: http://berkeleyearth.org/dataset.php
So we only have to focus on your first point. Now, what does “per the scientific method” mean?

No, a thousand times NO! “testable and verifiable” does NOT mean “tested and verified”, merely that it is capable of being so. Peer-review is a first, minimal hurdle for “presentability”, and says little or nothing about validity. THAT comes later, from surviving all the slings and arrows the related scientific community can throw at it.
And, furthermore, passing peer review isn’t even a guarantee of minimum presentability. All sorts of junk that happens to stroke the reviewers’ prejudices and preconceptions makes it through. Time and well-crafted challenges are the only real “proof”. To use the pudding analogy, the scent is the peer review, but the replication and testing (and application) are the “eating”.

Not Chuck Collins
October 20, 2011 9:38 pm

Jim Spice says:
October 20, 2011 at 4:12 pm
“…I would accept their findings whatever they were…”
WHAT’S UP with the crossed fingers. HA HA HA HA HA HA HA HA HA HA HA HA! And I’ll use my real name.

Jim Spice is just the name you post to blogs under. Your real name is Chuck Collins. You and your wife Linda live in Wisconsin. You’re a dishonest leftist troll.

Chuck Nolan
October 20, 2011 9:59 pm

I understand that for we climate followers this is big news but, the public was already told the earth is warming, NY will be under water and the consensus is in. I don’t think this will be big news to the general folks. They have put CAGW behind them. The problem is Obama and the EPA. This administration wants another/new revenue stream providing more money to spend. Plus, they can just make up the value of the credits at will. (and they will)

James of the West
October 20, 2011 9:59 pm

It is their paper after all, so they can publish when they want and accept the consequences. If they are crazy enough to go public before basic reviews are completed and simple errors are corrected it will only reflect on them and their lack of attention to detail and needless haste – surely its no skin off anybody elses nose though.

October 20, 2011 10:14 pm

Muller used to be a card carrying “skeptic”. What happened to him?

October 20, 2011 10:49 pm

I always suspected that BEST was set up just to nullify the findings of your surfacestations project. A lot of effort was put into creeping down the historic temps and relocating stations to take full advantage of UHI, then you come along to upset the applecart. Something must be done, and the rapid response team doesn’t have time to waste.

October 20, 2011 10:53 pm

It is time to revisit some comments made March 31 – April 3, 2011 in WUWT Expect the Best, Prepare for the Worst. – Willis Eschenbach.
Part of the comments were objections to a “scalpel and suture” technique and how that might destroy the critical low frequency data in the records.
In Dr. Muller’s PR blitz, he writes this in a WSJ Opinion piece.

Many of the records were short in duration, and to use them Mr. Rohde and a team of esteemed scientists and statisticians developed a new analytical approach that let us incorporate fragments of records. …. Rather than try to correct for the discontinuities in the records, we simply sliced the records where the data cut off, thereby creating two records from one.

— Dr. Muller, WSJ Eur Oct. 21, 2011.
The word “Scalpel” does not do justice to this work. “Meat Ax” is more fitting.
Anyone with a passing knowledge of the Fourier domain must realize that the low frequency components of the temperature record spectrum are now gone and any low frequency content at the end is purely a fabrication of the analyst who somehow spliced them back together.
Dr. Muller wants peer review? Here it is. Take it to any signal processor. Take it to someone who knows information theory. If the technique is “simply” as his own words attest, it is a fatal flaw in the data processing.

Nylo
October 20, 2011 10:54 pm

The news is already in the spanish media. They are putting the following words in the mouth of Richard Muller:
“Mi esperanza es que este estudio sirva para convencer a los sanamente escépticos y a los que expresaron sus legítimas dudas”, admite ahora Richard Muller. “A los negacionistas no les vamos a convencer nunca: no les interesa la ciencia”.
Translated:
“My hope is that this study will be useful to convince those healthily sceptics and those who expressed legitimate doubts”, admits now Richard Muller. “We won’t be able to ever convince the negationists, they are not interested in science”.

Neil Jones
October 20, 2011 10:57 pm

BBC radio news has just implied that this paper “proves” that the CRU did not manipulate data to show warming.

pauline
October 20, 2011 11:10 pm

Whilst mutttering darkly at the TV last night, a programme called Climate Wars, my teenager asked a question which highlights well the problem, ‘mum, these guys are all experts and scientists, you aren’t, therfore they must be right and you are wrong’, I didn’t expect my teenager to be of this opinion as she has a curious mind, so what chance does the rest of the world stand when confronted with wall to wall AGW and other illusions from a body of people they see as experts and esteem as part of the scientific community. I did wonder what the fall out would be if and when these scientists are proved wrong.
Convincing said teenager that mum might be right for once was futile, hints and suggestion welcome

Phil
October 20, 2011 11:13 pm

From: Berkeley_Earth_Averaging_Process
The authors appear to criticize Folland et. al. 2001 (from Page 12):

We note that the correlation in the limit of zero distance, R(0) = 0.8802, has a natural and important physical interpretation. It is an estimate of the correlation that one expects to see between two typical weather monitors placed at the same location. By extension, if we assume such stations would report the same temperature except that each is subject to random and uncorrelated error, then it follows that 1 − R(0) = 12.0% of the non-seasonal variation in the typical station record is caused by measurement noise that is unrelated to the variation in the underlying temperature field. Since the average root-mean-square non-seasonal variability is ~2.0 C, it follows that an estimate of the short-term instrumental noise for the typical month at a typical station is ~0.47 C at 95% confidence. This estimate is much larger than the approximately 0.06 C typically used for the random monthly measurement error (Folland et al. 2001). Our correlation analysis suggests that such estimates may understate the amount of random noise introduced by local and instrumental effects.(emphasis added)

Although the authors do not reference Frank, Patrick 2011: Imposed and Neglected Uncertainty in the Global Average Surface Temperature Index, the following appears to touch on Frank 2011 (from page 12):

However, our correlation estimate would not generally include long-term biases that cause a station to be persistently too hot or too cold, and so the estimates are not entirely comparable.

Beginning on page 14, there is a section titled “Homogenization and the Scalpel.” This section is very important in that it distinguishes the methodology of this approach from prior approaches. There are two key points, however, that may affect the validity of this paper:

The first step, cutting records at times of apparent discontinuities, is a natural extension of our fitting procedure that determines the relative offsets between stations … as an intrinsic part of our analysis. We call this cutting procedure the scalpel. Provided that we can identify appropriate breakpoints, the necessary adjustment will be made automatically as part of the fitting process.(emphasis added)

There is no discussion of what effect there would be on the conclusions of the paper if the requirement of “identify(ing) appropriate breakpoints” is not met or only partially met.
Second, the authors, for this paper, did not use the “scalpel” to create breakpoints due to station moves and instrumentation changes. Given the importance which station moves and instrumentation changes have been given in the literature, the conclusions of this paper maybe should be marked with an asterisk. One could presume that this may be evidence of rushing the paper out to the public.
There is also a third point and it is one that the authors did not discuss when perhaps they should have done so. They discuss creating breakpoints unnecessarily, but they do not discuss the effect of missing breakpoints that are necessary.
They state that unnecessary breakpoints should be trend neutral, but they could amplify noise and increase the resulting uncertainty, but, it seems to me, that would not create overstatements in the conclusions. However, it has been well documented that missing necessary breakpoints, such as station moves and instrumentation changes, is not trend neutral and that, following the discussion in the paper, missing such breakpoints may result in underestimating uncertainty.
In the section on “Outlier Weighting,” a parameter is chosen (from page 17):

This choice of target threshold, 2.5e, is partly arbitrary but was selected with the expectation that most of the measured data should be unaffected.

A sensitivity analysis of this parameter might be a good idea to see how the final conclusions may be affected. In Steig, et al. 2009, the addition of an outlier filter changed a slight cooling trend in Antarctica in Comiso et al. 2000 to a slight warming trend. Much was made of this change.
In the section on “Reliability Weighting,” good record keeping is rewarded with greater weighting, but there is no discussion as to whether the better records are also representative. To illustrate using a perhaps unfair comparison, a station that is poorly sited with faulty instrumentation may be given a high weighting due to great diligence of the record keepers.
Normally, when an experiment is conducted, this does not become an issue, as location and calibration of sensors is done beforehand and data that is found to not comply with location and calibration standards is never used. In climate science, however, poor adherence to standards or lack of performing the most minimal quality assurance does not generally mean that such compromised data is disregarded. Indeed on page 18, the authors state:

Due to the limits on outliers from the previous section, the station weight has a range between 1/13 and 2, effectively allowing a “perfect” station record to receive up to 26 times the weight of a “terrible” record.(emphasis added)

It cannot be emphasized enough that “perfect” does not mean that the station is well sited or that the instrumentation is properly calibrated or that recalibrations are or were done periodically. In fact, prior to 1979, there is essentially no reliable information as to siting quality and there is little to no information on instrumentation calibration for all years except probably at airport stations. On page 19, the authors state:

Implicit in the discussion of station reliability considerations are several assumptions. First, we assume that the local weather function constructed from many station records, (the “weather” term), will be a better estimate of the local temperature than any individual record could be. This assumption is generally characteristic of all averaging techniques; however, we can’t rule out the possibility of large-scale systematic biases. Our reliability adjustment techniques can work well when one or a few records are noticeably inconsistent with their neighbors, but large scale biases affecting many stations could cause such comparative estimates to fail. Second, we assume that the reliability of a station is largely invariant over time. This will in general be false; however, the scalpel procedure discussed previously will help us here (unless necessary breakpoints are missed). (emphasis and comment added)

Again, the inability to rule out “large-scale systematic biases” seems to touch on Frank 2011. The authors should discuss Frank 2011 even if they disagree with that paper, as it forms part of the relevant literature.
In the section titled “Uncertainty Analysis,” the authors state on page 20:

In general, it is impossible to directly quantify structural uncertainties, and so they are not a factor in our standard uncertainty model. However, one may be able to identify model limitations by drawing comparisons between the results of the Berkeley Average and the results of other groups (but not if they all have similar faults). (comment added)

The lack of controls in climate science cannot be emphasized enough. Comparing (potentially bad) data to other (potentially bad) data may not result in conclusions that are as robust as they seem.
On page 31, the authors state:

In previous work, it has been argued that instrumentation changes may have led to an artificial warm bias in the early 1900s (Folland et al. 2001, Parker 1994). To the degree that our reconstruction from that era is systematically lower than prior work (Figure 8), it could be that our methods are more resistant to biases due to those instrumental changes (but the authors excluded instrumentation changes). (comment added)

Did the authors forget that, on page 16, they stated that they had disregarded instrumentation changes when using the “scalpel”?
In the data file (I downloaded the *.txt files, but I presume the Matlab files are identical), there are approximately 385 stations without valid latitude or longitude values (stations 113227 through 113612), yet these same stations appear to have valid temperature data. It is not clear how the authors used these stations in their methodology, as valid latitudes and longitudes would seem to be a requirement.
_________________________________________________________________________
Anthony: I couldn’t find a link to Frank 2011. Would it be possible to include it, if you have it?

October 20, 2011 11:26 pm

FleshNotMachine says:
“Please specify exactly what data you claim was fabricated.”
Years of temperature data were FABRICATED:
http://img19.imageshack.us/img19/555/it6i9ciu3h.jpg

Richard G
October 20, 2011 11:30 pm

steven mosher says:
October 20, 2011 at 2:37 pm
“So what are the GOOD arguments within the science?
C02 warms the planet, the question is how much. That’s the real debate. join it”
________________________________
Correction: C02 FEEDS the planet, the question is how much. That’s the real debate. join it.
That’s better. Feed me Seymore.

P. Solar
October 20, 2011 11:37 pm

this will probably never get noticed in all the hubub and 250+ posts but the real problem with their choice of period is the old trough to peak trick.
http://tinypic.com/r/10za9zq/7
Here are the 50 year trends in global temp, periods starting in the year shown. Pick your start year according to the result you want to show. The innocent and seemingly objective choice of “the last 50 years” is precisely the trough to peak of the dominant variation in the record. What they have measured with such care has more to do with natural cycles than the current non cyclic trend.
What Joe public will take home from all this is the OMG! one degree is 50 years, it’s worse than we thought. I mean, that’s 2C per century !!!
Anthony, It’s not just compatibility with your paper that is at issue here. Whatever the merits of their stats methods, the choice of period is far from objective.

October 20, 2011 11:52 pm

FleshNotMachine says:
October 20, 2011 at 9:10 pm
“It’s BEST’s project — they can do it for whatever period they want, and their results apply accordingly. If anyone else, like you or Anthony, wants a different time period, they should do it themselves and publish their own paper. BEST already provides a framework to start from.”
Nope! Anthony’s surfacestations project provided the framework, and the challenge to the falsified temp records. BEST is a response to that, so it’s up to Muller to apply the appropriate time period and not resort to cheating.

FleshNotMachine
Reply to  Slacko
October 21, 2011 12:22 am

Slacko commented
Nope! Anthony’s surfacestations project provided the framework, and the challenge to the falsified temp records. BEST is a response to that, so it’s up to Muller to apply the appropriate time period and not resort to cheating.
BEST analyzed *more* data than Anthony suggested, not less. On a Venn diagram, Anthony’s dataset of concern is a subset fully contained within the dataset BEST analyzed. How is this an objection, even theoretically?
REPLY: Gosh you are dense. Nobody knows, not you, me, NOAA, BEST, or anyone else what the siting quality of a weather station was 60 years ago, when the surfacestations.org project only guaranteed data 30 years. You can’t make up metadata where there is none, even NOAA’s Menne et al knew not to do this and also used a 30 year period. BEST chose 60 years for their rush to judgement trial by compliant media paper, in contradiction to two peer reviewed papers that used 30 year data. The error BEST made is a fatal one, that should be addressed in peer review, but of course that’s all moot now since they achieved their goal of bloviating all over the world media without that pesky peer review getting in the way.
BTW you’ve changed your previous handle used here in violation of published site policy, so in the troll bin you go.
-Anthony

gbaikie
October 20, 2011 11:52 pm

“Convincing said teenager that mum might be right for once was futile, hints and suggestion welcome.”
Tell kid to study the science necessary to show that you are wrong.
Kind of obvious.
It should great motivation for kid to do some study.

kadaka (KD Knoebel)
October 20, 2011 11:54 pm

Question:
In Update #2, coming from a press release, it says:

The Berkeley Earth team has completed the preliminary analysis of the land surface temperature records, and our findings are now available on the Berkeley Earth website, together with the data and our code at
http://www.BerkeleyEarth.org/resources.php.

(Did wordpress try to make that a full URL like it’s done with name-dot-com becoming http://name-dot-com?)
They say that’s where the data and code is found, yet the actual link to that name is:
http://berkeleyearth.us1.list-manage1.com/track/click?u=d8548cad5e5305433c810b0d4&id=69f78b51e4&e=08079d0c8f
The http://www.berkeleyearth.org/resources.php address appears to work. So why the trickery with the press release link?

FleshNotMachine
October 20, 2011 11:59 pm

Smokey commented
-> “Please specify exactly what data you claim was fabricated.”
Years of temperature data was <i.FABRICATED:
http://img19.imageshack.us/img19/555/it6i9ciu3h.jpg

It is far from clear that this excerpt means he fabricated data.
The entire HARRY_READ_ME file is here: http://www.anenglishmanscastle.com/HARRY_READ_ME.txt
(or rather, we assume it’s the file, and it hasn’t been altered, and it is the same as taken from CRU computers).
Scroll down to where “Harry” writes:
“Here, the expected 1990-2003 period is MISSING – so the correlations aren’t so hot! Yet
the WMO codes and station names /locations are identical (or close). What the hell is
supposed to happen here? Oh yeah – there is no ‘supposed’, I can make it up. So I have 🙂 ”
That, and what follows in the file, more likely indicates that the programmer is making up a *method* to handle the missing data correlating the two stations — not that he made up data.
So he decides to give the user of his program one of three choices:
“1. Match them after all.
2. Leave the existing station alone, and discard the update.
3. Give existing station a false code, and make the update the new WMO station.”
Is there any indication or evidence of which of these three choices users of his code made in every instance of missing data?
Or if they made any of them?
Or if his code was ever used at all?
Or if the code was used and the user(s) made a choice other than #2, how this affects the final results.
So there is no evidence anyone “fabricated data” — you have jumped to an unwarranted conclusion.

October 21, 2011 12:46 am

FnM, the meaning of data (observations) in science is established by a falsifiable physical theory, only. Climate models are the representation of the physical theory of climate. Only they, supposing they were falsifiable, can establish the causal meaning of the evidence of recent warming.
However, they are not falsifiable and not capable of doing so. They are artificially tuned to produce 20th century climate; their physical uncertainty limits have never been established; even the discussion of errors deep in the back of the IPCC ARs show climate models cannot resolve any climate process at 10 W/m^2 resolution, much less 1 W/m^2. Therefore, the observation of a warmer climate cannot be assigned to human-produced CO2 or to any other specific cause.
What you call “evidence” of AGW is merely guilt by tendentious assignment.

Jeff Grantham
Reply to  Pat Frank
October 21, 2011 9:48 pm

Pat Frank wrote:
FnM, the meaning of data (observations) in science is established by a falsifiable physical theory, only. Climate models are the representation of the physical theory of climate. Only they, supposing they were falsifiable, can establish the causal meaning of the evidence of recent warming.
This is untrue.
Harries et al Nature 2001 did an analysis of actual data recorded by instruments. They found that less heat is escaping earth from 1979 to 1997, and that the deficit occurs at the wavelengths where CO2 and methane absorb radiation.
Their measurements do not rely on climate models, or involve them in any way.
Why aren’t these measurements evidence of anthropogenic global warming?

1 8 9 10 11 12 15