CRUTEM3 "…code did not adhere to standards one might find in professional software engineering"

Those of us who have looked at GISS and CRU code have been saying this for months. Now John Graham-Cumming has posted a statement with the UK Parliament about the quality and veracity of CRU code that has been posted, saying “they have not released everything”.

I found this line most interesting:

“I have never been a climate change skeptic and until the release of emails from UEA/CRU I had paid little attention to the science surrounding it.”

Here is his statement as can be seen at:

http://www.publications.parliament.uk/pa/cm200910/cmselect/cmsctech/memo/climatedata/uc5502.htm

=================================

Memorandum submitted by John Graham-Cumming (CRU 55)

I am writing at this late juncture regarding this matter because I have now seen that two separate pieces of written evidence to your committee mention me (without using my name) and I feel it is appropriate to provide you with some further information. I am a professional computer programmer who started programming almost 30 years ago. I have a BA in Mathematics and Computation from Oxford University and a DPhil in Computer Security also from Oxford. My entire career has been spent in computer software in the UK, US and France.

I am also a frequent blogger on science topics (my blog was recently named by The Times as one of its top 30 science blogs). Shortly after the release of emails from UEA/CRU I looked at them out of curiosity and found that there was a large amount of software along with the messages. Looking at the software itself I was surprised to see that it was of poor quality. This resulted in my appearance on BBC Newsnight criticizing the quality of the UEA/CRU code in early December 2009 (see http://news.bbc.co.uk/1/hi/programmes/newsnight/8395514.stm).

That appearance and subsequent errors I have found in both the data provided by the Met Office and the code used to process that data are referenced in two submissions. I had not previously planned to submit anything to your committee, as I felt that I had nothing relevant to say, but the two submissions which reference me warrant some clarification directly from me, the source.

I have never been a climate change skeptic and until the release of emails from UEA/CRU I had paid little attention to the science surrounding it.

In the written submission by Professor Hans von Storch and Dr. Myles R. Allen there are three paragraphs that concern me:

“3.1 An allegation aired on BBC’s “Newsnight” that software used in the production of this dataset was unreliable. It emerged on investigation that the neither of the two pieces of software produced in support of this allegation was anything to do with the HadCRUT instrumental temperature record. Newsnight have declined to answer the question of whether they were aware of this at the time their allegations were made.

3.2 A problem identified by an amateur computer analyst with estimates of average climate (not climate trends) affecting less than 1% of the HadCRUT data, mostly in Australasia, and some station identifiers being incorrect. These, it appears, were genuine issues with some of the input data (not analysis software) of HadCRUT which have been acknowledged by the Met Office and corrected. They do not affect trends estimated from the data, and hence have no bearing on conclusions regarding the detection and attribution of external influence on climate.

4. It is possible, of course, that further scrutiny will reveal more serious problems, but given the intensity of the scrutiny to date, we do not think this is particularly likely. The close correspondence between the HadCRUT data and the other two internationally recognised surface temperature datasets suggests that key conclusions, such as the unequivocal warming over the past century, are not sensitive to the analysis procedure.”

I am the ‘computer analyst’ mentioned in 3.2 who found the errors mentioned. I am also the person mentioned in 3.1 who looked at the code on Newsnight.

In paragraph 4 the authors write “It is possible, of course, that further scrutiny will reveal more serious problems, but given the intensity of the scrutiny to date, we do not think this is particularly likely.” This has turned out to be incorrect. On February 7, 2010 I emailed the Met Office to tell them that I believed that I had found a wide ranging problem in the data (and by extension the code used to generate the data) concerning error estimates surrounding the global warming trend. On February 24, 2010 the Met Office confirmed via their press office to Newsnight that I had found a genuine problem with the generation of ‘station errors’ (part of the global warming error estimate).

In the written submission by Sir Edward Acton there are two paragraphs that concern the things I have looked at:

“3.4.7 CRU has been accused of the effective, if not deliberate, falsification of findings through deployment of “substandard” computer programs and documentation. But the criticized computer programs were not used to produce CRUTEM3 data, nor were they written for third-party users. They were written for/by researchers who understand their limitations and who inspect intermediate results to identify and solve errors.

3.4.8 The different computer program used to produce the CRUTEM3 dataset has now been released by the MOHC with the support of CRU.”

My points:

1. Although the code I criticized on Newsnight was not the CRUTEM3 code the fact that the other code written at CRU was of low standard is relevant. My point on Newsnight was that it appeared that the organization writing the code did not adhere to standards one might find in professional software engineering. The code had easily identified bugs, no visible test mechanism, was not apparently under version control and was poorly documented. It would not be surprising to find that other code written at the same organization was of similar quality. And given that I subsequently found a bug in the actual CRUTEM3 code only reinforces my opinion.

2. I would urge the committee to look into whether statement 3.4.8 is accurate. The Met Office has released code for calculating CRUTEM3 but they have not released everything (for example, they have not released the code for ‘station errors’ in which I identified a wide-ranging bug, or the code for generating the error range based on the station coverage), and when they released the code they did not indicate that it was the program normally used for CRUTEM3 (as implied by 3.4.8) but stated “[the code] takes the station data files and makes gridded fields in the same way as used in CRUTEM3.” Whether

3.4.8 is accurate or not probably rests on the interpretation of “in the same way as”. My reading is that this implies that the released code is not the actual code used for CRUTEM3. It would be worrying to discover that 3.4.8 is inaccurate, but I believe it should be clarified.

I rest at your disposition for further information, or to appear personally if necessary.

John Graham-Cumming

March 2010

0 0 votes

Article Rating

162 Comments

Inline Feedbacks

View all comments

RockyRoad

March 4, 2010 8:03 am

The computer code is the most important aspect of climate science and the least available. Why am I not surprised?

Craig Moore

March 4, 2010 8:04 am

“I rest at your disposition for further information, or to appear personally if necessary.”
Careful there. They still have those river level chambers in the bottom of the Tower of London.

Al Gore's Brother

March 4, 2010 8:08 am

hmmm. Interesting…

JonesII

March 4, 2010 8:09 am

This is very important:
it appeared that the organization writing the code did not adhere to standards one might find in professional software engineering
What a big surprise!

Robert M

March 4, 2010 8:12 am

OT, but a good read if you are not a Gore or NYT fan…
http://bigjournalism.com/kashiver/2010/02/28/former-veep-goes-girly-man-has-hissy-fit-in-pages-of-new-york-times/

Mhmm

March 4, 2010 8:13 am

…of course.
It’s slowly coming out. CRU, NASA, MET…collusion at it’s worst form.

John Graham-Cumming

March 4, 2010 8:20 am

Your headline is slightly inaccurate because I don’t say that the code for CRUTEM3 is ‘below standards’, I was referring to other code at CRU.

TerryBixler

March 4, 2010 8:22 am

All this computer code stuff baffles these ‘high level reviewers’. Why would anyone ever use version control when we all know these experts get it right on the first try. A better set of questions is why do they not have some software experts on their review board. No version control poor documentation within the code is criminal, especially with trillions of dollars hanging on every amateur bug in the code. Why has no one asked about version control for the data? Code and data go hand in hand. Amateurs one and all. Yes I have looked at the code and yes I have 45 years as a professional programmer.

Gary

March 4, 2010 8:23 am

Will we be seeing volume II in the Climategate series?

John in L du B

March 4, 2010 8:30 am

So Dr. Graham-Cummings has a BA in Matematics and Computation and a doctorate in Computer Security, has been writing software for 30 years but still only manages, in von Storch and Allen’s opinions, to qualify as an “amateur computer analyst”.
Are they “amature” climate scientists”?

P Gosselin

March 4, 2010 8:31 am

Somewhat OT, more of the same crap – but a different pile:
http://nofrakkingconsensus.blogspot.com/

Michael Ozanne

March 4, 2010 8:31 am

JonesII (08:09:18) :
“This is very important:
it appeared that the organization writing the code did not adhere to standards one might find in professional software engineering
What a big surprise!”
I have to tell you in all candour that it isn’t that straightforward to get professional software engineering outfits to “adhere to standards” …..:-)

hotrod ( Larry L )

March 4, 2010 8:31 am

Is there a UK based software developers professional association, or similar group?
I would think that if so they should be interested in commenting on the quality of the code and limitations of the software. If silent they are by implication endorsing the shoddy coding practices (no version control, poor documentation etc.) in code which governments depend on for major policy decisions.
Same question applies for other national, and international software developer organizations and related specialties.
The professional associations of librarians and archivists, should also be commenting on lack of proper archiving of materials and versions. If someone wanted to write a forensic history of climate model coding practices used in the late 20th century and early 21st century they would be trying to pull information out of a black hole.
Larry

stan

March 4, 2010 8:36 am

As AJ Strata pointed out, you never let the PhDs write the code if the project is important. People get hurt. If it’s important, you get people who actually have a clue what they are doing to write the code.

Jean Parisot

March 4, 2010 8:36 am

“I am a professional computer programmer who started programming almost 30 years ago. I have a BA in Mathematics and Computation from Oxford University and a DPhil in Computer Security also from Oxford. My entire career has been spent in computer software in the UK, US and France.” Amateur, a true professional would work for the Government or a University. /sarc

steve parker

March 4, 2010 8:37 am

This is why climategate is so important.Clever people like this guy are taking an intrest and ripping the warmists to shreds.Wonderful!

CodeTech

March 4, 2010 8:42 am

So… to be a “climatologist” one requires knowledge of:
1. Computer programming
2. Chemistry
3. Physics
4. Statistics
However, failure to have knowledge of or skills in some or any of these disciplines is easily overlooked if you have a “green” attitude and mindset, and are capable of writing suitably alarming predictions. Also the ability to procure grants and scare children is a genuine asset.

Steve Koch

March 4, 2010 8:43 am

Why not bring in professional software auditors to do a thorough review of the CRU software methodology and conformance to that methodology? If the CRU software is not even under version control, that pathetic beyond belief.

ShrNfr

March 4, 2010 8:44 am

To the extent that the code was written by grad students, I would not expect professional quality code. You should see some of the code written by Professors sometime too. My wife works processing Voyager data at MIT among other projects. If you want a strange and obscure code written over a couple of decades, then go look at that set of programming. But the work is open and available in the scientific tradition.
But the code must be available to see if it is valid. It can be written in a way that is not up to “current standards”, but implements the correct stuff.
I give CRU a little slack here bad programming style. But none for their failure to release it so others can see what they did.

Henry chance

March 4, 2010 8:45 am

Surprise.
I read the code and also saw problems. What shall we do? Jones really doesn’t want this out either.
“It is standard for scientists to manipulate and hide their code”

rbateman

March 4, 2010 8:45 am

Now I have to wonder if there is software at USHCN or GHCN that stomps over real data and punches artificial holes in it where no holes should be.
Or maybe it is scrambling the data, or it could be the overuse of tape media that left corruption in the stored data.
I’d like to get with you, E.M.Smith, if you are about. I have some very interesting anomalies in the data that you’d be good at figuring out the pattern to.

Alexander

March 4, 2010 8:47 am

Professor Acton seems to be quite economical with verifiable and accurate statements of fact! He is quite adept at digging deeper holes, though.

mpaul

March 4, 2010 8:49 am

I suspect John will get a very polite reply, something along the lines of: “While the subject matter at hand is of the utmost importance to the very survival of the human race, we find the your submission was post marked after the final date for submissions and therefore cannot be considered.”

Richard Drury

March 4, 2010 8:49 am

“The computer code is the most important aspect of climate science and the least available.”
It could be worse. What if the code that we have already is all that is available?

Al Gore Rhythm

March 4, 2010 8:54 am

I invented teh Internets and teh computer programs mathematical models as well.
Fiddle the figures
In tune with Al Gore
Now you got
The Al Gore Rhythm