The “Statisticians: ‘Global Cooling’ a Myth” story

By William M. Briggs, professional statistician

Your statistical model!

J’accuse! A statistician may prove anything with his nefarious methods. He may even say a negative number is positive! You cannot trust anything he says.”

Sigh. Unfortunately, this oft-hurled charge is all too true. I and my fellow statisticians must bear its sad burden, knowing it is caused by our more zealous brethren (and sisthren). But, you know, it really isn’t their fault, for they are victims of loving not wisely but too well their own creations.

First, a fact. It is true that, based on the observed satellite data, average global temperatures since about 1998 have not continued the rough year-by-year increase that had been noticed in the decade or so before that date. The temperatures since about 1998 have increased in some years, but more often have they decreased. For example, last year was cooler than the year before last. These statements, barring unknown errors in the measurement of that data, are taken as true by everybody, even statisticians.

Th AP gave this data—concealing its source—to “several independent statisticians” who said they “found no true temperature declines over time” (link)

How can this be? Why would a statistician say that the observed cooling is not “scientifically legitimate”; and why would another state that noticing the cooling “is a case of ‘people coming at the data with preconceived notions’”?

Are these statisticians, since they are concluding the opposite of what has been observed, insane? This is impossible: statisticians are highly lucid individuals, its male members exceedingly handsome and charming. Perhaps they are rabid environmentalists who care nothing for truth? No, because none of them knew the source of the data they were analyzing. What can account for this preposterous situation!

Love. The keen pleasures of their own handiwork. That is, the adoration of lovingly crafted models.

Let me teach you to be a classical statistician. Go to your favorite climate site and download a time series picture of the satellite-derived temperature (so that we have no complications from mixing of different data sources); any will do. Here’s one from our pal Anthony Watts.

Now fetch a ruler—a straight edge—preferably one with which you have an emotional attachment. Perhaps the one your daughter used in kindergarten. The only proviso is that you must love the ruler.

Place the ruler on the temperature plot and orient it along the data so that it most pleases your eye. Grab a pencil and draw a line along its edge. Then, if you can, erase all the original temperature points so that all you are left with is the line you drew.

If a reporter calls and asks if the temperature was warmer or colder last year, do not use the original data, which of course you cannot since you erased it, but use instead your line. According to that very objective line the temperature has obviously increased. Insist on the scientificity of that line—say that according to its sophisticated inner-methodology, the pronouncement must be that the temperature has gone up! Even though, in fact, it has gone down.

Don’t laugh yet, dear ones. That analogy is too close to the truth. The only twist is that statisticians don’t use a ruler to draw their lines—some use a hockey stick. Just kidding! (Now you can laugh.) Instead, they use the mathematical equivalent of rulers and other flexible lines.

Your ruler is a model Statisticians are taught—their entire training stresses—that data isn’t data until it is modeled. Those temperatures don’t attain significance until a model can be laid over the top of them. Further, it is our credo to, in the end, ignore the data and talk solely of the model and its properties. We love models!

All this would be OK, except for one fact that is always forgotten. For any set of data, there are always an infinite number of possible models. Which is the correct one? Which indeed!

Many of these models will say the temperature has gone down, just as others will say that it has gone up. The AP statisticians used models most familiar to them; like “moving averages of about 10 years” (moving average is the most used method of replacing actual data with a model in time series); or “trend” models, which are distinct cousins to rulers.

Since we are free to choose from an infinite bag, all of our models are suspect and should not be trusted until they have proven their worth by skillfully predicting data that has not yet been seen. None of the models in the AP study have done so. Even stronger, since they said temperatures were higher when they were in fact lower, they must predict higher temperatures in the coming years, a forecast which few are making.

We are too comfortable with this old way of doing things. We really can prove anything we want with careful choice of models.

0 0 votes
Article Rating

Discover more from Watts Up With That?

Subscribe to get the latest posts sent to your email.

132 Comments
Inline Feedbacks
View all comments
Tenuc
October 28, 2009 9:39 am

Thanks for an enjoyable read William, and good to see the truth coming from the horses mouth… 🙂
However, it’s worse than that. Temperature is the outcome of a set of interlinked dynamic chaotic processes, where trends mean nothing. Global average temperature is an even more ridiculous and meaningless measure, as only the total energy balance sheet of Earth’s climate system that has any meaning. Unfortunately we have no method of accurately measuring that yet, nor will we be able to do it for the foreseeable future.
Currently all predictive climate science is a sham.

Rog
October 28, 2009 9:42 am

Come on. You guys are reading too much into this. We all know that 69.43% of all statistics are made up on the spot. *;)

October 28, 2009 9:43 am

Psychologists did a study of statisticians and found that their attitude, on average, was mean.

Tenuc
October 28, 2009 9:44 am

OT
For those following the Science Museums support Copenhagen pole, the results have changed drastically yet again.
* 764 counted in so far
* 5223 counted out so far
Curiouser and curiouser!!!
Poll is open until December – link below if you want to vote:-
http://www.sciencemuseum.org.uk/proveit.aspx

October 28, 2009 9:52 am

Outstanding !!!!!

stan
October 28, 2009 9:53 am

Good stuff Briggs. This exercise by the AP appears to be just another version of the bogus Rahmstorf exercise of a few years ago (“it’s even worse than we thought!”). The real fun is to contemplate what the AP is thinking by going to the statisticians in the first place. Apparently, regular people are not capable of looking at two numbers and discerning which is the higher number. Only with a doctorate in statistics and a resume replete with publications is it possible to compare numbers.
In kindergarten, children are asked to identify which number is larger. Most handle the task with ease. Apparently, none of those who managed the task are employed by the AP.

Philip T. Downman
October 28, 2009 9:53 am

Hmm..don’t get nervous now. Cooling might not be so impressive, there is a big sunspot, arctic temperature refuse to dip, ice extent threatens to lag behind.
Things don’t go the climate sceptics way in the short perspective.
It can’t do every moment.
Monty Python jokes and even raljant and witty articles like the above seems to me a sign of desperation.
There is no need yet. Just wait and see and save the jokes till AGW is indisputably discredited.

Chris
October 28, 2009 9:56 am

As I have said before, borenstein is the worst. It’s not what he writes, but what he leaves out. In essense, he only writes half the story. Of course, he knows what he is doing, thus the reason for him being the worst newspaper writer out there today.

October 28, 2009 9:57 am

The AP article makes a rather stunning error of terminology in it’s headline. “Statisticians Reject Global Cooling”. This implies that the data was given to them, not without any information, as Borenstein claims but with the idea given to them that they should test it for negative trends as a null hypothesis. Worse than that, it would be impossible, if they were doing things right, for them to have “rejected” cooling in the last twelve years-that’s what RSS UAH and Hadley show!-this is a terminological inexactitude-the statisticians actually failed to reject warming. Why didn’t anyone think to test against the IPCC projections instead? .2 degrees per decade is probably easy to reject after a dozen years with none.

October 28, 2009 10:00 am

I wonder what these statisticians would have done if for each global annual average they were also given a calculated estimate of varience or if they were given global monthly averages with varience? Also, since they assumed the deviation from the streight line based on ten points was random error ( which it is not) what do the confidence limits on a true line look like and with what confidence can they declare either warming or cooling?

October 28, 2009 10:00 am

When I was in college, I had a stats class and we were taught how to make the stats say what we want them to say. Here is an example of what I mean.
The population of City A grew to 200% of the original. The population of City B grew to 105%. Without any more knowledge, which do you think gained the most people? However, what if the number of people in City A was 10 and grew to 20 but the number of people in City B was 100,000 and grew to 105,000.
Lawyers use this kind of stuff to manipulate juries. You change the scope or you use selective data to get the desired impression. For instance, in the above example, I can chose varying years to compare the two cities and only choose the ones that give the impression that I desire. It is very easy to make stats mislead, even when it is completely true. You better believe there are statisticians out there who are paid to make the numbers mislead.

Adam Gallon
October 28, 2009 10:03 am

Very good. What are the good Statistician’s views on Rahmstorf’s triangular filtering and other “innovative” methods used to “prove” the full horrors of AGW?

Reed Coray
October 28, 2009 10:09 am

An enjoyable read.

jorgekafkazar
October 28, 2009 10:12 am

Sean (09:27:25) : “What’s the best kind of ruler to use when you want to draw a roller coaster?”
A UC Twit of the Century nutter. Long live the Queen!

Bernie
October 28, 2009 10:12 am

Matt:
My guess is that it is Borenstein that really needs to read this clearly stated explanation. But perhaps he already knew exactly what the statisticians would have to say given his poorly articulated charge.
I do think it would be helpful to clarify that most good looking statisticians would be loathe to undertake such a task without knowing what the data represented at since the model chosen should have some relationship to the physical, social or behavioral processes represented by the data.

MartinGAtkins
October 28, 2009 10:13 am

Just a reminder to you folks. The data given to the researchers was with the satellite data removed.
However, data from the National Oceanic and Atmospheric Administration and NASA show 2005 has topped 1998. Published peer-reviewed scientific research generally cites temperatures measured by ground sensors, which are from NOAA, NASA and the British, more than the satellite data.
So they doctored the data especially for the article to get the result they wanted before Copenhagen. This is the data used.
ftp://ftp.ncdc.noaa.gov/pub/data/anomalies/monthly.land_ocean.90S.90N.df_1901-2000mean.dat
Obviously this:-
http://wattsupwiththat.files.wordpress.com/2009/10/junkyard_mmts_org.jpg
Is superior to this:-
http://wattsupwiththat.files.wordpress.com/2008/03/noaa-n_satellite.jpg

ralphzillo
October 28, 2009 10:13 am

There’s quite a few models that I would like to draw…

Symon
October 28, 2009 10:24 am

All cousins are distinct, but some are distant?

Gene Nemetz
October 28, 2009 10:25 am

Gary (08:47:51) :
“Seth Borenstein told me he got the data that he gave to the statisticians from Dr. Christy. Wouldn’t Dr. Christy be able to tell us exactly what it was?”
Seth Borenstein used NOAA and NASA data for his conclusion. He didn’t use all data sets from around the world. He did mention one set, but didn’t use that set, that shows cooling in the earth, to make the conclusion of his article.
The NASA set, i.e., GISTemp, is the data that James Hansen distributes. NOAA data is government data. There are problems with both entities.

TJA
October 28, 2009 10:28 am

“What’s the best kind of ruler to use when you want to draw a roller coaster?”
A mobius ruler

DonK31
October 28, 2009 10:28 am

OT: For the sake of argument, the latest “Prove It” numbers from the British Science Museum are 767 in and 5239 out; More than 87% out.
I think that I can safely say than they won’t post another online poll.

Gene Nemetz
October 28, 2009 10:29 am

Temperatures rose in the 20th Century.
Stamp prices rose in the 20th Century.
Stamps caused temperatures to rise.
FedEx and UPS are doing fine. The problem comes from the Post Office.

Gene Nemetz
October 28, 2009 10:31 am

ralphzillo (10:13:12) :
I’ve dated a few. They’re not like people think. 🙁
But ya, great to look at.

Mark Fawcett
October 28, 2009 10:35 am

Great article.
Sorry for the OT – what’s happening back at the science museum “prove it” survey? It’s now (5:35 PM GMT) reading 771 in, 5249 out?
Cheers
Mark

Jeff
October 28, 2009 10:37 am

Here’s a question that will immediately reveal my complete lack of knowledge of anything in climate science. Why do we use the temperature anomaly? Why not the man global temp? Based on the above graph, if I understand what is being graphed, it is indeed continuing to warm, but at a much slower rate than in the past. The 0.1 degree anomaly means that we are still higher than the average over the past x years, right? Or am I missing something?
Of course, if we just looked at temperature it would be easier. That’s one of my complaints about this whole field of climate science. We build models based on data that is not direct experimental data, but either derived from models of proxies or based on models of actual data with n layers of indirection before we get to direct measurement.