Distribution analysis suggests GISS final temperature data is hand edited – or not

UPDATE: As I originally mentioned at the end of this post, I thought we should “give the benefit of the doubt” to GISS as there may be a perfectly rational explanation. Steve McIntyre indicates that he has done an analysis also and doubts the other analyses:

I disagree with both Luboš and David and don’t see anything remarkable in the distribution of digits.

I tend to trust Steve’s intuition and analysis skills,as his track record has been excellent. So at this point we don’t know what is the root cause or even if there is any human touch to the data. But as Lubos said on CA “there’s still an unexplained effect in the game”.

I’m sure it will get much attention as the results shake out.

UPDATE2: David Stockwell writes in comments here:

Hi,

I am gratified with the interest in this, very preliminary analysis. There’s a few points from the comments above.

1. False positives are possible, for a number of reasons.

2. Even though data are subjected to arithmetric operations, distortions in digit frequency at an earlier stage can still be observed.

3. The web site is still in development.

4. One of the deviant periods in GISS seems to be around 1940, the same as the ‘warmest year in the century’ and the ‘SST bucket collection’ issues.

5. Even if in the worst case there was manipulation, it wouldn’t affect AGW science much. The effect would be small. Its about something else. Take the Madoff fund. Even though investors knew the results were managed, they still invested because the payouts were real (for a while).

6. To my knowledge, noone has succeeded in exactly replicating the GISS data.

7. I picked that file as it is the most used – global land and ocean. I haven’t done an extensive search of files as I am still testing the site.

8. Lubos relicated this study more carefully, using only the monthly series and got the same result.

9. Benfords law (on the first digit) has a logarithmic distribution, and really only applies to data across many orders of magnitude. Measurement data that often has a constant first digit doesn’t work, although the second digit seems to. I don’t see why last digit wouldn’t work, and should approach a uniform distribution according to the Benford’s postulate.

That’s all for the moment. Thanks again.


This morning I received an email outlining some work that David Stockwell has done in some checking of the GISS global Land-Ocean temperature dataset:

Detecting ‘massaging’ of data by human hands is an area of statistical analysis I have been working on for some time, and devoted one chapter of my book, Niche Modeling, to its application to environmental data sets.

The WikiChecks web site now incorporates a script for doing a Benford’s analysis of digit frequency, sometimes used in numerical analysis of tax and other financial data.

The WikiChecks Site Says:

‘Managing’ or ‘massaging’ financial or other results can be a very serious deception. It ranges from rounding numbers up or down, to total fabrication. This system will detect the non-random frequency of digits associated with human intervention in natural number frequency.

Stockwell runs a test on GISS and writes:

One of the main sources of global warming information, the GISS data set from NASA showed significant management, particularly a deficiency of zeros and ones. Interestingly the moving window mode of the algorithm identified two years, 1940 and 1968 (see here).

You can actually run this test yourself, visit the WikiChecks web site, and paste the URL for the GISS dataset

http://data.giss.nasa.gov/gistemp/tabledata/GLB.Ts+dSST.txt

into it and press submit. Here is what you get as output from WikiChecks:

GISS

Frequency of each final digit: observed vs. expected

0 1 2 3 4 5 6 7 8 9 Totals
Observed 298 292 276 266 239 265 257 228 249 239 2609
Expected 260 260 260 260 260 260 260 260 260 260 2609
Variance 5.13 3.59 0.82 0.08 1.76 0.05 0.04 4.02 0.50 1.76 17.75
Significant * . *
Statistic DF Obtained Prob Critical
Chi Square 9 17.75 <0.05 16.92
RESULT: Significant management detected. Significant variation in digit 0: (Pr<0.05) indicates rounding up or down. Significant variation in digit 1: (Pr<0.1) indicates management. Significant variation in digit 7: (Pr<0.05) indicates management.

Stockwell writes of the results:

The chi-square test is prone to produce false positives for small samples. Also, there are a number of innocent reasons that digit frequency may diverge from expected. However, the tests are very sensitive. Even if arithmetic operations are performed on data after the manipulations, the ‘fingerprint’ of human intervention can remain.

I also ran it on the UAH data and RSS data and it flagged similar issues, though with different deviation scores. Stockwell did the same and writes:

The results, listed from lowest deviation to highest are listed below.

RSS – Pr<1

GISS – Pr<0.05

CRU – Pr<0.01

UAH – Pr<0.001

Numbers such as missing values in the UAH data (-99.990) may have caused its high deviation. I don’t know about the others.

Not being familiar with this mathematical technique, there was little I could do to confirm or refute the findings, so I let it pass until I could get word of replication from some other source.

It didn’t take long. About two hours later,  Lubos Motl, of the Reference Frame posted his results obtained independently via another method when he ran some checks of his own:

David Stockwell has analyzed the frequency of the final digits in the temperature data by NASA’s GISS led by James Hansen, and he claims that the unequal distribution of the individual digits strongly suggests that the data have been modified by a human hand.

With Mathematica 7, such hypotheses take a few minutes to be tested. And remarkably enough, I must confirm Stockwell’s bold assertion.

But that’s not all, Lubos goes on to say:

Using the IPCC terminology for probabilities, it is virtually certain (more than 99.5%) that Hansen’s data have been tempered with.

To be fair, Lubos runs his test on UAH data as well:

It might be a good idea to audit our friends at UAH MSU where Stockwell seems to see an even stronger signal.

In plain English, I don’t see any evidence of man-made interventions into the climate in the UAH MSU data. Unlike Hansen, Christy and Spencer don’t seem to cheat, at least not in a visible way, while the GISS data, at least their final digits, seem to be of anthropogenic origin.

Steve McIntyre offered an explanation in the way rounding occurs when converting from Fahrenheit to Centigrade, but Lubos can’t seem to replicate the same results he gets from the GISS data:

Steve McIntyre has immediately offered an alternative explanation of the non-uniformity of the GISS final digits: rounding of figures calculated from other units of temperature. Indeed, I confirmed that this is an issue that can also generate a non-uniformity, up to 2:1 in the frequency of various digits, and you may have already downloaded an updated GISS notebook that discusses this issue.

I can’t get 4,7 underrepresented but there may exist a combination of two roundings that generates this effect. If this explanation is correct, it is a result of much less unethical approach of GISS than the explanation above. Nevertheless, it is still evidence of improper rounding.

Pretty strong stuff, but given the divergence of the GISS signal with other datasets, unsurprising.  I wonder if it isn’t some artifact of the GISS Homogenization process for surface temperature data, which I view as flawed in its application.

But let’s give the benefit of the doubt here. I want to see what GISS has to say about it, there may be a perfectly rational explanation that can be applied that will demonstrate that these statistical accusations are without merit. I’m sure they will post something on RC soon.

Stay tuned.

Get notified when a new post is published.
Subscribe today!
0 0 votes
Article Rating
161 Comments
Inline Feedbacks
View all comments
Eric Chieflion
January 14, 2009 7:24 pm

I took enough classes in Statistics and Probability in college and graduate school to know that I would never be competent at Statistics and that if I tried, I’d get my tail feathers shot off.
The bottom line is that if Dr. Hansen is going to publish statistical data used to influence policy making, his processes must be transparent. Since that is apparently lacking, due diligence now appears to be required.
That due diligence should not only include an analysis of his statistical methods but also a thermodynamic review of the data collection process.
If he truly believes that there is an APGW signal in the data, then as a scientist, he should initiate such a review. I realize that he may be out on a limb having been running a set of “off the cuff” arguments for many years, but if the signal is there, then a rigorous analysis and – if necessary – correction to his methodologies will do more for his cause then denying his methodologies have a problem.
I was always a little skeptical of APGW but by early last year became alarmed and started researching the subject. I was not impressed with the “pro” internet sites that argued that the science was settled or attacked skeptics using emotional “grabbers” as opposed to scientific counter arguments. I was, however, impressed by sites such as this that disseminated large numbers of counter arguments even though I questioned many of those counter arguments as well.
Congratulations Anthony on your win, and by the way, I have a son at Chico State who managed to get an ‘A’ from an APGW believing professor despite stating contrary beliefs in the first week of class!

evanjones
Editor
January 14, 2009 7:38 pm

Pamela:
Get both.
(Beat it out of the squeeze.)

crosspatch
January 14, 2009 7:41 pm

Smokey: What you are looking for is GHCN-D data which is archived at ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/
And here are the stations he uses: http://data.giss.nasa.gov/gistemp/station_data/station_list.txt
The source code is here: http://data.giss.nasa.gov/gistemp/sources/GISTEMP_sources.tar.gz
(cover your eyes and don’t look at the code immediately after eating)
Lots of clues on this thread from Climate Audit from about a year ago:
http://www.climateaudit.org/?p=2746
Good luck!

January 14, 2009 7:44 pm

Pamela,
Get the mary janes, if that is anything like choosing between Ginger and Mary Ann. Mary Ann every time.
Roger

crosspatch
January 14, 2009 7:48 pm

Oh, smokey, look in the “all” subdirectory of the station_data once you have the station list.
Interesting thing is that station data tends to get spotty sometimes and must be calculated for fills. This is even though complete data is available elsewhere. This happened at one Siberian station when they were at record cold this year. Chances are when a value must be calculated from previous averages or surrounding stations, it is probably going to be a much warmer value than a record cold value would be.
One thing I am interested in is the filling of these values that are missing in the NOAA data and see if they impact the GISS grid temps enough to make a difference. I would also be interested to know if stations experience more data “drop outs” in winter than in summer.

van
January 14, 2009 8:22 pm

Isn’t Hansen’s work publicly funded? How could he avoid FOIA requests for ALL of his methodology?

giovanniworld
January 14, 2009 8:31 pm

Has anyone confronted Gavin with this info yet? If so, what is his latest song and dance number?
Gio-
http://giovanniworld.wordpress.com/2009/01/14/the-jews-are-our-dogs/

Pamela Gray
January 14, 2009 8:40 pm

Well, leave it to a skeptic to go against the consensus. I want with the racy one that rides high in the back. Must have been the wine. Or the chocolate. Or both.

old construction worker
January 14, 2009 8:43 pm

Pamela
I agree with evanjones, get both. I bet you have picked out an ensemble for both pairs.

Pamela Gray
January 14, 2009 9:02 pm

I should stop trying ti type afyter a nicr glass of gnarlay head zingfandel windee. I wanted them so I went with them.
And now back to science.

Nylo
January 14, 2009 9:37 pm

If you want to talk about data manipulation in GISSTEMP, how about this?
The updated GISS data including December temperatures, apart from the usual small changes of some past but recent (i.e. less than 3 years old) temperature changes, shows what I want to believe is an unusual data handling of really, really old temperatures (>20 years)
The following monthly temperatures have been COOLED by 0.01ºC:
10/1901, 01/1902, 04/1902, 02/1911, 09/1913, 10/1916, 09/1917, 09/1921, 07/1923, 09/1924, 10/1926, 12/1928, 12/1935, 09/1944, 04/1948, 01/1951, 07/1953, 11/1983 and 11/1985. Data before 1900 has also changed but I didn’t list the changes.
The following monthly temperatures have been WARMED by 0.01ºC:
09/1977 and 12/1985.
Do you see anything especial about the distribution of these changes? You may think that there is an overwhelming interest in cooling the past. I do too.
Additionally, some seasonal averages have changes without any of their monthly averages apparently changing: summer/1901, fall/1910, fall/1912, fall/1990, spring/1991, which suggests that GISS works with many more digits than those published, and that many more monthly temperatures have temperature changes that cannot be seen because of not affecting the first 2 significant digits. The same happens with a year, 1946, which changes its yearly December-to-November average without any of the monthly temperatures apparently changing. And yes, you correctly gessed the direction of all the changes: they cool at the beginning of the 20th century and warm towards its end.
The effect that these only 0.01ºC step changes in only a handful of months has on the global temperature trend WARMS the trend. The increase is really, really little, affecting only the third significant digit, HOWEVER, it is of a similar ammount than the COOLING effect that a cold 12/08 would normally have on the trend. This suggests that whenever we have a cold month, GISS can manage to mantain the warming trend just by adjusting 20 very old and separate monthly data by just 0.01C expecting that few or no people would notice.
After seeing this, I am personally going to keep track of any changes that GISS introduces in its very old data. Whenever they publish for a month, I will not only download the recent months, but all the list, and will keep track of all the versions. I am also going to do a few calculations with what I learn. Does anybody have previous full versions of GISS monthly data stored anywhere?
CU.

Justin Sane
January 14, 2009 10:24 pm

If there’s one thing I’ve learned well, it’s that perception is everything, and in AGW my perception is a conflict of interest when one person edits missing data, manipulates the raw data, and professes that he’s just the messenger, my spider senses start to tingle. How is it that one person can control the whole process and not not provide/share that data manipulation formula with anyone else? Who’s data is it anyway, it belongs to the taxpayers that’s who — this isn’t a national secret he’s keeping. I also thought that real scientists enjoyed having other scientists critique their work and look for errors. Take my word, I’m not a crook!

crosspatch
January 14, 2009 11:23 pm

“I am personally going to keep track of any changes that GISS introduces in its very old data. ”
The old data always changes. Here’s why:
If a value is missing in a month, a fill value is calculated. That fill value is calculated from averages over time. If a value for this month is warmer than average, when the new GISSTEMP is calculated, a new warmer “average” value will be calculated and plugged into that data value in the past. But there is a “hinge” point. Data earlier than that date are adjusted downwards and data after that date are adjusted warmer. So you make warming stronger by making the past cooler in addition to “adjusting” the more modern values upward. It has long been a complaint of people looking at the data and never has gotten any kind of answer other than a brush off.
So if you have a “cold” month, it can adjust months going back in time and if you have a warm month the same thing happens. The problem is that the data set Hansen uses seems to have a large number of “missing values” where the data from the same station from other sources do not have the missing values. But the most annoying part is that the past keeps moving around with Hansen’s method. Temperatures today affect temperatures long ago. It’s rather dumb, really. Just pick a temperature that you think the past was and leave it alone.

Pierre Gosselin
January 15, 2009 12:42 am

GISS most probably has its thumbs on the temperature scales. But I think this leads to a difference of only few hundreths of a degree. The big issue remains SITING. The data is measured at the wrong places. That’s the big source of errors.

Nylo
January 15, 2009 1:06 am

Thanks a lot crosspath for the explanation.
It is interesting to see that, as long as we are above the average (not very difficult), the warming trend is going to look bigger and bigger every month because we will think that the past was cooler and cooler!
I have been making numbers with an excel sheet. The effect which these temperature changes in the past has over the 1900-2008 warming trend is equivalent to not having changed the past values and having had a December 2008 temperature anomaly of +0.59 instead of +0.45. This gives an idea of the order of magnitude of the FRAUD we are just watching.
Best regards.

January 15, 2009 1:24 am

Out of the world of conjecture, into the world of evidence.
Here are some consecutive figures from a weather station, described as “raw” data. ALL ARE A DECADE OR MORE AFTER THE CHANGE FROM DEG F TO DEG C.
Tmax on 1st Tmax on 1st
Year Month Day Tmax day of year day of month
1976 1 1 29.6 29.6
1976 2 1 23.1
1976 3 1 24.7
1976 4 1 19.6
1976 5 1 23.8
1976 6 1 16 16
1976 7 1 18.8
1976 8 1 19.2
1976 9 1 17.2
1976 10 1 23.2
1976 11 1 16.1
1976 12 1 22.5
1977 1 1 27.9 27.9
1977 2 1 25 25
1977 3 1 38.6
1977 4 1 21.3
1977 5 1 23.2
1977 6 1 20.8
1977 7 1 22.6
1977 8 1 18 18
1977 9 1 14.8
1977 10 1 17.7
1977 11 1 18.8
1977 12 1 23.1
1978 1 1 25 25 25
1978 2 1 34 34
1978 3 1 25.7
1978 4 1 28.4
1978 5 1 25.6
1978 6 1 20.8
1978 7 1 16.5
1978 8 1 15.1
1978 9 1 18 18
1978 10 1 20.5
1978 11 1 25.6
1978 12 1 23.2
1979 1 1 25.9 25.9
1979 2 1 21.6
1979 3 1 23.6
1979 4 1 24.3
1979 5 1 17.3
1979 6 1 20.2
1979 7 1 14.3
1979 8 1 16 16
1979 9 1 15.2
1979 10 1 20.2
1979 11 1 17.6
1979 12 1 26.7
1980 1 1 27.7 27.7
1980 2 1 23 23
1980 3 1 24.2
1980 4 1 23.8
1980 5 1 23.3
1980 6 1 22.7
1980 7 1 17.4
1980 8 1 16.4
1980 9 1 16 16
1980 10 1 21.7
1980 11 1 26.5
1980 12 1 20.5
1981 1 1 24 24 24
1981 2 1 26.4
1981 3 1 20.2
1981 4 1 21.1
1981 5 1 21.9
1981 6 1 17.3
1981 7 1 13.2
1981 8 1 16.8
1981 9 1 20.1
1981 10 1 16.3
1981 11 1 31.3
1981 12 1 26.1
1982 1 1 21 21 21
1982 2 1 35.8
1982 3 1 23.5
1982 4 1 19.8
1982 5 1 21.6
1982 6 1 18.8
1982 7 1 10.9
1982 8 1 23.2
1982 9 1 17.7
1982 10 1 27.3
1982 11 1 35 35
1982 12 1 22.3
1983 1 1 24 24 24
1983 2 1 22.4
1983 3 1 23.4
1983 4 1 25.9
1983 5 1 25.5
1983 6 1 26.3
1983 7 1 17.2
1983 8 1 18.6
1983 9 1 21.4
1983 10 1 19.3
1983 11 1 30.6
1983 12 1 15.8
1984 1 1 23.7 23.7
1984 2 1 38.6
1984 3 1 31.6
1984 4 1 36.8
1984 5 1 21.5
1984 6 1 21.9
1984 7 1 15.7
1984 8 1 22.1
1984 9 1 18.4
1984 10 1 16.5
1984 11 1 21.2
1984 12 1 20.7
CONCLUSIONS count 108 9 13
4 whole nos 13 whole nos
in 9 YRS in 108 months
EXPECT <1 EXPECT 10.8
ends padded, not cherry picked

Mark
January 15, 2009 1:28 am

I pasted the mean monthly Mauna Loa co2 record into the Wikicheck and got the following result:
RESULT: Extremely Significant management detected.
Interesting!

January 15, 2009 1:55 am

Pamela said
“You guys sometimes make me giggle. You are quibbling (and I do sometimes as well because when I haven’t had a glass of red wine and a bit of chocolate, accuracy is important) over whether or not winters were actually .02 degrees colder than the official data claims.
Its obviously a boy thing. Mind you what makes me laugh rather than giggle is when everyone parses ‘global temperatures to 1850’ to fractions of a degree as if these figures had the slightest touch with reality. The trouble is that those doing the parsing the most are the IPCC who are influencing the economy of the world. Could you distract them by taking them to help choose your shoes?
TonyB

hotrod
January 15, 2009 1:56 am

Since I am somewhat new to the forum, please excuse me if this is redundant!
If the issue is that Jim Hansen is not releasing key information about how or why he does things, and he works for NASA a federally funded organization, wouldn’t a freedom of information request compel NASA to release the relevant information?
Likewise for those issues that drive public policy, I would think that a freedom of information request to release the methodology of the data that formulation of new public policy depends on would either compel release of the information, or provide grounds that that data/methodology not be used as the basis of new policy until such info is released for public review.
Larry

Nylo
January 15, 2009 2:50 am

Anyway, what kind of “missing values” in 1900 are being calculated from averages which include data from 2008? Are we talking of specific days in specific stations, or monthly averages of specific stations, or monthly averages of entire regions, or global monthly averages? Obviously all of them will be changed, but where does the problem start?
Best regards.

January 15, 2009 3:02 am

So there is not a full resolve to the questions regarding the GISS data yet. Anthony is leaning toward Steve McIntyre’s analysis of the data indicating that nothing is really askew. There is, however, a larger dynamic present in this situation. The openness and sharing of methodology, software used, formula’s, etc. It is the ethic which those running the different analyses in this matter continually demonstrate in all of their work. They seek the proper solution, the truth, even if the truth means their initial approach is ultimately revealed to contain error of one nature or another. Their comments and work were and are available for all to see. There to be challenged. That is what science is supposed to be about.
Man’s understanding of the climate would be much farther along if that same openness existed with all scientists involved in climate research. Science realizes no true gains when one small group is permitted to claim the final and proper results and do so without openness of data, methodology, codes, etc. Essentially, some feel their work should go unchallenged and they, to a large extent, are permitted to get away with it. That is not good science and they damage science in doing so. It is a shame that they lack the integrity, ethic, and courage of those participating here in the GISS data issue.

January 15, 2009 3:55 am

Pamela Gray:

…I am more concerned about whether or not to buy the shoes with a racy strap that angles over my ankle and rides high in the back, or the mary janes that have two cute little straps that cross in front.

Solution: WWID? What Would Imelda Do? The answer is obvious. She would buy both pairs.

January 15, 2009 3:58 am

Actually there should be a slight bias to certain ending digits due to the instrumentation. Digital equipment would have a random tenth degree digit, but analog equipment typically is read to the nearest 1/4 degree.

Katherine
January 15, 2009 4:43 am

Smokey wrote:
Solution: WWID? What Would Imelda Do? The answer is obvious. She would buy both pairs.
LOL Actually, a lot of Imelda’s shoes were gifts from the manufacturers for bragging rights–free promo. Kind of like all the freebies that are given away to actors/actresses attending the Oscars?

Editor
January 15, 2009 5:40 am

John Philip (16:07:46) :

I confidently predict that this will turn out to be a non-story. As our host is a meteorologist and interested in matters climatic, and on the day that WUWT was awarded Best Science Blog, can we expect some comment on the news that the American Meteorological Society has honoured the custodian of the GISS dataset with its highest commendation, the href=”http://www.nasa.gov/centers/goddard/news/topstory/2009/hansen_ams.html”> Carl-Gustaf Rossby Research Medal.
Newsworthy, surely?

Well, I’m sure Joe D’Aleo would just point out that this is another example of the AMS board having been taken over by people better at playing politics than most meteorologists.
I didm’t record their comments at a Climate Task Force meeting (see http://wermenh.com/climate/index.html ), but Joe and Fred Ward (retired TV Met from Boston) had rather harsh words about the AMS position statement on global warming. It was passed over the objections of several board members and the rank and file membership had no say.
I commented on that earlier, see http://wattsupwiththat.com/2008/12/03/cleveland-area-tv-meteorologists-disagree-with-prevailing-attitude-about-climate-change/#comment-61147
Joe would likely call it another example of the AGW community patting each other on the back.