Guest post by John Goetz
Cross posted from Climate Audit
Earlier this year I did a post on the amount of estimation done to the GHCN temperature record by GISS before generating zonal and global averages. A graphic I posted compared the amount of real temperature data with the amount of estimation over time. To read the graphic, consider 2000 as an example. As of February 7, 2008 there were 3159 station records in the GHCN data with an entry for the year 2000. Of those station records, 62% were complete and an annual average could be fully calculated. Another 29% were incomplete, but contained enough monthly data that the GISS estimation method kicked in. The final 9% were so incomplete that no estimation could be done.
What I did not explore at the time and would like to look more closely here is the accuracy of the estimation. One would hope with so much infilling going on that the accuracy would be rather high (I will leave the determination of “high accuracy” for a later time). Because I did not have real data to compare with the GISS estimations, I took another approach. I used the GISS method to estimate real temperature data as if that data were missing.
Recall that GISS never explicitly estimates missing monthly temperatures. What they do is estimate seasonal averages when one monthly temperature is missing but the other two are present. Similarly, an annual temperature can be estimated when one seasonal value is missing but the other three are present. Using this methodology GISS can estimate an annual temperature when as many as six monthly values are missing.
While no explicit monthly estimate is recorded by GISS, it certainly can be derived from the seasonal estimate. I have shown several times a one-line equation that exactly reproduces the GISS seasonal estimate. Leaving a subsequent derivation as an exercise for the reader, the implied monthly estimate can be found from that equation and is expressed as follows:
where the average values for A, B, and C are calculated from all valid entries for the given month in a particular station record.
Now to test the estimation accuracy. In Connecticut, December 2006 was warmer than normal, but February 2007 was colder than normal. Looking at the records for Hartford, CT, we see the following monthly and seasonal temperatures:
Dec 2006: 3.3
Jan 2007: -0.3
Feb 2007: -4.6
DJF: -0.5
If the December 2006 record were missing from Hartford, GISS would estimate a value of -0.7 C, which would yield a seasonal average of -1.9 C. Similarly, if February 2007 were missing, GISS would estimate it at 1.7 C and produce a seasonal average of 1.6 C. That’s a 4.0 degree miss for Dec, a 6.3 degree miss for February, and a 3.5 degree swing at the seasonal level.
The winter of 06-07 in Connecticut was a bit of an oddball. I really wanted to know what the typical error looked like. To do that, I performed the same calculation on all GHCN v2.mean records.
A real monthly value can be compared against its GISS estimate only when all three monthly values in the season are available. In my copy of GHCN v2.mean, there are approximately 6.25 million monthly values that meet that requirement. I went through each of the monthly values and simulated a GISS estimate, and from that estimate I subtracted the actual value to produce a delta temperature. A positive delta means that GISS would over-estimate the temperature and a negative delta means GISS would under-estimate the temperature.
Following is a histogram of the delta values collected. The x-axis is the value of the delta in degrees C. The y-axis is the percentage of records that had the specified delta value.
The fact that the simulation histogram looks like a normal distribution should not be surprising. This comes about because I need all three months in a season in order to simulate an estimate and a resulting delta. Recall that in the Hartford example above a large delta for December was followed by a similarly large delta for February, but of the opposite sign. Given the enormous sample size, the small differences in magnitude eventually even out.
The above distribution tells us the probability that the GISS estimate will miss the actual value by a specific amount. Zooming in on the distribution, we see GISS should get it exactly right just over 3% of the time:
Following is a table of absolute values and their corresponding probabilities, through a delta value of 2.9 degrees:
Referring to the table, the probability GISS will create an estimate within 0.4C of the actual value is 26.7%. A value between 0.5 C and 0.9 C has a 22.2% probability of occuring. Similarly, 1.0 C to 1.9 C is 26.5%, and 2.0 C to 2.9 C is 12.7%. There is about a 12% probability that the GISS estimate will be off by 3.0 C or more.
Note that the estimation method as it stands does not introduce a bias into the station record. But it does introduce a sizable uncertainty.
Discover more from Watts Up With That?
Subscribe to get the latest posts sent to your email.
I’m not sure that some of the readers understand what John did here. He is using actual data to estimate how much likely error there is in the estimated monthly temperatures used by GISS. We can’t know how wrong the actual guesses were. We can’t.
First, he showed that the averaging method GISS uses can produce a guesstimate that is significantly off. Look at a known Dec value and a known Feb value. Use the GISS method to guesstimate what Jan value would be (if it were not available, this guesstimate would be included in the GISS data).
But John only used data where we know Jan’s actual value. How does the hypothetical guesstimate (using GISS methodology) stack up with reality?
Answer — not good.
John Goetz; thanks, that makes sense. So would an even more useful measure of accuracy be a sampling distribution for a single month, rather than a single station? Or you could estimate the standard errors for a month through monte carlo analysis?
Can somebody post a URL where I can get my hands on the actual station temperature data, not the gridded data? And the 1951..1980 normals that GISS compares against?
This I found interesting when comparing warmest and coolest averages per month for a couple stations here in MN against what GISS has for those months.
The GISS figure is to the right of the year with the difference in ().
Pine River Dam,MN(216547)
Jan: 23.7F/-4.6C (2006) -4.5 (+0.1)
-10.7F/-23.7C (1912) -25.4 (-1.7)
Feb: 28.5F/-1.9C (1998) -1.9 (0.0)
-7.3F/-21.8C (1936) -23.1 (-1.3)
Mar: 37.8F/3.2C (1910) 2.9 (-0.3)
10.2F/-12.1C (1899) -12.5 (-0.4)
Apr: 51.9F/11.1C (1915) 10.9 (-0.2)
31.0F/-0.6C (1950) -0.7 (-0.1)
May: 65.2F/18.4C (1977) 17.9 (-0.5)
43.1F/6.2C (1907) 5.8 (-0.4)
Jun: 70.9F/21.6C (1933) 21.8 (+0.2)
57.2F/14.0C (1945) 14.1 (+0.1)
Jul: 75.1F/23.9C (1916) 23.9 (0.0)
61.7F/16.5C (1992) 16.5 (0.0)
Aug: 74.1F/23.4C (1983) 23.4 (0.0)
60.4F/15.8C (1927) 15.9 (+0.1)
Sep: 62.9F/17.2C (1906) 17.0 (-0.2)
50.1F/10.1C (1965) 9.6 (-0.5)
Oct: 56.6F/13.7C (1963) 12.9 (-0.8)
30.2F/-1.0C (1925) -1.5 (-0.5)
Nov: 40.9F/4.9C (2001) 4.9 (0.0)
17.4F/-8.1C (1911) -8.4 (-0.3)
Dec: 25.4F/-3.7C (1931) -5.2 (-1.5)
1.3F/-17.1C (1927) -18.6 (-1.5)
Obviously GISS cooled the past a bit, but they did it by cooling the colder months of the year and recent data stays about the same. Lets do one more.
Cloquet,MN(211630)
Jan: 24.7F/-4.1C (2006) -4.1(0.0)
-8.7F/-22.6C (1912) -22.7 (-0.1)
Feb: 29.7F/-1.3C (1998) -1.3 (0.0)
-4.4F/-20.2C (1936) -20.3 (-0.1)
Mar: 35.0F/1.7C (2000) 1.7 (0.0)
14.3F/-9.8C (1923) -9.8 (0.0)
Apr: 48.1F/8.9C (1987) 8.9 (0.0)
30.7F/-0.7C (1950) -0.7 (0.0)
May: 59.6F/15.3C (1977) 15.3 (0.0)
45.4F/7.4C (1915) 8.0 (+0.6)
Jun: 67.7F/19.8C (1933) 19.8 (0.0)
54.3F/12.4C (1915) 13.1 (+0.7)
Jul: 72.3F/22.4C (1921) 23.2 (+0.8)
60.0F/15.6C (1915) 16.3 (+0.7)
Aug: 70.1F/21.2C (1983) 21.1 (-0.1)
57.8F/14.3C (1912) 15.1 (+0.8)
Sep: 61.6F/16.4C (2004) 16.4 (0.0)
48.2F/9.0C (1918) 9.5 (+0.5)
Oct: 54.6F/12.6C (1963) 12.5 (-0.1)
33.1f/0.6C (1917) 1.1 (+0.5)
Nov: 41.1F/5.1C (2001) 5.1 (0.0)
19.8F/-6.8C (1911) -6.3 (+0.5)
Dec: 26.2F/-3.2C (1913) -3.3 (-0.1)
1.6F/-16.9C (1983) -16.9 (0.0)
We see with this station that it was warmed in the past and that warming was done mostly in the warmer months. I know this station was warmed because you can easily compare the chart listed with GISS and the chart posted with the survey done at surfacestations.org. before the 2007 adjustments. The Y-axis of the charts are 1degC different(warmer after the adjustments).
On these two stations, some years are there more than once for each station monthly record. For example, 1915(May,Jun,Jul) in Cloquet was cold but adjusted up pretty good. 1912(Jan) and 1912(Aug) are adjusted different. A slight tick down in Jan, but up in Aug.
Be nice to compare all months in a station record to what GISS has, but this gives some idea. The pros are going bald over trying to figure what they’ve done.
Warmest and coolest averages for each month came from here.
http://mrcc.sws.uiuc.edu/INTERACT/mwclimate_data_calendars_1.jsp