I’m happy to present this essay created from both sides of the aisle, courtesy of the two gentlemen below. Be sure to see the conclusion. I present their essay below with only a few small edits for spelling, format, and readability. Plus an image, a snapshot of global temperatures. – Anthony

By Zeke Hausfather and Steven Mosher
There are a variety of questions that people have about the calculation of a global temperature index. Questions that range from the selection of data and the adjustments made to data, to the actual calculation of the average. For some there is even a question about whether the measure makes any sense or not. It’s not possible to address all these questions in one short piece, but some of them can be addressed and reasonably settled. In particular we are in a position to answer the question about potential biases in the selection of data and biases in how that data is averaged.
To move the discussion onto the important matters of adjustments to data or, for example, UHI issues in the source data it is important to move forward on some answerable questions. Namely, do the methods for averaging data, the methods of the GISS, CRU and NCDC bias the result? There are a variety of methods for averaging spatial data, do the methods selected and implemented by the big three bias the result?
There has been a trend of late among climate bloggers on both sides of the divide to develop their own global temperature reconstructions. These have ranged from simple land reconstructions using GHCN data
(either v2.mean unadjusted data or v2.mean_adj data) to full land/ocean reconstructions and experiments with alternative datasets (GSOD , WDSSC , ISH ).
Bloggers and researchers who have developed reconstructions so far this year include:
Steven Mosher
And, just recently, the Muir Russell report
What is interesting is that the results from all these reconstructions are quite similar, despite differences in methodologies and source data. All are also quite comparable to the “big three” published global land temperature indices: NCDC , GISTemp , and CRUTEM .
[Fig 1]
The task of calculating global land temperatures is actually relatively simple, and the differences between reconstructions can be distilled down to a small number of choices:
1. Choose a land temperature series.
Ones analyzed so far include GHCN (raw and adjusted), WMSSC , GISS Step 0, ISH , GSOD , and USHCN (raw, time-of-observation adjusted, and F52 fully adjusted). Most reconstructions to date have chosen to focus on raw datasets, and all give similar results.
[Fig 2]
It’s worth noting that most of these datasets have some overlap. GHCN and WMSSC both include many (but not all) of the same stations. GISS Step 0 includes all GHCN stations in addition to USHCN stations and a selection of stations from Antartica. ISH and GSOD have quite a bit of overlap, and include hourly/daily data from a number of GHCN stations (though they have many, many more station records than GHCN in the last 30 years).
2. Choosing a station combination method and a normalization method.
GHCN in particular contains a number of duplicate records (dups) and multiple station records (imods) associated with a single wmo_id. Records can be combined at a single location and/or grid cell and converted into anomalies through the Reference Station Method (RSM), the Common Anomalies Method (CAM), and First Differences Method (FDM), or the Least Squares Method (LSM) developed by Tamino and Roman M . Depending on the method chosen, you may be able to use more stations with short records, or end up discarding station records that do not have coverage in a chosen baseline period. Different reconstructions have mainly made use of CAM (Zeke, Mosher, NCDC) or LSM (Chad, Jeff Id/Roman M, Nick Stokes, Tamino). The choice between the two does not appear to have a significant effect on results, though more work could be done using the same model and varying only the combination method.
[Fig 3]
3. Choosing an anomaly period.
The choice of the anomaly period is particularly important for reconstructions using CAM, as it will determine the amount of usable records. The anomaly period can also result in odd behavior of anomalies if it is too short, but in general the choice makes little difference to the results. In the figure that follows Mosher shows the difference between picking an anomaly period like CRU does, 1961-1990, and picking an anomaly period that maximizes the number monthly reports in a 30 year period. The period that maximizes the number of monthly reports over a 30 year period turns out to be 1952-1983. 1953-82 (Mosher). No other 30 year period in GHCN has more station reports. This refinement, however, has no appreciable impact.
[Fig 4]
4. Gridding methods.
Most global reconstructions use 5×5 grid cells to ensure good spatial coverage of the globe. GISTemp uses a rather different method of equal-size grid cells. However, the choice between the two methods does not seem to make a large difference, as GISTemp’s land record can be reasonably well-replicated using 5×5 grid cells. Smaller resolution grid cells can improve regional anomalies, but will often result in spatial bias in the results, as there will be large missing areas during periods when or in locations when station coverage is limited. For the most part, the choice is not that important, unless you choose extremely large or small gridcells. In the figure that follows Mosher shows that selecting a smaller grid does not impact the global average or the trend over time. In his implementation there is no averaging or extrapolation over missing grid cells. All the stations within a grid cell are averaged and then the entire globe is averaged. Missing cells are not imputed with any values.
[Fig 5]
5. Using a land mask.
Some reconstructions (Chad, Mosh, Zeke, NCDC) use a land mask to weight each grid cell by its respective land area. The land mask determines how much of a given cell ( say 5×5) is actually land. A cell on a coast, thus, could have only a portion of land in it. The land mask corrects for this. The percent of land in a cell is constructed from a 1 km by 1 km dataset. The net effect of land masking is to increase the trend, especially in the last decade. This factor is the main reason why recent reconstructions by Jeff Id/Roman M and Nick Stokes are a bit lower than those by Chad, Mosh, and Zeke.
[Fig 6]
6. Zonal weighting.
Some reconstructions (GISTemp, CRUTEM) do not simply calculate the land anomaly as the size-weighted average of all grid cells covered. Rather, they calculate anomalies for different regions of the globe (each hemisphere for CRUTEM, 90°N to 23.6°N, 23.6°N to 23.6°S and 23.6°S to 90°S for GISTemp) and create a global land temp as the weighted average of each zone (weightings 0.3, 0.4 and 0.3, respectively for GISTemp, 0.68 × NH + 0.32 × SH for CRUTEM). In both cases, this zonal weighting results in a lower land temp record, as it gives a larger weight to the slower warming Southern Hemisphere.
[Fig 7]
These steps will get you a reasonably good global land record. For more technical details, look at any of the many http://noconsensus.wordpress.com/2010/03/25/thermal-hammer-part-deux/different http://residualanalysis.blogspot.com/2010/03/ghcn-processor-11.html models http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/ that have been publicly http://drop.io/treesfortheforest released http://moyhu.blogspot.com/2010/04/v14-with-maps-conjugate-gradients.html
].
7. Adding in ocean temperatures.
The major decisions involved in turning a land reconstruction into a land/ocean reconstruction are choosing a SST series (HadSST2, HadISST/Reynolds, and ERSST have been explored http://rankexploits.com/musings/2010/replication/ so far), gridding and anomalizing the series chosen, and creating a combined land-ocean temp record as a weighted combination of the two. This is generally done by: global temp = 0.708 × ocean temp + 0.292 × land temp.
[Fig 8]
8. Interpolation.
Most reconstructions only cover 5×5 grid cells with one or more station for any given month. This means that any areas without station coverage for any given month are implicitly assumed to have the global mean temperature. This is arguably problematic, as high-latitude regions tend to have the poorest coverage and are generally warming faster than the global average.
GISTemp takes a somewhat different approach, assigning a temperature anomaly to all missing grid boxes located within 1200 km of one or more stations that do have defined temperature anomalies. They rationalize this based on the fact that “temperature anomaly patterns tend to be large scale, especially at middle and high latitudes.” Because GISTemp excludes SST readings from areas with sea ice cover, this leads to the extrapolation of land anomalies to ocean areas, particularly in the Arctic. The net effects of interpolation on the resulting GISTemp record is small but not insignificant, particularly in recent years. Indeed, the effect of interpolation is the main reason why GISTemp shows somewhat different trends from HadCRUT and NCDC over the past decade.
[Fig 9]
9. Conclusion
As noted above there are many questions about the calculation of a global temperature index. However, some of those questions can be fairly answered and have been fairly answered by a variety of experienced citizen researchers from all sides of the debate. The approaches used by GISS and CRU and NCDC do not bias the result in any way that would erase the warming we have seen since 1880. To be sure there are minor differences that depend upon the exact choices one makes, choices of ocean data sets, land data sets, rules for including stations, rules for gridding, area weighting approaches, but all of these differences are minor when compared to the warming we see.
That suggests a turn in the discussion to the matters which have not been as thoroughly investigated by independent citizen researchers on all sides:
A turn to the question of data adjustments and a turn to the question of metadata accuracy and finally a turn to the question about UHI. Now, however, the community on all sides of the debate has a set of tools to address these questions.









Anthony,
Ron did all the heavy lifting on the GSOD data. The daily temp values are a bit hard to work with (and he posted the scripts for them here: http://rhinohide.wordpress.com/2010/06/26/gsod-to-ghcn-round-2/ ), but Ron turned them into monthly means available here:
http://rhinohide.cx/co2/gsod/data/201006/my.gsod.mean
With a station inventory file here:
http://rhinohide.cx/co2/gsod/data/201006/my.gsod.inv
(Note that this data should not be used for commercial purposes per its redistribution restrictions: http://www.ncdc.noaa.gov/cgi-bin/res40.pl?page=gsod.html )
REPLY: Thanks much -A
To supplement my last comment a tad, you can find the unprocessed GSOD data here:
ftp://ftp.ncdc.noaa.gov/pub/data/gsod/
And the metadata file here:
ftp://ftp.ncdc.noaa.gov/pub/data/gsod/ish-history.txt (warning: large file)
Nick Stokes says:
July 13, 2010 at 6:14 pm
Bill Illis,
“2. What adjustments are done to have GISTemp and Hadcrut3 higher (and lower) than the reconstruction numbers.”
Gistemp code is available. But the message from the linked articles is that the adjusted GISS and HADcrut are not noticeably different from the raw GHCN results.”
GISTemp and Hadcrut3 are quite different. I quoted the numbers above.
“Land temperatures in the GHCN dataset (according to the reconstructions) has increased about 0.9C and the Land/Ocean temperature series has increased about 0.55C since 1900.
GISTemp is 0.70C (Land) and 0.64C (Land and Ocean) since 1900 and Hadcrut3 at 0.966C (Land) and 0.703C (Land and Ocean) since 1900.”
Maybe when they are plotted on chart with a certain resolution, they look like they come close to matching up, but they don’t. 0.1C and 0.2C’s do matter in this business. Temps are already only 50% of that predicted, 40% is now in the redo category.
Ahh, sorry, I misread (its been a long day…). You want the list of stations with 100+ year records. Its here: http://drop.io/0yhqyon/asset/long-lived-ghcn-stations-txt
REPLY: Yes. Thanks for picking that up. I was perusing the links you gave and ??? were sprouting – A
Bill Illis,
GISTemp Land isn’t actually calculating land temperature. Rather, its attempting to calculate global temperature with only land records. So its not strictly comparable to other records (though you can replicate it with the correct zonal weighting and no land mask).
We had a fun time looking into the Great GISTemp Mystery over here awhile back: http://rankexploits.com/musings/2010/the-great-gistemp-mystery/
If I read the graphs correctly, global temperature over land is up .8 degrees C in 110 years, but if you put in the oceans, then the global temperature is up only .4 degrees.
I realize that there are issues in the quality & consistency of reading SST; however, I believe the following is correct:
So the skeptic hypothesis is that UHI and siting issues have biased the land temperature upwards.
And the believer’s hypothesis is that air over land is drier than air over water, and therefore we will see land temperatures rise faster than SSTs.
One of the points of a thread like this, is that the ‘game’ doesn’t stop when someone raises an objection. Unsure of the reliability of CRUTEM? Go out and build your own temperature reconstruction! Don’t understand how GISTEMP does station combinations? Rewrite their method in your own code! Unsure what the effect of the ‘dying of the thermometers’? Use a data set that doesn’t have the station loss. Object to the UHI in the analysis? Explore the urban effect with your own tools.
Science doesn’t stop when a reasonable question is raised. Indeed, that is precisely the point at which science begins! … but only if you are willing to do the follow-up work required. 🙂
Re: Steven Mosher says:
July 13, 2010 at 3:43 pm
>>> The data I used was “uncorrected” GHCN.
Color me confused about the temperature record. I had thought there was some dispute about the temperatures in the 1940’s relative to modern times. That earlier records had indicated higher temperatures for the pre 1970 period than does the modern record. Am I incorrect in this?
Civilly yours,
EC
Assume, as a given, that figure 1 depicts global temperatures. There was no warming or cooling, net, between 1900 and the mid 1970s. All of the net warming was between late 1970s and about 2005.
Why?
Anthony,
Here is a newer graph with all GHCN v2.mean stations, only stations with > 100 year records, and only rural (via GRUMP) stations with > 100 year records. It gets noisier as the number of stations available decreases, but the trends don’t change too much.
http://i81.photobucket.com/albums/j237/hausfath/Picture18-4.png
And here are all the wmo_ids for the long-lived rural stations:
http://drop.io/0yhqyon/asset/long-lived-rural-ghcn-stations-txt
Mosh should be able to run this analysis with his code as well, or an analysis with any particular subset of stations you choose.
EC: The 1930s and 40s were about the same temp as now for the United States (CONUS).
You can see some US-v-World with GHCN-v-USHCN in this post by Zeke:
http://rankexploits.com/musings/2010/a-simple-model-for-spatially-weighted-temp-analysis/
I wonder how long before I get a friendly ‘cease and desist’ letter and I, just like Dr Jones before me, will have to pull down my data and tell me people that I can’t give them access and that they should just download their own?! 😆
REPLY: If that ever happens, I’ll be the first to step up for defending your right to use and publish it. – Anthony
Wonder why they didn’t include Dr. Spencer’s computations??
Oh yeah, he used a data set that did not have the bias built in through poorly formed homogenisation adjustments!!
HAHAHAHAHAHAHAHAHAHAHAHAHAHA
Zeke and Steve —
Thanks for the very informative article.
It’s not particularly important, but figure 2 shows a large divergence in the GSOD dataset around 1969. Any idea of its cause?
On a different topic, Bill Illis (July 13, 2010 at 5:27 pm) says: “Land temperatures in the GHCN dataset has increased about 0.9C and the Land/Ocean temperature series has increased about 0.55C since 1900.” Using a 30/70 area split for land/ocean, the implied gain in ocean temps since 1900 is about 0.4 C. [I’m not sure that sentence is clear, so I’ll write it algebraically as “average land/ocean change = land_area_fraction * land_change + ocean_area_fraction * ocean_change.” Using the above information, 0.55 C = 0.3 * 0.9 C + 0.7 * ocean_change;
solving gives ocean_change = 0.4 C.]
Visually comparing figures 1 & 8, it appears that in the 1915-1940 warming period, land and ocean warmed at similar rates, but during the more recent period 1970-2000 land warmed considerably faster than ocean. Obviously you have more precise information about the land-ocean difference — can you draw any conclusions from that data?
Ec,
This may give you a sense of the variation in temps between countries/regions: http://i81.photobucket.com/albums/j237/hausfath/Picture155.png
“Zeke Hausfather says:
July 13, 2010 at 7:45 pm
Anthony,
Here is a newer graph with all GHCN v2.mean stations, only stations with > 100 year records, and only rural (via GRUMP) stations with > 100 year records. It gets noisier as the number of stations available decreases, but the trends don’t change too much.
http://i81.photobucket.com/albums/j237/hausfath/Picture18-4.png
”
I understand that the lines on the chart look similar. But let’s just turn the chart into numbers here.
GHCN rural long-lived: +0.45C 1900 to 2009
GHCN long-lived: +0.85C
GNCN V2: +0.70C
This is enough difference to start drawing conclusions about the differences rather than concluding they are the same.
kuhnkat,
You mean this one? http://www.drroyspencer.com/wp-content/uploads/ISH-vs-CRUTem3NH-1986-thru-20091.jpg
The only reason we didn’t include a temp reconstruction by Dr. Spencer is because he hasn’t released the data yet, just figures. I emailed him awhile back requesting global monthly means from his ISH implementation, but unfortunately didn’t receive a response.
If you are referring to Spencer’s U.S. graph, well, we don’t have any U.S.-only charts in this post. I’d warrant it would be pretty close to GHCN v2.mean or USHCN raw for CONUS, however. I took a stab at replicating it using GHCN back in the day: http://rankexploits.com/musings/2010/effect-of-dropping-station-data/#comment-35519
Kuhnkat, GSOD is the daily summary of the same data set (ISH aka ISD) that Dr Spencer used.
The ISH data is only freely accessible in bulk from known US domains (.edu, .gov, .mil, and such). It is also much larger than the GSOD data (~ 300gb for everything). I’m working on getting access to it. Alternately, if you would like to donate $2000 US so that I can purchase the whole set on CD, I would be happy to accept. 😀
REPLY: if you can give me an exact citation for this, I can see if my friend Jim Goodridge, former State Climatologist, has it -Anthony
Bill Illis,
I’m not sure where you are getting those slopes from, but they are much more similar than that.
1900-2009 (degrees C per decade)
All stations: 0.072
Long-lived: 0.086
Long-lived rural: 0.071
1960-2009 (degrees C per decade)
All stations: 0.222
Long-lived: 0.246
Long-lived rural: 0.238
Spreadsheet with data outputs and slope calculations:
http://drop.io/0yhqyon/asset/ghcn-longevity-urbanity-xls
All the raw data sets show 1 degree per century. Satellite data since 1978 shows 1.2 degrees per century. Tony B’s long history stations show .26 degrees per century but with periods of similar rate of increases. I am ready to throw UHI under the bus and concede that the last thirty years fulfills the expectations of AGW supporters. However they will have to fight the long term trend of .26 degrees per century. I look for cooling to bring the trend back down.
ISH / ISD (same thing, two names)
http://ols.nndc.noaa.gov/plolstore/plsql/olstore.prodspecific?prodnum=C00532-TAP-A0001
http://www.ncdc.noaa.gov/oa/nndc/freeaccess.html
http://www.ncdc.noaa.gov/oa/climate/isd/index.php
Note: There are several references to COOP data in those pages. The ISH/ISD data is NOT the same as COOP. It is the ISH/ISD that I am most interested in.
If there is a chance that we can arrange a data transfer, you can contact me at my email listed in the posting headers. And thanks for looking into it!
It is true that the climate community has managed to focus the world’s attention to the temperature anomalies, and by sleight of hand taken the attention from the temperatures themselves and how badly the models reproduce them.
I am with George Smith on this .
It is good that we see that the anomalies calculated in different ways agree within the magnitude of the change seen, so as to be able to say : there has been an x change in anomaly +/- y systematic from different methods.
But it is like magic tricks with cards.
Lets see what we are talking about. We are talking about an excess of retained energy on the continuous input output flux coming from the sun and radiating to space. i.e. energy.
What does the anomaly tell us about energy input output except “there is change”?
It is absolutely impossible to calculate energies radiated by using the anomaly map.
It would be impossible with the temperature map too, unless one had the gray body constants and radiation curves of the map.
In addition, a 1C anomaly in a region that has an average temperature of 273K has a completely different physical manifestation than a 1C anomaly where the average temperature is 288K.
I am also with George on UHI, that in an ideal temperature measurement it should be integrated in. Do we correct for rocky mountains? Deserts? Energy is energy and is what is important for life on earth.
Now of course, in the way the anomalies are being used UHI is important. It is the anomalies that are really irrelevant except in the sense “here there be tigers”.
@jorge says:
July 13, 2010 at 6:35 pm
I agree fully with your comments in that post.
Now, we seem here above to have substantiated that the output of an imperfect (and apparently, increasingly unreliable) temperature data collection system can be screwed with one way or another and still come up with, more or less, the same results in assessing the “global temperature”.
At least we have this from the authors to tease us: “…Our point was this: concerns about bias in the methods of GISS, CRU and NCDC can be put to rest. The big issue HAS ALWAYS been the adjustments and the metadata.
That is the next topic for discussion, for serious discussion that is….”
and further (in comments):
“…A turn to the question of data adjustments and a turn to the question of metadata accuracy and finally a turn to the question about UHI. Now, however, the community on all sides of the debate has a set of tools to address these questions….”
No kidding, Dick Tracy. I’ll wait for that next installment. Still, after looking at all those graphs above, I’m left with the feeling that my grandparents and John Steinbeck were a bunch of whiners. Why, the Thirties were positively chilly! They told me they sweated buckets. So, did they lie to me?
Ron Broberg says:
July 13, 2010 at 7:48 pm
>>> EC: The 1930s and 40s were about the same temp as now for the United States
That’s what’s bothered me. How can we have AGW that exempts the US — where we have some of the best temperature record. It just seems… convenient.
Who said “global warming only happens were nobody lives” — can’t remember.
EC.
David A. Evans says:
July 13, 2010 at 3:11 pm (Edit)
How RAW is RAW?
*********************
That’s the next question. But what does raw mean?