Much Ado About Very Little
Guest post by Zeke Hausfather and Steve Mosher
E.M. Smith has claimed (see full post here: Summary Report on v1 vs v3 GHCN ) to find numerous differences between GHCN version 1 and version 3, differences that, in his words, constitute “a degree of shift of the input data of roughly the same order of scale as the reputed Global Warming”. His analysis is flawed, however, as the raw data in GHCN v1 and v3 are nearly identical, and trends in the globally gridded raw data for both are effectively the same as those found in the published NCDC and GISTemp land records.
Figure 1: Comparison of station-months of data over time between GHCN v1 and GHCN v3.
First, a little background on the Global Historical Climatology Network (GHCN). GHCN was created in the late 1980s after a large effort by the World Meteorological Organization (WMO) to collect all available temperature data from member countries. Many of these were in the form of logbooks or other non-digital records (this being the 1980s), and many man-hours were required to process them into a digital form.
Meanwhile, the WMO set up a process to automate the submission of data going forward, setting up a network of around 1,200 geographically distributed stations that would provide monthly updates via CLIMAT reports. Periodically NCDC undertakes efforts to collect more historical monthly data not submitted via CLIMAT reports, and more recently has set up a daily product with automated updates from tens of thousands of stations (GHCN-Daily). This structure of GHCN as a periodically updated retroactive compilation with a subset of automatically reporting stations has in the past led to some confusion over “station die-offs”.
GHCN has gone through three major iterations. V1 was released in 1992 and included around 6,000 stations with only mean temperatures available and no adjustments or homogenization. Version 2 was released in 1997 and added in a number of new stations, minimum and maximum temperatures, and manually homogenized data. V3 was released last year and added many new stations (both in the distant past and post-1992, where Version 2 showed a sharp drop-off in available records), and switched the homogenization process to the Menne and Williams Pairwise Homogenization Algorithm (PHA) previously used in USHCN. Figure 1, above, shows the number of stations records available for each month in GHCN v1 and v3.
We can perform a number of tests to see if GHCN v1 and 3 differ. The simplest one is to compare the observations in both data files for the same stations. This is somewhat complicated by the fact that station identity numbers have changed since v1 and v3, and we have been unable to locate translation between the two. We can, however, match stations between the two sets using their latitude and longitude coordinates. This gives us 1,267,763 station-months of data whose stations match between the two sets with a precision of two decimal places.
When we calculate the difference between the two sets and plot the distribution, we get Figure 2, below:
Figure 2: Difference between GHCN v1 and GHCN v3 records matched by station lat/lon.
The vast majority of observations are identical between GHCN v1 and v3. If we exclude identical observations and just look at the distribution of non-zero differences, we get Figure 3:
Figure 3: Difference between GHCN v1 and GHCN v3 records matched by station lat/lon, excluding cases of zero difference.
This shows that while the raw data in GHCN v1 and v3 is not identical (at least via this method of station matching), there is little bias in the mean. Differences between the two might be explained by the resolution of duplicate measurements in the same location (called imods in GHCN version 2), by updates to the data from various national MET offices, or by refinements in station lat/lon over time.
Another way to test if GHCN v1 and GHCN v3 differ is to convert the data of each into anomalies (with baseline years of 1960-1989 chosen to maximize overlap in the common anomaly period), assign each to a 5 by 5 lat/lon grid cell, average anomalies in each grid cell, and create a land-area weighted global temperature estimate. This is similar to the method that NCDC uses in their reconstruction.
Figure 4: Comparison of GHCN v1 and GHCN v3 spatially gridded anomalies. Note that GHCN v1 ends in 1990 because that is the last year of available data.
When we do this for both GHCN v1 and GHCN v3 raw data, we get the figure above. While we would expect some differences simply because GHCN v3 includes a number of stations not included in GHCN v1, the similarities are pretty remarkable. Over the century scale the trends in the two are nearly identical. This differs significantly from the picture painted by E.M. Smith; indeed, instead of the shift in input data being equivalent to 50% of the trend, as he suggests, we see that differences amount to a mere 1.5% difference in trend.
Now, astute skeptics might agree with me that the raw data files are, if not identical, overwhelmingly similar but point out that there is one difference I did not address: GHCN v1 had only raw data with no adjustments, while GHCN v3 has both adjusted and raw versions. Perhaps the warming the E.M. Smith attributed to changes in input data might in fact be due to changes in adjustment method?
This is not the case, as GHCN v3 adjustments have little impact on the global-scale trend vis-à-vis the raw data. We can see this in Figure 5 below, where both GHCN v1 and GHCN v3 are compared to published NCDC and GISTemp land records:
Figure 5: Comparison of GHCN v1 and GHCN v3 spatially gridded anomalies with NCDC and GISTemp published land reconstructions.
If we look at the trends over the 1880-1990 period, we find that both GHCN v1 and GHCN v3 are quite similar, and lie between the trends shown in GISTemp and NCDC records.
1880-1990 trends
GHCN v1 raw: 0.04845 C (0.03661 to 0.06024)
GHCN v3 raw: 0.04919 C (0.03737 to 0.06100)
NCDC adjusted: 0.05394 C (0.04418 to 0.06370)
GISTemp adjusted: 0.04676 C (0.03620 to 0.05731)
This analysis should make it abundantly clear that the change in raw input data (if any) between GHCN version 1 and GHCN version 3 had little to no effect on global temperature trends. The exact cause of Smith’s mistaken conclusion is unknown; however, a review of his code does indicate a few areas that seem problematic. They are:
1. An apparent reliance on station Ids to match stations. Station Ids can differ between versions of GHCN.
2. Use of First Differences. Smith uses first differences, however he has made idiosyncratic changes to the method, especially in cases where there are temporal lacuna in the data. The method which used to be used by NCDC has known issues and biases – detailed by Jeff Id. Smith’s implementation and his method of handling gaps in the data is unproven and may be the cause.
3. It’s unclear from the code which version of GHCN V3 that Smith used.
STATA code and data used in creating the figures in this post can be found here: https://www.dropbox.com/sh/b9rz83cu7ds9lq8/IKUGoHk5qc
Playing around with it is strongly encouraged for those interested.
Smokey, are you aware of any work comparing the response time of modern electronic thermometers to traditional mercury max/min thermometers? I suspect there a built in high bias in the modern record due to quicker response of electronic thermometers.
Carrick
I am thinking over my comments and trying to see how “sleaze” is an appropriate description for any of them. It could be true that I could have worded two things differently. But even in their current wording they are not “sleaze”. You really need to do a reassessment of your opinion of people in the world that don’t agree with you.
steven mosher
“2. even with these errors left in your get roughly the same answer”
“Roughly” does not count. Roughly leaves too much play in the data. Roughly here and roughly there makes for useless data. Again, “manmade global warming” is based on 1/10ths of a degree which is roughly the same as there being no “manmade global warming”
Carrick says:
June 24, 2012 at 7:40 am
Good question, Carrick. For me, in general a request for a citation to a claimed fact should always be answered, regardless of the questioner. It is important for a couple of reasons. First, it is an issue of transparency, traceability, and accountability, which is why there is an entire section of references at the end of every scientific paper.
But more to the point, the questioner is likely not the only one who is interested in a request for a citation. For example, in this case I’d like to know why Steven made the claim that he made about the time of observation adjustment, and I suspect I’m not the only one.
So in answer to your question, “How much engaging in [my] opinion is required with people who have such strong confirmation biases …”, if what they are asking for is a simple citation, in general that is a valid scientific question no matter who is asking it, and it should be answered.
w.
“[Moderator’s Note: Rare agreement with Carrick: some people (probably not moderators) have lives outside of commenting on WUWT. Carrick, good luck with the tiling! If I try that myself, I now know where to go for advice. -REP]
I would hope you are not including his “a completely dishonest sleaze” as part of that rare agreement.
[REPLY: Excellent point. No. It would be a very good thing if all commenters, including Carrick, would refrain from personalizing things. -REP]
Carrick says:
June 24, 2012 at 3:22 pm
Carrick, maybe I’m not following your point, but if you have evidence that “phi” has been given this same information before, please bring it forward. Absent such evidence, Steven should have just posted a reference to support his claim and moved on.
And no, there’s no reason to do that more than once.
w.
How many requests for citations do they need to answer? Well … about as many as the number of uncited claims that they make. In this particular example, Steven Mosher claimed that the TOBS adjustments didn’t just apply to the US. Citing his authority for this would have been a trivial thing to do, but instead he wants to play “Go Fish” …
#############################
using google and doing some reading is all it would take. I’ve explained this and cited it on several occasions. I get tired of doing other peoples work. I get tired of lazy people who are not interested in the truth, who expect others to do their work for them.
A SIMPLE GOOGLE ON TIME OF OBSERVATION BIAS will get you the mere
beginnings of the literature on this. Then read the papers because the papers have bibilographies going back decades
But here goes yet again. The US is SOMEWHAT Unique in this matter because unlike other countries we had no standard observation time, However, we are not entirely alone in this regard. There are several other countries who have the same issue. A few follow.
Japan
http://sciencelinks.jp/j-east/display.php?id=000020000500A0108818
Or if any of you started to look at the sources of Crutem4 ( none of you have ) you would have found
this right away
Canada:
http://www.ec.gc.ca/dccha-ahccd/default.asp?lang=en&n=70E82601-1
This website provides monthly, seasonal and annual means of the daily maximum, minimum and mean temperatures from the Second Generation of Homogenized Temperature datasets which now replace the first generation datasets.
A First Generation of Homogenized Temperature datasets were originally prepared for climate trends analysis in Canada. Non-climatic shifts were identified in the annual means of the daily maximum and minimum temperatures using a technique based on regression models (Vincent, 1998). The shifts were mainly due to the relocation of the station, changes in observing practices and automation (Vincent and Gullett, 1999). Adjustments for the identified shifts were applied to monthly and daily maximum and minimum temperatures (Vincent et al. 2002). Observations from nearby stations were sometimes combined to create long time series that are useful for climate change studies.
The Second Generation of Homogenized Temperature datasets were recently prepared to provide a better spatial and temporal representation of the climate trends in Canada. In this new version, the list of stations was revised to include stations with long-term temperature observations covering as many of the recent years as possible. New adjustments were applied to the daily minimum temperatures in order to address the bias due to a change in observing time (Vincent et al. 2009). Techniques based on regression models were used to detect non-climatic shifts in temperature monthly series (Wang et al. 2007; Vincent 1998). A new procedure based on a Quantile-Matching (QM) algorithm was applied to derive adjustments (Vincent et al., 2012; Wang et al. 2010).
Want More?
Well, there is also Australia which was posted here. They changed observation time in
1964 and in there latest product I believe they made a few site specific adjustments for TOBS
See section 8 here
http://cawcr.gov.au/publications/technicalreports/CTR_049.pdf
And Norway.
Nordli, P.Ø. 1997. Adjustments of Norwegian monthly means of daily minimum temperature.
KLIMA Report 6/97, Norwegian Meteorological Institute, Oslo
And almost every long temperature series ( see the 4 long european stations) has TOB adjustments
Willis Eschenbach
” if what they are asking for is a simple citation, in general that is a valid scientific question no matter who is asking it, and it should be answered.”
What you are saying is pretty simple to understand.
I could speculate as to why someone would not want to give a cite reference: Maybe there isn’t one. Or maybe the reference has been called into question before. Or maybe those being asked aren’t understanding what is being requested. Or maybe they are expecting you, and everyone else, to just believe them without question. Or, maybe they don’t understand the simple scientific procedure that is so obvious to you, and me, and most people looking for truth.
But even if any of my guesses aren’t correct there still is no call for the name calling and character assassination that Carrick is engaging in.
Amino Acids in Meteorites says:
June 24, 2012 at 6:16 pm (Edit)
steven mosher
“2. even with these errors left in your get roughly the same answer”
“Roughly” does not count. Roughly leaves too much play in the data. Roughly here and roughly there makes for useless data. Again, “manmade global warming” is based on 1/10ths of a degree which is roughly the same as there being no “manmade global warming”
###########################
I will give you an example. In GHCN daily there are two measurements with 15000C
guess what happens when you average that with thousands of other measurements
and take the trend over 150 years.
nothing. nothing if you leave it in, even less when you take that outlier out
So, if the fact that there are some bad “raw” data makes you want to dump the whole
dataset, then almost no knowledge is possible.
steven mosher said:
So, if the fact that there are some bad “raw” data makes you want to dump the whole dataset, then almost no knowledge is possible.
So why not just drop the bad data, instead of making adjustments? You still never explained how you know what to adjust TO. So far, I haven’t heard any justification for making adjustments.
DR says:
June 24, 2012 at 5:43 pm (Edit)
3. This data is not used to build models. Models are built from first principles, not data
Actually, very little is built from first principles.
####################################
Model E code is online. I started reading it back in 2007. Its 100K Loc. Suggest you
get started.
If you want an introduction about models
start here. Simple enough. two types of models
http://www.newton.ac.uk/programmes/CLP/seminars/082310001.html
read more; comment less,
“[REPLY: Excellent point. No. It would be a very good thing if all commenters, including Carrick, would refrain from personalizing things. -REP]”
Thanks.
I hope none of what I have said in this thread was in bad taste. As I think over a couple of my comments I see I could have worded a couple things differently. How I put them could easily be construed in a way I didn’t mean. Lesson one: don’t be in a hurry to write your comment on the internet!
David, did you correlate it with ENSO geographic location? The oscillations we so often talk about are not global, they are regional. If station dropout was also regional, we have potential artifact.
steven mosher
I don’t think EMSmiths work in this case is about “raw” data.
Willis.
I have pointed phi to the sources before. He refuses to read them or to acknowledge anything
Here. a month ago
http://rankexploits.com/musings/2012/a-surprising-validation-of-ushcn-adjustments/#comment-95737
where you will find the reference to japan as the first link.
But he is not interested in looking at the actual data, actual papers, actual code.
he is not interested in the fact that the skeptic John daly and Jerry B looked into the TOBS matter themselves. he is only interested in derailing the conversation.
Jimmy Haigh says:
June 24, 2012 at 7:01 am (Edit)
Mosher and Zeke? Crickets…
##############
no projects.
JT says:
June 23, 2012 at 11:52 am (Edit)
@Mosher
@Smith
question: if you took each raw temperature measurement and plotted it against the time when the measurement was made from the earliest known temperature measurement to the latest known temperature measurement so as to create a complete scatterplot of available raw data, what would it look like?
##############################
Zeke uploaded the data. Have at it.
steven mosher says:
June 24, 2012 at 6:39 pm
DR says:
June 24, 2012 at 5:43 pm (Edit)
3. This data is not used to build models. Models are built from first principles, not data
—————————————————
“Bauer et al. used a large aerosol effect and still needed a large deforestation warming to bring her results in line with the Mann et al. reconstruction (in fact, it was done specifically for that reason)”
http://di2.nu/foia/foia2011/mail/1891.txt
Pamela Gray says:
June 23, 2012 at 8:38 am (Edit)
Steven, a significant portion of your time as a scientist should be spent explaining what you have said. To refuse to do so by telling others to figure it out for themselves seems a bit juvenile and overly dressed in “ex-spurt” clothes. If questions come your way, kindly doing your best to answer them seems the better approach, especially with such a mixed audience of arm-chair enthusiasts such as myself and truly learned contributors. I for one appreciate your post. Don’t spoil it.
############################
Pamela. I am not a scientist. I create tools for others to investigate data that has been made freely available. I provide answers and help to anyone who shares my commitment to the following
1. use your real name when you post
2. post your data
3. post your code
I provide those tools so that people can see for themselves and do their own damn work.
I routinely get requests to “do this chart” or “do that chart” Guess what? I’m not your chart monkey. Im not your Phd director , read some damn papers, use google scholar, read bibliographies. read more, comment less. answer questions for yourself UNLESS YOU CANT.
I give you all the tools you need to answer the questions you have. If I spend time showing you the library or coding up a chart for you, that LESS TIME I have to build a tool for a guy who wants to do his own work or who wants to work with my software.
I don’t think EMSmiths work in this case is about “raw” data.
############
since GHCN is unadjusted and and he claims to use GHCNv3 unadjusted, we can probably agree this is about unadjusted.
sunshine
:”He has no interest in the third of stations even Mueller admitted were cooling.”
This is a very crude misunderstanding of a bad chart.
What’s more, sunshine knows this is wrong since I spent a considerable amount of time looking at this issue. A point which Carrick can attest to since he gave me some really interesting ideas.
However, remember this is sunshine. he posts data that is estimated and does not disclose that fact.
This issue related to ENSO parameters possibly conflagrating with station dropout as well as station degradation has to do with what we are experiencing in NE Oregon. Every year for the past 7 years, we have experienced much cooler Spring and Summer temperatures according to our ranch thermometers, corr0borated by late starts on much of the agricultural products, and decreased insect populations ubiquatious to our area. We have also experienced a massive surge in cold water loving sports fish.
Of interesting note, one of the local extreme NE Oregon weather stations that has a long term record has undergone significant deterioration and has not recorded this colding, while others not so deteriorated in nearby counties have.
NE Oregon is like the canary in the coalmine, being situated in a geographic area highly sensitive to ENSO oscillations. So much so that for decades peas will disappear from our fields then surge back, simply because peas are what we can, or cannot, grow under the highly sensitive conditions we are facing. Salmon and steelhead make their way up our alpine rivers in similar decadal oscillating fashion, falling to barely countable numbers year after year, then rising to such numbers that fishing season stretches for weeks and weeks year after year. And we are not the only canary. ENSO patterns result in other highly sensitive geographic areas across the US.
If station dropout concentrated itself to these ENSO sensitive geographic areas, it could have quite an affect on anomalies I would think. I think there are socially related reasons for this. Unstable climate areas tend to not encourage population growth and station dropout may be more frequent in areas not described as population centers.
dp says:
June 23, 2012 at 9:25 am (Edit)
Start over and show where Smith’s error is. All you’ve shown is you have arrived at a different result with different methods. No surprise. I could do a third and be different again. It would not show that Smith or you are right (or wrong).
##################################
1. to do that with certainty would require EM to post his data and code in a turnkey fashion
like steve mcintyre, zeke, nick stokes, willis, and I do.
2. The flaws in First differences, established by skeptic Jeff Id, should be enough to
give you pause. My question to you is why didnt you spot that he had used
a discredited method? was it the fact that you liked the answer that made you
drop your skepticism? was it the fact that he called it “peer reviewed” that
bamboozled you?
“Steven Mosher claimed that most adjustments came from Tobs not only in US. TObs adjustments are generally made in all countries but they are weak and the problem is usually totally different. In this regard, US is in fact a special case. Very curious.”
The US is not a special case. Its probably the largest case, but the problem exists in other countries as well: Japan, Canada, Norway, perhaps a bit in Australia, and in all the extra long series that go back to 1800 and beyond. Thankfully with the Berkeley approach we dont have to explicitly do this adjustment.
sunshine:
“Mosher clearly has stated numerous times he believes if CO2 has increased it must have warmed the earth. Therefore he works hard to find some magical formula that proves crap data proves the earth is warming.”
Wrong. Basic physics tell us that C02 is a green house gas. If C02 increases, then, all other things being equal, the earth will warm. How much? that is the really hard question.
The temperature record has little to say out this in the short term.
is the earth warming? Yes, there was a little Ice Age. I don’t know anyone who argues that
it is cooler now than in the LIA. but go ahead, fire away with your best data.
Pamela Gray says:
June 24, 2012 at 6:41 pm
David, did you correlate it with ENSO geographic location?
>>>>>>>>
No, I didn’t. (I was just beginning my climate self education and had no idea what ENSO was back then). I looked at it strictly from the perspective of any major grouping of drop outs (or the reverse earlier in the record).