The Smoking Gun At Darwin Zero

by Willis Eschenbach

People keep saying “Yes, the Climategate scientists behaved badly. But that doesn’t mean the data is bad. That doesn’t mean the earth is not warming.”

Darwin Airport – by Dominic Perrin via Panoramio

Let me start with the second objection first. The earth has generally been warming since the Little Ice Age, around 1650. There is general agreement that the earth has warmed since then. See e.g. Akasofu . Climategate doesn’t affect that.

The second question, the integrity of the data, is different. People say “Yes, they destroyed emails, and hid from Freedom of information Acts, and messed with proxies, and fought to keep other scientists’ papers out of the journals … but that doesn’t affect the data, the data is still good.” Which sounds reasonable.

There are three main global temperature datasets. One is at the CRU, Climate Research Unit of the University of East Anglia, where we’ve been trying to get access to the raw numbers. One is at NOAA/GHCN, the Global Historical Climate Network. The final one is at NASA/GISS, the Goddard Institute for Space Studies. The three groups take raw data, and they “homogenize” it to remove things like when a station was moved to a warmer location and there’s a 2C jump in the temperature. The three global temperature records are usually called CRU, GISS, and GHCN. Both GISS and CRU, however, get almost all of their raw data from GHCN. All three produce very similar global historical temperature records from the raw data.

So I’m still on my multi-year quest to understand the climate data. You never know where this data chase will lead. This time, it has ended me up in Australia. I got to thinking about Professor Wibjorn Karlen’s statement about Australia that I quoted here:

Another example is Australia. NASA [GHCN] only presents 3 stations covering the period 1897-1992. What kind of data is the IPCC Australia diagram based on?

If any trend it is a slight cooling. However, if a shorter period (1949-2005) is used, the temperature has increased substantially. The Australians have many stations and have published more detailed maps of changes and trends.

The folks at CRU told Wibjorn that he was just plain wrong. Here’s what they said is right, the record that Wibjorn was talking about, Fig. 9.12 in the UN IPCC Fourth Assessment Report, showing Northern Australia:

Figure 1. Temperature trends and model results in Northern Australia. Black line is observations (From Fig. 9.12 from the UN IPCC Fourth Annual Report). Covers the area from 110E to 155E, and from 30S to 11S. Based on the CRU land temperature.) Data from the CRU.

One of the things that was revealed in the released CRU emails is that the CRU basically uses the Global Historical Climate Network (GHCN) dataset for its raw data. So I looked at the GHCN dataset. There, I find three stations in North Australia as Wibjorn had said, and nine stations in all of Australia, that cover the period 1900-2000. Here is the average of the GHCN unadjusted data for those three Northern stations, from AIS:

Figure 2. GHCN Raw Data, All 100-yr stations in IPCC area above.

So once again Wibjorn is correct, this looks nothing like the corresponding IPCC temperature record for Australia. But it’s too soon to tell. Professor Karlen is only showing 3 stations. Three is not a lot of stations, but that’s all of the century-long Australian records we have in the IPCC specified region. OK, we’ve seen the longest stations record, so lets throw more records into the mix. Here’s every station in the UN IPCC specified region which contains temperature records that extend up to the year 2000 no matter when they started, which is 30 stations.

Figure 3. GHCN Raw Data, All stations extending to 2000 in IPCC area above.

Still no similarity with IPCC. So I looked at every station in the area. That’s 222 stations. Here’s that result:

Figure 4. GHCN Raw Data, All stations extending to 2000 in IPCC area above.

So you can see why Wibjorn was concerned. This looks nothing like the UN IPCC data, which came from the CRU, which was based on the GHCN data. Why the difference?

The answer is, these graphs all use the raw GHCN data. But the IPCC uses the “adjusted” data. GHCN adjusts the data to remove what it calls “inhomogeneities”. So on a whim I thought I’d take a look at the first station on the list, Darwin Airport, so I could see what an inhomogeneity might look like when it was at home. And I could find out how large the GHCN adjustment for Darwin inhomogeneities was.

First, what is an “inhomogeneity”? I can do no better than quote from GHCN:

Most long-term climate stations have undergone changes that make a time series of their observations inhomogeneous. There are many causes for the discontinuities, including changes in instruments, shelters, the environment around the shelter, the location of the station, the time of observation, and the method used to calculate mean temperature. Often several of these occur at the same time, as is often the case with the introduction of automatic weather stations that is occurring in many parts of the world. Before one can reliably use such climate data for analysis of longterm climate change, adjustments are needed to compensate for the nonclimatic discontinuities.

That makes sense. The raw data will have jumps from station moves and the like. We don’t want to think it’s warming just because the thermometer was moved to a warmer location. Unpleasant as it may seem, we have to adjust for those as best we can.

I always like to start with the rawest data, so I can understand the adjustments. At Darwin there are five separate individual station records that are combined to make up the final Darwin record. These are the individual records of stations in the area, which are numbered from zero to four:

Figure 5. Five individual temperature records for Darwin, plus station count (green line). This raw data is downloaded from GISS, but GISS use the GHCN raw data as the starting point for their analysis.

Darwin does have a few advantages over other stations with multiple records. There is a continuous record from 1941 to the present (Station 1). There is also a continuous record covering a century. finally, the stations are in very close agreement over the entire period of the record. In fact, where there are multiple stations in operation they are so close that you can’t see the records behind Station Zero.

This is an ideal station, because it also illustrates many of the problems with the raw temperature station data.

  • There is no one record that covers the whole period.
  • The shortest record is only nine years long.
  • There are gaps of a month and more in almost all of the records.
  • It looks like there are problems with the data at around 1941.
  • Most of the datasets are missing months.
  • For most of the period there are few nearby stations.
  • There is no one year covered by all five records.
  • The temperature dropped over a six year period, from a high in 1936 to a low in 1941. The station did move in 1941 … but what happened in the previous six years?

In resolving station records, it’s a judgment call. First off, you have to decide if what you are looking at needs any changes at all. In Darwin’s case, it’s a close call. The record seems to be screwed up around 1941, but not in the year of the move.

Also, although the 1941 temperature shift seems large, I see a similar sized shift from 1992 to 1999. Looking at the whole picture, I think I’d vote to leave it as it is, that’s always the best option when you don’t have other evidence. First do no harm.

However, there’s a case to be made for adjusting it, particularly given the 1941 station move. If I decided to adjust Darwin, I’d do it like this:

Figure 6 A possible adjustment for Darwin. Black line shows the total amount of the adjustment, on the right scale, and shows the timing of the change.

I shifted the pre-1941 data down by about 0.6C. We end up with little change end to end in my “adjusted” data (shown in red), it’s neither warming nor cooling. However, it reduces the apparent cooling in the raw data. Post-1941, where the other records overlap, they are very close, so I wouldn’t adjust them in any way. Why should we adjust those, they all show exactly the same thing.

OK, so that’s how I’d homogenize the data if I had to, but I vote against adjusting it at all. It only changes one station record (Darwin Zero), and the rest are left untouched.

Then I went to look at what happens when the GHCN removes the “in-homogeneities” to “adjust” the data. Of the five raw datasets, the GHCN discards two, likely because they are short and duplicate existing longer records. The three remaining records are first “homogenized” and then averaged to give the “GHCN Adjusted” temperature record for Darwin.

To my great surprise, here’s what I found. To explain the full effect, I am showing this with both datasets starting at the same point (rather than ending at the same point as they are often shown).

Figure 7. GHCN homogeneity adjustments to Darwin Airport combined record

YIKES! Before getting homogenized, temperatures in Darwin were falling at 0.7 Celcius per century … but after the homogenization, they were warming at 1.2 Celcius per century. And the adjustment that they made was over two degrees per century … when those guys “adjust”, they don’t mess around. And the adjustment is an odd shape, with the adjustment first going stepwise, then climbing roughly to stop at 2.4C.

Of course, that led me to look at exactly how the GHCN “adjusts” the temperature data. Here’s what they say

GHCN temperature data include two different datasets: the original data and a homogeneity- adjusted dataset. All homogeneity testing was done on annual time series. The homogeneity- adjustment technique used two steps.

The first step was creating a homogeneous reference series for each station (Peterson and Easterling 1994). Building a completely homogeneous reference series using data with unknown inhomogeneities may be impossible, but we used several techniques to minimize any potential inhomogeneities in the reference series.

In creating each year’s first difference reference series, we used the five most highly correlated neighboring stations that had enough data to accurately model the candidate station.

The final technique we used to minimize inhomogeneities in the reference series used the mean of the central three values (of the five neighboring station values) to create the first difference reference series.

Fair enough, that all sounds good. They pick five neighboring stations, and average them. Then they compare the average to the station in question. If it looks wonky compared to the average of the reference five, they check any historical records for changes, and if necessary, they homogenize the poor data mercilessly. I have some problems with what they do to homogenize it, but that’s how they identify the inhomogeneous stations.

OK … but given the scarcity of stations in Australia, I wondered how they would find five “neighboring stations” in 1941 …

So I looked it up. The nearest station that covers the year 1941 is 500 km away from Darwin. Not only is it 500 km away, it is the only station within 750 km of Darwin that covers the 1941 time period. (It’s also a pub, Daly Waters Pub to be exact, but hey, it’s Australia, good on ya.) So there simply aren’t five stations to make a “reference series” out of to check the 1936-1941 drop at Darwin.

Intrigued by the curious shape of the average of the homogenized Darwin records, I then went to see how they had homogenized each of the individual station records. What made up that strange average shown in Fig. 7? I started at zero with the earliest record. Here is Station Zero at Darwin, showing the raw and the homogenized versions.

Figure 8 Darwin Zero Homogeneity Adjustments. Black line shows amount and timing of adjustments.

Yikes again, double yikes! What on earth justifies that adjustment? How can they do that? We have five different records covering Darwin from 1941 on. They all agree almost exactly. Why adjust them at all? They’ve just added a huge artificial totally imaginary trend to the last half of the raw data! Now it looks like the IPCC diagram in Figure 1, all right … but a six degree per century trend? And in the shape of a regular stepped pyramid climbing to heaven? What’s up with that?

Those, dear friends, are the clumsy fingerprints of someone messing with the data Egyptian style … they are indisputable evidence that the “homogenized” data has been changed to fit someone’s preconceptions about whether the earth is warming.

One thing is clear from this. People who say that “Climategate was only about scientists behaving badly, but the data is OK” are wrong. At least one part of the data is bad, too. The Smoking Gun for that statement is at Darwin Zero.

So once again, I’m left with an unsolved mystery. How and why did the GHCN “adjust” Darwin’s historical temperature to show radical warming? Why did they adjust it stepwise? Do Phil Jones and the CRU folks use the “adjusted” or the raw GHCN dataset? My guess is the adjusted one since it shows warming, but of course we still don’t know … because despite all of this, the CRU still hasn’t released the list of data that they actually use, just the station list.

Another odd fact, the GHCN adjusted Station 1 to match Darwin Zero’s strange adjustment, but they left Station 2 (which covers much of the same period, and as per Fig. 5 is in excellent agreement with Station Zero and Station 1) totally untouched. They only homogenized two of the three. Then they averaged them.

That way, you get an average that looks kinda real, I guess, it “hides the decline”.

Oh, and for what it’s worth, care to know the way that GISS deals with this problem? Well, they only use the Darwin data after 1963, a fine way of neatly avoiding the question … and also a fine way to throw away all of the inconveniently colder data prior to 1941. It’s likely a better choice than the GHCN monstrosity, but it’s a hard one to justify.

Now, I want to be clear here. The blatantly bogus GHCN adjustment for this one station does NOT mean that the earth is not warming. It also does NOT mean that the three records (CRU, GISS, and GHCN) are generally wrong either. This may be an isolated incident, we don’t know. But every time the data gets revised and homogenized, the trends keep increasing. Now GISS does their own adjustments. However, as they keep telling us, they get the same answer as GHCN gets … which makes their numbers suspicious as well.

And CRU? Who knows what they use? We’re still waiting on that one, no data yet …

What this does show is that there is at least one temperature station where the trend has been artificially increased to give a false warming where the raw data shows cooling. In addition, the average raw data for Northern Australia is quite different from the adjusted, so there must be a number of … mmm … let me say “interesting” adjustments in Northern Australia other than just Darwin.

And with the Latin saying “Falsus in unum, falsus in omis” (false in one, false in all) as our guide, until all of the station “adjustments” are examined, adjustments of CRU, GHCN, and GISS alike, we can’t trust anyone using homogenized numbers.

Regards to all, keep fighting the good fight,

w.

FURTHER READING:

My previous post on this subject.

The late and much missed John Daly, irrepressible as always.

More on Darwin history, it wasn’t Stevenson Screens.

NOTE: Figures 7 and 8 updated to fix a typo in the titles. 8:30PM PST 12/8 – Anthony

4.7 3 votes
Article Rating

Discover more from Watts Up With That?

Subscribe to get the latest posts sent to your email.

909 Comments
Inline Feedbacks
View all comments
carrot eater
December 16, 2009 5:09 am

Geoff Sherrington (03:18:23) :
I don’t know what you’re going on about. It is absolutely without a doubt that the GHCN does not use the Australian ‘high quality’ homogenised reference data that you’ve been talking about.
Other adjustments, like going back and finding new data, finding misrecorded or mislabeled data, and so on, are perfectly fine, and will happen continuously.

3x2
December 16, 2009 7:52 am

Probably a dead thread by now but …
Concerning the adjustments made for moves and equipment changes at Darwin, is the maximum adjustment (2°C (+/- doesn’t matter)) even possible?
(from v2.mean) If we take the january mean (over the whole record no adjustments) as around 28 and the july mean as 25 we have a difference of 3°C over the whole year. An adjustment of (v2.mean_adj) 2°C within that band seems a little excessive without some kind of major problem in the equipment or the site.

carrot eater
December 16, 2009 10:42 am

3×2 (07:52:00) :
I still check about once a day the thread, so I’ll see what you say.
We have seen that Darwin is atypical in how large the adjustment is, but yes, it is still a curiosity as to why that came about. This will take some time, gathering the neighboring stations, doing correlation checks, and so on. I don’t have the time for this now, but hopefully somebody else does.
The time of most interest here (due to the big stepladder) is 1940 to 2000.

carrot eater
December 16, 2009 10:50 am

Oh, and another remaining point of some interest is the treatment of duplicate records (series 0 to 4, where they overlap, are largely duplicate), how each one is homogenised by GHCN, and how they’re finally pasted together (In Willis Fig 7, you can see the overall result has much more moderate adjustment).
I’ve finally found what I think is the raw data on the ABoM page, and it is a better way to get it than GHCN or GISS because it explicitly separates the station moves – there are separate files for Airport and PO.

December 16, 2009 12:46 pm

carrot eater (05:03:28) :
“Although we should be a bit careful; the data being analysed here are land-only, and we’re comparing it to land+ocean. Without looking, I’m not sure how big a deal that is, but the oceans do slow things down.
“Isn’t a +0.0172 C/decade bias then a significant 26% of the supposed warming – reducing the unbiased warming to 0.048 C/decade?”
You’re a priori assuming that any net effect due to homogenisation is bad?
“And doesn’t this have a significant implication for an inferred CO2 sensitivity?”
Zero implication.”
No, I am not priori assuming that any net effect due to homogenisation is bad. What I am saying is this:
(1) So now it is out in the open. We now objectively know homogenisation introduces a positive bias. It has a positive sign. We can see it is NOT a trivial number. We saw plenty of lead-up DENIAL by the warmists about that!
(2) At the end of the day that ADDED +0.017 C/decade bias is a scientific judgement call. It’s peer review should have been a transparent part of the IPCC process. It was not. Ergo: it should have been subject to a transparent international QC/QA process by IPCC. It was not.
(3) You appear to be oblivious of these well known aspects of ‘consensual’ reality:
http://www.globalwarmingart.com/wiki/File:Climate_Change_Attribution_png
Zero implication for CO2 sensitivity my ass.
You’ve been eating far too many carrots, Bugs.

Geoff Sherrington
December 16, 2009 5:50 pm

carrot eater (10:42:31) :
Darwin is not atypical in Australia in having a large GISS adjustment to a long set. There are other 100-year records of similar shape. Here is Broome, which is 1,100 km s-w of Darwin, showing the anomaly graphs of Giss unadjusted & adjusted, taken from KNMI in Dec_09. (The BOM online data show an essentially horizontal line over the period from 1940 that I have worked on, but not finished yet) for both Tmax and Tmin.
http://i260.photobucket.com/albums/ii14/sherro_2008/BroomeGISSunadjusted.jpg?t=1261014137
http://i260.photobucket.com/albums/ii14/sherro_2008/BroomeGiss-1.jpg?t=1261014172
Quite a difference generated from a flat response, eh?
…………………………………………..
We seem to be misunderstanding each other a little. I have not mentioned the Reference Climate Station set in this discussion. My interest are broader and in short sentence form can be expressed as:
1. Australia’s BOM sends data to USA & British bodies for incorporation in global sets. Which authorities are primary recipients?
2. How do you know that these are the necessary and sufficient bodies for the purposes of this thread? Can you confirm it independently? e.g. does NOAA and GHCN both get data or does it go to WMO?
3. Which geographic set of Australian information is currently sent? Is if the RCS set, or a different one?
4. Are the data aggregated into monthly before sending?
5. Are the data as currenly used by the USA folks available in a file that can be accessed today, or have there been a number of files, some of which might have been updated in Australia – or not updated?
6. What is a site for all Australian data so far digitised, in primordial, raw form?
I do not know where you work so I do not know if you can answer these. Maybe Nick Stokes (ditto).

December 16, 2009 9:51 pm

Sigh, in short….a quality control/quality assurance problem.

Geoff Sherrington
December 17, 2009 12:19 am

If you can work out the formal meaning of “correlation %” then here are the curves for Australia by Della-Marta et al. If the Y-axis is correlation coefficient x 100, then you will see very few correlations above 0.8. What are the implications of this for Australia? Easy. Just lower the hurdle to 0.5 or something that gives you more data points to work with. The science is settled.
http://i260.photobucket.com/albums/ii14/sherro_2008/d-m05.jpg?t=1261037758

Gavin Andrews
December 17, 2009 6:58 am

Willis,
That’s an interesting article, and you’ve obviously put a fair amount of work into it, however I believe you’ve missed a fairly vital factor that I’d expect to be largely responsible for the amplification of the rise in temperatures post 1941 in the homogonised GHCN temperature records.
Darwin station is surrounded on 3 sides by the ocean, and is around 1-2km from the Ocean. The ocean acts as a major moderating factor of climate, so any stations located close to the ocean will show much more moderate increases in temperature than landlocked stations. If the darwin data was being used solely to demonstrate temperature change in Darwin itself, then this would not be a factor, and there would be no justification for adjusting for it.
However, the GHCN dataset is not used for this purpose, it is used to extrapolate temperature change across the entire region, and must therefore take the proximity to the coast into account and adjust the figures accordingly. In order for the Darwin dataset to best represent the entire region any change would need to be amplified to represent the average change across the region, most of which is landlocked, and only a small percentage of which is located close to the coast.
It will be this adjustment to the actual station data to allow a coastal stations readings to better represent the average for the area that is causing the amplification of the warming trend post 1941 that you’re finding hard to explain.
I can see why it would look odd at first glance, but if you think about it, it would be much worse to attempt to represent the average temperature of an entire mostly landlocked region based on uncorrected data from a coastal station that’s not at all representative of the average geography of the region.
If you’re wondering how they’d come up with the correction factors to use, essentially they’d take data from several areas of the world where there were stations at relatively regular intervals moving inland from the coast, produce an average graph of the difference in temperature change related to distance from the coast, and use this to calculate a correction factor to apply in areas where coastal stations are the only available stations based on the distance a station was from the coast compared to the average distance from the coast of the surrounding area. They’d also do the a similar thing to adjust for height above sea level, topography, vegetation and population density.
I’ll have a dig around and see if I can find any papers to support what I’m saying, but this how I remember the process used being taught to me when I was studying it shortly after the GHCN dataset was first released.
Granted it’s not a perfect method of estimating climate change across australia, but until someone invents a time machine that allows scientists to establish extra long term inland climate monitoring stations across areas where the network is sparse, it’s the best estimate that can be produced, and we really have to live with that rather than demanding perfection when this is an impossible to achieve.

Gavin Andrews
December 17, 2009 9:24 pm

Willis,
It looks like I was wrong about the methodology used, but at least partially right about the overall reason for the amplification of the rise over the period from the 1940’s onwards. In working this out though I’ve also sussed out a major flaw to your article…
Basically as far as I can work out, as additional air temp monitoring stations data comes on line from stations surrounding Darwin they use the data from these stations to create a reference series, which they then use to homogonise the Darwin Airport data. Via the BOM website, I’ve checked the data for most of the surrounding sites, and found 6 that seem to meet the criteria quoted in your article. 5 out of the 6 surrounding sites are located inland away from the coast, and as I predicted, all 5 show a significantly bigger warming signature in the parts of the 1940’s-1990’s than either Darwin or the other coastal site.
Presumably as outlined in the GHCN documents you link to, the homogonised data from these sites (and possibly others I’ve missed) would then have been used to produce a reference series which would have been used to adjust the Darwin series, to produce a homogonised series with a significantly higher rate of increase from the 1950s-1990’s.
Pre 1950ish, I don’t think there’s enough other local stations to be able to homogonise the data using this method, but there is enough meta data for the location to enable the data analyst to manually adjust the data, apparently due to tree shading the Post Office site in the 1930’s, and the move to the Airport in 1942. (Butterworth)
Now I’ll come to the major flaw in your article…
You state that there are 5 individual station records that combine to form the Darwin record, and then at the end of the article use a graph from the Darwin Zero dataset as the conclusion to the article. The GHCN dataset derives it’s data from the BOM monitoring stations, yet on the full list of all monitoring stations every operating at all in Australia, there are only 4 stations listed for Darwin, these being
Darwin Airport
Darwin Airport Comparison
Darwin Post Office
Darwin Regional Office
Darwin Zero is not an actual monitoring site then, and I’m 99% certain that Darwin Zero is actually just the name given to the file for the reference dataset of the average temperatures of the surrounding sites, running alongside the unajusted average data for Darwin. This also explains why this dataset ends around 1993, which ties in roughly with the end date on the original GHCN datasets being worked up.
After this point (1993) on the full graph as well, it’s notable that the correction factor remains constant, which is consistant with GHCN having stopped actively doing the comparisons beyond this point, and simply using the last correction factor produced from the original data analysis. This would corroborate the idea that the Darwin Zero data is the reference dataset.
Bottom line, as far as I can see the homogonisation has been done in the way they describe where the data is available, and they’ve reverted to using the metadata to make some justified adjustments prior to that point, which is exactly the way it should be done.
The main point that really needs to be understand in all this though is that this dataset is not aimed at producing the most accurate data for temperature change in Darwin city, it’s aimed at using Darwin’s temperature data and the temperature data from surrounding stations to produce the most accurate estimate for average air temperature change across the Darwin region as part of a global dataset that’s used to estimate global temperature change.
here’s the full list of australian monitoring stations ftp://ftp.bom.gov.au/anon2/home/ncc/metadata/lists_by_element/alpha/alphaAUS_3.txt
and here are the 6 stations that I’ve found that I believe meet the criteria set for being part of the reference series
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=014633&p_nccObsCode=36&p_month=13
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=015131&p_nccObsCode=36&p_month=13
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=014840&p_nccObsCode=36&p_month=13
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=014612&p_nccObsCode=36&p_month=13
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=014902&p_nccObsCode=36&p_month=13
http://www.bom.gov.au/jsp/ncc/cdio/weatherData/av?p_display_type=dataGraph&p_stn_num=014401&p_nccObsCode=36&p_month=13

December 17, 2009 9:55 pm

Here is what seems a thoughtful analysis of the Darwin issue from the other side. I’m not qualified to judge its merits:
http://www.economist.com/blogs/democracyinamerica/2009/12/trust_scientists

Geoff Sherrington
December 17, 2009 10:50 pm

Gavin,
It would help if you indicated where you are guessing, where you have documentation and where you know you are right.
Darwin is not surrounded on 3 sides by sea. There is a small tidal creek to the S-E, a few km long, but one can hardly count that in. On a scale of say 500 km, there is marginally more sea off Darwin than there is land to the interior.
There are many, many places where temperatures have been taken within 300 km of Darwin. Oenpelli, Port Keats, Pine Creek, Daly Waters, Katherine are some, with establishment dates going back to the 1870s. The question is, which of these, if any, are used for “adjustment” of Darwin, and by whom, and for how long, and for what purpose?
Darwin Zero I’m not sure of, but the 1993 date seems to be when observations ceased at the original Post Office site 014016. The move to the Airport 014015 was in 1940. There is an overlap period of 7 years documented between these 2, ending in 1993.
It might also be relevany that about June 1990, there was a change from daily to half-hourly recording. It is likely that this resulted in a step change, but it might have been spliced and tapered over several years.
Willis has shown above that there are really poor correlation coefficients between Darwin and other major bases. This would make them non-starters for adjustment.
Now, Gavin, you write “The ocean acts as a major moderating factor of climate, so any stations located close to the ocean will show much more moderate increases in temperature than landlocked stations.” If this statement was correct – and it is not – what conditions prevailed 100 years ago, 200 years ago, 1000 years ago? The sea, if it moderates, can do so only in a period of long change. If you take the last 100 years and compare the slope of Darwin’s temperature (about 1.2 deg C cooling) with that of say Alice Springs, (0.8 deg C warming), you will realise that this differncing cannot go on forever, or it would have snowed in Darwin a few centuries ago. All you have done id to poiunt to the need for long records, which we know anyhow.
Darwin Regional Office is now in northerly suburbia at 13 Scaturchio St Casuarina. It is used more for other purposes than taking temperatures at ground level.
Willis was basically correct. I could argue a few minor points with him, but it’s best to read again and learn from him.
The “adjusted” data that used to be put out by Giss and was adopted by KNMI is just dreamin’ . It has no basis in physics, mathematics or reality.
Finally, the primary purpose of the weather station 014015 is for aviation. It can be quite tricky landing at Darwin, sometimes with a severe inversion a few m above ground. That is one reason for the unusually long airstrip 29 of 3.35 km. Pilots want to know the temperature as they are about to land, not some complicated adjustment of it.
So have a think about what Willis and Steve and I have written and then come back with info that you are certain about.

Willis Eschenbach
December 17, 2009 11:55 pm

Don Dixon (21:55:21) :

Here is what seems a thoughtful analysis of the Darwin issue from the other side. I’m not qualified to judge its merits:

Don, that is not a “thoughtful analysis”. It is a farrago of lies, ad hominem attacks, and misunderstandings. See my response at “Willis: Reply to the Economist”.

Willis Eschenbach
December 18, 2009 12:13 am

Gavin Andrews (21:24:15) :

Willis,
It looks like I was wrong about the methodology used, but at least partially right about the overall reason for the amplification of the rise over the period from the 1940’s onwards. In working this out though I’ve also sussed out a major flaw to your article…
Basically as far as I can work out, as additional air temp monitoring stations data comes on line from stations surrounding Darwin they use the data from these stations to create a reference series, which they then use to homogonise the Darwin Airport data. Via the BOM website, I’ve checked the data for most of the surrounding sites, and found 6 that seem to meet the criteria quoted in your article. 5 out of the 6 surrounding sites are located inland away from the coast, and as I predicted, all 5 show a significantly bigger warming signature in the parts of the 1940’s-1990’s than either Darwin or the other coastal site.
Presumably as outlined in the GHCN documents you link to, the homogonised data from these sites (and possibly others I’ve missed) would then have been used to produce a reference series which would have been used to adjust the Darwin series, to produce a homogonised series with a significantly higher rate of increase from the 1950s-1990’s.

You can’t just claim that the sites “seem to meet the criteria quoted in [my] article.” The fact is that they don’t. The earliest of these starts in 1941, the second earliest in 1965. Thus they are useless for the 1920, 1930, and 1950 adjustments.

Pre 1950ish, I don’t think there’s enough other local stations to be able to homogonise the data using this method, but there is enough meta data for the location to enable the data analyst to manually adjust the data, apparently due to tree shading the Post Office site in the 1930’s, and the move to the Airport in 1942. (Butterworth)

Say what? The GHCN specifically says if there’s not enough other local stations to homogenize a given station, they don’t use the station. In other words … you’re just making it up as you go along.

Now I’ll come to the major flaw in your article…
You state that there are 5 individual station records that combine to form the Darwin record, and then at the end of the article use a graph from the Darwin Zero dataset as the conclusion to the article. The GHCN dataset derives it’s data from the BOM monitoring stations, yet on the full list of all monitoring stations every operating at all in Australia, there are only 4 stations listed for Darwin, these being
Darwin Airport
Darwin Airport Comparison
Darwin Post Office
Darwin Regional Office
Darwin Zero is not an actual monitoring site then, and I’m 99% certain that Darwin Zero is actually just the name given to the file for the reference dataset of the average temperatures of the surrounding sites, running alongside the unajusted average data for Darwin. This also explains why this dataset ends around 1993, which ties in roughly with the end date on the original GHCN datasets being worked up.

If this were true, if Darwin Zero were just an average of the other four files, then why does it end in 1991? (NB, it does not end in 1993 as you claim.) The other four records end in 1980, 1990, 1994, and 2009. Why would an average of those four records end in 1993?
In addition, look at the data. For a number of the years, Darwin Zero is NOT the average of the other data. Investigate 1952, for one of many examples.
Gavin, I appreciate your enthusiasm, but you need to read things more carefully. You also need to think your ideas out to the end, and actually test them. If you had averaged the other data, you would have seen at once that Darwin Zero simply is not the average of the others.
Best regards,
w.

Geoff Sherrington
December 18, 2009 3:16 am

Gavin,
You mention “and the move to the Airport in 1942. (Butterworth)”
Actually, Butterworth is in Malaysia and has essentially nought to do with the argument.
How much are you making up as you go along?
We are not setting out to show that these raw data are invalid unless they are adjusted. We don’t think that flat data need to be given a warming trend.
To the contrary, we are asking why plausible data need to be adjusted, especially to the synthetic degree noted by Willis.

Geoff Sherrington
December 18, 2009 3:58 am

Don Dixon (21:55:21) :
If I was a school master marking the article you quote, I’d give about 2/10. It is absolutley riddled with wrong statements, unjustified inferences, misquotes, etc. It is NOT science at work. Its author admits to not understanding correlation coefficients.
Blair Trewin would need to answer the question of why station shifts around Darwin Airport grounds resulted in an upward adjustment. In the comparison with the old site from 1967 to 1973, the respective means are near enough to identical at the 2 sites and they are more than 4 km apart. So why does it get hotter each time you move the site around within the confines of the airport? Why is there a need to make a step adjustment to the mean temp at the time of the shift to the airport in 1940, if the stations have identical means over a later 7 year daily test period?
Repeating again, Darwin is NOT surrounded on 3 sides with water, to a degree that the physics of latent heat of evaporation and the likes would be likely to make a detectable T difference (Caveat: I have not done the calculations as they do not warrant the time, so that statement is intuitive and not science based). Also, if there is land 9 km WEST of the old met station. The prevailing winds blow very little of the time exactly along the path of the mouth of Elizabeth River and it is a bit of a red herring – especially when the tide is out, for there is a tidal range in Darwin Harbour in the vicinity of the old Met Office of some 3-5 m or so. Yes, there is a limited amount of water to the south, but the south is a short line on the wind rose.
Also, the present site of the weather station at the airport is just over the road from one of our former company offices. I used to sit there watching the B52s and KC-135 tankers blasting heat as they taxied for takeoff just 350 m away from where the station is now. These days there are jumbo jets, bad scenario, pilots need to push up the power on taxi to turn sharp left heavily loaded, with wash heading towards the station.
Why do people write about subjects about which they know so little?

Street
December 18, 2009 5:16 am

Gavin Wrote:
“Darwin Zero is not an actual monitoring site then, and I’m 99% certain that Darwin Zero is actually just the name given to the file for the reference dataset of the average temperatures of the surrounding sites”
This part of your theory is interesting. We’ve seen occasions where the infilling process has kept creating measurements for years after a station has closed down. If Darwin Zero was created in the database as a new station in 1993, might the GHCN have infilled the entire measurement history by accident?
It would have to be an accident though. The process you describe goes against everything I’ve read on the adjustment process. What you are describing, creating a temperature series that reflects the region, is the gridding process.

Geoff Sherrington
December 18, 2009 5:21 am

Geoff Sherrington (22:50:46) :
To self – correction – the Darwin between site comparison ended 1973, not 1993. If 1993 was also the wrong date for the end of Darwin 0 and 1990 was correct, than that is the year when half-hourly readings commenced as per the brief public meta data sheet.

carrot eater
December 18, 2009 6:21 am

Steve Short (12:46:24) :
I think you’re mistaken on a number of points. Nothing is out in the open now, that wasn’t before. The analysis of gg was absolutely simple, and not news to anybody who follows the issue. But I still maintain that the effect is minor on a large scale (more below).
From your comment, one would think that nobody has tested and refined the adjustment methods before; that nobody has looked to see if they are giving reasonable results. You say it hasn’t been a ‘transparent part’ of the review process – I don’t know how you can just say that. Just because *you* don’t know of the work, doesn’t mean it hasn’t happened, or been published.
In order to finally assess what effect homogenisation actually has, you need to go a step further and grid the data and find spatial averages for whichever region. Like that plot people love to show for the US data. Do you think that plot would be on the NOAA page if they were somehow trying to hide it? No, but you can see there the effect of time of observation bias and automated weather station adjustments.
Or like the paper I keep citing from Peterson from way back in… 1995, (““The effect of artificial discontinuities on recent trends in minimum and maximum temperatures””) where he showed the difference between raw and homogenised GHCN for Northern Hemisphere, and then parts of China and the US. I don’t see how you can say that these things aren’t being looked at.
As for computing climate sensitivity: It isn’t done based on curve fits to the last century, though hindcasting is something of a test of the model that you’ve got. But even the graph that you show there would not change much, if you arbitrarily decided that all GHCN adjustments are bad and that all data should just be used, as is, regardless of how obviously bad it is, unadjusted. I’m basing that off the figures in Peterson’s paper, that show the effect of GHCN homogenisation. (which homogenisation, by the way, isn’t used by most others)

carrot eater
December 18, 2009 6:36 am

Geoff Sherrington (17:50:21) :
So gg has shown that Darwin is atypical, in general. You then say that it’s typical for Australia, by picking two examples. Now that’s a much less interesting question to me; but in any case, you’re still going about it the wrong way. Don’t pick me two stations. Use gg’s code to show me all of Australia. The station numbers have a country code in them, so the modification wouldn’t be hard. Of course, the spatial average would be better, but that would take rather more work.
“I have not mentioned the Reference Climate Station set in this discussion. ”
I’m pretty sure you have. You quoted directly from a file that listed the adjustments that were made for the purpose of that reference set – ‘high quality’, or whatever they call it.
“My interest are broader and in short sentence form can be expressed as:”
You basically want to know who mails exactly what to whom, when and how. Not something that interests me. The monthly max/min data are on the ABoM web page. Why not just compare it to whatever is on the GHCN, marked as raw? If there’s a difference, we can then try to figure it out. Further, the NOAA has a list of data sources, so you can see where they’re getting things from. As it happens, they get the same data from multiple sources sometimes, resulting in duplicates.
Finally, it’s my understanding that NOAA receives the monthly data. They mention that there are sometimes issues when different countries compute the mean in different ways. I don’t know if some country goes above and beyond and sends daily or hourly data, as well, or if the NOAA would even have the capacity to keep all that, if they wanted it.

carrot eater
December 18, 2009 6:56 am

Willis Eschenbach (21:32:14) :
The GHCN starts with raw data (inasumuch as monthly means are raw). It does not accept data unless the raw form is available. Now if the sending country messes up or lies, and sends something somewhat adjusted instead, that’s a different matter.
There might be some confusion here on what we all mean by ‘adjustment’. If there are obvious errors in the daily data (like a day where the temp was recorded as -5843 C), I think it’s reasonable for the nation’s weather service to leave that out before computing the monthly means. That’s quality control, not homogenisation.
“. There’s an adjustment in 1930, and 1950, but none around 1940. The Aussies, on the other hand, make huge adjustments around 1940. Make what you will of that.”
I noticed that pretty much right away. It isn’t that surprising. I wouldn’t expect the GHCN homogenisations to be timed exactly the same as those from somebody working with the metadata. The question is in the overall effect. That said, the GHCN adjustments for the record in “station 0” are stronger in overall effect, as well, as compared to the ABoM. Then again, the composite for Darwin looks reasonable again; just station 0 taken in isolation looks a bit weird.
So it may well be that the GHCN algorithm spit out somewhat unreasonable results for record 0.
“While the homogenization step may root out errors, it is also true that nearby stations may inherently not be homogeneous.”
If you read the literature, everybody is painfully aware of that. There’s no such thing a perfect reference network, because there’s no such thing as a station known to be perfect. I’m just saying that a total lack of correlation with anything could be an indication of messed up data, in case the QC step missed it.
“Why didn’t I do it your way? Lack of time. ”
I don’t think that’s valid. If you don’t have time to add some substance, then you don’t have time to make accusations of fraud or smoking gun. I’d suggest you could have just made a post saying, “this looks odd to me, but I haven’t put the work in yet.”
“All of the adjustments are of equal importance when we are trying to decide if GHCN did what they claimed to do. They didn’t, and it happens to be easiest to prove that using the earlier adjustments.”
When I have time, I’ll have to look at your findings for the early times. Or rather, just do it myself from scratch. I won’t have time for such a thing anytime soon, though. But I still maintain that your post would have received no particular interest if it were only the 1920s/1930s we were talking about.

carrot eater
December 18, 2009 7:01 am

I don’t agree with Gavin’s interpretation, but he comes across a point that keeps getting lost –
The multiple records that Willis took to be independent measurements simply aren’t. Where they overlap, they are largely duplicates.