Darwin Zero Before and After

Guest Post by Willis Eschenbach

Recapping the story begun at WUWT here and continued at WUWT here, data from the temperature station Darwin Zero in northern Australia was found to be radically adjusted and showing huge warming (red line, adjusted temperature) compared to the unadjusted data (blue line). The unadjusted data showed that Darwin Zero was actually cooling over the period of the record. Here is the adjustment to Darwin Zero:

Figure 1. The GHCN adjustments to the Darwin Zero temperature record.

Many people have written in with questions about my analysis. I thank everyone for their interest. I’m answering them as fast as I can. I cannot answer them all, so I am trying to pick the relevant ones. This post is to answer a few.

• First, there has been some confusion about the data. I am using solely GHCN numbers and methods. They will not match the GISS or the CRU or the HadCRUT numbers.

• Next, some people have said that these are not separate temperature stations. However, GHCN adjusts them and uses them as separate temperature stations, so you’ll have to take that question up with GHCN.

• Next, a number of people have claimed that the reason for the Darwin adjustment was that it is simply the result of the standard homogenization done by GHCN based on comparison with other neighboring station records. This homogenization procedure is described here (PDF).

While it sounds plausible that Darwin was adjusted as the GHCN claims, if that were the case the GHCN algorithm would have adjusted all five of the Darwin records in the same way. Instead they have adjusted them differently (see below). This argues strongly that they were not done by the listed GHCN homogenization process. Any process that changed one of them would change all of them in the same way, as they are nearly identical.

• Next, there are no “neighboring records” for a number of the Darwin adjustments simply because in the early part of the century there were no suitable neighboring stations. It’s not enough to have a random reference station somewhere a thousand km away from Darwin in the middle of the desert. You can’t adjust Darwin based on that. The GHCN homogenization method requires five well correlated neighboring “reference stations” to work.

From the reference cited above:

“In creating each year’s first difference reference series, we used the five most highly correlated neighboring stations that had enough data to accurately model the candidate station.”

and  “Also, not all stations could be adjusted. Remote stations for which we could not produce an adequate reference series (the correlation between first-difference station time series and its reference time series must be 0.80 or greater) were not adjusted.”

As I mentioned in my original article, the hard part is not to find five neighboring stations, particularly if you consider a station 1,500 km away as “neighboring”. The hard part is to find similar stations within that distance. We need those stations whose first difference has an 0.80 correlation with the Darwin station first difference.

(A “first difference” is a list of the changes from year to year of the data. For example, if the data is “31, 32, 33, 35, 34”, the first differences are “1, 1, 2, -1”. It is often useful to examine first differences rather than the actual data. See Peterson (PDF) for a discussion of the use of the “first-difference method” in climate science.)

Accordingly, I’ve been looking at the candidate stations. For the 1920 adjustment we need stations starting in 1915 or earlier. Here are all of the candidate stations within 1,500 km of Darwin that start in 1915 or before, along with the correlation of their first difference with the Darwin first difference:

WYNDHAM_(WYNDHAM_PORT) = -0.14

DERBY = -0.10

BURKETOWN = -0.40

CAMOOWEAL = -0.21

NORMANTON = 0.35

DONORS_HILL = 0.35

MT_ISA_AIRPORT = -0.20

ALICE_SPRINGS = 0.06

COEN_(POST_OFFICE) = -0.01

CROYDON = -0.23

CLONCURRY = -0.2

MUSGRAVE_STATION = -0.43

FAIRVIEW = -0.29

As you can see, not one of them is even remotely like Darwin. None of them are adequate for inclusion in a “first-difference reference time series” according to the GHCN. The Economist excoriated me for not including Wyndham in the “neighboring stations” (I had overlooked it in the list). However, the problem is that even if we include Wyndham, Derby, and every other station out to 1,500 km, we still don’t have a single station with a high enough correlation to use the GHCN method for the 1920 adjustment.

Now I suppose you could argue that you can adjust 1920 Darwin records based on stations 2,000 km away, but even 1,500 km seems too far away to do a reliable job. So while it is theoretically possible that the GHCN described method was used on Darwin, you’ll be a long, long ways from Darwin before you find your five candidates.

• Next, the GHCN does use a good method to detect inhomogeneities. Here’s their description of their method.

To look for such a change point, a simple linear regression was fitted to the part of the difference series before the year being tested and another after the year being tested. This test is repeated for all years of the time series (with a minimum of 5 yr in each section), and the year with the lowest residual sum of the squares was considered the year with a potential discontinuity.

This is a valid method, so I applied it to the Darwin data itself. Here’s that result:

Figure 2. Possible inhomogeneities in the Darwin Zero record, as indicated by the GHCN algorithm.

As you can see by the upper thin red line, the method indicates a possible discontinuity centered at 1939. However, once that discontinuity is removed, the rest of the record does not indicate any discontinuity (thick red line). By contrast, the GHCN adjusted data (see Fig. 1 above) do not find any discontinuity in 1941. Instead, they claim that there are discontinuities around 1920, 1930, 1950, 1960, and 1980 … doubtful.

• Finally, the main recurring question is, why do I think the adjustments were made manually rather than by the procedure described by the GHCN? There are a number of totally independent lines of evidence that all lead to my conclusion:

1. It is highly improbability that a station would suddenly start warming at 6 C per century for fifty years, no matter what legitimate adjustment method were used (see Fig. 1).

2. There are no neighboring stations that are sufficiently similar to the Darwin station to be used in the listed GHCN homogenization procedure (see above).

3. The Darwin Zero raw data does not contain visible inhomogeneities (as determined by the GHCN’s own algorithm) other than the 1936-1941 drop (see Fig. 2).

4. There are a number of adjustments to individual years. The listed GHCN method does not make individual year adjustments (see Fig. 1).

5. The “Before” and “After” pictures of the adjustment don’t make any sense at all. Here are those pictures:

Figure 3. Darwin station data before and after GHCN adjustments. Upper panel shows unadjusted Darwin data, lower panel shows the same data after adjustments.

Before the adjustments we had the station Darwin Zero (blue line line with diamonds), along with four other nearby temperature records from Darwin. They all agreed with each other quite closely. Hardly a whisper of dissent among them, only small differences.

While GHCN were making the adjustment, two stations (Unadj 3 and 4, green and purple) vanished. I don’t know why. GHCN says they don’t use records under 20 years in length, which applies to Darwin 4, but Darwin 3 is twenty years in length. In any case, after removing those two series, the remaining three temperature records were then adjusted into submission.

In the “after” picture, Darwin Zero looks like it was adjusted with Sildenafil. Darwin 2 gets bent down almost to match Darwin Zero. Strangely, Darwin 1 is mostly untouched. It loses the low 1967 temperature, which seems odd, and the central section is moved up a little.

Call me crazy, but from where I stand, that looks like an un-adjustment of the data. They take five very similar datasets, throw two away, wrench the remainder apart, and then average them to get back to the “adjusted” value? Seems to me you’d be better off picking any one of the originals, because they all agree with each other.

The reason you adjust is because records don’t agree, not to make them disagree. And in particular, if you apply an adjustment algorithm to nearly identical datasets, the results should be nearly identical as well.

So that’s why I don’t believe the Darwin records were adjusted in the way that GHCN claims. I’m happy to be proven wrong, and I hope that someone from the GHCN shows up to post whatever method that they actually used, the method that could produce such an unusual result.

Until someone can point out that mystery method, however, I maintain that the Darwin Zero record was adjusted manually, and that it is not a coincidence that it shows (highly improbable) warming.


Sponsored IT training links:

Want to pass HP0-J33 at first try? Gets certified 000-377 study material including 199-01 dumps to pass real exam on time.


Get notified when a new post is published.
Subscribe today!
0 0 votes
Article Rating
303 Comments
Inline Feedbacks
View all comments
Gail Combs
December 21, 2009 7:56 am

Ox-FordPrefect (04:46:08) :
Is there a record of the number of corrections for inhomogeneity and what direction these are in? An assessment of the formal guidelines used should be revealing – are there more reasons to adjust data upwards than down?
As others above stated, you can have a zero net adjustment and cause a large change in the graphs. Just adjust a reading before 1950 DOWN by the same amount you adjust a reading after 1950 UP. The net adjustment is zero but you lowered the first half of the graph and raised the second giving a flat line graph of raw data a nice upward trend so you can yell GLOBAL WARMING.

beng
December 21, 2009 8:09 am

*******
21 12 2009
Geoff Sherrington (02:58:12) :
The weakesses with this study are several fold. One is evident from a station in an older city with UHI. In Melbourne for example, various comparisons with other sites show it likely that UHI ‘maxed out’ at Melbourne Central some time before 1950. That is, from humble beginnings, the temperature from UHI rose steadily to (say) the 1930s and then remained about constant because nothing was explainable or available to push the UHI much higher. So Melbourne would be one of the stations in an 80-year plot that whould be near the median, despite having UHI (from studies already referenced) of up to 10 deg C.
*******
Geoff, I don’t think UHI effects will “max out” in this way in most situations. Even if an urban area becomes static, if the surrounding suburban areas continue to build up, this would still increase (admittedly by a smaller amount) the UHI effect in the center area by increasing the temps of incoming air toward it. One exception might be under conditions of no wind or no surrounding growth — then the effect might max out.
So, tho I agree the UHI in the urban center would increase at a lesser rate (how much, I don’t know) once it’s fairly static, I don’t think it would max out if surrounding areas continue to build up.

KevinM
December 21, 2009 8:11 am

Nick Stokes:
Prove it! somebody show me one plot from one station with data that is downloadable and verifiable and shows the opposite pattern.
Get the data Nick, if it exists.

amicus curiae
December 21, 2009 8:16 am

well its 14C outside a few days off Xmas in aus..hmm
it could be expected to be 25. instead I am heating wheat bags for arthritic aches from the cold..
and if Mayon goes boom..I guess I wont be bothering to plant melons for a few summers

Entrecuesto
December 21, 2009 8:29 am

Nick Stokes
I find your histograms very interesting. Especially the unadjusted and
the adjusted ones at the bottom of the page. Could you please also indicate
in your blog the mean, mode and median of both histograms?. Or even better,
could you subtract both histograms so we can see what we are left with?

Bruce
December 21, 2009 8:34 am

Nick Stokes, isn’t the reality that the cooling adjustments were done to old records and warming adjustments done to newer records creating the illusion of warming when there was none?

KeithGuy
December 21, 2009 8:36 am

When considering the possibility for bias in the GHCN temperature reconstruction, which would not affect the overall temperature balance
JJ (07:49:00) described it clearly earlier:
“These data have not only magnitude, but a spatial and temporal component. That the adjustments have a nearly equal distribution in magnitude does not demonstrate that they have an equal temporal distribution, or an equal spatial distribution.”
I understand the importance of considering the temporal component, which is dealt with well in this article:
http://statpad.wordpress.com/2009/12/12/ghcn-and-adjustment-trends/
Am I right to assume the possibility of the following bias:
Is there another important possibility of bias? Since one remote station has more weight attached to it than a number of stations grouped together. A large adjustment upwards on the remote station can be ‘averaged out’ by several small movements down on the grouped station but the effect it has when creating the temperature profile for a geographical area is significantly larger (I’m assuming that GHCN use a method similar to the grid system used by GISS to provide a global temperature metric.)
Could this be the motivation for adjusting Darwin’s temperature history with a view to the creation of a warming profile?

3x2
December 21, 2009 8:47 am

(somebody mentioned DB earlier in the thread)
I don’t have Dr Bellamy’s e-mail address so I will post this message here.
Dr David Bellamy
Bedburn, Co Durham
Dr. Bellamy,
Stick to your guns.
I (only slightly younger) have very fond memories of your work. You have always been an excellent bridge between “hard science” and the “public”. Opening up worlds that, without you pointing them out, most of the “public” would just never see or care about. Now they do, thanks to you.
It is a real travesty that the current set of “bong heads” over at the BBC do not recognise that you have done more for the real “environment”, at least in the UK, than most any individual has. Don’t worry though, you have many allies.
Can’t help you much with the science but if you need me to go ’round the BBC and kick (Dr. M boots wise) some sense into one of their current “environment workshops” and flush their Jamaican weed down the nearest multi-cultural, multi-sexual WC- just give me a call.
Like I say – Stick to your guns.
(In the mean time why not post your views over here at WUWT?)

Dave F
December 21, 2009 8:49 am

Nick Stokes (04:09:42) :
And the slope is about equal to the 0.0175 C/decade of GG.
Your point should be relatively simple to prove visually. Plot the adjusted data next to the unadjusted data and the trend should be .0175 C/dec different.

Larry K
December 21, 2009 8:55 am

— Nick Stokes wrote : If the adjustment moves trends up and down almost equally, they won’t reinforce anyone’s preconception. —
This is simply not the case as (Michael R 00:31:00) post showed. Just because the overall averaged temperature adjustment = 0.0C, it doesn’t mean the temperature data wasn’t skewed to make it look like there is a warming trend. When the adjustments were made makes all the difference in the world.
For example, if one averaged a 20-year period like 1921 – 1940 downward -1.0C, and then averaged the 20-year period after that, 1941 – 1960 upward +1.0C. While the overall average adjustment = 0.0C, one has skewed the temperature data and introduced a 2.0C warming trend into the data for the overall 40 year period. If the 40 year temperature data for 1921- 1960 previously showed a flat trend, it now shows a 2.0C warming trend where there was none before.
(moderator please delete my previous post, it got messed up due to formatting)

Gail Combs
December 21, 2009 9:05 am

Let me see if I have this correct. To determine the global temperature, climate scientists have cut the earth up into grids. They then use mysterious mathematical manipulations to prepare “raw data” so a temperature can be assigned to each grid and use that temperature to compute the global temp.
GHCN makes adjustments to the raw Darwin data according to these mysterious mathematical manipulation techniques presumably to prepare it for use in the global grids and then reports this as the temperature for the Darwin location instead of using the raw data.
Nick Stokes is defending the adjustments made to the Darwin data by saying they were made according to the mysterious mathematical manipulation techniques of GHCN and therefore were justified while Willis Eschenbach has analyzed the data set for Darwin and see no justification for any adjustments to be made at all. Nick Stokes then states the mysterious mathematical manipulation techniques called GHCN homogenization “corrects for discrete events that it detects by time series analysis. Things like station moves, equipment changes, MMTS introduction.” (So I guess I was wrong it is not adjusted so it helps represent a grid area)
But if that is the case and if homogenization is supposed to detect and correct for “discrete events” then why does fig 3 raw data look completely reasonable and figure 3 adjusted data look like a lot of noise has been added to the data and not filtered out? If there were “discrete events” requiring “adjustment I would expect the graphs to be reversed.

joe
December 21, 2009 9:07 am

There is an error in the picture:
The left and the right labels are not equal.

Billy
December 21, 2009 9:14 am

Can’t this issue be easily resolved by just looking at some tree ring, mussel shell and sediment proxies in order to determine what the true temperature was at Darwin Zero over the last 50 years? I mean according to the chart at the top of this page,
http://www.usatoday.com/weather/climate/2006-06-01-wine-warming_x.htm
proxies can measure temps from 2000 years ago to the nearest 1/10000th of a degree whereas modern day thermometers can only measure today’s temps to the nearest 1/1000th of a degree. And you can believe it because it’s “based on scientific analysis”.
Heck, I don’t even know why we bother with thermometers and billion dollar satellites at all.

Jim
December 21, 2009 9:15 am

I’ve seen this same “histogram of adjustments” approach on other blogs. Do these people have a play book or do they just copy each other? It’s time to call a spade a spade. This one is pure BS meant to divert the discussion from more fruitful paths. I think the best idea is to just ignore them.

Marc
December 21, 2009 9:16 am

When I look at the homogenization technic, some questions come to mind.
Why do they use yearly temperatures?
If the temperature stepped at some point, it should be possible to see it in the day by day temperature.
Have they proven that isolated hot spot are equally likely than isolated cold spot on the planet?
If isolated cold spot are more likely on the planet, the technic could generate a false warming signal.

DR
December 21, 2009 9:21 am

“The sum of anecdotes is not data.” -Roger Brinner
One could only imagine if we who work in metrology (not to be confused with the other -ology) applied the techniques for measurement of temperature to that of measuring engine components, how various engine parts would fit together.
Sorry Nick Stokes and other GHCN apologists, but from a metrology perspective, derived temperature values can’t even be considered data. It is a complete joke and goes against every principle in basic metrology concepts.

bill
December 21, 2009 9:28 am
who cares
December 21, 2009 9:44 am

Haven’t checked all the comments, but maybe this is useful.
I have found, but misplaced, a study in google scholar where there was an analysys of the impact of changing from averaging max and min temperature (probably from mercury and alcohol thermomethers) to using regular timed temperatures. Nevertheless that change was in 1995, so it cannot be used to justify previous adjustments.
On the other hand on the thread about the economist I left a link to a study on the adjustments to radiosondes and how the first attempts on purely manual adjustments “failled” I think you will find many interetsing tricks of the trade from the usual suspects.
best regards

Bill Illis
December 21, 2009 9:44 am

Other sources using this data confirm Willis charts.
The raw data from GISS.
http://data.giss.nasa.gov/cgi-bin/gistemp/gistemp_station.py?id=501941200000&data_set=0&num_neighbors=1
(I can’t be sure but there is long enough overlap between individual station locations in this chart that no adjustments should have been made.)
And then the ClimateExplorer lets one look at and download the monthly adjusted GHCN data for all/most of the stations in the network.
Here are the individual stations (4771 with a handy Get Data link where you can also download the monthly GHCN data) .
http://climexp.knmi.nl/allstations.cgi?someone@somewhere+temperature+12
Here is the adjusted monthly data for Darwin (+2.0C increase in the trend compared to GISS’ unhomogenized data).
http://climexp.knmi.nl/gettemp.cgi?someone@somewhere+94120+DARWIN_AIRPOR+

bill
December 21, 2009 9:58 am

DR (09:21:43) :
…from a metrology perspective, derived temperature values can’t even be considered data. It is a complete joke and goes against every principle in basic metrology concepts.
Many say you must adjust for UHI
It is obvious you must correct for instrument change
It is obvious you must correct for location change
To me there is an obvious error in max/min/average due to time of observation. Using a continuous reading electronic thermometer max and min can be “easily” determined. A once/twice daily reading is wrong perhaps 3 consecutive days
So should adjustments be made?
The record we have is all there is – warts and all.
There is no way to improve this. For future a few millions spent on new equipment re-sited to ideal locations (some of these MUST be in cities and near roads as these are a valid part of the environment).
To get a VALID check on global temperatures will take at least another 150 years and more. If there is a problem with AGW then you have consigned large populated areas to the scrap heap.
What is the way you suggest that we take?
Just for info (I’m sure you will shout INVALID) here are a few UK places and de Bilt. No adjustments made to individual UK stations This shows the typical warming response.
http://img410.imageshack.us/img410/8996/ukspaghetti.jpg

Bill Illis
December 21, 2009 10:01 am

You can also get the annual GHCN adjusted data for Darwin
At the link (Click on “Make Time Series” at the bottom of the page and the annual data will be shown).
http://climexp.knmi.nl/gettemp.cgi?someone@somewhere+94120+DARWIN_AIRPOR+

pat
December 21, 2009 10:20 am

It is extraordinary bad science to do any adjustment to thermometer readings. If there is a noted environmental or instumentational alteration to the readings, such as the construction of nearby blacktop or the reorientation of the thermometer, the same should be noted with an asterisk and a footer.

lowercasefred
December 21, 2009 10:35 am

To Nick Stokes:
I would also appreciate your pointing out stations with downward adjustments equal to or greater than that of Darwin. I’ll bet Anthony would arrange for the graphics to be shown if you cannot provide a link.

geo
December 21, 2009 10:49 am

As somebody formally asked whoever is in charge of GHCN?
I still think I want the first question answered “how did this actually happen?” before I move on to any discussion of “was the result an intentional inflation?”.
I still have this feeling -possibly entirely incorrect- that there exists a significant chance this kind of thing (we’re talking about how many thousands of stations in GHCN?) is much more likely to be done by herds of grad students working from scripts they barely understand and with no authority, opportunity, nor confidence in their own judgements to challenge results that their older, more experienced, more confident colleagues would immediately discard as an unacceptable result.

Mesa
December 21, 2009 10:53 am

The number of negative correlations on an annual basis is pretty surprising…
Here is a significance test for correlations.
http://faculty.vassar.edu/lowry/rsig.html
For 100 data points correlations of absolute value < .2 are not significantly different from zero.
It's possible that Australia has a particularly low spatial correlation of temperature changes compared to other regions?

1 4 5 6 7 8 13