Comparing USCRN and nClimDiv to USCHN

Admin

November 6, 2020 6:50 am

NOAA dropped USHCN because I and the volunteers that photographed hundreds of noncompliant stations turned it into a PR train wreck for them.

They quietly closed a number of embarrassing stations with no public notice, such as Marysville CA, Tucson AZ, and Ardmore OK.

Bottom line – siting makes a difference.

https://wattsupwiththat.com/2015/12/17/press-release-agu15-the-quality-of-temperature-station-siting-matters-for-temperature-trends/

0

Andy

Editor

Reply to Anthony Watts

November 6, 2020 7:35 am

Thanks Anthony

0

Pat Frank

Reply to Andy

November 6, 2020 9:47 am

Siting quality doesn’t remove the systematic error due to non-aspirated shields.

USHCN measurement uncertainty is at least ±0.5 C over the entire 1880-2010 range because the shields were and are not aspirated. That ±0.5 C uncertainty does not average away with large data sets.

The 1981-2010 normal uses USHCN measurements, which has the same ±0.5 C uncertainty.

Systematic measurement error is small in the USCRN network because the shields are aspirated. The USCRN overall uncertainty is reduced to about ±0.05 C – the manufacturer’s calibration uncertainty, which does not average away either.

However, when the USHCN 1981-2010 normal is subtracted from USCRN temperatures, the ±0.5 C USHCN normal uncertainty propagates into the USCRN anomalies.

The USCRN anomalies then are subject to a ±0.5 C uncertainty that their temperature measurements did not have.

When the normal period eventually becomes 2011-2040 and is fully USCRN, the normal uncertainty will become (should be) about ±0.05 C.

However, the lower limit of uncertainty in the 1880-2005 temperature record will always be at least ±0.5 C because the individual measurements themselves are no more accurate than that.

If the USCRN anomaly record is grafted onto the 1880-2005 USHCN record, the USHCN uncertainty enters with the USHCN record. The extended record will then have an implicit uncertainty of ±0.5 C with respect to the 1880-2005 time-range, courtesy of the grafted USHCN record.

The whole air temperature record is a study in incompetence.

The people at GISS, NOAA, UKMet, and BEST pay no price for their incompetence. They keep their jobs, the science societies remain silent, the money keeps coming in, the media continue to pull their collective forelocks, and the climate nutters keep up their vociferous support.

So, an air temperature record thoroughly corrupted with false precision gets blandly reiterated as the various groups of compilers determinedly reject the hard judgment of measurement science.

0

windlord-sun

Reply to Pat Frank

November 6, 2020 12:02 pm

“The whole air temperature record is a study in incompetence.”

Do you include the thousands of dedicated and serious people who kept the record for a century prior to 1989? Were they deeply incompetent? They made 50 million recordings on TMAX alone. All garbage?

I don’t think so.

If you goal is “ISO-9000 level accuracy at every station is required to determine the average temperature of the earth” then fine. However, that does not answer the question “Do the individual plots of each station one at a time show any abnormal warming at the location using the ‘less than perfect’ instruments and protocols?”

The plot of a station that was consistently low or high over a century is more valuable than a hyper-modern station that only goes back a decade or two, let alone a model temp reconstruction.

0

Ian W

Reply to windlord-sun

November 6, 2020 1:34 pm

Do you include the thousands of dedicated and serious people who kept the record for a century prior to 1989? Were they deeply incompetent? They made 50 million recordings on TMAX alone. All garbage?

The records being kept were weather records the required accuracy and precision was what was achievable then reading values leaning into a Stevenson screen at night with sleet being blown down your neck. In the Southern hemisphere there were very few observers less than 100 until 1900 and most of those in Australia. The observations are being misused by today’s climate ‘scientists’ as if they were AWOS with a claimed precision and accuracy that was unachievable and most observations are invented. An analogy would be comparing a pace stick to a micrometer – the errors from a century ago are a lot larger than the accuracy claimed by today’s climate ‘scientists’ who are in any case measuring the wrong variable (as I have said several times). Temperature of a volume of atmosphere can be varied as defined in the gas laws and by the enthalpy of the volume being tested due to the latent heat of the varied water content.

It really doesn’t matter how many observation sites you have – if you are measuring the incorrect variables then the output is meaningless. This is not the fault of the Met Observers a century ago; it is the fault of today’s climate ‘scientists’ misusing the century old observations. It is probable that for many sites the humidity is available and the energy content of the air in kilojoules per kilogram could be calculated. However, the required coverage of observation stations particularly in the southern hemisphere was not available until probably 1945.

0

Tim Gorman

Reply to windlord-sun

November 6, 2020 2:45 pm

““Do the individual plots of each station one at a time show any abnormal warming at the location using the ‘less than perfect’ instruments and protocols?””

How do you know? If they have a +/- 0.5C degree of uncertainty then until the trend exceeds that interval, e.g. if a reading is 20C +/- 0.5c, then until the temperature readings exceed 21C you can’t tell if there is any warming at all!

0

windlord-sun

Reply to windlord-sun

November 6, 2020 5:02 pm

@Ian W

“really doesn’t matter how many observation sites you have – if you are measuring the incorrect variables then the output is meaningless.”

That is exactly 180 degrees wrong.

Here is the correction: It really doesn’t matter how accurate-in-the-perfection-of-the-abstract the cumulative ‘average’ is measured to for any one given station; it matters that you have many observation sites over a long time, each consistent, mostly, in itself. You then can look at 1200 sine curves one by one, and the effect of seeing no abnormal warming in this one, then the next one, then the next one…

And none show abnormal warming.

That is the opposite of meaninglessness.

0

Pat Frank

Reply to windlord-sun

November 6, 2020 5:44 pm

wl-s “Do you include the thousands of dedicated and serious people …, etc., etc.”

I clearly referred to those producing the modern air temperature record, didn’t I.

Your question is a non-sequitur.

If anything, the carelessness of the modern producers is an insult to all the dedicated and serious people who came before.

And regardless of their dedication and seriousness, the temperatures they recorded from non-aspirated shelters were no more accurate than ±0.5 C.

That’s systematic measurement error, not random error. It does not average to zero in large data sets.

0

Pat Frank

Reply to windlord-sun

November 6, 2020 5:48 pm

wl-s, your reply is meaningless.

0

windlord-sun

Reply to windlord-sun

November 6, 2020 7:06 pm

“It does not average to zero in large data sets.”

did it skew all recordings higher or lower?

0

Tim Gorman

Reply to windlord-sun

November 7, 2020 4:54 am

wl-s,

The issue is *not* individual stations being looked at separately. If that was what the climate scientists were doing then I would agree with you somewhat about following trends.. There are two main issues.

1. What is the NOAA “mobile” lab calibrated against when it gets to the remote site? Do they also transport a mobile standards lab with them everywhere? If not, then the NOAA measurements will have its own uncertainty interval so how will Zumwhere be able to tell how far off their measurement device is?
2. Climate scientists today don’t look at 10,000 individual stations to determine what is going on. They “average” all 10,000 together. Even with stations having +/- 0.05C uncertainty, that uncertainty adds by root sum square. So that overall uncertainty will grow by the sq rt (10^4) * .05 = +/- 5C. With an overall uncertainty of 5C how is anyone going to tell if the “average” temperature went up or down by even a degree let alone by tenths or hundredths of a degree? It’s somewhat ironic that the 100 or so stations originally being used which had +/- 0.5C uncertainty also have an uncertainty interval of +/- 5C when averaged together. [sq rt(100) * 0.5 = 5] From an uncertainty viewpoint the new dataset from 10,000 stations don’t give you any more of a useful average than the older 100 (or so) stations.

If the climate scientists looked at each individual station and applied a plus or minus mark to each and then counted the pluses and minuses they would have a far better “feel” for what is happening with the climate. But how would they then generate millions (billions?) of dollars to create climate models based on an average temperature with a +/- 5C uncertainty interval?

0

windlord-sun

Reply to windlord-sun

November 7, 2020 7:21 am

“The issue is *not* individual stations being looked at separately. If that was what the climate scientists were doing then I would agree with you somewhat about following trends.”

Wait … even though you agree observing the sine curves one by one is important, you Yield to Orthodoxy? That is a form of Appeal to Authority.

Wouldn’t the correct path be to help falsify “what climate scientists are doing” on the basis it is fallacious?

0

Pat Frank

Reply to windlord-sun

November 7, 2020 1:24 pm

wl-s: “did it skew all recordings higher or lower?”

Here we go again.

Uncertainty is not error, wl-s. Uncertainty is the rms of the errors revealed by a series of calibration measurements.

In the case of meteorological temperature stations, systematic measurement errors arise from from wind speed or irradiance. The errors can change magnitude and even sign day-by-day or even hour-by-hour.

The only way to account for such errors is to carry out calibration experiments using well-maintained and well-sited instruments.

The rms calibration uncertainties are then applied to the field station measurements, made using the same types of instruments.

Those (+/-) uncertainties are applied to every single field measurement that enters into the air temperature record. They do not average away. Ever.

0

Pat Frank

Reply to windlord-sun

November 7, 2020 1:28 pm

wl-s: “Wouldn’t the correct path be to help falsify “what climate scientists are doing” on the basis it is fallacious?”

Done.

Patrick Frank (2019) “Propagation of Error and the Reliability of Global Air Temperature Projections” Frontiers in Earth Science – atmospheres 7(223).

The second paper, ever, to demonstrate that climate models can’t say anything about CO2 emissions.

Patrick Frank (2016) “Systematic Error in Climate Measurements: the global air temperature record” in The Role of Science in the Third Millennium, R. Ragaini ed, 2016, World Scientific: Singapore, pp. 337-351.

The third paper, ever, demonstrating that the global air temperature record is climatologically useless

Patrick Frank (2015) “Negligence, Non-Science, and Consensus Climatology” Energy & Environment 26(3), 391-416.

The first paper, ever, showing that the entire corpus of AGW science contains no science at all.

Patrick Frank (2011) “Imposed and Neglected Uncertainty in the Global Average Surface Air Temperature Index” Energy & Environment 22(4), 407-424.

The second paper, ever, demonstrating that the global air temperature record is climatologically useless

Patrick Frank (2010) “Uncertainty in the Global Average Surface Air Temperature Index: A Representative Lower Limit” Energy & Environment 21(8), 969-989.

The first paper, ever, demonstrating that the global air temperature record is climatologically useless

Patrick Frank (2008) “A Climate of Belief” Skeptic 14(1), 22-30.

The first paper, ever (peer-reviewed and all), to demonstrate that climate models can’t say anything about CO2 emissions.

0

windlord-sun

Reply to windlord-sun

November 7, 2020 9:44 pm

@Patrick Frank

“The second paper, ever, demonstrating that the global air temperature record is climatologically useless — Patrick Frank (2010) “Uncertainty in the Global Average Surface Air Temperature Index: A Representative Lower Limit” Energy & Environment 21(8), 969-989. — The first paper, ever, demonstrating that the global air temperature record is climatologically useless…”

Does not address my claim.
Conflation of raw measurement with reconstructed temp models.
Red herring of “uncertainty.”

Note: I did not click over to your papers. You would be justified to request that I do, since I am questioning the root of your project outright. Nevertheless, I’ll write the short version of my position:

The RAW recordings of USHCN are relatively pure. Almost naïve in their innocence. Just 50 million TMAX recordings from 1200 stations over 120 years made by an army of people who care about science. This is not a model. It is raw data. There is no gridding, homogenization, imagination, etc.

Are these recordings “uncertain?” Who cares? I don’t.

Because:

=========================
A tailor for a gentleman makes three suits a year for his customer from scratch for 60 years, and he notes the measurements. He plots them in Excel. He likes to observe the sine curve in waistline that reveals the “natural” flow generated by the slow fight to maintain the man’s figure, up and down. The tailor and the man exchange ironic insults over this, the gentleman sure the tailor must have been drunk, or the wind was blowing too hard, the decade of the highest WAIST-MAX.

Laughingly, the tailor takes his assistant’s tape in hand and lays it out on the table next to the honorable tape he used for the the last twenty years. To his horror, he sees that they do not agree! His tape is shorter per foot by 1/8 inch! Quickly he checks the data and graph at the point he began using the new tape. Sure enough, there is a tiny jerky uptick, hardly noticeable, until you know where to look. He fearfully compares his tape with four other tapes, including one from a friend in another shop. “I am out of calibration,” he wails.

“Still,” says the haughty tailor to his amused customer, “this does not change my claim. You have been getting fatter and thinner on the normal curve that nature intended.”

That night, the tailor had a nightmare. He is a member of a trade organization of his craft. In the dream he sees 1200 of his fellow tailors making the same mistake. Some might even be using a tape that is too long! He wakes up in a cold sweat. “All our measurements are uncertain,” he cries out to the bedroom wall. “And I’m terrified I did not record the waistline down to the 1/32 of an inch. I am going to purchase a laser-driven measuring tape tomorrow!”

His guardian angel soothes him back to sleep and invokes a sweet dream. “Your long slow dedication to record consistently, even with one or more errors in your instrument of measurement, has served you well. There was no abnormal waistline explosion to report to your customers. All your suits fit them. Sleep peacefully.”

===========================

Patrick and anyone who cares to read my posts:
There is one dataset extent that reveals in the purest way the reality of surface temperature over 120+ years. That is the RAW version of USHCN. If you look — one by one or on accumulation — at the sine curve of 1200 station [800 recently until the redacted 400 are restored] you see normal natural flow. There is no evidence, to your very eyes, of abnormal warming.

http://theearthintime.com

“Uncertainty” is trumped by “consistency.”
The RAW USHCN is far from “useless.” It is the only 14-carat gold we have.

0

Pat Frank

Reply to windlord-sun

November 8, 2020 8:56 am

wl-s, you’ve got no clue.

0

windlord-sun

Reply to windlord-sun

November 8, 2020 12:27 pm

“you’ve got no clue”

Ah, yes, the Appeal to Nothority.

0

Ian W

Reply to windlord-sun

November 9, 2020 5:38 am

@windlord-sun

Here is the correction: It really doesn’t matter how accurate-in-the-perfection-of-the-abstract the cumulative ‘average’ is measured to for any one given station; it matters that you have many observation sites over a long time, each consistent, mostly, in itself. You then can look at 1200 sine curves one by one, and the effect of seeing no abnormal warming in this one, then the next one, then the next one…

This is not a ‘mathematical issue’ and averaging errors issue.

The metric for energy content of a volume air is kilojoules/kilogram. This has a nonlinear relationship to air temperature. So you cannot tell the energy content of air from temperature unless you also have the humidity of the air and can calculate its enthalpy (specific heat).

It really doesn’t matter how many measurements of the wrong metric you make you cannot ever calculate the change in energy content of the air because of the non-linear relationship between air temperature and heat content. Just small changes in humidity would account for all the changes in measured temperatures – with NO energy change.

0

windlord-sun

Reply to windlord-sun

November 9, 2020 6:34 am

Ian,

“Energy content” vs “Surface air temperature” …

What are you saying? That climate science is foolish to examine surface air temperature in the quest to answer “Is there any abnormal warming?”

Please clarify.

0

Pat Frank

Reply to windlord-sun

November 11, 2020 4:49 pm

wl-s, let me put it another way. Your analogy merely demonstrates that you’ve got no clue.

The USHCN raw temperatures have the identical systematic measurement calibration uncertainty as do the USHCN anomalies: ±0.5 C.

That’s because the calibration uncertainty is a property of the instrument in the field. Nothing removes it.

That’s what I discuss in my lower limit of uncertainty paper.

I don’t discuss adjusted temperatures. I don’t even discuss anomalies, except as derived.

I discuss the measurements themselves. You’ll find your first clue when you figure out what that means.

0

windlord-sun

Reply to windlord-sun

November 11, 2020 5:12 pm

Pat Frank, “You’ve got no clue.”

a) I am completely clued-in to the ‘ignore the plots of raw, pound the inaccuracy of the instruments, make a model and spout anomaly graphs” game;
b) What you persist in not becoming clued-in on is this: I utterly reject your POV.

Repeating: I do not care a smidge about the uncertainty or inaccuracy. I claim no one should care about it, if the baseline question is: Is there any abnormal warming?

I am proffering a totally different POV. Apparently, you don’t like it. But instead of refuting my position, you simply state my position is wrong simply because it is not yours!

A good debater will at least honor the oppositions full platform and process, and refute it from that place. When you don’t do that, either;
1) you realize my argument destroys your project and you just want to squash me without honoring/refuting it on it’s terms; or
2) you really really really don’t understand my premise, and so pity me for being out on Mars or something, and are praying I’ll get a clue and sign on to your position.

We can only answer “Is there any abnormal warming” by examining unaltered, un-tweaked, un-massaged, raw, long term trends of weather station direct measurement, one by one, to detect abnormal sine waves in one or more of their datasets.

0

Pat Frank

Reply to windlord-sun

November 12, 2020 9:05 am

wl-s, how are you going to detect “abnormal warming” when the systematic measurement uncertainty in your data is so large that you can’t detect any warming at all?

That’s the case with the air temperature record. There is no doubt about that. The field calibration experiments have been done. The lower limit of uncertainty due to systematic measurement error is ±0.5 C.

You can have your own lovely POV and reject all disagreeably contrary POVs, but you’re still wrong.

0

Tim Gorman

Reply to Pat Frank

November 6, 2020 2:50 pm

While the new stations my have a 0.05C uncertainty today what will it be in a year? Are the louvers in the shield going to be cleaned regularly? Are the fan blades going to be cleaned and balanced? Will dirt and insect grime be removed from around the temperature sensor itself on a regular basis? For all 10,000 stations?

Who is charged with doing this? And how many people are assigned this task?

I personally have no faith at all in the systemic uncertainty of these 10,000 stations remaining at +/- 0.05C. My guess is that within five years the routine maintenance of these stations will be slipshod at best if it is even done at all. Who knows what the uncertainty will be then.

0

windlord-sun

Reply to Tim Gorman

November 6, 2020 5:17 pm

Tim you are focused on “uncertainty.” Some of the measuring is primitive and sloppy, so we are slapping a .05C bound on them.

Here is an exaggerated example, for the purpose of illustrating a vital point …

The weather station at Zumwhere University has been logging TMAX, TMIN,SNOW, and PERC for 130 years. They have been doing it the same way forever, meticulously training each new helper on the protocols, including calibration of instruments. Their sine curve of TMAX shows a natural up and down traverse, with the top of each 35-year wave just slightly lower than the previous. This continues right into the measurements of 2020.

Now, a visitor from NOAA insists on calibrating for five days, Zumwhere’s recordings vs his mobile instruments which are new and spectacularly sensitive, accurate, and sure. To the horror of Zumewhere U, they discover that they have been reporting TMAX and TMIN both .765C Degrees too high all this time. What a nightmare.

However …. they have been consistent. It does not matter that they might have “skewed” the global average gridded temp of NOAA by .00001. Nor that they have been in variation from a station 77 miles to the north of Zumwhere.

What matters is that no abnormal deformation of their sine curve ever showed. There was no abnormal warming at Zumwhere.

Now multiply that by 1200.

0

Pat Frank

Reply to Tim Gorman

November 6, 2020 7:06 pm

wl-s, none of your calibrations or meticulousnesses will remove the error wind speed and irradiance errors that impose themselves to non-aspirated shields.

0

AndyHce

Reply to Tim Gorman

November 12, 2020 12:51 am

The 10,000 stations are not USCRN. I don’t know where Pat Frank gets +/-0.05C as the USCRN uncertainty, but the others have +/-0,5C uncertainty.

0

Pat Frank

Reply to Tim Gorman

November 12, 2020 9:43 am

Andy, the USCRN includes aspirated shelters. Aspiration makes all the difference. The air inside the shield accurately reflects the outside air.

Measurement accuracy is no longer impacted by insufficient wind-speed or irradiance. Measurement accuracy then approaches the lab-calibration of the sensor.

The manufacturer’s uncertainty for the aspirated shielded sensors, such as the Yankee MET2010, is ±0.05 C.

The USHCN sensors are not aspirated. They are subject to errors from insufficient wind-speed and irradiance.

The lower limit of USHCN measurement accuracy is ±0.5 C — 10 times higher than USCRN.

0

Steven Mosher

Reply to Pat Frank

November 6, 2020 5:56 pm

“However, when the USHCN 1981-2010 normal is subtracted from USCRN temperatures, the ±0.5 C USHCN normal uncertainty propagates into the USCRN anomalies.”

Thats not how it’s is done

0

Joel O'Bryan

Reply to Steven Mosher

November 6, 2020 6:54 pm

You’ll have to swallow the poison pill to see what is in it. Thanks Nancy.

0

Pat Frank

Reply to Steven Mosher

November 6, 2020 7:07 pm

Doesn’t matter how the anomaly is calculated, Steve. The uncertainty in the USHCN normal will propagate into it.

0

RickWill

Reply to Pat Frank

November 6, 2020 6:01 pm

One apparently reliable temperature record is the Nino 34 tropical Pacific SST. The linked chart has that data for the satellite era:

https://1drv.ms/b/s!Aq1iAj8Yo7jNg3j-MHBpf4wRGuhf
This is an important region for temperature measurement because it is a good indicator of weather across the Pacific and its rim.

Any temperature trend for the last 40 years that varies from the trend in this data set has to be suspect.

0

ferdberple

Reply to Anthony Watts

November 6, 2020 4:30 pm

Bottom line – siting makes a difference.
=====
Population change also has an effect and is likely related to the siting problem as rural stations over time end up in major population centers.

0

Steven Mosher

Reply to ferdberple

November 6, 2020 5:57 pm

Nope

-1

fred250

Reply to Steven Mosher

November 7, 2020 4:22 am

Yep, it does,

…. no matter how your mob are at intentionally ignoring the problem.

0

Steven Mosher

Reply to Anthony Watts

November 6, 2020 5:54 pm

comparing

https://www.ncdc.noaa.gov/temp-and-precip/national-temperature-index/time-series?datasets%5B%5D=uscrn&datasets%5B%5D=climdiv&datasets%5B%5D=cmbushcn&parameter=anom-tavg&time_scale=ann&begyear=2005&endyear=2020&month=10

Note

1 USCRN is UNADJUSTED GOLD STANDARD
2. USHCN is ADJUSTED and has Sites that are poor quality ( high CRN rating)
3. nCLIMDIV is also ADJUSTED and has sites that are poor quality (high CRN rating)

Result?

ADJUSTMENTS work. The Unadjusted gold standard USCRN SHOWS SLIGHTLY WARMER
temps than adjusted data

0

windlord-sun

Reply to Steven Mosher

November 6, 2020 7:32 pm

I’ll stipulate this: USCRN certainly is unsurpassed in measuring surface air temp. It takes the breath away, the lengths to which NOAA goes to get this accomplished. I stipulate that the stations damn well produce hyper-accurate measurement of surface temp.

for the last 15 years or so.
114 stations contiguous United States

Fine.

This does not address my claim. My claim is that no purported abnormal warming is visible in RAW TMAX USHCN. If USCRN claims abnormal warming over the past 15 years, it ought to have dinged (to say the least) the sine curve of many stations in the United States. We ought to see the tracers of the abnormal warming. Instead, RAW USHCN shows cooling over the period 2005-2019.

Is the absolute unaltered recorded TMAX for the 114 stations available for download?
But there is a problem: Is there an old fashioned weather station close by each of the 114 stations? It would interest me to acquire the raw recordings from them and graph them with their twin in USCRN.

please confirm …
#2, By saying “USHCN is ADJUSTED” do you subsume within that claim that ushcn RAW is adjusted?

0

AndyHce

Reply to windlord-sun

November 12, 2020 12:55 am

Suppose one looked at those 1200 graphs which you believe show sine waves of periodic changes. What could one possibly see that could be pointed at as “abnormal warming”?

0

fred250

Reply to Steven Mosher

November 7, 2020 4:19 am

Yep USCRN has brought all the data manipulation to an end.

The temperatures leveled of

NOTHING before USCRN has any relevance to REALITY at all.

It is still an agenda driven mal-fabricated mess.

Just like BEST is. !

0

Pat Frank

Reply to Steven Mosher

November 7, 2020 1:33 pm

Adjustments that hit a target. Typical.

And no uncertainty bars.

Let’s see, a lower limit uncertainty would be ±0.5 C for the USHCN anomalies, and the same ±0.5 C will propagate into the UCCRN anomalies from the USHCN normal.

But all displayed as though with perfect accuracy and infinite precision in all measurements.

Utter incompetence. Whether it’s studied or not is the only question.

0

Steven Mosher

Reply to Anthony Watts

November 6, 2020 6:15 pm

Anthony you still have not shared the data on sites.

or published your study
no cookie

0

fred250

Reply to Steven Mosher

November 7, 2020 1:27 pm

YAWN,

Everyone had access to the surface station study

You have gotten SO PATHETIC now you have to try to be a yabbering mouthpiece for the WORST data series around.

0

Pat Frank

Reply to Steven Mosher

November 7, 2020 1:35 pm

You’re in no position to gloat, Steve.

0

Sheldon Walker

Reply to Anthony Watts

November 8, 2020 5:44 pm

If people are interested, I have written several articles about NOAA’s ClimDiv temperature series

Warning – they feature my “infamous” Global Warming Contour Maps (which Anthony doesn’t like)

USA Warming since 1900
https://agree-to-disagree.com/usa-warming

Tavg-Tmin-Tmax Warming
https://agree-to-disagree.com/tavg-tmin-tmax-warming

If you want to learn how Global Warming Contour Maps work, Read this article
Robot-Train Contour Maps
https://agree-to-disagree.com/robot-train-contour-maps

I don’t accept any responsibility for headaches caused by looking at Global Warming Contour Maps

0

Ron Long

November 6, 2020 6:54 am

Good presentation of data, Andy. It looks like, at least for CONUS, in 120 years the temperature has changed from -0.7 deg C to about +0.5 deg C, or about 1.2 deg C, or about 1.0 deg C per hundred years. I’m underwhelmed, and losing my current excuse for “survival liquids”, aka cold drinks.

0

A C Osborn

Reply to Ron Long

November 6, 2020 10:13 am

I think that you will find that those are the “adjusted” datasets, so that most of the change is due to the adjustments and not the actual measured temperatures.
As predicted in Menne at el.

0

brians356

November 6, 2020 6:58 am

I’d like to see a histogram of daily high temperature records for US stations vs time. Fully half of our states still publish all-time high records set prior to 1941. In my city, the local TV weather broadcast always shows the date og that day’s high temperature record. It’s amazing how mamy daily records were set in the 1930s, and even in the late 1800s. Let’s see a histogram which will display the trend over time.

0

Steve CASE

Reply to brians356

November 6, 2020 9:09 am

Here’s one from Milwaukee:
http://www.aos.wisc.edu/~sco/clim-history/stations/mke/MKE-HIGH-T-ANN.gif

0

Tim Gorman

Reply to Steve CASE

November 6, 2020 2:52 pm

Wow! Where’s the warming? These maximums certainly didn’t contribute to the “global average temperature” going up! Must be happening somewhere else.

0

rbabcock

November 6, 2020 7:11 am

Just as long as we can get to a hundredth of a degree accuracy, that’s all that counts.

0

Mr.

Reply to rbabcock

November 6, 2020 9:10 am

Yep. Just as in marketing-speak, a number like $5.99 is somehow more credible than $6

0

Jeff Alberts

November 6, 2020 7:22 am

“They switched to a dataset they call nClimDiv in March 2014. Where USHCN had a maximum of 1218 stations, the new nClimDiv network has over 10,000 stations and is gridded to a much finer grid, called nClimGrid. The nClimGrid gridding algorithm is new, it is called “climatological aided interpolation” (Willmott & Robeson, 1995). The new grid has 5 km resolution, much better than the USCHN grid.”

Doesn’t matter. You still can’t average intensive properties and end up with anything meaningful.

0

Ian W

Reply to Jeff Alberts

November 6, 2020 7:53 am

+1
Especially if your ‘average’ is really the arithmetic mean of the highest and lowest temperature recorded. The supposed ‘increase in average temperatures’ could easily be due to higher night minimums; the peak high temperatures may even have fallen but the arithmetic mean is higher.
Then the difference in arithmetic mean is added to the reported maximum temperatures and it is claimed that ‘maximum temperatures will go up by….’. That allows scary headlines a lot more than nights will not be as cold in the next decades.

0

Jeff Alberts

Reply to Ian W

November 6, 2020 8:59 am

It’s much like Mann’s Bristlecones, and Briffa’s “One Tree” in Yamal. The outliers can overwhelm the rest, especially if you put your thumb on the scales.

0

Clyde Spencer

Reply to Ian W

November 6, 2020 10:12 am

Ian
What you called the “arithmetic mean of the highest and lowest temperature” is really more properly called the Mid-Range Value. While the arithmetical operations of arriving at the two are the same (i.e. add all and divide by the number of temps) the classical mean and mid-range have different statistical properties.

The average global temperature IS being influenced more by the daily lows than by the highs. I showed that in Fig. 2 at:
https://wattsupwiththat.com/2015/08/11/an-analysis-of-best-data-for-the-question-is-earth-warming-or-cooling/

0

Matthew Schilling

Reply to Jeff Alberts

November 6, 2020 8:06 am

The average is often meaningless. For instance, I enlisted in the Navy as a linguist. I selected a language that was so foreign to English it makes Spanish seem like a near twin of English. There were eleven of us in the class on Day 1. We faced our first test at the end of the first week. Four of us scored in the high 90’s, the other seven failed utterly. No one “kind of got it”. No one scored anywhere near the mean.
The mean of that first test offered no useful information about that class. Nor did it represent anyone in that room.
As for that class, the seven were removed over that first weekend. We remaining four graduated together 47 weeks later.

0

Krishna Gans

Reply to Matthew Schilling

November 6, 2020 9:31 am

Sample size isn’t meaningless 😀

0

Matthew Schilling

Reply to Krishna Gans

November 6, 2020 11:33 am

Krishna, averages would’ve been of more value after the class was whittled down to 4 than they were when the class was 11. But don’t let me interrupt you as you thrash about…

0

chemman

Reply to Krishna Gans

November 6, 2020 4:13 pm

Neither is it meaningful.
A larger sample size with bad data is far less useful that a small sample size with good data.

0

jtom

Reply to Krishna Gans

November 6, 2020 8:00 pm

Sample size is clearly meaningless for a great many averages. 350 million people in the US. The average person has 3.98 limbs. When no sample is average, the average can be nonsense, and the sample size is meaningless..

0

Matthew Schilling

Reply to jtom

November 7, 2020 6:43 am

+1

0

Matthew W

Reply to Matthew Schilling

November 6, 2020 5:42 pm

When U.S. air force discovered the flaw of averages

https://www.thestar.com/news/insight/2016/01/16/when-us-air-force-discovered-the-flaw-of-averages.html

0

Matthew Schilling

Reply to Matthew W

November 6, 2020 7:38 pm

+1

0

Gordon A. Dressler

November 6, 2020 7:37 am

Referencing the almost-two complete min-max anomaly cycles that have occurred from 2010 to 2020, as shown in Figure 2 of the above article: their amplitudes appear to be statistically unusual . . . there is only a single prior instance of a full cycle amplitude of equal magnitude shown, that over the period of 1920-1924. These three unusual-amplitude cycles have amplitudes of 1.3-1.6 °C. Maybe this is within the range of natural variability, maybe not; I don’t know if there is anything to conclude from this observation.

All “cycle” periods (based on counting successive local minimums or successive local maximums) of the data plotted in Figure 2 appear to be within a rather limited range of 5 to 7 cycles per 20 years, or an average of about 1 cycle every 3.3 years, which is generally consistent with the periods between successive El Ninos or successive La Ninas, so the plotted data has not been adjusted to take out this natural variability.

Nonetheless, Figure 2 clearly shows a cooling trend from about 1938 to about 1977, and a “pause” (or “hiatus”) in warming from 1998 to 2019.

So, it does appear that the data comprising Figure 2 has not (yet) been adjusted by AGW/CAGW “scientists”.

0

Andy May

Author

Reply to Gordon A. Dressler

November 6, 2020 9:48 am

Gordon, Every time I look at the data between 2010 and 2019 (inclusive) it looks fishy. This decade is the focus of my work. Who knows what will turn up? I’ve really just started. I’ll write up stuff as soon as possible.

0

Gordon A. Dressler

Reply to Andy May

November 6, 2020 1:20 pm

Thanks Andy! I am very glad to know that you are looking into it.

0

Graemethecat

November 6, 2020 7:44 am

How does this graph comport with the well-known fact that the 1930’s were hotter than the 2000’s in the lower 48 states?

0

DMacKenzie

November 6, 2020 8:27 am

It seems that when all stations were manual Stevenson screen type, the temp anomaly was -0.5. With new automated aspirated stations, the anomaly is +.5 As stations converted, the anomaly rose by a degree from about 1985 to 2005. You really have to ask whether this change is real or an artifact of changing the sampling methods.
The automated stations are sensitive to short term temp spikes due to ground convection “bubbles” that the older bird houses could not react to. This results in a downward adjustment, but is it enough? Plus stations went from recording high/lows on metal floats in the mercury thermometers, to averaging continuous thermistor readings. Many stations moved locations from “handy-for-volunteers” to local airports, which experienced increasing levels of air traffic, jet engines, and runway paving over recent decades. The possibility of interpretive bias in the corrections, adjustment, and homogenization of these effects is large, much higher than the actual instrumentation accuracy. The presently assumed rise of +1 just happens to be where different methodologies seem to agree with each other, given whatever interpretive bias has inadvertently been built into the data adjustment methods. As long as the raw data remains available, future analysis will reveal something different than today’s analysis, all probably within a half degree of the raw data and “homogenizations” by over a degree will be ridiculed….

0

Geoff Sherrington

Reply to DMacKenzie

November 6, 2020 3:26 pm

DMac,
In Australia, the electronic temperature is taken over a one second interval. These is no averaging mathematically of continuous thermistor readings. Geoff S

0

Carbon Bigfoot

November 6, 2020 8:55 am

Bob Tisdale’s book “Extremes and Averages in Contiguous U.S. Climate” has graphs of 100 years of NOAA Continuous U.S. Climate Data (2018 Edition).
A book that NOAA should have published and you cheap bastards should have bought to support his INDEPENDENT RESEARCH.

0

Gordon A. Dressler

Reply to Carbon Bigfoot

November 6, 2020 3:28 pm

Carbon Bigfoot, your phrase “. . . and you cheap bastards should have bought to support . . .” demands some clarification.

As in: are you really meaning to offend all WUWT readers/commenters by such an assertion? And/or do you really think it is the responsibility of all WUWT readers/commenters to fund INDEPENDENT RESEARCH.

“Fools rush in where angles fear to tread.”

0

Nick Stokes

November 6, 2020 9:57 am

Andy
“So, the problem may still exist in the GHCN dataset. I’ll try and check that out”
There is no corresponding issue in GHCN. It only arises because USHCN replaces missing month data by a local average so that every station in the “final” set has a reading. That is not done for GHCN. You need to be aware too that GHCN is now onto V4.

The reason that USHCN did that interpolation to ensure that each of 2128 stations had a reading (final) was that they were trying to average without taking anomalies. This endeavour was unwise, but the remedy more or less worked.

0

Andy May

Author

Reply to Nick Stokes

November 6, 2020 11:02 am

Thanks Nick. The dataset I have is labeled ghcnm.v4.0.1.20201010. I take this to mean GHCN, monthly, version 4.0.1, October 10, 2020. I’ve actually done quite a lot of work with it, but I have a way to go before I can post anything. I’ll be very interested in your comments.

0

J Mac

November 6, 2020 10:15 am

Good comparison, Andy May! If Stokes and Mosher had a point to make in their prior comments, they failed to elucidate it with clarity. D’OH!

0

Nick Stokes

Reply to J Mac

November 6, 2020 10:22 am

The simple point is, as acknowledged here:
” Mosher said the USHCN is no longer the official record of the CONUS temperatures. This is correct as far as NOAA/NCEI is concerned. They switched to a dataset they call nClimDiv in March 2014.”

0

Tom Abbott

November 6, 2020 10:18 am

From the article: “While NOAA/NCEI has dropped USCHN in favor of a combination of USCRN and nClimDiv, the anomaly record from 1900 to today hasn’t changed in any significant way.”

My problem with the current US temperature record is it does not look like the Hansen 1999 US surface temperature chart, where 1934, was 0.5C warmer than 1998, and that makes 1934, 0.4C warmer than 2016, the so-called “hottest year evah!”. Hansen 1999 demonstrates that the US has been in a temperature downtrend since the 1930’s.

The charts mentioned in this post all show the US to be in a temperature uptrend and to be much warmer than the 1930’s. So who is wrong? Hansen, or the current group of Data Manipulators? One of them says the 1930’s was the hottest decade, the other group says the current decade is the hottest decade. They both can’t be correct.

I will go with Hansen 1999. That’s the chart that shows we don’t have anything to worry about from CO2. That’s the reason the alarmist Data Manipulators decided to manipulate the data to make things appear to be hotter today than at any time in the past. The historic temperature record, like Hansen 1999, puts the lie to these claims if anyone cared to pay attention to it.

Hansen 1999:

We are not experiencing unprecedented warming today. It was just as warm in the 1930’s as it is today, which means that there is no CO2-caused warming worth speaking about, since there is much more CO2 in the atmosphere now, than in the 1930’s, yet it is no warmer today than it was in the 1930’s.

CO2 is a benign gas that plays a very small role in the Earth’s atmospheric temperatures, and unmodified regional temperature charts from around the world show just that: It was just as warm in the recent past as it is today. CO2 has not caused the temperautures to climb abnormally. The proof is in the written surface temperature records from around the world.

0

Andy May

Author

Reply to Tom Abbott

November 6, 2020 11:09 am

Tom, I think you have a valid point. I plan on looking at the corrections used to GHCN raw data. The changes are mostly due to the corrections applied as far as I can tell.

0

Steven Mosher

Reply to Andy May

November 6, 2020 6:03 pm

Andy,

With GHCN-M you have to be careful because ONLY the US stations receive the TOB+PHA
treatment. And only when there is metadata for the Time of Observation. the rest of the world
gets ONLY PHA.

Finally. It is instructive to compare the same station in USHCN with its counterpart in GHCN

0

fred250

Reply to Steven Mosher

November 7, 2020 1:32 pm

Before or after all the “mal-adjustments’ to both sets. ??

0

dh-mtl

Reply to Tom Abbott

November 6, 2020 1:46 pm

The sea surface temperature records, ENSO and AMO, show the same thing. No warming.

0

Megs

Reply to Tom Abbott

November 6, 2020 3:28 pm

Thanks for bringing that up Tom. I looked at the graph in Andy’s article and my first thought was what happened to the 1930s temps, have they been permanently removed too? Along with the ‘medieval warming’ and the ‘little ice age’, going back a bit further.

You queried it much better than I could.

0

Tom Abbott

Reply to Tom Abbott

November 7, 2020 4:21 am

Hey! What do you know, my chart actually showed up in the post! I guess Anthony must be improving the comment software. Good!

0

Andy May

Author

November 6, 2020 11:08 am

J Mac, Nick is correct. Mosher was correct to a point. He also said that that the process used to go from the raw data to the final anomalies had changed. It turns out it didn’t. The nClimDiv dataset uses exactly same corrections. What changed was the number of stations used and the gridding algorithm.

0

J Mac

Reply to Andy May

November 6, 2020 12:42 pm

I understand that, Andy. Did any of that offer any distinction to or warrant any change to your original hypothesis? If no, it is a distinction without a difference.

0

Andy May

Author

Reply to J Mac

November 6, 2020 12:53 pm

No, It didn’t matter, because the graphs overlaid. I have all the GHCN data, but it is a huge file and takes hours to process on my computer. Fortunately, once I get all of it loaded into R and output to properly formatted RData files, I can work with it more efficiently. I’m nearly there. Even with GHCN/nClimDiv, the last 5-10 years look very strange. So I see your point.

0

windlord-sun

Reply to Andy May

November 6, 2020 3:11 pm

What program is “R?”

Last time I parsed out full daily records from GHCN it came to 500 million records. Is that your count?

0

windlord-sun

Reply to windlord-sun

November 6, 2020 3:46 pm

actually, 450,396,126 recordings between 1900 and 2019.

… and I agree, there is something bizarre in the recent data, as per this sine wave of it….

that can’t be right.

0

Andy May

Author

Reply to windlord-sun

November 7, 2020 9:40 am

R is a free statistical programming language. You can download it here:
https://www.r-project.org/

0

J Mac

Reply to Andy May

November 6, 2020 3:50 pm

Andy,
I could see the fidelity in the overlay graphs you provided. Stokes and Mosher’s comments served only to pointlessly distract from your valid and demonstrated hypothesis. They added nothing.

0

Derg

Reply to J Mac

November 6, 2020 10:07 pm

Bingo

0

windlord-sun

Reply to Andy May

November 9, 2020 6:54 am

Andy May: “R is a free statistical programming language. You can download it here:
https://www.r-project.org/”

Thank you, I’ll look into it.

Did you ever come up with a total number of recordings in GHCN?

One issue with GHCN is that many stations did not and/or are not reporting long term. Huge numbers of stations blinked on/off after a decade or two … or less! Many ‘just got started’ in the last few decades.

In my opinion, those short reports are useless for either POV: 1) looking at each station one by one to see if they reveal any abnormal warming; or 2) attempting to construct a model of precision through gridding, interpolation, meshing with proxies, etc.

Raw count of stations in GHCN: 40,145, with an average lifespan of 38 years.

Once I screened out those short ones, I came up with a long-term station count of 1863, of which 1593 were in the USA. That means that GHCN contains only 270 long-term non-US stations.

The filter was “more than 99 years of records, and still active now.”

0

Steven Mosher

Reply to Andy May

November 6, 2020 5:18 pm

My point is this

Global climate records DONT USE USHCN, they DONT USE nCLIMDIV

they use GHCN-M which doesn’t use TOB

0

fred250

Reply to Steven Mosher

November 7, 2020 1:31 pm

They use some sort of other manipulations to try the temperature fabrication in line with their “expectations”

Spreading urban data all over the place doesn’t help relate to reality.

But you are concerned about what is REAL are you mosh, only what the scammer want to be shown as real.

0

windlord-sun

November 6, 2020 11:55 am

Stipulating — for a brief brief moment — that the suppression of 400 of the 1200 stations of USHCN starting in 1989 was justified …

Examining the RAW datasets for the remaining stations…if you look at the sine wave of each of them one by one, you find no perturbance that would signal abnormal warming. No matter what anomaly charts, temperature reconstructions by gridding, homogenizing, estimating and extrapolating, the fact that any purported warming claimed by those models is not visible amounts to a hard rejection of alarms about warming claimed by the models.

0

Richard Greene

November 6, 2020 3:04 pm

None of this matters:
A large percentage of numbers are wild guesses (aka infilled)
Almost no Southern Hemisphere temperature data before 1920.
Too little Southern Hemisphere data from 1920 to 1950.

The numbers are not fit for real science before UAH in 1979.
UAH has the potential for accuracy since it has little infilling, is measured in consistent environment, and is measured where the greenhouse effect occurs.

The surface numbers are garbage (a scientific term) even before all the repeated “adjustments” and the use of a global average is not useful — no one lives in a global average temperature.

The one number global average temperature is a statistic, not a real measurement.

It also hides where the most warming has happened since 1979 — Upper half of the Northern Hemisphere, and when the most warming has happened (mainly in the coldest six months of th year and mainly at night). The use of a global temperature anomaly on a chart, with a range of 1 to 1.5 degrees C., is climate alarmist propaganda. The statement that winter nights in Alaska are warmer than they used to be tells a completely different story using the same “data”.

0

RickWill

Reply to Richard Greene

November 6, 2020 10:22 pm

But there is no “greenhouse effect”. The more water vapour in the atmosphere, the more energy is rejected:
https://1drv.ms/u/s!Aq1iAj8Yo7jNg2_DukRksyuhIkZ8

Think about it for 10 seconds – if water vapour caused the surface to warm, it would just all boil off and there would be no oceans.

The atmosphere goes into cyclic cloudburst mode when TPW reaches 38mm. That results in highly reflective cloud that shades ocean surface. It is the cause of monsoon and cyclones. Cyclones result in massive heat rejection. Cloud in cyclones can result in surface cooling when the sun is directly overhead.

“Greenhouse effect” is a fairy tale dreamt up by incompetents who have no understanding of atmospheric physics.

The tropical ocean surface temperate does not have a long term trend and cannot. There is a powerful thermostat that prevents the surface temperature ever exceeding 32C:
https://1drv.ms/b/s!Aq1iAj8Yo7jNg3j-MHBpf4wRGuhf

0

JimK

November 6, 2020 4:40 pm

Has anyone else noticed that Fig.2 shows the 2020 temperature dip as the same as the 1900 peak. That doesn’t seem very catastrophic to me.

0

Steven Mosher

November 6, 2020 5:11 pm

“The data used to build the nClimDiv dataset is drawn from the GHCN (Global Historical Climate Network) dataset”

WRONG AGAIN

nClimDIV is build from a whole collection of sources

GCHN D ( DAILY )
ASOS
SNOTEL
RAWS
and a few others

0

fred250

Reply to Steven Mosher

November 7, 2020 4:28 am

And then matched to USCRN…..

Hadn’t you realised that yet, mosh

Not being a mathematician, you probably wouldn’t.

0

Andy May

Author

Reply to Steven Mosher

November 7, 2020 9:21 am

Steve, NOAA/NCEI say differently, to quote:
“The new divisional data set (nCLIMDIV) is based on the Global Historical Climatological Network-Daily (GHCN-D) and makes use of several improvements to the previous data set. ”

All the data is corrected for time-of-day. I’m sure adjustments have been made to the corrections, that is what I’m looking into. Why is the recent reconstruction so odd? Some changes have been made. ASOS, SNOTEL, and RAWS are in GHCN-D already. For a complete list of datasources for GHCN check the source flag list here:
https://docs.opendata.aws/noaa-ghcn-pds/readme.html
T=SNOTEL, U=RAWS, etc.

0

Steven Mosher

November 6, 2020 5:16 pm

“The nClimDiv dataset uses a lot more stations than USCHN and if the stations are well sited and well taken care of this is a good change. The USCRN dataset is from a smaller set of weather stations, but these are highly accurate and carefully located. I do not think the USCRN stations are part of the nClimDiv set but are used as an independent check on them. The two systems of stations are operated independently.”

WRONG AGAIN

Jesus andy.

USCRN stations are also compiled into the GCHN D

GHCN D is one of the sources for nCLIMDIV

Also NOTE. the comparison of of USCRN and NCLIMDIV PROVES that adjustments works
and proves that siting doesn’t matter

get that

0

fred250

Reply to Steven Mosher

November 7, 2020 4:31 am

Yep, we GET that they adjust climdiv to match USCRN.

That is patently obvious.

USCRN has brought the warming manipulations under control

Before USCRN, they adjusted to an agenda.

USCRN stopped the warming in the US.

0

Andy May

Author

Reply to Steven Mosher

November 7, 2020 9:24 am

Steve, USCRN is not listed as a data source for GHCN-D. Check the list I linked in my previous comment. Since USCRN is supposed to be a reference, it doesn’t make sense to include it. But, if you can document it is a source of data for USHCN or nClimDiv, I would be interested.

0

Nick Stokes

Reply to Andy May

November 7, 2020 1:15 pm

Andy,
Stations for GHCN (and nClimDiv, USHCN) are chosen for their long record period, among other criteria. So USCRN stations haven’t qualified so far.

You may find some Moyhu facilities helpful here. This page lets you search for GHCN V4 stations (click radio button “GHCN V4 new!”). And this post has graphical presentation of comparisons of GHCN and USCRN, with this follow-up showing some time series results.

0

Andy May

Author

Reply to Nick Stokes

November 7, 2020 1:34 pm

Thanks Nick! They look like great links, I will spend a lot of time on them.

0

Steven Mosher

November 6, 2020 5:28 pm

“hey looked very anomalous from 2015 through 2019. The same set of corrections are used in the GHCN dataset, which is the source of the data fed into nClimDiv. So, the problem may still exist in the GHCN dataset. I’ll try and check that out and report on it in a future post”

NO WRONG WRONG WRONG

USHCN data flow goes like this

INGEST FROM SOURCES
RUN TOB
RUN PHA

nCLIMDIV data goes like this
INGEST FROM SOURCES
RUN TOB
RUN PHA

GHCN – D goes like this
INGEST FROM SOURCES
RUN QA

GHCN M goes like this
INGEST from GHCN D
INGEST from ITSI
INGEST from other sources
RUN Pha

Note GHCN-M DOES NOT USE TOB.

For USHCN sources these are the files

Also if you are having trouble with GHCN -M on your system, you are probably doing something
wrong again

0

Andy May

Author

Reply to Steven Mosher

November 7, 2020 9:39 am

Steve, I don’t know why you are saying I’m wrong. I agree with your list and never wrote otherwise. Monthly averages cannot be time of day bias corrected, that has to happen daily, obviously. The daily data used to create GHCM-M has to be bias adjusted, that’s common sense. Presumably, GHCN-D is used to make GHCN-M. From the readme file:
“GHCN-M version 4 currently contains monthly mean temperature for over
25,000 stations across the globe. In large part GHCN-M version 4 uses
the same quality control and bias correction algorithms as version 3.
The greatest difference from previous version is a greatly expanded
set of stations based on the large data holdings in GHCN-Daily as well
as data collected as part of the International Surface Temperature
Initiative databank (ISTI; Rennie et al. 2014).

There are currently three versions of GHCN-M version 4
QCU: Quality Control, Unadjusted
QCF: Quality Control, Adjusted, using the Pairwise Homogeneity
Algorithm (PHA, Menne and Williams, 2009).
QFE: Quality Control, Adjusted, Estimated using the Pairwise
Homogeneity Algorithm. Only the years 1961-2010 are provided.
This is to help maximize station coverage when calculating
normals. For more information, see Williams et al, 2012. ”

My processing is going fine, just slowly due to the huge amount of data.

0

Steven Mosher

November 6, 2020 5:48 pm

comparing

https://www.ncdc.noaa.gov/temp-and-precip/national-temperature-index/time-series?datasets%5B%5D=uscrn&datasets%5B%5D=climdiv&datasets%5B%5D=cmbushcn&parameter=anom-tavg&time_scale=ann&begyear=2005&endyear=2020&month=10

Note

1 USCRN is UNADJUSTED GOLD STANDARD
2. USHCN is ADJUSTED and has Sites that are poor quality ( high CRN rating)
3. nCLIMDIV is also ADJUSTED and has sites that are poor quality (high CRN rating)

Result?

ADJUSTMENTS work. The Unadjusted gold standard USCRN SHOWS SLIGHTLY WARMER
temps than adjusted data

0

fred250

Reply to Steven Mosher

November 7, 2020 1:37 pm

“ADJUSTMENTS work.

Yep, now they have USCRN to set a baseline for those “adjustments” to “adjust” to

And warming in the US has leveled off. Having good data was always going to do that.

Before 2005, they can “adjust” as they want, to fit their agenda.

0

AndyHce

November 7, 2020 3:33 am

Gavin Schmidt wrote an article on his RealClimate site about why temperature anomalies are presented instead of actual temperatures. Unfortunately I never have any success with searches to find older articles but I think it was in 2012 or 2014. Anyway, if I understood correctly he admitted that temperature measurements were at best accurate to +/- 0.5 C and that a legitimate calculation of global average was thus only accurate to +/- 0.5 C.

The problem with that, he said, was that, over the last 30 years, each year’s average comes out to the same number – it isn’t possible to see any change. But, by calculating anomalies to multiple decimal places, a trend shows up. The anomaly grows by hundredths of a degree year by year (many or most years). This presents convincing evidence that warming is occurring.

I got the impression that he wasn’t even trying to argue that the measurement error does not propagate through the anomaly calculations, only that he and his friends believe that the fact a trend emerges from the calculations’ intermediate values means, with a high probability, that the trend is real.

I think this conclusion is a consideration of philosophy rather than science. In hard terms, to the actual accuracy available, no trend exists in the measurements. However, this perhaps only means that the technology, or the technique, is not up to the real task. The world does seem to be giving signs of warming over the past 200 years or so, perhaps in more locations than not, and as has been pointed out various times, if there is a warming trend, there should be higher and higher temperatures as long as the trend continues.

0

Tim Gorman

Reply to AndyHce

November 7, 2020 5:35 am

If this was the calculation Gavin actually used then he was way off. The calculation of uncertainty in the global average would be:

sq rt(number of stations) * 0.5C.

No amount of extra digits in the calculation of the anomalies will change this. It won’t change it for an individual station or for an average of multiple stations. An individual anomaly of .001C will have the same uncertainty interval as the actual temperature, +/- 0.5C. Thus it doesn’t matter how many digits you calculate out to, it won’t help you distinguish anything meaningful.

As J.R. Taylor states in his Introduction to Error Analysis:
—————————————
Rule for Stating Answers
The last significant digit in any stated answer should usually be of the same order of magnitude (in the same decimal position) as the uncertainty.
—————————————-

Thus the anomaly calculated from a measurement uncertainty with a +/- 0.5C interval should not be stated out past the tenth digit. Calculating out further is only fooling yourself.

0

windlord-sun

Reply to Tim Gorman

November 7, 2020 7:27 am

Why does anyone care about a reconstructive model of the global average temperature?

0

AndyHce

Reply to Tim Gorman

November 7, 2020 11:36 am

I suppose I did not express the point clearly enough. The argument was not about a result with statistical certainty it was a consideration of the (claimed) fact that the calculation to two decimal places, continues, year by year (for more years than not, anyway) to trend upward. This ignores the inherent uncertainty. It answers the questions

Why is there a strong (though small) trend in this calculation?
If the results were random, how likely is it that they would show a trend over a long period?

with the opinion that the consistency of the trend most likely means it is real.

Or at least that was my understanding of the argument.

This is different from a conclusion that says the trend matters (unless perhaps it goes on for ten thousand years), that it relates in any meaningful way to anything else that goes on in the natural world.

0

fred250

Reply to AndyHce

November 7, 2020 1:39 pm

“The world does seem to be giving signs of warming over the past 200 years or so

And THANK GOODNESS for that !!

But I prefer REAL warming , over FAKE warming.

0

kribaez

November 7, 2020 1:42 pm

Some commenters, notably Pat Frank and Tim Gorman above, seem to have suggested that an error in instrument precision must of necessity flow through to final estimates of uncertainty in temperature anomaly estimation. I will try to demonstrate here how and why it is possible to measure a variation in temperature anomaly which is less than the precision error in individual measurements.
First, if you are trying to take a single temperature measurement in a laboratory with an instrument which is accurate, but which records only to the nearest degree, say – a precision uncertainty of plus or minus 0.5 deg C – then repeat measurements cannot improve the precision of the result. Similarly, using 1000 accurate thermometers, all with the same precision, cannot improve the precision of the single temperature measurement.
Equally, in the same laboratory, if you wish to estimate the temperature change with time over a temperature range of less than 1 deg C, then repeat measurements over time with a single thermometer with a precision to the nearest degree is unlikely to yield any meaningful information.
Suppose however that instead of having one accurate thermometer, you have 11 thermometers, say, each with the same plus or minus 0.5 degree precision, and the same relative accuracy, but which are all calibrated to a slightly different base. Can you measure a temperature change of less than 1 deg C? Counter-intuitively, the answer is that yes you can, if the between-thermometer spread in calibration error is large enough.

And what about the isomorphic situation where you have 11 well calibrated thermometers sited in slightly different places with different temperatures, all within a grid-area or locale, and where you wish to estimate the uniform change in temperature of the locale? Again, the answer is that yes, you can measure a temperature change which is below the level of precision of the thermometers – and, moreover, the final estimate of change has an uncertainty which is significantly below the variance arising from a difference in two measurements due to precision uncertainty.
To illustrate why this is so, let us first consider the simple situation where the initial true temperature differences between the thermometers are regular. For this example, we assume that the first thermometer is well calibrated (accurate) and reads 0 deg C initially. When the first thermometer reads 0 degrees, the temperature of the second is 0.1 deg C, the temperature of the third is 0.2 deg C, and so on up to the 11th thermometer which is sited with initial temperature at 1 deg C. At their recording precision levels, when the first thermometer is at zero deg C, the first five of these thermometers will record (or will be read at) 0 deg C, while the next 6 thermometers will record (or be read at) 1.0 deg C. In order, therefore, the temperature recordings look like this:-
0 0 0 0 0 1 1 1 1 1 1 for the 11 thermometers, yielding an estimate of mean initial temperature of 0.55 deg C (= 6/11)

Now, starting at 0 deg C, (arbitrarily) we consider 30 increments of true-but-unknown temperature change each of 0.02 degree steps – making a total true change of 0.6 deg C uniformly warming the locale.
For the first four temperature increments, the thermometers all continue to record the same temperatures. However, on the fifth increment, when a cumulative warming of 0.1 deg C is added, this “trips” thermometer number 5 to change its record from 0 to 1 deg C. The temperature recordings then become:-
0 0 0 0 1 1 1 1 1 1 1 yielding a mean of 0.64 deg C.

The next trip occurs on the 10th incremental temperature step when 0.2 has been added. This causes thermometer number 4 to go from 0 deg to 1 deg, which raises the overall average recorded temperature to 0.73 deg C.

A plot of mean recorded temperatures against the true temperature change therefore yields a stair-step plot, as various thermometers are “tripped” into recording a higher integer value of temperature.
The starting and final values over this simulated “true” change of 0.6 deg C look like this:-
True Temp Thermometer readings Mean recorded temp
0.0 0 0 0 0 0 1 1 1 1 1 1 0.55
0.6 1 1 1 1 1 1 1 1 1 2 2 1.18
Apparent temperature change from recorded temperatures = 1.18 – 0.55 = 0.63 deg C. Alternatively, a regression of recorded temperature against step-count yields a gradient of 0.0207 deg C/step. Over the 30 steps this gives an estimate of total temperature change of 30×0.0207 = 0.62 deg C . This is very close to the true-but unknown value despite the recording precision being in whole integers of temperature.

Please note that this illustration works for either of two cases:-
a) Estimating a change in temperature on a single body using multiple thermometers of the same precision but with (sufficient) spread in calibration.
b) Estimating a change in temperature over a locale using multiple thermometers located at points with a spread of different initial temperatures (as in the worked example above) with or without a spread in calibration accuracy.
The obvious weakness in the above simple example is that I have pre-selected thermometers with regularly spaced differences in their initial temperatures (or regularly sampled calibration errors if applied to Case (a) as defined just above). It raises the obvious question:- what happens if you have thermometers which are sited such that the initial temperature in a locale is sampled in a random fashion? To test this, I retained 11 thermometers, but ran a MC, sampling the initial thermometer temperatures from a one degree spread (U[0,1] distribution). The resulting mean estimate of temperature gain corresponded to the true solution (0.6) as expected, but the sample variance in the estimated temperature gain rose to 0.026. Increasing the number of temperature points within the locale improves the stair-step relationship (recorded temp vs true incremental temp) by adding more steps – a sort of smoothing – and decreases the variance of the final estimate of temperature gain in inverse proportion to the number of samples, so the variance associated with this 11 thermometer case can be reduced. However, even with just 11 temperature stations, the variance of 0.026 may be compared with the variance of a difference in two values each carrying a precision error of plus or minus 0.5; assuming two uniform distributions for precision error, this works out to be 0.17 – a far larger value.
In summary, it cannot be assumed that measurement error arising from instrument precision propagates through the calculation as an irreducible uncertainty.

0

Tim Gorman

Reply to kribaez

November 7, 2020 4:35 pm

“the first thermometer is well calibrated (accurate) and reads 0 deg C initially.”

You are assuming a 100% accurate reading with no uncertainty. You’ve already violated reality.

“To test this, I retained 11 thermometers, but ran a MC, sampling the initial thermometer temperatures from a one degree spread (U[0,1] distribution)”

Error is *NOT* uncertainty. There is no probability distribution for uncertainty.

“assuming two uniform distributions for precision error,”

There is no probability distribution for uncertainty. You are trying to equate the probability distribution of error with uncertainty. It just doesn’t work that way. Your first thermometer reading of 0C should have an uncertainty interval associated with it. E.g. 0 deg C +/- 0.5 deg C. It is that +/- 0.5 deg C that propagates.

If your statement “And what about the isomorphic situation where you have 11 well calibrated thermometers sited in slightly different places ” is true then you are trying to combine independent values with uncertainty intervals. Uncertainty for independent values add by root sum square. You can’t get around that. Each and every one of the sites will have an uncertainty interval associated with it. They may or may not be equal uncertainty intervals but they still add root sum square.

0

Pat Frank

Reply to kribaez

November 7, 2020 4:54 pm

It’s not about precision, kribaez.

The uncertainty derives from systematic measurement error produced by uncontrolled environmental variables.

The error is revealed only by field calibration studies using well-sited and maintained standard instruments. The uncertainty in meteorological field station measurements is expressed as the rms of the systematic errors found during calibration.

The field station thermometers are exposed to environmental variables of unknown intensity that divert the measured temperature from the physically correct temperature. The error in each measurement is therefore unknown.

The only recourse is to apply the measured rms field calibration uncertainty to all field measurements.

Averaging field measurements subject to an applied field calibration uncertainty does not, ever, reduce the uncertainty.

0

kribaez

Reply to Pat Frank

November 8, 2020 5:49 am

Pat,

“It’s not about precision, kribaez.

The uncertainty derives from systematic measurement error produced by uncontrolled environmental variables. ”

I agree that there are multiple sources of uncertainty that go into the annual temperature index. My comment however IS about precision and only precision. One of the “beliefs” which seems to still be floating around is that it is impossible to estimate the change in annual temperature index to a higher precision than your best thermometer of the day. I am offering a clear worked example of the arithmetic to demonstrate why this is fallacious reasoning when a large number of thermometers are used. That does not mean that the temperature indices are ok, just that, in the general scheme of things, thermometer precision is way down the list in the ranked order of contributors to uncertainty in the annual temperature index.

0

Pat Frank

Reply to kribaez

November 8, 2020 9:01 am

No one disputes that random precision errors average away, kribaez.

All of my temperature record work concerns systematic measurement error arising from uncontrolled environmental variables.

The calibration uncertainty stemming from those errors does not average away.

0

AndyHce

Reply to Pat Frank

November 12, 2020 2:18 am

For thermometers that measure to the nearest degree C, isn’t the inherent uncertainty =/-0.5C, having nothing to do with environmental variables?
If not, what is the correct label for the fact that 20C on the thermometer just means some where between 19.5 and 20.4?
Does this have some effect on a sum or average of measurements that is different from uncertainty?

0

Pat Frank

Reply to Pat Frank

November 12, 2020 1:34 pm

Hi Andy, by ‘inherent uncertainty’ do you mean resolution?

Typically, the reading resolution of a LiG thermometer graduated in 1 C is given as ±0.25 C.

That is, even if the LiG thermometer had been calibrated and is known to be accurate to ±0.1 C over its whole range, the ability of a human reader to judge a temperatures reading when the liquid meniscus is between two graduation marks is taken to be ±0.25 C.

If that LiG is in a field meteorological Stevenson screen, then the calibration uncertainty that recognizes the impact of environmental variables on the screen, and thus the thermometer, is ±0.5 C..

The total uncertainty in a given read temperature is then the rms of ±0.25 C and ±0.5 C, which is ±0.56 C.

That ±0.56 C does not average away.

Uncertainty bars at least that large should be on every single global air temperature data point since 1880 (GISS) or 1850 (UKMet).

0

Tim Gorman

Reply to kribaez

November 8, 2020 9:46 am

You cannot estimate precision to a higher value than your instrument allows. This would require having multiple measurements of the same thing using the same measurement device. Then the central limit theory would apply. Even then, if you use significant digits properly you won’t actually get any more precision than the instruments provide. If your calculation is 93.81 +/- 0.3 then it should be rounded to 93.8 +/- 0.3 since you can’t distinguish past the tenths digit. The same thing would apply to the situation you are speaking to.

The uncertainty of one independent instrument measuring one thing adds to the uncertainty of a second independent instrument measuring a second thing. They add by root sum square. The growth of the uncertainty will mask any supposed increase in precision.

Large numbers of thermometers only add to the uncertainty associated with the group when they are combined. If you have 100 thermometers and each has an uncertainty of +/- 0.5 deg C then the total uncertainty of the 100 thermometers would be sqrt(100) * 0.5 = +/- 5 deg C. You can calculate anything you want to out to however digits you want to but it won’t change the uncertainty. Going from 1 deg C +/- 0.5 deg C to 1.00001 deg C +/- 5 deg C is a meaningless exercise. You are no more sure of your answer than when you started. Using significant digits your result should actually go no further than the units digit if the uncertainty is +/- 5. Anything past that is just a false precision.

0

AndyHce

November 7, 2020 10:47 pm

Pat, Tim

Just as a thought experiment, the process I described above seems interesting. To make it simpler, consider that there are
● 10,000 widely distributed weather station thermometers that all meet the best site conditions,
● all are electronically monitored to eliminate human reading error
● all have been calibrated to the same standard
● their precision is 1 degree C
● accuracy is +/- 0.5 C
● they retain that precision and accuracy over 100 years without intervention
● they all restart the day at local midnight
● on site computation provides the NOAA standard of 1 reading per second averaged over 5 minutes to produce a recorded reading
● they all report the minimum temperature and the maximum temperature between one midnight and the next so that day’s average can be calculated
● NO adjustment are ever applied
● The actual temperature is slowly increasing such that it warms, averaged over all 10,000 sites, by 0.01C per year.

Their uncertainty is +/- 0.5 C for all measurements so an simple average of all 10,000’s daily average, and an yearly average of 365 days of daily averages, still has an uncertainty of +/-0.5 C, no?

For them to actually detect that warming would require more than 50 years of measurements, no?

If the averaging of all 10,000 daily readings, and all forward calculations of those individual readings, is carried out to three decimal places, then rounded to two at year’s end, the places to the right of the decimal are still individually meaningless but are they meaningless in combination? The uncertainty is much larger than the calculation precision but might the actual 0.01C trend be revealed?

Alternatively
Warming might or might not be occurring, the truth is completely unknown.

If the calculation precision of multiple decimal places is truly meaningless, should the distribution of +/- difference relative to the previous year not be random from year to year?

If a trend emerges, at the second decimal place in the year’s end calculation, isn’t there a probability distribution as to how often that could happen by chance?

If a calculation trend is consistent over many years, how many years would be necessary to say the trend is so improbable that it very probably is real?

0

windlord-sun

Reply to AndyHce

November 8, 2020 6:24 am

AndyHce,

You need 120 years minimum. The dataset we have, USHCN RAW, reveals apogee points 60 years apart. Tracking for two cycles will pin things down, yet three/four/five will be better.

I think NOAA should do just as you suggest. Expand USCRN from 145 stations in the US to 1200, then track the raw data for 120 years. Then forever.

However, NOAA should 1) not destroy the currently-ongoing collection of USHCN RAW; and 2) restore the data from the redacted 400 sites absented since 1989; and 3) assure all the USHCN stations they ought not make radical changes to their instruments and protocols. Just be consistent.

The same ought be done for satellites that can make a reasonable reading of surface air temperature. Track for 120 years in a consistent manner.

Now you have three plots of direct measurement. The parallel systems will act as running controls on each other. It will be a tremendous accomplishment, and inexpensive to implement.

Note: many in the orthodox climate profession have faced the fact that Alarm was over-heated [sarc] and things are not dire. Humans might think 120 years is an eternity. Nature says it is a blink of an eye. Do we wish we had direct measurement back through the entire Holocene and into the Wisconsin? Well, we don’t have it. But now we can have it forever in cross-confirmation.

Stay calm and track the raw many ways.

0

windlord-sun

Reply to windlord-sun

November 8, 2020 6:34 am

ADD: the 1200 USCRN sites ought to be located right next to the 1200 USHCN sites.

0

AndyHce

Reply to windlord-sun

November 12, 2020 2:27 am

The location of USCRN sites is a major part of what makes them what they are. I they were next to the USHCN sites they could not qualify for USCRN rating.

0

windlord-sun

Reply to AndyHce

November 12, 2020 5:58 am

Ok, I’ll change my wording.

In order to illuminate the in-place 1200 USCHN stations, 1200 new installations with the precision of those used by USCRN should be located right next to each. That way, we would learn with what factor to nudge the historic curve up or down. [this step is not necessary to answer the question “is there any abnormal warming. The non-high-precision historical plots already can signal that, yes or no.]

Another approach — but watered down — would be to make the new installations mobile … then rotate them from one of the 1200 stations to the next every few years or so.

This is what happens in aerospace and precision manufacturing. Your Quality Assurance department welcomes outside calibration of all gages.

It would be better to make all new confirming stations.

Meanwhile, you can continue whatever it is you meant by this: “The location of USCRN sites is a major part of what makes them what they are. I they were next to the USHCN sites they could not qualify for USCRN rating.”

0

Pat Frank

Reply to AndyHce

November 12, 2020 1:43 pm

“That way, we would learn with what factor to nudge the historic curve up or down.”

Nudging historical USHCN station trends up or down relative to co-located USCRN measurements would not remove the uncertainty in the historical USHCN temperatures.

0

AndyHce

Reply to AndyHce

November 14, 2020 4:18 am

Many of the USHCN sites not allow the USCRN equiptment to function at USCRN accuracy. The site conditions are not suitable.

0

kribaez

Reply to AndyHce

November 8, 2020 8:14 am

Andy,
“Their uncertainty is +/- 0.5 C for all measurements so an simple average of all 10,000’s daily average, and an yearly average of 365 days of daily averages, still has an uncertainty of +/-0.5 C, no?”
You are correct, the answer is NO.
For them to actually detect that warming would require more than 50 years of measurements, no?
You are correct again, the answer is NO.
I think you are perhaps forgetting that during the course of a year, the temperature at any single point station undergoes a major change, covering a range from less than 10 degrees in the tropics to over 60 degrees in the higher latitudes. This makes a huge difference to the effect of measurement precision.
Let us consider just ONE thermometer in your idealised example. If the accuracy estimate comes from manufacturer’s instrument calibration, and relative accuracy is maintained over the measured temperature range, then it has no effect on the temperature change estimate. If instead, it represents an independent sampling error, which randomly applies to any measurement, then it is easy to add it in by quadrature.
However, let us look first at just the precision question. With just ONE of your thermometers, after 5 years, the estimate of the change in means after accounting for thermometer precision is 0.05 with a standard deviation (sd) of 0.021. In other words, after a uniform temperature gain of 0.05 deg C, the estimate of mean gain will be significantly different from zero – despite the fact that temperatures are recorded only to the nearest whole number.
With 10,000 thermometers, each with a different annual variation, the effect of precision becomes negligible and even after only one year of change the estimate of the change in the mean from daily recordings is 0.01 with a sd of 0.00021. If we add in the effect of a random accuracy problem (+/- 0.5 deg C) assumed to be independently applied to each measurement taken, then this increases the sd after the first year to about 0.0003. So the change in mean temperature after the first year with 10,000 thermometers should be both visible and significant in your idealised example.
I know that this is all counterintuitive. The best way to understand why it works this way is to spend 10 minutes on a spreadsheet. Try the following test for a single thermometer. Set up a sine function with a half-amplitude of 10 degrees (equivalent to a mid-latitude variation of 20 degrees) and a periodicity of 365 days. Calculate its value at daily intervals. Then define a second series by adding 0.05 degrees to the first series. Now round both series to the nearest integer (to account for precision) and take the difference of the means of the two series. The temperature difference for this case works out to be 0.0519. If you change your parameters and repeat a few thousand times, you should find that roughly 95% of the outcomes will fall in the range 0.01 to 0.09 with a mean of 0.05.

0

Nick Stokes

Reply to kribaez

November 8, 2020 1:00 pm

“With 10,000 thermometers, each with a different annual variation, the effect of precision becomes negligible”
Indeed so. The uncertainty of global averages is due mainly to spatial sampling error, not precision. ie would you have got a different answer by putting the thermometers in different places.

0

Tim Gorman

Reply to Nick Stokes

November 8, 2020 6:09 pm

The uncertainty of global averages is due to measuring different things with different measuring devices each with an uncertainty interval. Uncertainty in independent measurements don’t cancel, they only keep adding up. Uncertainty is not error. It doesn’t really matter where you site the thermometers as far as uncertainty is concerned. Siting may affect bias in measurement, i.e. *error*, but it won’t affect uncertainty.

From: https://www.nbi.dk/~petersen/Teaching/Stat2016/Week2/AS2016_1202_SystematicErrors.pdf
=======================================================
Measurements are taken with a steel ruler, the ruler was calibrated at 15 C, the
measurements done at 22 C.

This is a systematic bias and not a systematic uncertainty! To neglect this effect is a
systematic mistake.

Effects can be corrected for! If the temperature coefficient and lab temperature is
known (exactly), then there is no systematic uncertainty.

If we correct for effect, but corrections are not known exactly, then we have to
introduce a systematic uncertainty (error propagation!).

In practice (unfortunately): often not corrected for such effects, but then just
“included in sys. uncertainties”.
===============================================

Errors can be corrected, many times using statistical processes. Uncertainty can’t be corrected.

I hearken back to Donald Rumsfeld: there are known knowns, there are known unknowns, and there are unknown unknowns.

You can handle known knowns statistically, e.g. random measurement errors, bias errors, etc. You can’t do so with known unknowns, they just are uncertainties and aren’t susceptible to statistical reduction. Unknown unknowns can’t even be determined to exist let alone corrected.

0

AndyHce

Reply to Nick Stokes

November 9, 2020 1:34 pm

Nick
In regard to spacial sampling error, the validity of the calculated average rather depends on what one calls that average. If one insists that one is measuring the average of the planet, that sampling error might be huge. If one merely admits that one is measuring the average of the places sampled, the sampling error is zero (assuming all instruments report data at all the times they are supposed to report). Is that not correct?

0

Nick Stokes

Reply to AndyHce

November 9, 2020 4:27 pm

Yes, there isn’t much point in doing it unless you are estimating the true global average. As I said, the sampling error is the main component, but it isn’t “huge”. There are a lot of samples there. You can estimate it very well by taking random sub-samples.

0

Pat Frank

Reply to Nick Stokes

November 11, 2020 4:55 pm

Nick, “The uncertainty of global averages is due mainly to spatial sampling error, not precision. ”

The uncertainty of global averages is due mainly to systematic measurement error arising from uncontrolled environmental variables. Not spatial sampling error, not precision.

0

Tim Gorman

Reply to AndyHce

November 8, 2020 11:16 am

Andy,

Uncertainty adds as root sum square. If you have 10000 independent stations each with a +/- 0.5 uncertainty the final uncertainty of the combined group will be sqrt(10000) * 0.5 = +/-(100 * 0.5) = +/- 50 deg.

Think about averaging a daily max with an uncertainty of +/- 0.5 and a daily minimum with an uncertainty of +/- 0.5. The uncertainty of the two independent measurements combined would be:

sqrt[ (.5)^2 + (.5)^2] = sqrt[ .25 + .25} = sqrt[ 2 * .25] = =/- (1.4 * .5) = +/- 0.7

So now, when you are creating a combined average daily value you are starting with an uncertainty of each data point at +/- 0.7 instead of +/- 0.5. So combining the first two stations will give an uncertainty of sqrt(2) * .7 = 1.0

You’ve already started off with an uncertainty larger then the difference you are trying to see. Is the trend downward or upward?

0

Pat Frank

Reply to Tim Gorman

November 8, 2020 12:18 pm

Good description, Tim. If each of the 10000 ±0.5 C uncertainty temperatures is used in some step-wise calculation, then the final uncertainty is ±50 C, where the ±0.5 C uncertainty is a calibration measure of systematic errors stemming from uncontrolled variables.

If the 10000 ±0.5 C uncertainty temperatures are directly averaged, then the uncertainty in the average is the root-mean-square (rms). That’s sqrt[(100000*(0.5)^2)/9999] = ±0.5 C.

In either case, the uncertainty never averages away.

0

AndyHce

Reply to Pat Frank

November 9, 2020 2:38 pm

Pat,
By directly averaged do you mean add the 10,000 measurements, then divide by 10,000?

0

Pat Frank

Reply to AndyHce

November 11, 2020 4:57 pm

Andy: yes.

Sorry for the delay in replying. This post didn’t load into my browser for 2 or 3 days. Until today, in fact.

0

Nick Stokes

Reply to Tim Gorman

November 8, 2020 1:04 pm

“the final uncertainty of the combined group will be…”
That is the uncertainty of the sum. But to get the average, you divide the sum by 10000 (which has no uncertainty). So the uncertainty of the average is 0.005°C.

0

Pat Frank

Reply to Nick Stokes

November 11, 2020 5:05 pm

Nick, “So the uncertainty of the average is 0.005°C.”

Really funny, Nick.

The final uncertainty in an empirical average is the rms.

If the calibration systematic uncertainty of each measurement is ±0.5 C, then the uncertainty in the 10000 average of measurements is sqrt{[(10000)*(0.5)^2]/9999} = ±0.5 C.

The uncertainty never gets smaller, and never averages away.

0

Tim Gorman

November 8, 2020 5:45 pm

Nick,

“That is the uncertainty of the sum. But to get the average, you divide the sum by 10000 (which has no uncertainty). So the uncertainty of the average is 0.005°C.”

No. You’ve never bothered to study up on Dr. Taylor’s textbook, have you? Perhaps this one will work better for you:

https://www.nbi.dk/~petersen/Teaching/Stat2016/Week2/AS2016_1202_SystematicErrors.pdf
=========================================================
“Even with infinite statistics, the error on a result will never be zero!
Such errors are called “systematic uncertainties”, and typical origins are:
• Imperfect modeling/simulation
• Lacking understanding of experiment
• Uncertainty in parameters involved
• Uncertainty associated with corrections
• Theoretical uncertainties/limitations

While the statistical uncertainty is Gaussian and scales like 1/sqrt N,
the systematic uncertainties do not necessarily follow this rule.

When statistical uncertainty is largest, more data will improve precision.
When systematic uncertainty is largest, more understanding will improve precision.
================================================================

Uncertainty doesn’t average. When averaging the temperatures you divide by the number of temperatures. You do *NOT* divide by N or sqrt of N when totaling uncertainty. The total uncertainty is the root sum square of the uncertainties. You do not average uncertainty, you calculate the quadrature sum of the uncertainties.

Error is not uncertainty. Why is this so hard to get across to so many on this blog?

0

Nick Stokes

Reply to Tim Gorman

November 8, 2020 7:01 pm

“Perhaps this one will work better for you:”
OK, I look at Petersen’s notes here (p 12-13). And what does it say? With big red letters saying to commit to memory?:
“What is the uncertainty on the mean? “
Uncertainty of mean= σ/√N
where σ is your 0.5 and N is your 10000. As you calculated, the uncertainty of the sum is σ*√N. And to get the uncertainty of the mean, divide by N.

As of course you must. Dividing by N is just rescaling. If you rescale a number, you must equally rescale its uncertainty.

0

Tim Gorman

Reply to Nick Stokes

November 9, 2020 5:32 pm

I know you’ve been schooled on this by Jim Gorman. Apparently it went in one ear and out the other.

Uncertainty has no mean therefore the term “uncertainty on the mean” is meaningless for uncertainty. In order to have a mean you must have a probability distribution. Uncertainty has no probability distribution. You can’t pick a point in the uncertainty interval and say *this point has the highest probability of being the true value”.

Uncertainty on the mean has to do with the central limit theorem. If you have a random variable with a Gaussian distribution then you can reduce the uncertainty on the mean statistically.

Everything you write is based on having a random variable with a probability distribution – i.e. random error when taking multiple measurements of the same thing using the same measurement device. In this case you can reduce the uncertainty of the mean using the central limit theory.

σ is a statistical description of a probability distribution which has a mean.

If I give you a temperature of 20 deg C +/- 0.5 deg C can you tell me what point in the interval of 19.5 deg C and 20.5 deg C has the highest probability of being the true value?

If you can’t then that means there is no probability distribution associated with that interval – which leads to there being no mean and no standard deviation (i.e. σ). With no mean how do you then reduce the uncertainty of the mean?

0

Nick Stokes

Reply to Tim Gorman

November 10, 2020 1:33 am

“Uncertainty has no mean therefore the term “uncertainty on the mean” is meaningless for uncertainty.”
You quoted Peterson’s notes as the authority. And I quoted then verbatim (p 12):
“What is the uncertainty on the mean? And how quickly does it improve with
more data?”
Answer, σ/√N
Correctly, he makes no requirement that anything be normally distributed.
This is basic stats 101.

0

Tim Gorman

Reply to Nick Stokes

November 10, 2020 6:13 am

Nick,

Please read the entirety of Peterson:

“While the statistical uncertainty is Gaussian and scales like 1/sqrt N ,
the systematic uncertainties do not necessarily follow this rule”

Systematic uncertainties do no necessarily follow this rule.

Once again you are confusing ERROR with UNCERTAINTY. Error is a statistical uncertainty with a probability distribution. Systematic uncertainty is *NOT error and has no probability distribution.

You also ignored the fact that he specifically mentioned GAUSSIAN distribution, i..e NORMAL distribution.

Do you have a reading disability we need to know about?

0

Pat Frank

Reply to Tim Gorman

November 11, 2020 5:11 pm

Tim, Nick is just making a diversion. He knows you’re right.

Look, he wrote, “If you rescale a number, you must equally rescale its uncertainty”

Nick’s rescaling a number is not the same as taking an empirical average. But his choice of phrasing makes it seem as though it is the same. He’s trying to shift the ground of the argument, so as to lead you into a skewed contention from which he can eventually emerge triumphant.

0

AndyHce

Reply to Tim Gorman

November 9, 2020 2:47 pm

Making the assumption that few of the temperature measurements of the past 120 to 150 years are fraudulent, i.e. people were measuring and recording to the best of their ability and the quality of the circumstances, not trying to mislead, then each measurement, for the sake of this discussion, can be taken to be accurate to +/- 0.5C. Of course in reality some instruments must have been incorrect enough that their reading had a much larger error, but assume that those were few enough to have little effect upon the total.

The straightforward calculations of yearly global averages varies only a few degrees from year to year over the entire period. The importance of such a temperature variation for humans, frogs, or life in general, is not part of the current question.

What is the meaning of a two or more digits of uncertainty relative to the calculated averages made to the nearest +/- 0.5C? Can it be claimed that the “real” global average might differ +/- 20C, 30C, or more, from what is calculated? I get the claim from Pat Frank’s articles that the uncertainty isn’t equivalent to degrees of temperature variance, but to say simply the uncertainty is some value does not inform what that means. If it does not have a real world meaning then it is surely less relevant than calculating the number of angels on a pin head.

0

Tim Gorman

Reply to AndyHce

November 9, 2020 6:26 pm

Uncertainty *has* a real world meaning. Think about a civil engineer designing a bridge. He calculates the forces on the various trusses based on all kinds of different loads (e.g. vehicles, wind, ice, etc). Then he can specify the trusses needed to match the compression, tension, and torsional forces that are calculated. Someone then has to order the trusses that are specified in the “catalogs” by ability to withstand a certain force level +/- an uncertainty interval. Trusses whose masimum negative uncertainty interval is *higher* than the forces applied to the trusses had better be the ones ordered in order to provide a safety margin for the bridge.

In this case the uncertainty interval of the trusses very much has a real world meaning.

The same thing applies when combining independent temperature readings. The uncertainty rules apply here just as they do anywhere else and they give the result a real world meaning. You may not like what that real world meaning says, but it is still the real world meaning. The fact that combining the uncertainty intervals of independent measurements to generate daily, monthly, or annual averages gives you a total uncertainty interval that is larger than the differences you are trying to find is why so many climate scientists just ignore the rules about uncertainty. If they didn’t ignore them then they would have to admit that their results are meaningless.

The term “variance” has meaning in a probability distribution. Uncertainty has no probability distribution. Probability distributions require one to assign a probability to each possible population member. If I tell you the temperature is 20 deg c +/- 0.5 deg C can you tell me which point in the interval of 19.5 deg C to 20.5 deg C has the highest probability of being the true value?

If you can’t do so then isn’t the reason you can’t do so is because there is no probability distribution that describes the interval? If there is no probability distribution then there can be no standard deviation, no variance, and no mean.

Since each station is independent of all others each station has its own temperature curve over a year. If you consider the curve to represent a statistical population with a mean and a variance then how do you combine the populations from Station 1 and Station 2? The climate scientists just average the mean temperature from the curve of Station 1 with the mean temperature of the curve for Station 2 and assumes that tells the whole story. But this is wrong. The variances of the two curves can be significantly different and that must be handled somehow as well if you truly want to look at the “climate”.

So how should you combine these multiple independent data sets? Darned if I know. Do you?

0

windlord-sun

Reply to Tim Gorman

November 9, 2020 7:23 pm

Hi Tim,

I’d rather decided to not respond to the ongoing discussion about averaging, the complexity of which and earnestness for which is running strong in this thread, and my pointed posts to the contrary, yet which answer your question, already having been proffered several times, and met with silence.

However, I have changed my mind. Your pointed question can not be ignored; it is right to the crux of the matter.

Consider flipping the worldview: instead of the effort to arrive at global or regional “Average Temperature,” instead face the fact we can’t get there with models, gridding, estimating, etc, and instead trade up to “examine 1200 individual cases.”

In addition to “we can’t get there,” add “… and the illusion that we can construct a model — of one statistic: global average — of the prior 150 years, while tempting, is in fact misleading. We can’t go on illusions.

Here’s my graph, once more, of USHCN RAW:

http://theearthintime.com

That sine curve is NOT intended to arrive at “an average.” Instead, it is an amalgam of a spaghetti graph. Imagine an animation that would lay down each of 1200 curves on top of each other. Not only does “average” not serve to describe the nature of the result, even “trend” falls short, because we are used to seeing a straight trendline, which in the case of this “Method” hides the crucial reality: temp rises and falls in a sine curve with oscillations inside oscillations.

An individual weather station might have inaccuracy [uncertainty] within and of itself. However, 120 years of recording 43,800 times in a sufficiently accurate yet consistent manner, reveals one giant truth. “Does this station’s plot display any abnormal warming or cooling?” Now look at each station’s curve, and ask the same question 1200 times.

I’ll say no more for now.

::::: windlord :::::

0

Tim Gorman

Reply to windlord-sun

November 10, 2020 6:45 am

windlord:

“Consider flipping the worldview: instead of the effort to arrive at global or regional “Average Temperature,” instead face the fact we can’t get there with models, gridding, estimating, etc, and instead trade up to “examine 1200 individual cases.””

I agree with this. There is no such thing as a “global climate”. There is barely even such a thing as “regional climate”. Climate is mostly local. Take Nebraska, Iowa, and Kansas – all part of the same region but clearly having different climates. What is typically considered the “Central Plains” have widely varying climates based on longitude and latitude. Locations as close as Lincoln, NE and Topeka, KS have different climates.

“In addition to “we can’t get there,” add “… and the illusion that we can construct a model — of one statistic: global average — of the prior 150 years, while tempting, is in fact misleading. We can’t go on illusions.”

I agree totally.

“which in the case of this “Method” hides the crucial reality: temp rises and falls in a sine curve with oscillations inside oscillations.”

Again, correct.

“reveals one giant truth. “Does this station’s plot display any abnormal warming or cooling?” Now look at each station’s curve, and ask the same question 1200 times.”

In which case you will probably find some that show some warming, some that show some cooling, and some that show nothing. You still can’t combine these into one whole.

0

Pat Frank

Reply to windlord-sun

November 11, 2020 6:37 pm

wl-s’s graph of USHCN raw has no uncertainty bars on the temperature points, from the known systematic measurement error.

Put ±0.5 C uncertainty bars on the points, and the sine curve goes through a mean that has no known physical meaning.

He doesn’t know whether the sine curve is real or artifactual. His entire analysis is based upon a blind assumption of accuracy. An assumption that we know, for a fact, is wrong.

0

windlord-sun

Reply to Pat Frank

November 11, 2020 8:19 pm

That is absolutely stunning. Seriously, my head is swirling and my jaw is on the floor.

I’ve known for quite a while that the obsession with “Global Average Temperature” was virulent and blinding, but this exchange with Pat Frank has me nearly immobile. I can barely type.

Pat Frank, are you willing to state what your bottom line, basement, foundational quest is with regard to climate? What question are you determined to answer?

To anyone else, all I can say is, 1218 stations reporting 50 million recordings of TMAX alone over 120+ years…

It does not matter if my station has had protocol errors;
It does not matter if my station has a gauge out of calibration;
We have been consistent:
I’ve given NOAA my 40,000+ dailies;
I’ve graphed them;
They show the sine curve that all of nature loves and follows;

Viewing 1218 graphs, one by one, even if some or all of them are high or low within themselves, reveals the ultimate (and only) measurement flow of temperature. “Accuracy” is irrelevant.

[recreation begins…]

Is there any abnormal warming?
Not at my station.

I asked the same question at a conference of 436 of my fellow weather station directors. A few said yes. A few said they saw drastic cooling. The rest saw their data forming an oscillation with a period of about 60 years, a sine wave.

We all agreed to not put our data in a cement mixer in quest for an “Average,” since, being rational scientists, we know that is both impossible vis a vis credibility, and not needed.

We agreed to immediately start up an email thread if any of us detects any abnormal warming.

[recreation ends…]

0

Tim Gorman

Reply to windlord-sun

November 12, 2020 6:03 am

wls,

“To anyone else, all I can say is, 1218 stations reporting 50 million recordings of TMAX alone over 120+ years…”

All I can say is that Pat is correct. If you put 0.5C error bars on that graph then you can’t even see most of the sine wave. Try it. Assume the mean is 65.5F. That means the error bars would run from 66.2F to 64.4F. Print the graph and then cut a piece of paper of the width 64.4F to 66.2F. Now use that strip to cover up the that area on the graph. Most of what you see on the graph disappears. You won’t be able to see enough to generate a sine wave.

Now, if you have actually combined 120 stations to obtain points on the graph then your actual uncertainty interval will approach +/- 5degC. So the uncertainty bars will run from about 75degF to 56degF.

That means that *any* point within that range can be the true value, you just don’t know. You could generate anything you want out of that kind of range of true values.

I know this is hard to grasp. But it is fundamental to understanding why combining a bunch of single independent measurements of totally different independent things using totally different measuring devices, all without regard to systematic uncertainty, is just destined to give a result that is meaningless.

Is there any doubt in your mind that uncertainty of independent measurements of independent things using independent measurement devices adds as root sum square?

“It does not matter if my station has had protocol errors;
It does not matter if my station has a gauge out of calibration;
We have been consistent:
I’ve given NOAA my 40,000+ dailies;
I’ve graphed them;
They show the sine curve that all of nature loves and follows;”

Protocol errors are ERRORS, not uncertainty. Calibration is ERROR, not uncertainty. Error and uncertainty are not the same thing. Pat has tried to explain this over and over but it doesn’t seem to be getting through to people. Error is measuring a 2″x4″ as 7.92′ when it is actually 8.08′ long. Uncertainty is going to a huge pile of 2″x4″ studs and selecting ten of them to build a wall for your house. They won’t all be exactly 8′ long. Some will be shorter and some longer. That’s uncertainty. When you build the wall and attach your sheet rock you’ll have gaps where the longer studs are (that’s what trim boards are for – to cover the gaps) or you won’t be able to fit the sheet rock where the shorter studs are. That isn’t an issue of measurement error, calibration error, or any other kind of error, it’s uncertainty.

At least at your single station you have been using the same measurement device but you don’t measure the same thing everyday. Environmental conditions (wind, humidity, clouds, etc) all add a degree of uncertainty to each daily measurement in addition to the inherent uncertainty added by your measurement device itself. If your overall uncertainty for each measurement is 0.5C then you still must add that uncertainty bar to your measurements and see if you can actually distinguish anything. And when you are combining multiple measurements into an annual average remember the root sum square rule for uncertainty.

If NOAA, NASA, and all the so-called climate scientists were honest they would have to admit that they simply can’t tell what year has been the hottest when they are trying to distinguish 0.1C or 0.01C differences, not when their uncertainty interval is more than 1C!

0

windlord-sun

Reply to windlord-sun

November 12, 2020 9:13 am

Tim Gorman,

The curving wave at my website, which is an amalgam of 50-million recordings, and most of the waves of the 1200 individual sites in USHCN, take the shape of Nature’s favorite form: an oscillating sine wave, accompanied by harmonics of longer and shorter periods.

Per your paragraph cited below, that is a coincidence?

“At least at your single station you have been using the same measurement device but you don’t measure the same thing everyday. Environmental conditions (wind, humidity, clouds, etc) all add a degree of uncertainty to each daily measurement…”

Or, if not a coincidence, the fluctuation of ‘conditions’ — which you would think would be a random distribution — instead wonderfully reveals they occur in a regular oscillation?

“And when you are combining multiple measurements into an annual average ….”

Again, you avoid my point. I am not obsessed with “an average.” A spaghetti graph, or amalgam, does not seek to claim an average. It illuminates the curving trend visually.

What is your explanation for the formation of the affecting conditions into an organic shape?

0

Pat Frank

Reply to windlord-sun

November 12, 2020 10:46 am

wl-s, “Pat Frank, are you willing to state what your bottom line, basement, foundational quest is with regard to climate? What question are you determined to answer?.”

My original intention was to discover for myself whether the IPCC claim is true, that human CO2 emissions are warming the climate.

That was in 2001, when the TAR came out. I decided to find out for myself, and began to read the primary literature.

After two years of study, it was clear the IPCC could not possibly know what they claimed to know.

In 2006 I read Brohan, et al., Uncertainty estimates in regional and global observed temperature changes: A new data set from 1850 J. Geophys. Res. 111, D12106 doi:10.1029/2005JD006548

That paper revealed that they assumed all temperature measurement error is random, and averages away to insignificance.

That was a shock. It led me to look for and examine published field calibration experiments. Published calibrations showed very significant systematic sensor errors coming from uncompensated environmental variables.

On discovering this, it became clear that the IPCC (and Brohan, et al.) could not possibly know what they claimed to know about air temperatures.

I started out just wanting to know, wl-s. My critical view came honestly from what I discovered.

I’ve learned from direct experience (much documented) that the entire field of consensus climatology ignores data quality. They don’t understand it, they’re not interested to know about it, they reject with hostility any effort to draw their attention to it.

So, you can go ahead and fit your sine curves to low-resolution data, ws-l. No matter how pretty your curves are, they’re physically worthless.

You wrote, ““Accuracy” is irrelevant.” Incredible.

That position makes everything you do irrelevant.

0

Pat Frank

Reply to windlord-sun

November 12, 2020 5:02 pm

wl-s, “That is absolutely stunning. Seriously, my head is swirling and my jaw is on the floor.

“I’ve known for quite a while that the obsession with “Global Average Temperature” was virulent and blinding, but this exchange with Pat Frank has me nearly immobile. I can barely type.”

Bite the bullet of low-resolution measurements, wl-s. Nothing anyone does will improve past measurements that are inherently poor.

Contrary to your writing, I am not obsessed with global average temperature. I’m focused on individual measurements of air temperature, such as are taken using a LiG thermometer in a Stevenson screen or an MMTS sensor, at a meteorological station.

Individual air temperature measurements, as they are recorded off individual instruments. Raw. Unadjusted. Unchanged. As-is. Each one, each time.

The lower limit of calibration uncertainty in Raw. Unadjusted. Unchanged. As-is. Each one, each time. air temperature measurements from each and every one of the USHCN stations is ±0.5 C.

There’s no getting around it. No sine wave fit will change that fact. A fit through any air temperature trend is not the most probably true mean.

Systematic error is not random. It is not normally distributed. It does not have a zero mean. Fits through temperature trends that include systematic error are not physically reliable.

Period. End of story.

Going forward, an air temperature compiled using low-resolution measurements will necessarily and ineluctably produce an unreliable record.

The entire field is pursuing a chimera. And they *will not* see it

0

AndyHce

Reply to Tim Gorman

November 10, 2020 12:20 am

Tim,
You wrote
“If I tell you the temperature is 20 deg c +/- 0.5 deg C can you tell me which point in the interval of 19.5 deg C to 20.5 deg C has the highest probability of being the true value?

If you can’t do so then isn’t the reason you can’t do so is because there is no probability distribution that describes the interval? If there is no probability distribution then there can be no standard deviation, no variance, and no mean. “

The question, in my thought experiment, is about something that does have a distribution and a trend. The distribution is over years, the trend is in the second decimal place of the yearly average temperature calculation. The question is not about the probability of each value that makes up the trend. It is not about the likely meaninglessness of the yearly global average calculation vis a vis the fate of the world, no matter its precision or accuracy. It is about the fact that there is a consistent trend (in this thought experiment) in the result, as there is in the data published by the “climate agencies” even though that trend is below the precision and the accuracy of the data.

If the individual measurements are unbiased, as specified, it seems there should be a random distribution of those values to the right of the decimal. An apparent trend could be random, just as many consecutive tosses of a fair coin producing the same value IS random, but a trend that continues long enough suggests that the coin is not, in fact, fair. The probability that it is not fair grows larger and larger as the same result is produced again and again (yes, the improbability grows, but the fact of the coin’s balance doesn’t change, regardless of what the numbers suggest).

Can the trend in the calculated average indicate that a trend beyond the ability to measure directly does actually exist and it is being detected? If not, is there some other rigorous explanation for the trend? I suspect it for the climate data it is just a random result that casually appears to have a trend but that doesn’t seem to be the way the believers see it. Can there be any answer, one way or the other?

0

kribaez

Reply to AndyHce

November 10, 2020 3:55 am

Andy,

A lot of Tim’s comments on this subject will damage your mental health. When taking multiple measurements over a range of values using an unbiased thermometer with a precision error of plus or minus 0.5, the precision error for each individual measurement is sampled from a uniform distribution U[-0.5, +0.5]. This is NOT an approximation. It is an exact distribution. The only condition is that the range of values sampled is larger than the range of the uniform distribution. Since in the real world the temperature variation is generally several orders of magnitude larger than this uncertainty range over the annual cycle, it is a condition which is generally easily met. However, it would make little difference whether the distribution was approximated (as it often needs to be) or exact, as it is in this case. The variance of a sample dataset is an arithmetic property of the dataset. It does not need to be attached to a distribution for us to be able to make use of it in theoretical calcs.

The uncertainty associated with this Uniform distribution is easily calculated from theory. The variance of the individual precision error = 1/12 = 0.0833. The standard deviation is the sqrt of this. The variance of the MEAN temperature over a one year period with daily measurements is then (.0833) / 365 = 0.000228 . The standard deviation is then sqrt(.000228) = 0.01511 . This can equally be obtained using the formula correctly stated by Nick Stokes above:
Standard Deviation of the mean = σ/√N.
The uncertainty attached to this mean value arising from precision error is now closely approximated by a Normal distribution.

However, in your 10,000 thermometer example, you were interested in detecting a CHANGE in mean values from the starting point.
The variance of the difference between any two (annual) means = 2 x Var(mean) = 2×0.000228 = 0.000457.
The standard deviation of the difference between the two (annual) means is the sqrt(0.000457) = 0.021 , which I hope is the same sd value that I quoted previously when working through your example. For just a single well, therefore, you would expect to be able to detect a significant change in temperature after just 5 years ( = 0.05 temperature change as per your assumption). With 10,000 thermometers all varying differently, the standard deviation on the difference between two means is reduced by a factor of 1/√N, so the standard deviation becomes 0.021/100 = 0.00021.
So your instincts about “values to the right of the decimal” are correct, and in your idealised example, you could detect a uniformly applied shift in temperature of 0.01 degrees (i.e. after the first year).

Please note that while Tim is making assertions on stuff that he evidently does not understand, the above calculation is readily testable using a Monte Carlo.

I should re-emphasise that this is ONLY trying to deal with the precision question. There are still far more important sources of uncertainty in the adjusted temperature records.

0

Tim Gorman

Reply to AndyHce

November 10, 2020 7:06 am

Andy,

“The question, in my thought experiment, is about something that does have a distribution and a trend. The distribution is over years, the trend is in the second decimal place of the yearly average temperature calculation.”

If your instruments only have a precision of 1 decimal place then there is no way to determine the second decimal place. So no matter how long of a time period you use, you are only fooling yourself that you can see a second decimal place.

“even though that trend is below the precision and the accuracy of the data.”

You can only see a trend below the precision and accuracy of the data by ignoring the rules for significant digits in the data. If there is only one significant digit then no amount of calculation can extend that. You will be able to see a trend in the first decimal point but not beyond.

“If the individual measurements are unbiased, as specified, it seems there should be a random distribution of those values to the right of the decimal.”

Nope, you are confusing error with uncertainty. Error has a random distribution, uncertainty does not. If you can’t assign a probability to each point in the uncertainty interval then how do you know you have a random distribution?

You *can* see trends even in measurements that have uncertainty. But the difference MUST be outside the range of uncertainty. Take the 20C measurement again. If it has a +/- 0.5C uncertainty then until you get outside that uncertainty range, 19.5 to 20.5, you don’t know if there is a difference or not. If you begin to measure 21C with +/- 0.5C then you are on the cusp of beginning to see an actual increase you can identify for sure. But that increase of 1C in your reading is far beyond trying to identify a 0.1C or a 0.01C difference trend.

When you add in the fact that when combining 100 different, independent populations with uncertainty intervals, the total uncertainty interval grows by sqrt 100 (i.e. 10), an entire order of magnitude. Now you not only have to see a difference of 1C in your measurements, you have to see a difference of 2.5C before you are sure you’ve actually seen an increase.

“Can the trend in the calculated average indicate that a trend beyond the ability to measure directly does actually exist and it is being detected?”

Follow the rules of significant digits and you can answer this yourself. If you have measurements to the tenth of a degree then how can the average give you more than a tenth of a degree? If your average turns out to be a repeating decimal does that mean that the average now has an infinite precision?

0

Tim Gorman

Reply to AndyHce

November 10, 2020 7:49 am

kribaez,

“A lot of Tim’s comments on this subject will damage your mental health. When taking multiple measurements over a range of values using an unbiased thermometer with a precision error of plus or minus 0.5, the precision ERROR for each individual measurement is sampled from a uniform distribution U[-0.5, +0.5]. This is NOT an approximation.” (caps are mine, tg)

You have already started off describing error, not uncertainty.

“It is an exact distribution. The only condition is that the range of values sampled is larger than the range of the uniform distribution.”

Uncertainty is not a probability distribution, not even a uniform one.

“Since in the real world the temperature variation is generally several orders of magnitude larger than this uncertainty range over the annual cycle”

You have already made the mistake of claiming that the uncertainty interval for a combined set of independent populations never grows with the combination. As I pointed out, if you come up with a daily average using a maximum and minimum temperature using an instrument with a +/- 0.5C uncertainty range, your uncertainty associated with the average grows from +/- 0.5C to +/- 0.7C. If you combine 365 daily averages from *one* station to get a yearly average then your uncertainty for the yearly average grows by sqrt(365) = 19. Thus your uncertainty interval becomes +/- (19 * 0.7) = +/- 13C. If your calculated average is 20C then the true value could be between 7C and 33C.

“The uncertainty associated with this Uniform distribution is easily calculated from theory.”

When you assigned a probability distribution to an uncertainty interval you made a wrong step. If you can’t tell me which point in an interval from 19.5C to 20.5C is the most likely the true value then how do you even know you have a uniform distribution? It could be a skewed Caussian distribution, a Poisoon distribution (discrete), or an exponential distribution (continuous) of some kind. The fact is that an uncertainty interval has no probability distribution -period. When you artificially assign one to the interval it suddenly becomes something other than an uncertainty interval.

“However, in your 10,000 thermometer example, you were interested in detecting a CHANGE in mean values from the starting point.”

You already admitted that you can’t tell me the mean (the most likely value in an interval) of an uncertainty interval so how can you have a mean?

You just keep on pretending that error is uncertainty. That tells me you are a mathematician or a computer programmer and not an experimental scientist.

“With 10,000 thermometers all varying differently, the standard deviation on the difference between two means is reduced by a factor of 1/√N, so the standard deviation becomes 0.021/100 = 0.00021.”

From Peterson again:
=====================================
While the statistical uncertainty is Gaussian and scales like 1/sqrt N, the systematic uncertainties do not necessarily follow this rule.

When statistical uncertainty is largest, more data will improve precision.
When systematic uncertainty is largest, more understanding will improve precision.
====================================

Statistical uncertainty is ERROR. Systematical uncertainty is not error.

“Please note that while Tim is making assertions on stuff that he evidently does not understand, the above calculation is readily testable using a Monte Carlo.”

Please, when you have to convert uncertainty to error in order to come up with some kind of probability distribution it is clear that it is you that doesn’t understand. You already admitted you can’t tell me which point in an uncertainty interval is most likely the true value so how can you then turn around and assume there is a probability distribution associated with the uncertainty interval? Even assuming a uniform distribution is assuming something you don’t know just to make the uncertainty interval fit into your world view of everything being a random variable with a probability distribution.

0

AndyHce

November 12, 2020 6:25 am

Tim,
“If you can’t assign a probability to each point in the uncertainty interval then how do you know you have a random distribution?”

In the case of the +/-0.5C uncertainty of individual measurements, which is due to the basic fact of only being able to measure to the nearest integer, each possible value in the interval – to + surely has an equal probably of being the “true” temperature. How many possible values are there? Is there a quantum limitation to the smallest change of temperature possible? Probably, but the thermometer cannot measure quantum quantities.

Anyway, kribaez seems correct in asserting the distribution is uniform; what else could it be? It has been many years (of non-use) since my statistics classes so, without significant study, I can’t verify the calculations presented. They may indeed be totally correct for some circumstancs but I don’t see how they could apply to independent measurements made with many different instruments, in many different environments, with many unknown factors between them. For kribaez, I acknowledge that my not see understanding something doesn’t effect its validity, plus or minus.

The so small calculated variance kribaez shows is too strange to be real for the temperature record. The actual variance of interest seems more likely to be the difference between the largest and smallest measurement. Given that, a change in one year surely can’t be given significance as indicating any actual change, and five years still seems much too little data for a conclusion. However, a continued trend over some longer period might be very difficult to explain away – even if the calculated values aren’t the real amount of trend. Single bit analogue to digital, or digital to analogue converters don’t measure anything per sample or small number of samples, but they steer the data in the correct direction, ultimately producing very good results.

You wrote
“In this case the uncertainty interval of the trusses very much has a real world meaning.”

That uncertainty interval is expressed in terms of real mechanical forces. Choosing parts that equal or exceed the uncertainty of the stresses that are expected must be done by using actual measurements of those same mechanical forces. The uncertainty is not a unit-less number, the trusses must be selected in terms of some units such as foot-pounds of whatever type of stress. Therefore the bridge example cannot be comparable to something which you says has no units of measure that relate to something that can be found in the real world, outside of the calculations. If it can be said what it isn’t, there must be something that it is.

Regardless, I apparently am still not getting my point across. The answer has to be something other than it exceeds the certainty level so it has no meaning if the result is not random.
Take it as stipulated that there is a significant uncertainty associated with each measurement and that the combination of measurements in calculations does not reduce the uncertainty or even that it greatly increases the uncertainty.

In making any series of calculations it is common practice to use intermediate values with more precision than is significant. There is a very good reason for this. If each step is only to the precision of the data, rounding errors in multiple steps might make the final product larger or smaller than reality. By calculating to more digits, and not rounding until all calculations are done, one obtains the more correct result.

In general, unless I am confused and somehow only work with special cases when I calculate things, in a series of calculation, the digits beyond the significant digits of the operands might each be any value from 0 to 9. If not, there must exist an arithmetic proof of the restrictions.

If the calculation for “global temperature” to two decimal digits comes to 15.01 one year, then 15.02 the next, then 15.03 the next, increasing by .01 each year – without any manipulation to produce such a series – then even though the places to the right of the decimal are not meaningful or valid as a measure of temperature, such a result is highly unlikely to be random. If not random it comes from something in particular.

The trend need not be so even, the values might be 2, 3, etc., varying from year to year. Some years might even produce lower value digits, a real trend isn’t necessarily uniform. However, if they go in one direction most of the time, either upward or downward, they produce a trend (even if a trend of nothing but the numbers).

The expected result of generating a “random” set of temperatures, of the same total number of values, and all within the same interval of the real temperatures, then doing the same calculations of average with them as are done with the temperature measurements, will almost surely produce a random set of values for each “year’s” worth of fake temperature’s intermediate digits. Thus for this random set there would not be a trend as discussed in the previous paragraph. Again I acknowledge there might be some arithmetic proof that explains there can be a “false” trend, just as there are certain mathematical tricks that can take a person through a series of steps, using many different starting values, yet always end up with the same ultimate result but it seems unlikely.

When Nic Lewis presented his initial analysis on why the results of a paper claiming a new method that concluded ocean heat uptake was 60% greater than previously believed were wrong, there was initially considerable noise from the activists community. Some showed calculations that matched the paper’s result (the paper itself provided the data but not the means by which the authors came to their conclusion). Lewis replied with chapter and verse from a handbook of standard statistical methods, showing his results were correct, his opponents approach was not valid FOR THE TYPE OF DATA. The arguments stopped and the paper was withdrawn, although its authors did not, in their withdrawal statement, admit the major error that Lewis revealed.

For years there has been this argument on various climate sites as to whether statistical calculations of a certain type that result in higher precision for the calculated data than the data the instruments can produce are applicable to temperature data (different instruments at different sites measuring different conditions). For this article Tim has referenced a certain statistical textbook for several rules verifying some particulars of his calculations but still no one seems to have a standard source that addresses the central question of applicability in any unambiguous way.

If a standard weather thermometer (+-/0.5C) is pared with a reference thermometer (+/-0.001C) then it can be said that the integer reading thermometer is off the correct temperature by so may thousands (or hundreds, or tenths if you choose a less extreme reference) of a degree. Does this make the difference, in this circumstance, for any particular measurement, an error of the standard thermometer or a measure of its particular uncertainty?

0

Tim Gorman

Reply to AndyHce

November 12, 2020 9:06 am

“Anyway, kribaez seems correct in asserting the distribution is uniform; what else could it be?”

It can be uncertainty which has no probability. In a uniform distribution you can calculate the probability of each point in the distribution – they are all equal. But the *true value* in an uncertainty interval has a higher probability than all other points in the interval, the problem is that you don’t know what the actual true value is. It is uncertain. If you assign equal probabilities to all points in the interval then you have, in essence, claimed that they are *all* the true value. That makes no sense at all.

“The actual variance of interest seems more likely to be the difference between the largest and smallest measurement.”

Even the largest and smallest measurements each have an uncertainty interval associated with them. So is the difference between the top of the uncertainty interval for the largest measurement and the bottom of the uncertainty interval for the smallest measurement? Or is the difference between the bottom of the uncertainty interval for the largest measurement and the top of the uncertainty interval for the smallest measurement? Or is it somewhere in between?

Remember, the difference between on measurement, i.e. year 0, and the next measurement, year 1, has to be greater than the uncertainty interval in order to know whether you way any difference between them for certain. If the difference you calculate is 0.01 but the uncertainty is +/-0.5 then how do you know for sure that you actually saw that difference of 0.01?

For the global climate the differences seen over a long period of time, even 30 years, remains within the uncertainty interval. So how do you know if you’ve seen a difference at all?

“That uncertainty interval is expressed in terms of real mechanical forces. Choosing parts that equal or exceed the uncertainty of the stresses that are expected must be done by using actual measurements of those same mechanical forces. ”

You can calculate the forces, at least within the uncertainty of your varioius estimates of loading. All you need to do then is order trusses whose lower bounds of uncertainty exceed the forces you have calculated.The only measurement that is required is by the manufacturer to insure that his characterization of the strengths of his product are accurate.

“Therefore the bridge example cannot be comparable to something which you says has no units of measure”

I never claimed that uncertainty has no units of measure. If you’ll look back I always said something line +/- 0.5degC.The indicator C means temperature in Centigrade. That *is* a unit of measure.

“In making any series of calculations it is common practice to use intermediate values with more precision than is significant.”

Calculations are not measurements. And the intermediate values only go one digit further than your precision.

As Dr. Taylor notes in his textbook: “To reduce inaccuracies caused by rounding, any numbers used in subsequent calculations should normally retain at least one significant figure more than is finally justified. At the end of the calculations, the final answer should be rounded to remove these extra, insignificant figures.”

Remember, measurements are not calculated. If you are calculating an average of measurements then in the final answer you still round to the significant figure determined by the precision of the instrument doing the rounding and the rule “The last significant figure in any stated answer should usually be of the same order of magnitude (in the same decimal position) as the uncertainty.” (again from Dr. Taylor’s textbook) If the uncertainty is in tenths then round to the tenths digit.

“If the calculation for “global temperature” to two decimal digits comes to 15.01 one year, then 15.02 the next, then 15.03 the next, increasing by .01 each year – without any manipulation to produce such a series – then even though the places to the right of the decimal are not meaningful or valid as a measure of temperature, such a result is highly unlikely to be random. If not random it comes from something in particular.”

You are missing the point. How does the global temperature get calculated to the hundredths digit when the uncertainty is in at least in the units digit? All you *really* know is that year one is 15 and year two is 15. Calculating year one and then year 2 are *not* intermediate calculations. They are separate calculations and each need to be rounded to the same magnitude as the uncertainty interval.

“However, if they go in one direction most of the time, either upward or downward, they produce a trend (even if a trend of nothing but the numbers).”

How do you know which way they go most of the time? You are violating the rules for significant figures in order to try and gain more precision than you actually have.

“The expected result of generating a “random” set of temperatures, of the same total number of values, and all within the same interval of the real temperatures, then doing the same calculations of average with them as are done with the temperature measurements, will almost surely produce a random set of values for each “year’s” worth of fake temperature’s intermediate digits. ”

The issue is not the calculation, the issue is not knowing the true value. Of what use is it to generate a whole bunch of different “random” set of temperature when you don’t know if they are the true value or not? Does it help you in any way to determine the true values? Does it help you distinguish between two points that are within the uncertainty interval?

“Again I acknowledge there might be some arithmetic proof that explains there can be a “false” trend”

No one is saying anything about a “false” trend. The issue is that you simply can’t determine if there is a trend, either a true trend or a false trend, unless the temperatures being compared are outside of the uncertainty interval of both.

“For years there has been this argument on various climate sites as to whether statistical calculations of a certain type that result in higher precision for the calculated data than the data the instruments can produce are applicable to temperature data (different instruments at different sites measuring different conditions).”

And for years those who think they can gain precision using calculations while ignoring uncertainty, be it for temperatures or the mass of a proton are only fooling themselves. It’s like arguing how many angels fit on the head of a pin? Who can measure it precisely enough to know for sure?

“still no one seems to have a standard source that addresses the central question of applicability in any unambiguous way.”

There are all kinds of sources. Dr. Taylor’s textbook is just one of many. Go here for another discussion of the issue: https://www.sjsu.edu/people/ananda.mysore/courses/c1/s0/ME120-11_Uncertainty_Analysis.pdf

Note carefully the statement: “Taking the square root of the sum-of-squares is an effective way to combine uncertainties into one value, and squaring each contributing term before taking the sum has some important advantages:” Root-sum-square again. You will find this *everywhere* on the internet.

Go to the NIST site or look up the GUM document. You are really interested in NIST Type B uncertainty. Don’t confuse “standard uncertainty of the mean” with “random uncertainty” with “systematic uncertainty”. They are all different and the first two are involved with probability functions of random error and not with uncertainty.

“If a standard weather thermometer (+-/0.5C) is pared with a reference thermometer (+/-0.001C) then it can be said that the integer reading thermometer is off the correct temperature by so may thousands (or hundreds, or tenths if you choose a less extreme reference) of a degree.”

What do you mean by “paired with”? If they are both in the same site measuring the same thing then why would you use two different devices? And remember, like the Argo floats, the uncertainty of the measurement is not determined by the precision of the thermistor used as a detector but by the entire system itself. If the 0.001C detector is housed in a system which causes a 0.5C uncertainty in the measurement, then the precision of the detector is meaningless.

0

Watts Up With That?

Comparing USCRN and nClimDiv to USCHN

Works Cited

Works Cited

Share this:

Related Posts

New paper: U.S. temperature extremes have declined since 1899, challenging assumptions about increasing heatwaves

New Temperature Study in Reno Finds Strong Urban Heat Island Bias at Official Climate Station

Another Temperature Bias: The Shrinking Stevenson Screen = Warming

‘Death Valley Days’ May Be Over for Global Temperature Record