Guest Essay by Kip Hansen

This essay is long-ish — and is best saved for a time when you have time to read it in its entirety. It will be worth the wait and the eventual effort. It comes in three sections: a Primer on Averages, a general discussion of Fruit Salad metrics, and a more in-depth discussion of an example using a published study.
NB: While this essay is using as an example a fairly recent study by Catherine M. O’Reilly, Sapna Sharma, Derek K. Gray, and Stephanie E. Hampton, titled “Rapid and highly variable warming of lake surface waters around the globe” [ .pdf here; poster here, AGU Meeting video presentation here ], it is not meant to be a critique of the paper itself — I will leave that to others with a more direct interest. My interest is in the logical and scientific errors, the informational errors, that can result from what I have playfully coined “The First Law of Averages”.
Averages: A Primer
As both the word and the concept “average” are subject to a great deal of confusion and misunderstanding in the general public and both word and concept have seen an overwhelming amount of “loose usage” even in scientific circles, not excluding peer-reviewed journal articles and scientific press releases, let’s have a quick primer (correctly pronounced “prim – er”), or refresher, on averages (the cognizanti can skip this bit and jump directly to Fruit Salad).

and, of course, the verb meaning to mathematically calculate an average, as in “to average”.
Since there are three major types of “averages” — the mode, the median, and the mean — a quick look at these:



Several of these definitions refer to “a set of data”… In mathematics, a set is a well-defined collection of distinct objects, considered as an object in its own right. (For example, the numbers 2, 4, and 6 are distinct objects when considered separately, but when they are considered collectively they form a single set of size three, written {2,4,6}.)
This image summarizes the three different common “averages”:

Here we see the Ages at which patients develop Stage II Hypertension (severe HBP – high blood pressure) along the bottom (x-axis) and the Number of Patients along the left vertical axis (y-axis). This bar graph or histogram shows that some patients develop HBP fairly young, in their late 30 and 40s, after 45 the incidence increases more or less steadily with advancing age to peak in the mid-60s, falling off after that age. We see what is called a skewed distribution, skewed to the right. This shewdness (right or left) is typical of many real world distributions.
What we would normally call the average, the mean, calculated by adding together all the patient’s ages at which they developed HBP and dividing by the total number of patients, though mathematically correct, is not very clinically informative. While it is true that the Mean Age for Developing HPB is around 52 it is far more common to develop HPB in one’s late 50s to mid- 60s. There are medical reasons for this skewing of the data — but for our purposes, it is enough to know that those outlying patients who develop HPB at younger ages skew the mean — ignoring the outliers at the left would bring the mean more in line with the actual incidence figures.
For the medically inclined, this histogram hints that there may be two different causes or disease paths for HPB, one that causes early onset HPB and one related to advancing age, sometimes known as late High Blood Pressure.
(In this example, the Median Age for HPB is not very informative at all.)
Our HPB example can be read as “Generally, one begins their real risk of developing late HPB in their mid-40s and the risk continues to increase until their mid-60s. If you haven’t developed HPB by 65 or so, your risk decreases with additional years, though you still must be vigilant.”
Different data sets have different information values for the different types of averages.
Housing prices for an area are often quoted as Median Housing Costs. If we looked at the mean, the average would be skewed upward by the homes preferred by the wealthiest 1% of the population, homes measured in millions of dollars (see here, and here, and here).
Stock markets are often judged by things like the Dow Jones Industrial Average (DJIA) [which is a price-weighted average of 30 significant stocks traded on the New York Stock Exchange (NYSE) and the NASDAQ and was invented by Charles Dow back in 1896]. A weighted average is a mean calculated by giving values in a data set more influence according to some attribute of the data. It is an average in which each quantity to be averaged is assigned a weight, and these weightings determine the relative importance of each quantity on the average. The S&P 500 is a stock market index tracks the 500 most widely held stocks on the New York Stock Exchange or NASDAQ. [A stock index … is a measurement of the value of a section of the stock market. It is computed from the prices of selected stocks, typically a weighted average.]
Family incomes are reported by the US Census Bureau annually as the Median Household Income for the United States [$55,775 in 2015].
Life Expectancy is reported by various international organizations as “average life expectancy at birth” (worldwide it was 71.0 years over the period 2010–2013). “Mathematically, life expectancy is the mean number of years of life remaining at a given age, assuming age-specific mortality rates remain at their most recently measured levels. … Moreover, because life expectancy is an average, a particular person may die many years before or many years after the “expected” survival.” (Wiki).
Using any of the major internet search engines to search phrases including the word “average” such as “average cost of a loaf of bread”, “average height of 12-year-old children” can keep one entertained for hours.
However, it is doubtful that you will be more knowledgeable as a result.
This series of essays is an attempt to answer this last point: Why studying averages might not make you more knowledgeable.
Fruit Salad
We are all familiar with the concept of comparing Apples and Oranges.

Sets to be averaged must be homogeneous, as in comparable and not so heterogeneous as to be incommensurable.


Problems arise, both physically and logically, when attempts are made to find “averages” of non-comparable or incommensurable objects — objects and/or measurements, which do not logically or physically (scientifically) belong in the same “set”.
The discussion of sets for Americans schooled in the 40s and 50s can be confusing, but later, younger Americans were exposed to the concepts of sets early on. For our purposes, we can use a simple definition of a collection of data regarding a number of similar, comparable, commensurable, homogeneous objects, and if a data set, the data being itself comparable and in compatible measurement units. (Many data sets contains many sub-sets of different information about the same set of objects. A data set about a study of Eastern Chipmunks might include sub-sets such as height, weight, estimated age, etc. The sub-sets must be internally homogeneous — as “all weights in grams”.)
One cannot average the weight and the taste of a basket of apples. Weight and taste are not commensurable values. Nor can one average the weight and color of bananas.
Likewise, one cannot logically average the height/length of a set like “all animals living in the contiguous North American continent (considered as USA, Canada, and Mexico)” Why? Besides the difficulty in collecting such a data set, even though one’s measurements might all be in centimeters (whole or fractional), “all animals” is not a logical set of objects when considering height/length. Such a set would include all animals from bison, moose and Kodiak bears down through cattle, deer, dogs, cats, raccoons, rodents, worms, insects of all descriptions, multi-cellular but microscopic animals, and single-celled animals. In our selected geographical area there are (very very roughly) an estimated one quintillion five hundred quadrillion (1,500,000,000,000,000,000) insects alone. There are only 500 million humans, 122 million cattle, 83 million pigs and 10 million sheep in the same area. Insects are small and many in number and some mammals are comparatively large but few in number. Uni- and multicellular microscopic animals? Each of the 500 million humans has, on average, over 100 trillion (100,000,000,000,000 ) microbes in and on their body. By any method — mean, median, or mode — the average height/length of all North American animals would be literally vanishing small — so small that “on average” you wouldn’t expect to be able to see any animals with unaided eyes.
To calculate an average of any type that will be physically, scientifically meaningful as well as logical and useful, the set being averaged must itself make sense as a comparable, commensurable, homogenous collection of objects with data about those objects being comparable and commensurable.
As I will discuss later, there are cases where the collection (the data set) seems proper and reasonable, the data about the collection seems to be measurements in comparable units and yet the resulting average turns out to be non-physical — it doesn’t make sense in terms of physics or logic.
These types of averages, of disparate, heterogeneous data sets — in which either the measurements or the objects themselves are incommensurable — like comparing Apples and Oranges and Bananas — give a results which can be labelled Fruit Salad and have applicability and meaning that ranges from very narrow through nonsensical to none at all.
“Climate Change Rapidly Warming World’s Lakes”
This is claimed as the major finding of a study by Catherine M. O’Reilly, Sapna Sharma, Derek K. Gray, and Stephanie E. Hampton, titled “Rapid and highly variable warming of lake surface waters around the globe” [ .pdf here; poster here, AGU Meeting video presentation here ]. It is notable that the study is a result of the Global Lake Temperature Collaboration (GLTC) which states: “These findings, the need for synthesis of in situ and remote sensing datasets, and continued recognition that global and regional climate change has important impacts on terrestrial and aquatic ecosystems are the motivation behind the Global Lake Temperature Collaboration.”
The AGU Press Release regarding this study begins thus: “Climate change is rapidly warming lakes around the world, threatening freshwater supplies and ecosystems, according to a new study spanning six continents.”
“The study, which was funded by NASA and the National Science Foundation, found lakes are warming an average of 0.61 degrees Fahrenheit (0.34 degrees Celsius) each decade. That’s greater than the warming rate of either the ocean or the atmosphere, and it can have profound effects, the scientists say.”
All this is followed by scary “if this trend continues” scenarios.
Nowhere in the press release do they state what is actually being measured, averaged and reported. [See “What Are They Really Counting?”]
So, what is being measured and reported? Buried in the AGU Video presentation, Simon Hook, of JPL and one of the co-authors, in the Q&A session, reveals that “these are summertime nighttime surface temperatures.” Let me be even clearer on that — these are summertime nighttime skin surface water temperatures as in “The SST directly at the surface is called skin SST and can be significantly different from the bulk SST especially under weak winds and high amounts of incoming sunlight …. Satellite instruments that observe in the infrared part of the spectrum in principle measure skin SST.” [source] When pressed, Hook goes on to clarify that the temperatures in the study are greatly influenced by satellite measurement as the data is in large part satellite data, very little data is actually in situ [“in its original place or in position “ — by hand or buoy, for instance] measurements. This information is, of course, available to those who read the full study and carefully go through the supplemental information and data sets — but it is obscured by the reliance on stating, repeatedly “lakes are warming an average of 0.61 degrees Fahrenheit (0.34 degrees Celsius) each decade.“
What kind of average? Apples and Oranges and Bananas. Fruit Salad.
Here is the study’s map of the lakes studied:

One does not need to be a lake expert to recognize that these lakes range from the Great Lakes of North America and Lake Tanganyika in Africa to Lake Tahoe in the Sierra Nevada Mountains on the border of California and Nevada. Some lakes are smaller and shallow, some lakes are huge and deep, some lakes are in the Arctic and some are in the deserts, some lakes are covered by ice much of the year and some lakes are never iced over, some lakes are fed from melting snow and some are feed by slow-moving equatorial rivers.
Naturally, we would assume, that like Land Surface Temperature and Sea Surface Temperature, the Lake Water Temperature average in this study is weighted by lake surface area. No, it is not. Each lake in the study is given equal value, no matter how small or large, how deep or how shallow, snow fed or river fed. Since the vast majority of the study’s data is from satellite observations, the lakes are all “larger”, small lakes, like the reservoir for my town water supply, are not readily discerned by satellite.
So what do we have when we “average” the [summertime nighttime skin surface] water temperature of 235 heterogeneous lakes? We get a Fruit Salad — a metric that is mathematically correct, but physically and logically flawed beyond any use [except for propaganda purposes].
This is freely admitted in the conclusion of the study, which we can look at piecemeal: [quoted Conclusion in italics]
“The high level of spatial heterogeneity in lake warming rates found in this study runs counter to the common assumption of general regional coherence.”
Lakes are not regionally responding to a single cause — such as “global warming”. Lakes near one another or in a defined environmental region are not necessarily warming in similar manners or for the same reason, and some neighboring lakes have opposite signs of temperature change. The study refutes the researcher’s expectation that regional surface air temperature warming would correspond to regional lake warming. Not so.
“Lakes for which warming rates were similar in association with particular geomorphic or climatic predictors (i.e., lakes within a “leaf”) [see the study for the leaf chart] showed weak geographic clustering (Figure 3b), contrary to previous inferences of regional-scale spatial coherence in lake warming trends [Palmer et al., 2014; Wagner et al., 2012]. “
Lakes are warming for geomorphic (having to do with form of the landscape and other natural features of the Earth’s surface) and local climate — not regionally, but individually. This heterogeneity implies lack of a single or even similar causes within regions. Lack of heterogeneity homogeneity means that these lakes should not be consider a single set and thus should not be averaged together to find a mean. [ This correction made 17 Feb 2022 upon re-reading — kh ]
“In fact, similarly responding lakes were broadly distributed across the globe, indicating that lake characteristics can strongly mediate climatic effects.”
Globally, lakes are not a physically meaningful set in the context of surface water temperature.
“The heterogeneity in surface warming rates underscores the importance of considering interactions among climate and geomorphic factors that are driving lake responses and prevents simple statements about surface water trends; one cannot assume that any individual lake has warmed concurrently with air temperature, for example, or that all lakes in a region are warming similarly.”
Again, their conclusion is that, globally, lakes are not a physically meaningful set in the context of surface water temperature yet they insist on finding a simple average, the mean, and basing conclusions and warnings on that mean.
“Predicting future responses of lake ecosystems to climate change relies upon identifying and understanding the nature of such interactions.”
The surprising conclusion shows that if they want to find out what is affecting the temperature of any given lake, they will have to study that lake and its local ecosystem for the causes of any change.
A brave attempt has been made at saving this study with ad hoc conclusions — but most are simply admitting that their original hypothesis of “Global Warming Causes Global Lake Warming” was invalidated. Lakes (at least Summertime Nighttime Lake Skin Surface Temperatures) may be warming, but they are not warming even in step with air temperatures, not reliably in step with any other particular geomorphic or climatic factor, and not necessarily warming even if air temperatures in the locality are rising. As a necessary outcome, they fall back on the “average” lake warming metric.
This study is a good example of what happens when scientists attempt to find the averages of things that are dissimilar — so dissimilar that they do not belong in the same “set”. One can do it mathematically — all the numbers are at least in the same units of degrees C or F — but such averaging gives results that are non-physical and nonsensical — a Fruit Salad resulting from the attempt to average Apples and Oranges and Bananas.
Moreover, Fruit Salad averages not only can lead us astray on a topic but they obscure more information than they illuminate, as is clearly shown by comparing the simplistic Press Release statement “lakes are warming an average of 0.61 degrees Fahrenheit (0.34 degrees Celsius) each decade” to the actual, more scientifically valid findings of the study which show that each lake’s temperature is changing due to local, sometimes even individual, geomorphic and climate causes specific to each lake and casting doubt on the idea of global or regional causes.
Another example of a Fruit Salad metric was shown in my long-ago essay Baked Alaska? which highlighted the logical and scientific error of averaging temperatures for Alaska as a single unit, the “State of Alaska”, a political division, when Alaska, which is very large, consists of 13 distinct differing climate regions, which have been warming and cooling at different rates (and obviously with different signs) over differing time periods. These important details are all lost, obscured, by the State Average.
Bottom Line:
It is not enough to correctly mathematically calculate the average of a data set.
It is not enough to be able to defend the methods your Team uses to calculate the [more-often-abused-than-not] Global Averages of data sets.
Data sets must be homogeneous, physically and logically. They must be data sets of like-with-like, not apples-and-oranges. Data sets, even when averages can be calculated with defensible methods, must have plausible meaning, both physically and logically.
Careful critical thinkers will be on the alert for numbers which, though the results of simple addition and division, are in fact Fruit Salad metrics, with little or no real meaning or with meanings far different than the ones claimed for them.
Great care must be taken before accepting that any number presented as an average actually represents the idea being claimed for it. Averages most often have very narrow applicability, as they obscure the details that often reveal the much-more-important actuality [which is the topic of the next essay in this series].
# # # # #
Note on LOTI, HadCRUT4, etc.: It is my personal opinion that all combined Land and Sea Surface Temperature metrics, by all their various names, including those represented as indexes, anomalies and ‘predictions of least error’, are just this sort of Fruit Salad average. In physics if not Climate Science, temperature change is an indicator of change in thermal energy of an object (such as of a particular volume of air or sea water). In order to calculate a valid average of mixed air and water temperatures, the data set must first be equal units for equivalent volumes of same material (which automatically excludes all data sets of sea surface skin temperatures, which are volume-less). The temperatures of different volumes of different materials, even air with differing humidity and density, cannot be validly averaged without being converted into a set of temperature-equivalent-units of thermal energy for that material by volume. Air and water (and stone and road surfaces and plowed fields) have much different specific heat capacities thus a 1 °C temperature change of equal volumes of these differing materials represents greatly differing changes in thermal energy. Sea Surface (skin or bulk) Temperatures cannot be averaged with Surface Air Temperatures to produce a physically correct representation claimed as a change in thermal (heat) energy — the two data sets are incommensurable and such averages are Fruit Salad.
And yet, we see every day, these surface temperature metrics represented in exactly that non-physical way — as if they are quantitative proof of increasing or decreasing energy retention of the Earth climate system. This does not mean that correctly measured air temperatures at 2 meters above the surface and surface sea water temperatures (bulk — such as Argo floats at specific depths) cannot tell us something, but we must be very careful in our claims as to what they tell us. Separate averages of these data sets individually are nonetheless still subject to all the pitfalls and qualifications being presented in this series of essays.
Our frequent commenter, Steven Mosher, recently commented that:
“The global temperature exists. It has a precise physical meaning. It’s this meaning that allows us to say…
The LIA was cooler than today…it’s the meaning that allows us to say the day side of the planet is warmer than the nightside…The same meaning that allows us to say Pluto is cooler than earth and mercury is warmer.”
I must say I agree with his statement — and if Climate Scientists would limit its claims for various Global Temperature averages to these three concepts, their claims would be far more scientifically correct.
NB: I do not think it is correct to say “It has a precise physical meaning.” It may have a precise description but what it means for the Earth’s climate is far from certain and does not approach precise by any measure.
I expect opinions may vary on this issue.
# # # # #
Author’s Comment Policy:
I am always anxious to read your ideas, opinions, and to answer your questions about the subject of the essay, which in this case is Fruit Salad Averages as defined above.
As regular visitors know, I do not respond to Climate Warrior comments from either side of the Great Climate Divide — feel free to leave your mandatory talking points but do not expect a response from me.
I am interested in examples of Fruit Salad Averages from the diverse range of science and engineering fields in which WUWT readers work.
# # # # #
Discover more from Watts Up With That?
Subscribe to get the latest posts sent to your email.
as to what they tells us
You seem to be channeling Gollum here… 🙂
nasty hobitses
https://judithcurry.com/2016/02/10/are-land-sea-temperature-averages-meaningful/
Are land + sea temperature averages meaningful?
Eustace and Mark ==> Thank you for paying close attention to words. I like words and appreciate those who appreciate them.
My fingers are sometimes independent minded….
Good; here’s another little flub. The first comma should be a semicolon. As it stands, it forces the reader to re-read the sentence to understand it:
Roger Knights ==> Cruel Roger, to play on my weakness with semi-colons….only my Vassar educated editor understands the darkness that surrounds these vile creatures….
Truthfully, I write as if I were speaking — which sometimes plays havoc with punctuation-according-to-Style-Guides.
You are probably exactly right —
excellent presentation. this is a crucial point.
did you know you have an above average number of legs?
It is one thing to talk about “data sets” and another thing entirely to talk about “estimate sets” (data sets after “adjustment”).
Why would neighbouring lakes have different? Maybe one is covered by algae.
My guess is that the researchers gathered the data, couldn’t find anything useful, and thrashed about with different methods of analysis until they found a ‘significant’ result. It’s similar to the dark chocolate hoax. This XKCD cartoon also explains the phenomenon.
The researchers did a bunch of work. They can’t say they didn’t find anything because nobody will publish that. The result is there for everyone to see.
commie ==> The study is from “…the Global Lake Temperature Collaboration (GLTC) which states: “These findings, the need for synthesis of in situ and remote sensing datasets, and continued recognition that global and regional climate change has important impacts on terrestrial and aquatic ecosystems are the motivation behind the Global Lake Temperature Collaboration.” ”
Nearly all of the data in the study is from satellite data, as I point out.
They’re admitting that the study had a preordained outcome. That’s not science, it’s advocacy.
If a weighted average (rather than arithmetic mean) had produced the desired outcome, that’s what they would have used.
This is where a principle component analysis would come in handy. If you do it right, you feed everything into it, and it tells you what the important modes of variation are. In the case of global temperature measurements, it would probably make the UHI effect and data tampering pop right out, but you would also get to see how things vary with the season.
Forest: not to my knowledge. Certainly nothing in the hockey stick had anything to do with PCA.
“My guess is that the researchers gathered the data, couldn’t find anything useful, and thrashed about with different methods of analysis until they found a ‘significant’ result. ”
A practice now so common that it has even acquired a name: “significance-chasing” or “p-chasing”.
I didn’t know it had a name but, yes, it does. link Scientists admit there’s a problem with science but then you get folks like Neil deGrasse Tyson telling us that we have to accept the authority of science.
“Certainly nothing in the hockey stick had anything to do with PCA.”
You are wrong. The misuse of PCA is central to MBH 98 and MBH 99.
“Why would neighbouring lakes have different? Maybe one is covered by algae.”
I don’t know, but consider these NWS reports from Lake Champlain (One of their study lakes) yesterday. The temps are daytime and near shore. Yesterday was a clear, sunny, coolish day so clouds probably aren’t a factor.
Quite a lot of variation there on one largish lake.
Note: I don’t know where the USGS gauge at Burlington is located, but it’s unlikely to be even a kilometer from the gauge at the King St Ferry Dock. And Colchester Reef is only 12-15km NorthWest of Burlington. I’m guessing that the Burlington gauge is busted or was misread.
Although you finally get to it at the end, even your assessment method for defining Fruit Salad Average (FSA) is your own opinion. The people who create metrics such as “Global Mean Temperarure” have a different opinion, obviously. Thus we’ve really gained no ground here.
My foray into global warming skepticism falls along your reasonings, not to mention the act of measurement itself. But many many people in the sciences feel the methods of collection and interpretation are perfectly valid. If they weren’t, this all would’ve collapsed long ago.
One might refer to an average calculated from a mix of data, “adjusted” data (estimates) and ‘infilled” data (SWAGs) a “dogs breakfast” average, calculated after “processing”.
firetoice2014 ==> While I use the phrase “dog’s breakfast” in everyday speech (a nasty habit from my days as a merchant mariner), it has been ages since I’ve seen it in print. Thank you!
I wonder – if we averaged a set composed of ‘dog’s breakfasts’ and ‘dog’s dinners’ could we outline the parameters of a ‘dog’s brunch’? We would have to make sure that all the dogs used the term ‘dinner’ to denote a midday repast. Additionally, does the use of one or the other of these expressions bear any correspondence to the user’s regional or social stratum origin? Anybody got a blank grant application that I could use?
OldGuy ==> I picked it up from Brits/Aussies/Kiwis in the Merchants … in the 1970s. I don’t know if it is in common use in the States.
Since forever we have kept horses, and dogs, the phrase instantly conjured images of my dogs (we loved them all!) happily banging out the doggie door first thing in the morning to make a cursory fence patrol then settling into the horse paddock for breakfast.
While “many, many people in science FEEL [their] methods of collection and interpretation are perfectly valid” doesn’t mean they are valid. Kip’s review/ analysis and lesson is pretty darn good and not just his opinion. I have had to deal with more than one “fruit salad” submitted paper or analysis in my career. We were using the work to make regulations that would dramatically affect lots of people’s livelihoods. The authors would argue vehemently that their analysis was valid. Often it took very little to demonstrate how invalid their “averaging” was.
Ed ==> Thank you — can you say what field you are in?
Did you really mean to refer to the “shrewdness” if a skewed distribution? Or “patients who develop HPB at younger ages shew the mean.”?
There is a typo’ (probably autocorrect!) just after the graph on the different averages: “… This shrewdness (right or left)…”
The last sentence in the paragraph below the histogram lists “shrewdness” – I believe you meant skewness.
tadchem, Clyde, and Kokoda ==> Cheers my heart to know that others read carefully and pay attention to the words.
My editor was again unavailable yesterday — thus a few of these little typos have slipped through.
Skewidity, surely!
Or perhaps skewedness?
I am shrouded in obliquity today…
🙂
” If you haven’t developed HPB by 65 or so, your risk decreases with additional years, though you still must be vigilant.”
It may well be that those who are prone to develop HPB have been removed from the pool from heart attacks before the age of 65. Thus, those remaining are less likely to develop HPB.
Incidentally, I spotted another typo’. “Skew” was changed to “shew.”
Clyde ==> Thanks again.
Regarding the ‘meaning’ of a global average temperature, in the geophysical and the ecological senses of reality it is meaningless. No physical feature and no biological organism experiences an *average* temperature. All objects experience the local, transient temperature at their immediate location. There is no uniformity. There are only varying degrees of variability. The mesquite bush outside my office window can experience a minute change of temperature from minute to minute as clouds pass over, a small change from hour to hour as storms come and go, a larger change from night to day, and a tremendous change throughout the year. The bush has adapted to these changes and survives them all, and the ‘average’ temperature is irrelevant. It can survive temperatures over 100°F (as it did yesterday) and temperatures under 20° F (as it did 6 months ago). The pertinent life-threatening changes in its environment involve many other factors such as insects, browsing animals, wildfire, flash floods, etc.
Yes, but could it survive 101.6F and 21.6F (another 1C increment)? “Enquiring minds want to know.”
Yes. One can imagine the limited utility of a number said to be the “average summer temperature” of Honolulu, or the even more limited utility of the “average summer temperature” of Denver; but the “average” of those two numbers has lost all utility. It describes nothing useful.
The biggest fruit salad of them all is … the average person.
This particular “salad” does not even consider any measurable quantities. It’s just a stick figure of the mind.
Absolutely!! : “The average person has less than 2 legs, 2 eyes or 2 kidneys.”
While technically true it is a meaningless average. The “mode” would be far more meaningful.
“He uses statistics as a drunken man uses lampposts – for support rather than for illumination.”
(Andrew Lang)
“as a drunken man uses lampposts ”
Last drunken man I saw near a lamppost seemed to want to hold it up with a stream of liquid !
a stick figure on the serengeti of the mind…lol
Outstanding post.
ristvan ==> Thank you, sir.
Absolutely yes, well written stuff.
Just one more reason I do not weep when the budgets of government science are cut. Fewer of these ridiculous studies to pollute whatever remains of real science.
Science is hard to do. Good science takes years. These goombas found some data and ran it through their statistical packages that they don’t understand. I hope I didn’t pay for any of it.
Thanks Kip – a very good start.
~ ~ ~ ~ ~
Many years ago as sensors and algorithms (S&A) were being introduced to environmental studies, I wrote an essay for my instructor that included a look to the future.
Prior to S&A, land cover studies involved field excursions – usually students with clipboards criss-crossing an area and writing things like “block A has Pine trees, block B is a grass field, block C is the 3rd fairway.”
During the introduction of S&A, automated reports would be generated and then a “ground-truth” field trip would be conducted to see if the algorithms did, if fact, distinguish between the land cover types.
My paper’s hypothesis was that reliance on satellites to do the work of poorly paid graduate students was not necessarily a good thing.
[PS: On one “ground truth” trip, while a student was intent on recording land cover, a dog walked up and peed on his leg.]
John F. Hultquist == Anthony Watts’ Surface Station Project was that sort of “ground truthing” about the physical circumstances surrounding individual weather stations. I did a report on the station In Santo Domingo, Dominican Republic. (I did not get peed on).
Kip, the pronunciation is Prime- er, meaning basic or first. Children go to a primary school, not a primmery school. They are prime numbers, not primm numbers; the primary reason, not primmery reason. Jodie Foster couldn’t pronounce it either.
jsuther2013 ==> Oddly, M-W nonetheless insists on prim-er — listen to this link:
https://www.merriam-webster.com/dictionary/primer?pronunciation&lang=en_us&dir=p&file=primer01.
Of course, you are right for Primary School, and Primary, and Primarily, and “primer” — a type of base-coat for paints (we use a lot of primers on the boat) —
but for
1
: a small book for teaching children to read
2
: a small introductory book on a subject
3
: a short informative piece of writing
it is prim-er, with the short “i”.
Kip, M-W gives two pronunciations.
prim·er \ˈpri-mər, chiefly British ˈprī-mər\
It appears both [i] and [ī] are possible. Like sit [sit] and sight [sīt]. Never heard primmer before… I’m chiefly learnt-English-as-foreign-language.
In my traditional English education during the 50s and 60s there is only one pronunciation, and it is not primm-er.
(I say traditional because it was long before the modern ‘progressive’ education systems that produce the high number of illiterates we now have in the UK and which have brought about the significant drop in standards of UK education when measured against international standards)
Obviously only a matter of varied usage, semi-dialect. In NZ it happens to be primmer just for the 1st 4 classes, 2yrs where we learnt to read and write; and become at least partly-civilised. Still spelt primer, though.
I have heard “prim-er” for an elementary reader, not “prime-er”, with a long I. The pronunciation is irregular.
That makes no sense, since the purpose of a primer is to prime the student’s reading skills. Like priming a pump. You never prim a pump.
So what if it makes no sense? That was the pronunciation used in California some 50 years ago.
Kip,
I agree completely with you that there is an overabundance of ‘Fruit Salad’ in climatology papers. I have complained previously here that averaging sea surface temperatures with land air temperatures distorts what is happening. Similarly, I have advocated reporting the global average of diurnal highs and lows individually, rather than a grand average, because there are different processes at work controlling the highs and lows and, again, those processes are hidden by a grand average. Further, I have suggested that land averages be grouped by climatic zones to see if all are responding similarly, which I believe they are not. It is surprising to me just how poor the analysis of temperature data has been, and that so few people have remarked that “the emperor has no clothes.”
Quite so.
Clyde,
With the current data quality and infill (guessing) your proposed analysis is likely impossible. We would have to deploy something similar to the Climate Reference Network sites to all the areas considered representative and then wait a century to have enough data to do the analysis. People today don’t seem to have that sort of patience, especially with chicken littles out there proclaim thermogeddon in the next 50 years.
OweninGA,
There is no question that the global temperature data set was never intended for the use that climatologists are trying to apply it to. However, we do currently have some who are using the sparsely-monitored Arctic temperatures to make broad statements about what is happening in the Arctic. I’m suggesting that those who are the ‘professionals’ in the field should be similarly looking at the other climate zones. The data available may not be optimal, but they might still provide some insights, if someone bothered to look. That effort alone might help to provide specifications for what an optimal climate monitoring network might need. Yes, the mentality seems to be one of “Ready, fire, aim.”
Kip, I had previously commented in other Threads about the absurd “averaging” of SST and LST’s. Additionally, it is passing strange that we find nothing wrong with “averaging” Arctic and Equatorial temperatures; differences in humidity, height of the tropopause, etc.
Dave ==> No one goes to this much work without a specific purpose in mind. The specific purpose of measuring/deriving the Earth’s Mean Surface Temperature, in the political climate today, it to prove that CO2 concentrations increasing in the atmosphere are causing to Earth system to retain extra energy incoming from the Sun (and that this is somehow dangerous instead of beneficial).
Since Surface Temperature, regardless of how calculated, does not represent the amount of energy retained (or not retained) it is either just interesting for its own sake (for the curious) or a fool’s errand.
+ Many. Thanks.
The Global Average temperature and GCM’s are a construct. Maybe even a useful construct but still a “fruit salad” and casting of bones to attempt to foresee the future. What the results of this averaging exercise means will only be known in a few hundred years when folks look back on this period of time with the knowledge of hindsight.
INSIGHT: The Paris Accord is founded on fruit salad.
Bon appétit !
careful there- the fruits are using the wrong restrooms…
An example that immediately comes to mind is the so-called “pay gap” between men and women. Claims of a significant difference in pay are based on averaging all salaries of men and all salaries of women. This averaging method does not consider factors such as different jobs (comparing an oil field worker with a fast food worker), different levels of experience, and different hours worked (full-time vs. part-time). No significant gap is found when comparing salaries of men and women with the exact same job, who work the exact same hours, who have the same levels of education and experience, etc.
More social science and political than natural or hard science, but it is a good example of how how to lie with statistics. It is also a good example of how entrenched false claims can become, even when debunked by disparate sources. Even when people know the truth, they may still promote fruit salad based conclusions that “support” their narrative (noble cause corruption). Example: the Department of Labor has tweeted the false statistic as fact, even though a) they should know better, and b) they do know better, as evidenced by their own website’s refutation.
Ally ==> Yes, lumping “men” and “women” into two categories and averaging each for comparison is a Fruit Salad exercise. I have seen better analyses of the gap — the gap gets smaller with closer attention to avoiding Fruit Salad — but may still exist as a societal artifact.
A Fruit Salad, indeed – and for the same reason as trying to “average” a global, or national, or State temperature. There are economic “microclimates” that vary just as much as meteorological ones.
I was very well paid as a “Senior Application Developer” – for my area. Just up the road, less than 90 miles away, I would have been somewhat underpaid – and darn near poverty level if I had been making that much and living in San Francisco, New York, etc.
+10….thanks for that. I had been wondering about this claim as it did not seem right, but I couldn’t put my finger on why.
Very good argument on a basic point. Where I was raised in Northern California, much of the influence on temperature was just how close one was to the ocean, and how many mountains were between you and the ocean. It could be 64F at the coast, 80F in the Santa Clara Valley (one range of mountains), and 95F in the Central Valley (two ranges of mountains). An average temperature would not mean much.
No need to travel so far to disprove the concept of an average temperature.
A comparison of the temps at your house between the front (concrete/hard surface driveway) to the back (grass/soft surface) would suffice.
“Travel far”? The coast was about an hour and a half travel time away, as is the Central Valley.
Excellent, excellent, excellent.
In your first figure you show the mean to be 52. This is the mean of the ages at which at least one patient developed HPB, not the mean of all the patients ages at which they developed HPB. That latter mean would be 57.38 if I read the chart correctly and is actually the more normal meaning of mean.
David Kleppinger ==> The HBP chart is taken from a web search for charts about averages. The MEAN is the mathematical mean of age of all patients added together divided by number of patients. The MEDIAN is the age at which there are as many patients above this age as below this age. The MODE is the age with the highest number of patients.
I can not vouch for the medical accuracy of the chart — not my data.
I’m just saying that was a bad chart to show as an example.
What that chart is showing as the mean appears to be the high value + low value / 2.
If you calculate it, the mean of the all the patient ages is 57.4, the mean of the ages (without considering the number of patients at that age) is 53.2.
David ==> You have a very good eye, sir. There is something amiss (in the numbers). It is quite possible that the chart does not show all the outliers at either end, thus making the mean and median appear to be in the wrong places. (again, not my chart)
The concepts in the essay are correctly stated — only a maths or stats eye would catch what you have caught!
Points adjacent in space or time are as different as apples and oranges.
Doug Huffman ==> More context please.
Excellent post Mr. Hansen. Reminds me of this.
A scientist is standing with one arm in a freezer and the other arm in an oven. He says “on average I’m quite comfortable”.
“The global temperature exists. It has a precise physical meaning.” – agreed, but we haven’t been measuring the true global temperature, and I’m not sure we can. The true global temperature would be to capture all global temperatures at the same point in time and then average them. And then we need this same capture to happen for each unique surface area that’s in sunlight. If there is a time variance of when each temperature value included in the average is taken then we haven’t achieved the ‘precise physical meaning’ of a global temperature average.
“allows us to say the day side of the planet is warmer than the night side” – it does? Does it allow us to determine how the hemispheres compare? A single ‘global temperature’ value does not allow us to do any of that.
A ‘precise’ global temperature value would allow us to determine if the Earth is warming at the rate of 0.0000023 C degrees per hour to prove that we’ll have 2 C degrees warming in 100 years. I don’t believe we have that level of precision. And since we don’t, any proxy average global temperature value needs to have an error margin included.
To me bigger question is the meaning of the average of a non-linear quantity , temperature , rather than their 4th power , energy — in which computations are linear . That is , is the most meaningful average
temps 4. ^ avg .25 ^rather than
temps avgwhere
: avg dup +/ swap % ;Where I’ve taken the liberty of expressing the computations in CoSy‘s RPN Forth syntax and left out the StefanBoltzmann constant which drops out anyway because I think the difference in the computations is easily understood however expressed .
Bob Armstrong ==> It gets complicated….
The CO2 Global Warming hypothesis is that increasing concentrations of CO2 will cause the Earth system to retain additional energy (until it eventually regains equilibrium). As some part of this energy expresses as sensible heat, they measure temperatures. Temperatures of the atmosphere and seas are not proven to be reliable measures of retained energy — energy is not even being measured (with the exception of incoming and outgoing radiation). no attempt has been made to determine where that energy goes and how it is stored.
One might ask this question: when energy arrives from the Sun, at the Earth’s surface, what energy transformation take place? Some is stored as sensible heat, some is transformed into stored chemical energy, some moves the air around, some moves the seas, some is transformed into the energy potential of water moved from sea level to the tops of mountain ranges, etc etc.
I’ve presented the computations for radiative equilibrium for arbitrary spectra , perhaps most succinctly at http://cosy.com/Science/warm.htm .
I have recently become absolutely certain that the Al Gore , James Hansen GHG spectral “heat trapping” paradigm is false on the most basic physical level .
Gravity is left out of the equations and it cannot be . I have come to view this as “settled” despite it being apparently outside the ken of the “climate science” community — altho more here are realizing it .
The “nail in the paradigm” is that light itself is heated ie : blue shifted , on descent into a gravitational well as demonstrated by the Pound–Rebka experiment , https://en.wikipedia.org/wiki/Pound%E2%80%93Rebka_experiment , based on the Mössbauer , https://en.wikipedia.org/wiki/M%C3%B6ssbauer_effect , effect .
If light is “heated” by gravity , it is only a matter of working out the equations to calculate the heating of all matter including atmospheres as one descends into a gravitational well .
—
That was somewhat of an interjection of my current state of understanding . The point I raised is a straight forward mathematical one . The general computation which leads to the 255K meme , a specific case I include in this slide ,
http://cosy.com/Science/AGWpptHypotheticalSpectra.jpg
assumes a point or , in the more common derivation a flat disk . Thus whether the average is performed over temperatures or energies and then converted to temperatures makes no difference . However , the implicit assumption in the standard ” disk % 4 ” computation is in terms of energy .
But , taking just 2 temperatures , say 310. for the tropics and 260. for the poles , we get
f( 310. 260. )f +/ 2. _f %f |>| 285.00
f( 310. 260. )f T>Psb +/ 2. _f %f P>Tsb |>| 288.23
where
: sb ( -- StefanBoltzmanConstant ) 5.6704e-8 _f ;
: P>Tsb ( Power -- Temperature ) sb %f .25 _f ^f ; | SB power to temperature
: T>Psb ( P -- T ) 4. _f ^f sb *f ;
( Again , excuse the crudeness of the CoSy notation compared to a polished APL due to it executing right at the x86 register level , but if anybody appreciates what’s being done , definitely check out CoSy . )
As you can see , in this example you get either 285.00 or 288.23 depending on whether you average temperature or average kinetic energy density ( power ) .
That’s my real conundrum . Which is more real ?
Take-away: Global warming is threatening fruit salads everywhere.