Heat Waves vs. Observed Data

John Entwistle

A few months back one of the commenters on this WUWT article: https://wattsupwiththat.com/2021/07/12/the-truth-about-heat-waves/ posted a comment to the effect that the number of heatwaves is increasing, and cited this article on the EPA website: https://www.epa.gov/climate-indicators/climate-change-indicators-heat-waves .  Embedded in the EPA article is this diagram:

I was disappointed that the article doesn’t provide data, methodologies, or calculations. It also used some terms such as “the local threshold” and “heat wave season” without defining them. I was prompted by this comment to channel my inner Willis to see if I could replicate this chart.

What is a heat wave?

I started with the definition of a heat wave. I found several delightfully ambiguous definitions on the internet.  From  Mirriam Webster we have: “a period of unusually hot weather”; Wikipedia defines it as: “A heat wave is a prolonged period of unusually hot weather.”; while from NOAA there is: “A heat wave is simply a period of unusually hot weather that typically lasts two or more days. The temperatures have to be outside the historical averages for a given area.” These definitions leave a lot of room for interpretation. Who gets to decide what is unusual? Historical averages over what period of time? Average over the whole period or a moving average? What is the local threshold and who determines its value?  How far outside the historical averages?

Getting Some Data

I began my search for data by looking for some actual temperature records from my area. I wanted daily observed data, not monthly summaries or grid estimates. I live in central New Jersey, just outside of New York, and I was surprised to find that, despite the fact that there have been private colleges / universities and military bases from all branches within a stone’s throw of my house since the revolution, no local institution has recorded, or at least published, a historical daily temperature record. The best that Climate Data Online (https://www.ncdc.noaa.gove/cdo-web/ could offer was three discontinuous data sets from different locations, the oldest starting… in 1960!  On WUWT it’s often been stated that “climate is 30 years.” If I was to look for heat waves from the ‘60s that meant that I needed to find data from at least 1930 in order to look for heat waves in the 60s. 

I poked around the internet and found some sources that had daily temperature observations in New York starting in the late 1800s:

High Plains Regional Climate Center: High Plains Regional Climate Center – CLIMOD (unl.edu)  at the University of Nebraska Lincoln. This site has a good data set for New York’s Central Park.

Cornell University:  weather.nysaes.cals.cornell.edu/history/ has records taken at their experimental farm outside of Geneva, NY.

I used Python scripts to scrape or parse these data sets into CSV files. There was some missing data, particularly in the early years, so discarded the records from before 1900. The Cornell site seems to have stopped posting half way through 2020, so I dropped the last six months of that data set to align with the end of 2019. There were still a few holes, so I filled in the gaps by calculating the average of the ten days before each gap and plugged the hole with that average.

Data SetDate RangeNumber of rowsMissing HighsMissing Highs %Missing LowsMissing Lows %
New York Central Park1900 through 2020441956070
Cornell  1900 through 199943892110.25%134.31%
Temperature data sets used in this article

Back to the question of “usual temperature”. How do we determine the usual temperature for any given day of the year? It bothers me that we always think about usual / normal / average temperatures in terms of calendar months. A calendar month is a convenient fiction invented, in the case of our calendar, by the Romans. Months have little or nothing to do with actual climate related phenomena.

Rather than using an average of the days of a month, or the n days prior to a day, I thought it might make more sense to look at the temperatures for each individual day and location across the ~120 periods of each data set. This approach gives us a mean temperature for each day of the year that we can then use as the basis for our “two plus days above usual” evaluation. Using Python, Pandas, and Matplotlib I was able to quickly generate a data series for each day of the year over the ~120 years of each temperature record, calculate the mean, standard deviation, and a least squares trend line for each day, and plot these to see how the temperature has changed year over year for that day.

120 Years of Daily Temperatures

Every year citizens all across the US get together on the fourth of July, US Independence Day, to eat hot dogs, drink beer, and complain about how hot it is.  A plot of the temperature on US Independence Day in Central Park from 1900 to 2020 looks like this:

July Fourth in New York’s Central Park over a 120-year period – Is this why New Yorkers go to the Hamptons?

The mean in this graph is the mean high temperature value for this day of the year, in this case day 185.

Cornell University has an experimental farm not too far from Geneva, NY, about midway between Rochester and Syracuse. They’ve been collecting temperature data since 1891, though they stopped updating their web site mid-way through 2020.  If we look at July 4 in Geneva from 1900 through 2019 we get a slightly different picture than the one in Central Park:

July Fourth in Geneva, NY over a 119-year period

It is interesting to note that the minimum temperatures in both locations have increased over the last 120 years.

I thought it might be fun to have a look at similar daily plots for some days that might actually be climate related – the solstices and equinoxes.  Here is Central Park again:

NY Central Park temperature of solstice/equinox days over the same 120-year period

The same four solstice / equinox days look like this in Geneva, NY:

Temperature of the same solstice/equinox days in Geneva, NY over the same 119-year period

Heat Waves

To find the heat waves we need to find two or more consecutive days above normal. I decided to use one standard deviation above the mean temperature for that day of the year as “normal” for that day. I will do this for the entire year, not an arbitrary “heat wave season”. After all, if it is really warm in the middle of winter, isn’t that a heat wave?

To start, I identified every day above this threshold value. Here is the count of these ‘hot days’ per year in Central Park:

 1992 had quite a few hot days in Central Park!

The plot of hot days for Geneva, NY looks like this:

Those hot days seemed to occur in 1922 in upstate NY

From here it’s a simple matter to count the number of consecutive hot days to find the heat waves:

NY Central Park number of heat waves per year 1900 – 2020

Geneva, NY number of heat waves per year 1900 – 2019

Now let’s look at intensity and duration. I’ll choose intensity to mean the average temperature of all heat waves that year.

NY Central Park average temperature per heat wave per year 1900 – 2020

Geneva, NY average temperature per heat wave per year 1900 – 2019

These are interesting pictures. Does it mean that heatwaves are getting cooler, or that the percentage of heat waves in the cooler months has increased in the latter half of the data period?

Finally, we can plot the average length of the heat waves each year over the same period:

NY Central Park average duration of heat waves per year 1900 – 2020

Heat wave average duration per year in upstate New York 1900 – 2019

Summarizing by Decade – The EPA Chart

Averaging the above heat wave numbers for each decade from 1900 – 2019 gives us charts similar to those provided by the EPA:

NY Central Park heat waves per decade 1900 – 2019

Cornell Experimental Farm, Geneva, NY heat waves per decade 1900 – 2019

Interestingly, the number of heatwaves per decade in both Central Park and upstate New York in the period 1960 – 2019 show an increase in the number of heat waves each decade, similar to the EPA graph, though it looks like they used a higher threshold because the number of heatwaves they cite is much lower. The period from 1960 to 2020 doesn’t appear to tell the whole story, though, as we can see that there was a warmer period in both data sets in the 30s and 40s. The intensity and duration charts that I’ve produced are quite different from EPA’s in each location. 

One Chart to Rule them All

The EPA only had one set of charts representing an unspecified region, I suppose the whole Earth? I have two sets of charts, one for the city and one for upstate.  Combining the two by averaging the decadal values for each location gives these plots:

I’m not sure what these combined charts tell us. Averaging the counts of the heatwaves across the two locations does seem to mask the steady rise in the number of heatwaves in Central Park. The intensity and duration charts that we produced here bear no resemblance to those produced by the EPA. It would be nice to know what data they used to produce their charts.

What is clear is that a distance of approximately 300 miles and an altitude difference of 780 feet has produced very different temperature data in these two locations. The heatwave charts seem to show that even thirty years is not long enough to understand the climate in either location. It would be great to have another hundred years of data to see if that seemingly cyclical pattern in the number of heatwaves is real.

All data and calculations can be found here: https://github.com/jentwistle3/Heatwave_public

The climate data they don't want you to find — free, to your inbox.
Join readers who get 5–8 new articles daily — no algorithms, no shadow bans.
4.8 36 votes
Article Rating
84 Comments
Geoff Sherrington
November 19, 2021 3:08 pm

If readers want to get a stress headache, have a read of recent literature on marine heat waves. Unfettered alarmism seems to dominate cold, hard scientific assessment. Again. Geoff S

leitmotif
November 19, 2021 3:27 pm

” I was prompted by this comment to channel my inner Willis to see if I could replicate this chart.”

Add a little back radiation here, a bit of back radiation there, maybe ramp up the ECS. Remember, it is imperative you don’t supply any evidence for any assertions you might make or else you will have the lukewarmists banging on your front door.

That should channel your inner Willis, John.

Don’t forget to delete my comment, though. That would not be inner Willis if you didn’t.

Reply to  leitmotif
November 19, 2021 3:33 pm

Typical Leitmotif. Full of personal attacks, plus lies that I don’t supply evidence, and without a single relevant scientific comment on the subject at hand.

And of course, the big lie, that I’ve deleted his comments. Despite being sorely tempted to place his trash in the appropriate receptacle, I can’t recall ever doing that.

Leitmotif, you are a lying sack of bovine-processed hay.

w.

Abolition Man
Reply to  Willis Eschenbach
November 19, 2021 5:05 pm

Willis,
That’s an insult to sacks of bovine-processed hay!
At least THEY can be utilized for important things like methane digesters or compost piles!

leitmotif
Reply to  Abolition Man
November 20, 2021 2:52 pm

Abolition Man is a Willis Groupie.

You obviously like junk science too.

leitmotif
Reply to  Willis Eschenbach
November 20, 2021 3:03 pm

 Full of personal attacks, plus lies that I don’t supply evidence”

Erm… Well you don’t. You just utter BS about back radiation causing surface warming. You don’t even refer to any experimental evidence where this surface warming is measured.

You are also a serial liar, Willis. You have deleted many of my comments over the last year and those of rick will, eben and I also believe Zig Zag Wanderer. If you don’t believe me go back and check your blogs in the last 12 months.

If anyone is a lying sack of sh*t it’s you Willis.

Btw, I have background of STEM subjects to degree level. What is your background? Oh, wait a minute … psychology. Yes, that’s it. Not really a science, is it?

Geoff Sherrington
November 19, 2021 4:04 pm

You need to be careful when using local observations to explain heatwave properties. For example, UHI effects and marine cooling in coastal towns.
For many cities, especially coastal ones, the heatwaves, however defined, originate away from the city. Example, Melbourne Australia gets many of its worst heat waves blown in from central Australia, more than 1,000 miles away. It is the weather and climate of central Australia, combined with wind strength and direction, that governs most heatwave properties in coastal cities to the S-E.
Probably the same with Florida. Does it get heat waves when central US is cool? Geoff S

November 19, 2021 4:08 pm

Very well done analysis. The Urban Heat Island for New York has been commented on several time. I recently updated my Urban Heat Island web site with several more Land Surface Temperature maps, using Analysis Ready Data from the USGS Earth Explorer web site. Some of you might find the maps of interest.
http://www.urbanhi.net/uhi-cities/new-york.html

Prjindigo
November 19, 2021 6:30 pm

The problem with the whole “global warming” thing is that more than 200% of the data is fabricated either through manipulation of actual readings into homogeny or through fraudulent inputs to force table A to perform like expected table B. This means the margin of error on the output is up around 4000% Yes, that’s a four digit number indicating 40x as much error as the original individual data inputs.

This clearly indicates that most climatologists shouldn’t even be allowed to operate single-ply toilet paper.

Chris Wright
November 20, 2021 3:53 am

John, that’s a nice piece of work. Many thanks. As mentioned, it would be great if you could extend it to as many other sites as possible.
When I looked at the four EPA graphs, two things were obvious – and they do not cast the EPA in a good light.
First, the graphs start in the 1960’s, although of course there is good data going back to at least the 1900’s. It’s obvious why they did this: the warm 1930’s is just too inconvenient, so they had to go. Sadly this fraudulent practice is common. Shortly after Biden became presiden, the US wildfire agency deleted all the wildfire data before 1980. Of course, the deleted data shows that US wildfires were far worse during the 1930’s and that today’s wildfires are fairly average.
Second, the graphs show no sign of the cooling in the 1970’s – another example of hiding inconvenient data. In contrast, your graphs clearly show a cool 1970’s. During those years many climate scientists were telling us we were doomed by global cooling. How quickly can climate science bandwagons change!
Still, we shouldn’t be surprised by this scientific malfeasance. In many areas, including climate and air particles, the EPA “science” is hopelessly corrupt.
Chris

Reply to  Chris Wright
November 20, 2021 9:51 am

Hi Chris
I am in the process of collecting data from other sites and will follow up with another post when I have the chance. Many of these older data sets have large gaps that I am loath to fill in so it takes a lot of surfing to find decent ones. Then too there is the problem that many are urban or airport, which may bias the results. That darned day job keeps intruding as well.

November 20, 2021 9:26 am

Regarding the claim that the EPA article “doesn’t provide data, methodologies, or calculations”: I saw these in the EPA article:

“For consistency across the country, Figures 1 and 2 define a heat wave as a period of two or more consecutive days when the daily minimum apparent temperature (the actual temperature adjusted for humidity) in a particular city exceeds the 85th percentile of historical July and August temperatures (1981–2010) for that city.”

Figure 2 has maps showing where these major cities are, and they can be identified. comment image

More methodology statements are in the article farther down.

I don’t see lack of stating methodology, etc. being the problem. I see the stated methodology, apparently chosen to maximize indication of increase of heatwaves, being the problem.

Rick W Kargaard
November 20, 2021 3:51 pm

I’m not sure what these combined charts tell us”
They tell us no more or no less than all the other charts used to demonstrate aspects of global climate change. They do seem to be honest representations of data available but as such only become confusing visuals.
Graphics are too often accepted as a true representation of reality which makes them effective for purposeful misdirection. To be believable they must use unassailable data. If such (observational or experimental) data exists (which I doubt) it can only pertain to the past. The chart may tell you what was, but that may have no relationship at all to what is or what may be.
Thanks for making an already confusing topic even more confusing. I am probably not helping.

RoHa
November 20, 2021 7:04 pm

She certainly can can-can.