Proxy Science and Proxy Pseudo-Science

Guest post by Pat Frank

It’s become very clear that most published proxy thermometry since 1998 [1] is not at all science, and most thoroughly so because Steve McIntyre and Ross McKitrick revealed its foundation in ad hoc statistical numerology. Awhile back, Michael Tobis and I had a conversation here at WUWT about the non-science of proxy paleothermometry, starting with Michael’s comment here and my reply here. Michael quickly appealed to his home authorities at, Planet3.org. We all had a lovely conversation that ended with moderator-cum-debater Arthur Smith indulging a false claim of insult to impose censorship (insulting comment in full here for the strong of stomach).

But in any case, two local experts in proxy thermometry came to Michael’s aid: Kaustubh Thimuralai, a grad student in proxy climatology at U. Texas, Austin and Kevin Anchukaitis, a dendroclimatologist at Columbia University. Kaustubh also posted his defense at his own blog here.

Their defenses shared this peculiarity: an exclusive appeal to stable isotope temperature proxies — not word one in defense of tree-ring thermometry, which provides the vast bulk of paleotemperature reconstructions.

The non-science of published paleothermometry was proved by their non-defense of its tree-ring center; an indictment of discretionary silence.

Nor was there one word in defense of the substitution of statistics for physics, a near universal in paleo-thermo.

But their appeal to stable isotope proxythermometry provided an opportunity for examination. So, that’s what I’m offering here: an analysis of stable isotope proxy temperature reconstruction followed by a short tour of dendrothermometry.

Part I. Proxy Science: Stable Isotope Thermometry

The focus is on oxygen-18 (O-18), because that’s the heavy atom proxy overwhelmingly used to reconstruct past temperatures. NASA has a nice overview here. The average global stable isotopic ratios of oxygen are, O-16 = 99.757%, O-17 = 0.038 %, O-18 = 0.205 %. If there were no thermal effects (and no kinetic isotope effects), the oxygen isotopes would be distributed in minerals at exactly their natural ratios. But local thermal effects cause the ratios to depart from the average, and this is the basis for stable isotope thermometry.

Let’s be clear about two things immediately: first, the basic physics and chemistry of thermal isotope fractionation is thorough and fully legitimate. [2-4]

Second, the mass spectrometry (MS) used to determine O-18 is very precise and accurate. In 1950, MS of O-18 already had a reproducibility of 5 parts in 100,000, [3] and presently is 1 part in 100,000. [5] These tiny values are represented as “%o,” where 1 %o = 0.1% = 0.001. So dO-18 MS detection has improved by a factor of 5 since 1950, from (+/-)0.05%o to (+/-)0.01%o.

The O-18/O-16 ratio in sea water has a first-order dependence on the evaporation/condensation cycle of water. H2O-18 has a higher boiling point than H2O-16, and so evaporates and condenses at a higher temperature. Here’s a matter-of-fact Wiki presentation. The partition of O-18 and O-16 due to evaporation/condensation means that the O-18 fraction in surface waters rises and falls with temperature.

There’s no dispute that O-18 mixes into CO2 to produce heavy carbon dioxide – mostly isotopically mixed as C(O-16)(O-18).

Dissolved CO2 is in equilibrium with carbonic acid. Here’s a run-down on the aqueous chemistry of CO2 and calcium carbonate.

Dissolved light-isotope CO2 [as C(O-16)(O-16)] becomes heavy CO2 by exchanging an oxygen with heavy water, like this:

CO2 + H2O-18 => CO(O-18) + H2O-16

This heavy CO2 finds its way into the carbonate shells of mollusks, and the skeletons of foraminifera and corals in proportion to its ratio in the local waters (except when biology intervenes. See below).

This process is why the field of stable isotope proxy thermometry has focused primarily on O-18 CO2: it is incorporated into the carbonate of mollusk shells, corals, and foraminifera and provides a record of temperatures experienced by the organism.

Even better, fossil mollusk shells, fossil corals, and foraminiferal sediments in sea floor cores promise physically real reconstructions of O-18 paleotemperatures.

Before it can be measured, O-18 CO2 must be liberated from the carbonate matrix of mollusks, corals, or foraminifera. Liberation of CO2 typically involves treating solid CaCO3 with phosphoric acid.

3 CaCO3 + 2 H3PO4 => 3 CO2 + Ca3(PO4)2 + 3 H2O

CO2 is liberated from biological calcium carbonate and piped into a mass spectrometer. Laboratory methods are never perfect. They incur losses and inefficiencies that can affect the precision and accuracy of results. Anyone who’s done wet analytical work knows about these hazards and has struggled with them. The practical reliability of dO-18 proxy temperatures depends on the integrity of the laboratory methods to prepare and measure the intrinsic O-18.

The paleothermometric approach is to first determine a standard relationship between water temperature and the ratio of O-18/O-16 in precipitated calcium carbonate. One can measure how the O-18 in the water fractionates itself into solid carbonate over a range of typical SST temperatures, such as 10 C through 40 C. A plot of carbonate O-18 v. temperature is prepared.

Once this standard plot is in hand, the temperature is regressed against the carbonate dO-18. The result is a least-squares fitted equation that tells you the empirical relationship of T:dO-18 over that temperature range.

This empirical equation can then be used to reconstruct the water temperature whenever carbonate O-18 is known. That’s the principle.

The question I’m interested in is whether the complete physico-chemical method yields accurate temperatures. Those who’ve read my paper pdf on neglected systematic error in the surface air temperature record, will recognize the ‘why’ of focusing on measurement error. It’s the first and minimum error entering any empirically determined magnitude. That makes it the first and basic question about error limits in O-18 carbonate proxy temperatures.

So, how does the method work in practice?

Let’s start with the classic: J. M. McCrea (1950) “On the Isotopic Chemistry of Carbonates and a Paleotemperature Scale“[3], which is part of McCrae’s Ph.D. work.

McCrae’s work is presented in some detail to show the approach I took to evaluate error. After that, I promise more brevity. Nothing below is meant to be, or should be taken to be, criticism of McCrae’s absolutely excellent work — or criticism of any of the other O-18 authors and papers to follow.

McCrae made truly heroic and pioneering experimental work establishing the O-18 proxy temperature method. Here’s his hand-drawn picture of the custom glass apparatus used to produce CO2 from carbonate. I’ve annotated it to identify some bits:

Figure 1: J. McCrae’s CO2 preparative glass manifold for O-18 analysis.

I’ve worked with similar glass gas/vacuum systems with lapped-in ground-glass joints, and the opportunity for leak, crack, or crash-tastrophe is ever-present.

McCrae developed the method by precipitating dO18 carbonate at different temperatures from marine waters obtained off East Orleans, MA, on the Atlantic side of Cape Cod, and off Palm Beach, Florida. The O-18 carbonate was then chemically decomposed to release the O-18 CO2, which was analyzed in a double-focusing mass spectrometer, which they apparently custom built themselves.

The blue and red lines in the Figure below show his results (Table X and Figure 5 in his paper). The %o O-18 is the divergence of his experimental samples from his standard water.

Figure 2, McCrae, 1950, original caption (color-modified): “Variation of isotopic composition of CaCO3(s) with reciprocal of deposition temperature from H2O (Cape Cod series (red); Florida water series (blue)).” The vertical lines interpolate temperatures at %o O-18 = 0.0. Bottom: Color-coded experimental point scatter around a zero line (dashed purple).

The lines are linear least square (LSQ) fits and they reproduce McCrae’s almost exactly (T is in Kelvin):

Florida: McCrae: d18O=1.57 x (10^4/T)-54.2;

LSQ: d18O=1.57 x (10^4/T)-53.9; r^2=0.994.

Cape Cod: McCrae: d18O=1.64 x (10^4/T)-57.6;

LSQ: d18O=1.64 x (10^4/T)-57.4; r^2=0.995.

About his results, McCrae wrote this: “The respective salinities of 36.7 and 32.2%o make it not surprising that there is a difference in the oxygen composition of the calcium carbonate obtained from the two waters at the same temperature.(bold added)”

The boiling temperature of water increases with the amount of dissolved salt, which in turn affects the relative rates that H2O-16 and H2O-18 evaporate away. Marine salinity can also change from the influx of fresh water (from precipitation, riverine, or direct runoff), or from upwelling, from wave-mixing, and from currents. The O-16/O-18 ratio of fresh water, of upwelling water, or of distant water transported by currents, may differ from a local marine ratio. The result is that marine waters of the same temperature can have different O18 fractions. Disentangling the effects of temperature and salinity in a marine O-16/O-18 ratio can be difficult to impossible in paleo-reconstructions.

The horizontal green line at %o O18 = zero intersects the Florida and Cape Cod lines at different temperatures, represented by the vertical drops to the abscissa. These show that the same dO-18 produces a difference of 4 C, depending on which equation you choose, with the apparent T covarying with a salinity change of 0.045%o.

That means if one generates a paleotemperature by applying a specific dO18:T equation to paleocarbonates, and one does not know the paleosalinity, the derived paleotemperature can be uncertain by as much as (+/-)2 C due to a hidden systematic covariance (salinity).

But I’m interested in experimental error. From those plots one can estimate the point scatter in the physico-chemical method itself as the variation around the fitted LSQ lines. The point scatter is plotted along the purple zero line at the bottom of Figure 2. Converted to temperature, the scatter is (+/-)1 C for the Florida data and (+/-)1.5 C for the Cape Cod data.

All the data were determined by McCrae in the same lab, using the same equipment and the same protocol. Therefore, it’s legitimate to combine the two sets of errors in Figure 2 to determine their average, and the resulting average uncertainty in any derived temperature. The standard deviation of the combined errors is (+/-)0.25 %o O-18, which translates into an average temperature uncertainty of (+/-)1.3 C. This emerged under ideal laboratory conditions where the water temperature was known from direct measurement and the marine O18 fraction was independently measured.

Next, it’s necessary to know whether the errors are systematic or random. Random errors diminish as 1/sqrtN, where N is the number of repetitions of analysis. If the errors are random, one can hope for a very precise temperature measurement just by repeating the dO-18 determination enough times. For example, in McCrae’s work, 25 repeats reduces the average error in any single temperature by 1.3/5 => (+/-)0.26 C.

To bridge the random/systematic divide, I binned the point scatter over (+/-)3 standard deviations = (+/-)99.7 % certainty of including the full range of error. There were no outliers, meaning all the scatter fell within the 99.7 % bound. There are only 15 points, which is not a good statistical sample, but we work with what we’ve got. Figure 3 shows the histogram plot of the binned point-scatter, and a Gaussian fit. It’s a little cluttered, but bear with me.

Figure 3: McCrae, 1950 data: (blue points), binned point scatter from Figure 2; red line, Two-Gaussian fit to the binned points; dashed green lines, the two fitted Gaussians. Thin purple points and line: separately binned Cape Cod point scatter; thin blue line and points, separately binned Florida point scatter.

The first thing to notice is that the binned points are very not normally distributed. This immediately suggests the measurement error is systematic, and not random. The two Gaussian fit is pretty good, but should not be taken as more than a numerical convenience. An independent set of measurement scatter points from a different set of experiments may well require a different set of Gaussians.

The two Gaussians imply at least two modes of experimental error operating simultaneously. The two thin single-experiment lines are spread across scatter width. This demonstrates that the point scatter in each data sets participates in both error modes simultaneously. But notice that the two data sets do not participate equivalently. This non-equivalence again indicates a systematic measurement error that apparently does not repeat consistently.

The uncertainty from systematic measurement error does not diminish as 1/sqrtN. The error is not a constant offset and does not subtract away in a difference between data sets. It propagates into a final value as (+/-)sqrt[(sum of N errors)^2/(N-1)].

The error in any new proxy temperature derived from those methods will probably fall somewhere in the Figure 3 envelope, but the experimenter will not know where. That means the only way to honestly present a result is to report the average systematic error, and that would be T(+/-)1.3 C.

This estimate is conservative, as McCrae noted that, “The average deviation of an individual result from the relation is 0.38%o.”, which is equivalent to an average error of (+/-)2 C (I calculated 1.95 C; McCrae’s result). McCrae wrote later, “The average deviation of an individual experimental result from this relation is 2°C in the series of slow precipitations just described.”

The slow precipitation experiments were the tests with Cape cod and Florida water, shown in Figure 2, and McCrae mentioned their paleothermal significance at the end of his paper, “The isotopic composition of calcium carbonate slowly formed from aqueous solution has been noted to be usually the same as that produced by organisms at the same temperature.”

Anyone using McCrae’s standard equations to reconstruct a dO-18 paleotemperature must include the experimental uncertainty hidden inside them. However, they are invariably neglected. I’ll give an example below.

Another methodological classic is Sang-Tae Kim et al. (2007) “Oxygen isotope fractionation between synthetic aragonite and water: Influence of temperature and Mg2+ concentration“.[6]

Kim, et al., measured the relationship between temperature and dO-18 incorporation in Aragonite, a form of calcium carbonate found in mollusk shells and corals (the other typical form is calcite). They calibrated the T:dO-8 relationship at five temperatures, 0, 5, 10, 25, and 40 C which covers the entire range of SST. Figure 4a shows their data.

Figure 4: a. Blue points: Aragonite T:dO-18 calibration experimental points from Kim, et al., 2007; purple line: LSQ fit. Below: green points, the unfit residual representing experimental point-scatter, 1-sigma = (+/-)0.21. b. 3-sigma histogram of the experimental unfit residual (points) and the 3-Gaussian fit (purple line). The thin colored lines plus points are separate histograms of the four data sub-sets making up the total.

The alpha in “ln-alpha” is the O-18 “fractionation factor,” which is a ratio of O-18 ratios. That sounds complicated, but it’s just (the ratio of O-18 in carbonate divided by the ratio of O-18 in water): {[(O-18)c/(O-16)c] / [(O-18)w/(O-16)w]}, where “c” = carbonate, and “w” = water.

The LSQ fitted line in Figure 4a is 1000 x ln-alpha = 17.80 x (1000/T)-30.84; R^2 = 0.99, which almost exactly reproduces the published line, 1000 x ln-alpha = 17.88 x (1000/T)-31.14.

The green points along the bottom of Figure 4a are the unfit residual, representing the experimental point scatter. These have a 1-sigma standard deviation = (+/-)0.21, which translates into an experimental uncertainty of (+/-)1 C.

In Figure 4b is a histogram of the unfit residual point scatter in part a, binned across (+/-)3-sigma. The purple line is a three-Gaussian fit to the histogram, but with the point at -0.58,3 left out because it destabilized the fit. In any case, the experimental data appear to be contaminated with at least three modes of divergence, again implying a systematic error.

Individual data sub-sets are shown as the thin colored lines in Figure 4b. They all spread across at least two of the three experimental divergence modes, but not equivalently. Once again, that means every data set is uniquely contaminated with systematic measurement error.

Kim, et al., reported a smaller analytical error (+/-)0.13, equivalent to an uncertainty in T = (+/-)0.6 C. But their (+/-)0.13 is the analytical precision of the mass spectrometric determination of the O-18 fractions. It’s not the total experimental scatter. Residual point scatter is a better uncertainty metric because the Kim, et al., equation represents a fit to the full experimental data, not just to the O-18 fractions found by the mass spectrometer.

Any researcher using the Kim, et al., 2007 dO-18:T equation to reconstruct a paleotemperature must propagate at least (+/-)0.6 C uncertainty into their result, and better (+/-)1 C.

I’ve done similar analyses of the experimental point-scatter in several studies used to calibrate the T:O-18 temperature scale. Here’s a summary of the results:

Study______________(+/-)1-sigma______n_____syst err?____Ref.

McCrae________________1.3 C_________15_____Y________[3]

O’Neil_________________29 C_________11______?________[7]

Epstein_______________0.76 C________25______?_________[8]

Bemis________________1.7 C_________14______Y________[9]

Kim__________________1.0 C_________70______Y________[6]

Li____________________2.2 C__________5______________[10]

Friedman______________1.1 C__________6______________[11]

O’Neil’s was a 0-500 C experiment

All the Summary uncertainties represent only measurement point scatter, which often behaved as systematic error. The O’Neil 1969 point scatter was indeterminate, and the Epstein question mark is discussed below.

Epstein, et al., (1953), chose to fit their T:dO-18 calibration data with a second-order polynomial rather than with a least squares straight line. Figure 5 shows their data with the polynomial fit, and for comparison a LSQ straight line fit.

Figure 5: Epstein, 1953 data fit with a second-order polynomial (R^2 = 0.996; sigma residual = (+/-)0.76 C) and with a least squares line (R^2 = 0.992; sigma residual = (+/-) 0.80 C). Insets: histograms of the point scatter plus Gaussian fits; Upper right, polynomial; lower left, linear.

The scatter around the polynomial was pretty Gaussian, but left a >3-sigma outlier at 2.7 C. The LSQ fit did almost as well, and put the polynomial 3-sigma outlier within the 3-sigma confidence limit. The histogram of the linear fit scatter required two Gaussians, and left an unfit point at 2.5-sigma (-2 C).

Epstein had no good statistical reason to choose the polynomial fit over the linear fit, and didn’t mention his rationale. The poly fit came closer to the high-temperature end-point at 30 C, but the linear fit came closer to the low-T end-point at 7 C, and was just as good as through the internal data points. So, the higher order fit may have been an attempt to save the point at 30 C.

Before presenting an application of these lessons, I’d like to show a review paper, which compares all the different dO-18:T calibration equations in current use: B. E. Bemis, H. J. Spero, J. Bijma, and D. W. Lea, Reevaluation of the oxygen isotopic composition of planktonic foraminifera: Experimental results and revised paleotemperature equations. [9]

This paper is particularly valuable because it reviews the earlier equations used to model the T:dO18 relationship.

Figure 6 below reproduces an annotated Figure 2 from Bemis, et al. It compares several T:dO-18 calibration equations from a variety of laboratories. They have similar slopes but are offset. The result is that a given dO-18 predicts a different temperature, depending on which calibration equation one chooses. The Figure is annotated with a couple of very revealing drop lines.

Figure 6: Original caption”Comparison of temperature predictions using new O. universa and G. bulloides temperature:dO-18 relationships and published paleotemperature equations. Several published equations are identified for reference. Equations presented in this study predict lower temperatures than most other equations. Temperatures were calculated using the VSMOW to VPDB corrections listed in Table 1 for dO-18w values.”

The green drop lines show that a single temperature associates with dO-18 values ranging across 0.4 %o. That’s about 10-40x larger than the precision of a mass spectrometer dO-18 measurement. Alternatively, the horizontal red extensions show that a single dO-18 measurement predicts temperatures across a ~1.8 C range, representing an uncertainty of (+/-)0.9 C in choice of standards.

The 1.8 C excludes the three lines, labeled 11-Ch, 12-Ch, and 13-Ch. These refer to G. bulloides with 11-, 12-, and 13-chambered shells. Including them, the spread of temperatures at a single dO-18 is ~3.7 C (dashed red line).

In G. bulloides, the number of shell chambers increases with age. Specific gravity increases with the number of chambers, causing the G. bulloides to sink into deeper waters. Later chambers sample different waters than the earlier ones, and incorporate the ratio of O-18 at depth. Three different lines show the vertical change in dO-18 is significant, and imply a false spread in T of about 0.5 C.

Here’s what Bemis, et al., say about it (p. 150), “Although most of these temperature:d18O relationships appear to be similar, temperature reconstructions can differ by as much as 2 C when ambient temperature varies from 15 to 25 C.”

That “2 C” reveals a higher level of systematic error that appears as variations among the different temperature reconstruction equations. This error should be included as part of the reported uncertainty whenever any one of these standard lines is used to determine a paleotemperature.

Some of the variations in standard lines are also due to confounding factors such as salinity and the activity of photosynthetic foraminiferal symbionts.

Bemis, et al., discuss this problem on page 152: “Non-equilibrium d18O values in planktonic foraminifera have never been adequately explained. Recently, laboratory experiments with live foraminifera have demonstrated that the photosynthetic activity of algal symbionts and the carbonate ion concentration ([CO32-]) of seawater also affect shell d18O values. In these cases an increase in symbiont photosynthetic activity or [CO32-] results in a decrease in shell d18O values. Given the inconsistent SST reconstructions obtained using existing paleotemperature equations and the recently identified parameters controlling shell d18O values, there is a clear need to reexamine the temperature:d18O relationships for planktonic foraminifera.”

Bemis, et al., are thoughtful and modest in this way throughout their paper. They present a candid review of the literature. They discuss the strengths and pitfalls in the field, and describe where more work needs to be done. In other words, they are doing honest science. The contrast could not be more stark between their approach and the pastiche of million dollar claims and statistical maneuvering that swamp AGW-driven paleothermometry.

When the inter-methodological ~(+/-)0.9 C spread of standard T:dO-18 equations is added as the rms to the (+/-)1.34 C average measurement error from the Summary Table, the combined 1-sigma uncertainty in a dO-18 temperature =(+/-)sqrt(1.34^2+0.9^2)=(+/-)1.6 C. That doesn’t include any further invisible environmental confounding effects that might confound a paleo-O18 ratio, such as shifts in monsoon, in salinity, or in upwelling.

A (+/-)1.6 C uncertainty is already 2x larger than the commonly accepted 0.8 C of 20th century warming. T:dO-18 proxies are entirely unable to determine whether recent climate change is in any way historically or paleontologically unusual.

Now let’s look at Keigwin’s justly famous Sargasso Sea dO-18 proxy temperature reconstruction: (1996) “The Little Ice Age and Medieval Warm Period in the Sargasso Sea.” [12] The reconstructed Sargasso Sea paleotemperature rests on G. ruber calcite. G. ruber has photosynthetic symbionts, which induces the T:dO-18 artifacts mentioned by Bemis, et al. Keigwin is a good scientist and attempted to account for this by applying an average G. ruber correction. But removal of an average bias is effective only when the error envelope is random around a constant offset. Subtracting the average bias of a systematic error does not reduce the uncertainty width, and may even increase the total error if the systematic bias in your data set is different from the average bias. Keigwin also assumed an average salinity of 36.5%o throughout, which may or may not be valid.

More to the point, no error bars appear on the reconstruction. Keigwin reported changes in paleotemperature of 1 C or 1.5 C, implying a temperature resolution with smaller errors than these values.

Keigwin used the T:dO-18 equation published by Shackleton in 1974,[13] to turn his Sargasso G. ruber dO-18 measurements into paleotemperatures. Unfortunately, Shackleton published his equation in the International Colloquium Journal of the French C.N.R.S., and neither I nor my French contact (thank-you Elodie) have been able to get that paper. Without it, one can’t directly evaluate the measurement point scatter.

However in 1965, Shackleton published a paper demonstrating his methodology to obtain high precision dO-18 measurements. [14] Shackleton’s high precision scatter should be the minimum scatter in his 1974 T:dO-18 equation.

Shackleton, 1965 made five replicate measurements of the dO-18 in five separate samples of a single piece of Italian marble (marble is calcium carbonate). Here’s his Table of results:

Reaction No. _1____2____3____4____5____Mean____Std dev.

dO-18 value__4.1__4.45_4.35__4.2__4.2____4.26%___0.12%o.

Shackleton mistakenly reported the root-mean-square of the point scatter instead of the standard deviation. No big deal, the true 1-sigma = (+/-)0.14%o; not very different.

In Shackleton’s 1965 words, “The major reason for discrepancy between successive measurements lies in the difficulty of preparing and handling the gas.” That is, the measurement scatter is due to the inevitable systematic laboratory error we’ve already seen above.

Shackleton’s 1974 standard T:dO-18 equation appears in Barrera, et al., [15] and it’s T = 16.9 – 4.38(dO-18) + 0.10(dO-18)^2. Plugging Shackleton’s high-precision 1-sigma=0.14%o into his equation yields an estimated minimum uncertainty of (+/-)0.61 C in any dO-18 temperature calculated using the Shackleton T:dO-18 equation.

At the ftp site where Keigwin’s data are located, one reads “Data precision: ~1% for carbonate; ~0.1 permil for d18-O.” So, Keigwin’s independent dO-18 measurements were good to about (+/-)0.1%o.

The uncertainty in temperature represented by Keigwin’s (+/-)0.1%o spread in measured dO-18 equates to (+/-)0.44 C in Shackleton’s equation.

The total measurement uncertainty in Keigwin’s dO-18 proxy temperature is the quadratic sum of the uncertainty in Shackleton’s equation plus the uncertainty in Keigwin’s own dO-18 measurements. That’s (+/-)sqrt[(0.61)^2+(0.44)^2]=(+/-)0.75 C. This represents measurement error, and is the 1-sigma minimum of error.

And so now we get to see something possibly never before seen anywhere: a proxy paleotemperature series with true, physically real, 95% confidence level 2-sigma systematic error bars. Here it is:

Figure 7: Keigwin’s Sargasso Sea dO-18 proxy paleotemperature series, [12] showing 2-sigma systematic measurement error bars. The blue rectangle is the 95% confidence interval centered on the mean temperature of 23.0 C.

Let’s be clear on what Keigwin accomplished. He reconstructed 3175 years of nominal Sargasso Sea dO-18 SSTs with a precision of (+/-)1.5 C at the 95% confidence level. That’s an uncertainty of 6.5% about the mean, and is a darn good result. I’ve worked hard in the lab to get spectroscopic titrations to that level of accuracy. Hat’s off to Keigwin.

But it’s clear that changes in SSTs on the order of 1-1.5 C can’t be resolved in those data. The most that can be said is that it’s possible Sargasso Sea SSTs were higher 3000 years ago.

If we factor in the uncertainty due to the (+/-)0.9 C variation among all the various T:dO-18 standard equations (Figure 6), then the Sargasso Sea 95% confidence interval expands to (+/-)2.75 C.

This (+/-)2.75 C = (uncertainty in experimenter d-O18 measurements) + (uncertainty in any given standard T:dO-18 equation) + (methodological uncertainty across all T:dO-18 equations).

So, (+/-)2.75 C is probably a good estimate of the methodological 95% confidence interval in any determination of a dO-18 paleotemperature. The confounding artifacts of paleo-variations in salinity, photosynthesis, upwelling and meteoric water will bring into any dO-18 reconstruction of paleotemperatures, further errors that are invisible but perhaps of analogous magnitude.

At the end, it’s true that the T:dO18 relationship is soundly based in physics. However, it is not true that the relationship has produced a reliably high-resolution proxy for paleotemperatures.

Part II: Pseudo-Science: Statistical Thermometry

Now on to the typical published proxy paleotemperature reconstructions. I’ve gone through a representative set of eight high-status studies, looking for evidence of science. Evidence of science is whether any of them make use of physical theory.

Executive summary: none of them are physically valid. Not one of them yields a temperature.

Before proceeding, a necessary word about correlation and causation. Here’s what Michael Tobis wrote about that, “If two signals are correlated, then each signal contains information about the other. Claiming otherwise is just silly.”

There’s a lot of that going around in proxythermometry, and clarification is a must. John Aldrich has a fine paper [16] describing the battle between Karl Pearson and G. Udny Yule over correlation indicating causation. Pearson believed it, Yule did not.

On page 373, Aldrich makes a very relevant distinction: “ Statistical inference deals with inference from sample to population while scientific inference deals with the interpretation of the population in terms of a theoretical structure.”

That is, statistics is about the relations among numbers. Science is about deductions from a falsifiable theory.

We’ll see that the proxy studies below improperly mix these categories. They convert true statistics into false science.

To spice up the point, here are some fine examples of spurious correlations, and here are the winners of the 1998 Purdue University spurious correlations contest, including correlations between ice cream sales and death-by-drowning, and between ministers’ salaries and the price of vodka. Pace Michael Tobis, each of those correlated “signals” so obviously contains information about the other, and I hope that irony lays the issue to rest.

Diaz and Osuna [17] point out that distinguishing, “between alchemy and science … is (1) the specification of rigorously tested models, which (2) adequately describe the available data, (3) encompass previous findings, and (4) are derived from well-based theories. (my numbers, my bold)”

The causal significance of any correlation is revealed only within the deductive context of a falsifiable theory that predicts the correlation. Statistics (inductive inference) never, ever, of itself reveals causation.

AGW paleo proxythermometry will be shown missing Diaz and Osuna elements 1, 3, and 4 of science. That makes it alchemy; otherwise known as pseudoscience.

That said, here we go: AGW proxythermometry:

1. Thomas J. Crowley and Thomas S. Lowery (2000) “How Warm Was the Medieval Warm Period?.” [18]

They used fifteen series: three dO-18 (Keigwin’s Sargasso Sea proxy, GISP 2, and the Dunde Ice cap series), eight tree-ring series, the Central England temperature (CET) record, an Iceland temperature (IT) series, and two plant-growth proxies (China phenology and Michigan pollen).

All fifteen series were scaled to vary between 0 and 1, and then averaged. There was complete and utter neglect of the physical meaning of the five physically valid series (3 x dO18, IT, and CET). All of them were scaled to the same physically meaningless unitary bound.

Think about what this means: Crowley and Lowry took five physically meaningful series, and discarded the physics. That made the series fit to use in AGW-related proxythermometry.

There is no physical theory that coverts tree ring metrics into temperatures. That theory does not exist and any exact relationship remains entirely obscure.

So then how did Crowley and Lowery convert their unitized proxy average into temperature? Well, “The two composites were scaled to agree with the Jones et al. instrumental record for the Northern Hemisphere…,” and that settles the matter.

In short, the fifteen series were numerically adjusted to a common scale, averaged, and scaled up to the measurement record. Then C&L reported their temperatures to a resolution of (+/-)0.05 C. Measurement uncertainty in the physically real series was ignored in their final composite. That’s how you do science, AGW proxythermometry style.

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physical content: none.

Physical validity: none.

Temperature meaning of the final composite: none.

2. Timothy J. Osborn and Keith R. Briffa (2006) The Spatial Extent of 20th-Century Warmth in the Context of the Past 1200 Years.” [19]

Fourteen proxies — eleven of them tree rings, one dO-18 ice core (W. Greenland) — were divided by their respective standard deviation to produce a common unit magnitude, and then scaled into the measurement record. The ice core dO-18 had its physical meaning removed and its experimental uncertainty ignored.

Interestingly, between 1975 and 2000 the composite proxy declined away from the instrumental record. Osborn and Briffa didn’t hide the decline, to their everlasting credit, but instead wrote that this disconfirmation is due to, “the expected consequences of noise in the proxy records.”

I estimated the “noise” by comparing its offset with respect to the temperature record, and it’s worth about 0.5 C. It didn’t appear as an uncertainty on their plot. In fact, they artificially matched the 1856-1995 means of the proxy series and the surface air temperature record, making the proxy look like temperature. The 0.5 C “noise” divergence got suppressed and looks much smaller than it really is. Actual 0.5 C “noise” error bars scaled onto the temperature record of their final Figure 3 would have made the whole enterprise theatrically useless, no matter that it is bereft of science in any case.

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physical uncertainty in T: none.

Physical validity: none.

Temperature meaning of the composite: none.

3. Michael E. Mann, Zhihua Zhang, Malcolm K. Hughes, Raymond S. Bradley, Sonya K. Miller, Scott Rutherford, and Fenbiao Ni (2008) “Proxy-based reconstructions of hemispheric and global surface temperature variations over the past two millennia.” [20]

A large number of proxies of multiple lengths and provenances. They included some ice core, speleothem, and coral dO-18, but the data are vastly dominated by tree ring series. Mann & co., statistically correlated the series with local temperature during a “calibration period,” adjusted them to equal standard deviation, scaled into the instrumental record, and published the composite showing a resolution of 0.1 C (Figure 3). Their method again removed and discarded the physical meaning of the dO-18 proxies.

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physical uncertainty in T: none.

Physical validity: none.

Temperature meaning of the composite: none.

4. Rosanne D’Arrigo, Rob Wilson, Gordon Jacoby (2006) “ On the long-term context for late twentieth century warming .” [21]

Tree ring series from 66 sites, variance adjusted, scaled into the instrumental record and published with a resolution of 0.2 C (Figure 5 C).

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physically valid temperature uncertainties: no

Physical meaning of the 0.2 C divisions: none.

Physical meaning of tree-ring temperatures: none available.

Temperature meaning of the composite: none.

5.Anders Moberg, Dmitry M. Sonechkin, Karin Holmgren, Nina M. Datsenko and Wibjörn Karlén (2005) “Highly variable Northern Hemisphere temperatures reconstructed from low- and high-resolution proxy data.” [22]

Eighteen proxies: Two d-O18 SSTs (Sargasso and Caribbean Seas foraminiferal d-O18, and one stalagmite d-O18 (Soylegrotta, Norway), seven tree ring series. Plus other composites.

The proxies were processed using an excitingly novel wavelet transform method (it must be better), combined, variance adjusted, intensity scaled to the instrumental record over the calibration period, and published with a resolution of 0.2 C (Figure 2 D). Following standard practice, the authors extracted the physical meaning of the dO-18 proxies and then discarded it.

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physical uncertainties propagated from the dO18 proxies into the final composite? No.

Physical meaning of the 0.2 C divisions: none.

Temperature meaning of the composite: none.

6. B.H. Luckman, K.R. Briffa, P.D. Jones and F.H. Schweingruber (1997) “Tree-ring based reconstruction of summer temperatures at the Columbia Icefield, Alberta, Canada, AD 1073-1983.” [23]

Sixty-three regional tree ring series, plus 38 fossilwood series; used the standard statistical (not physical) calibration-verification function to convert tree rings to temperature, overlaid the composite and the instrumental record at their 1961-1990 mean, and published the result at 0.5 C resolution (Figure 8). But in the text they reported anomalies to (+/-)0.01 C resolution (e.g., Tables 3&4), and the mean anomalies to (+/-)0.001 C. That last is 10x greater claimed accuracy than the typical rating of a two-point calibrated platinum resistance thermometer within a modern aspirated shield under controlled laboratory conditions.

Any physical theory employed?: No

Strictly statistical inference?: Yes

Physical meaning of the proxies: none.

Temperature meaning of the composite: none.

7. Michael E. Mann, Scott Rutherford, Eugene Wahl, and Caspar Ammann (2005) “Testing the Fidelity of Methods Used in Proxy-Based Reconstructions of Past Climate.” [24]

This study is, in part, a methodological review of the recommended ways to produce a proxy paleotemperature made by the premier practitioners in the field:

Method 1: “The composite-plus-scale (CPS) method, “a dozen proxy series, each of which is assumed to represent a linear combination of local temperature variations and an additive “noise” component, are composited (typically at decadal resolution;…) and scaled against an instrumental hemispheric mean temperature series during an overlapping “calibration” interval to form a hemispheric reconstruction. (my bold)”

Method 2, Climate field reconstruction (CFR): “Our implementation of the CFR approach makes use of the regularized expectation maximization (RegEM) method of Schneider (2001), which has been applied to CFR in several recent studies. The method is similar to principal component analysis (PCA)-based approaches but employs an iterative estimate of data covariances to make more complete use of the available information . As in Rutherford et al. (2005), we tested (i) straight application of RegEM, (ii) a “hybrid frequency-domain calibration” approach that employs separate calibrations of high (shorter than 20-yr period) and low frequency (longer than 20-yr period) components of the annual mean data that are subsequently composited to form a single reconstruction, and (iii) a “stepwise” version of RegEM in which the reconstruction itself is increasingly used in calibrating successively older segments. (my bold)”

Restating the obvious: CPS: Assumed representative of temperature; statistical scaling into the instrumental record; methodological correlation = causation. Physical validity: none. Scientific content: none.

CFR: Principal component analysis (PCA): a numerical method devoid of intrinsic physical meaning. Principal components are numerically, not physically, orthogonal. Numerical PCs are typically composites of multiple decomposed (i.e., partial) physical signals of unknown magnitude. They have no particular physical meaning. Quantitative physical meaning cannot be assigned to PCs by reference to subjective judgments of ‘temperature dependence.’

Scaling the PCs into the temperature record? Correlation = causation.

‘Correlation = causation is possibly the most naive error possible in science. Mann et al., unashamedly reveal it as undergirding the entire field of tree ring proxy thermometry.

Scientific content of the Mann-Rutherford-Wahl-Ammann proxy method: zero.

Finally, an honorable mention:

8. Rob Wilson, Alexander Tudhope, Philip Brohan, Keith Briffa, Timothy Osborn, and Simon Tet (2006), “Two-hundred-fifty years of reconstructed and modeled tropical temperatures.”[25]

Wilson, et al, reconstructed 250 years of SSTs using only coral records, including dO-18, strontium/calcium, uranium/calcium, and barium/calcium ratios. I’ve not assessed the latter three in any detail, but inspection of their point scatter is enough to imply that none of them will yield more accurate temperatures than dO-18.

However, all the Wilson, et al., temperature proxies had real physical meaning. What a great opportunity to challenge the method, and discuss the impacts of salinity, biological disequilibrium, and how to account for them, and explore all the other central elements of stable isotope marine temperatures.

So what did they do? Starting with about 60 proxy series, they threw out all those that didn’t correlate with local gridded temperatures. That left 16 proxies, 15 of which were dO-18. Why didn’t the other proxies correlate with temperature? Rob Wilson & co., were silent on the matter. After tossing two more proxies to avoid the problem of filtering away high frequencies, they ended up with 14 coral SST proxies.

After that, they employed standard statistical processing: divide by the standard deviation, average the proxies together (they used the “nesting procedure,” which adjusts for individual proxy length), and scale up to the instrumental record.

The honorable mention for these folks derives from the fact that they used only physically real proxies, and then discarded the physical meaning of all of them.

That puts them ahead of the other seven exemplars, who included proxies that had no known physical meaning at all.

Nevertheless,

Any physical theory employed?: No

Strictly statistical inference?: Yes

Any physically valid methodology? No.

Physical meaning of the proxies: present and accounted for, and then discarded.

Temperature meaning of the composite: none.

Summary Statement: AGW-related paleo proxythermometry as ubiquitously practiced consists of composites that rely entirely on statistical inference and numerical scaling. They not only have no scientific content, the methodology actively discards scientific content.

Statistical methods: 100%.

Physical methods: nearly zero (stable isotopes excepted, but their physical meaning is invariably discarded in composite paleoproxies).

Temperature meaning of the numerically scaled composites: zero.

The seven studies are typical, and representative of the entire field of AGW-related proxy thermometry. As commonly practiced, it is a scientific charade. It’s pseudo-science through-and-through.

Stable isotope studies are real science, however. That field is cooking along and the scientists involved are properly paying attention to detail. I hereby fully except them from my general condemnation of the field of AGW proxythermometry.

With this study, I’ve now examined the reliability of all three legs of AGW science: Climate models (GCMs) here (calculations here), the surface air temperature record here (pdf downloads, all), and now proxy paleotemperature reconstructions.

Every one of them thoroughly neglects systematic error. The neglected systematic error shows that none of the methods – not one of them — is able to resolve or address the surface temperature change of the last 150 years.

Nevertheless, the pandemic pervasiveness of this neglect is the central mechanism by which AGW alarmism survives. This has been going on for at least 15 years; for GCMs, 24 years. Granting integrity, one can only conclude that the scientists, their reviewers, and their editors are uniformly incompetent.

Summary conclusion: When it comes to claims about unprecedented this-or-that in recent global surface temperatures, no one knows what they’re talking about.

I’m sure there are people who will dispute that conclusion. They are very welcome to come here and make their case.

References:

1. Mann, M.E., R.S. Bradley, and M.S. Hughes, Global-scale temperature patterns and climate forcing over the past six centuries. Nature, 1998. 392(p. 779-787.

2. Dansgaard, W., Stable isotopes in precipitation. Tellus, 1964. 16(4): p. 436-468.

3. McCrea, J.M., On the Isotopic Chemistry of Carbonates and a Paleotemperature Scale. J. Chem. Phys., 1950. 18(6): p. 849-857.

4. Urey, H.C., The thermodynamic properties of isotopic substances. J. Chem. Soc., 1947: p. 562-581.

5. Brand, W.A., High precision Isotope Ratio Monitoring Techniques in Mass Spectrometry. J. Mass. Spectrosc., 1996. 31(3): p. 225-235.

6. Kim, S.-T., et al., Oxygen isotope fractionation between synthetic aragonite and water: Influence of temperature and Mg2+ concentration. Geochimica et Cosmochimica Acta, 2007. 71(19): p. 4704-4715.

7. O’Neil, J.R., R.N. Clayton, and T.K. Mayeda, Oxygen Isotope Fractionation in Divalent Metal Carbonates. J. Chem. Phys., 1969. 51(12): p. 5547-5558.

8. Epstein, S., et al., Revised Carbonate-Water Isotopic Temperature Scale. Geol. Soc. Amer. Bull., 1953. 64(11): p. 1315-1326.

9. Bemis, B.E., et al., Reevaluation of the oxygen isotopic composition of planktonic foraminifera: Experimental results and revised paleotemperature equations. Paleoceanography, 1998. 13(2): p. 150Ð160.

10. Li, X. and W. Liu, Oxygen isotope fractionation in the ostracod Eucypris mareotica: results from a culture experiment and implications for paleoclimate reconstruction. Journal of Paleolimnology, 2010. 43(1): p. 111-120.

11. Friedman, G.M., Temperature and salinity effects on 18O fractionation for rapidly precipitated carbonates: Laboratory experiments with alkaline lake water ÑPerspective. Episodes, 1998. 21(p. 97Ð98

12. Keigwin, L.D., The Little Ice Age and Medieval Warm Period in the Sargasso Sea. Science, 1996. 274(5292): p. 1503-1508; data site: ftp://ftp.ncdc.noaa.gov/pub/data/paleo/paleocean/by_contributor/keigwin1996/.

13. Shackleton, N.J., Attainment of isotopic equilibrium between ocean water and the benthonic foraminifera genus Uvigerina: Isotopic changes in the ocean during the last glacial. Colloq. Int. C.N.R.S., 1974. 219(p. 203-209.

14. Shackleton, N.J., The high-precision isotopic analysis of oxygen and carbon in carbon dioxide. J. Sci. Instrum., 1965. 42(9): p. 689-692.

15. Barrera, E., M.J.S. Tevesz, and J.G. Carter, Variations in Oxygen and Carbon Isotopic Compositions and Microstructure of the Shell of Adamussium colbecki (Bivalvia). PALAIOS, 1990. 5(2): p. 149-159.

16. Aldrich, J., Correlations Genuine and Spurious in Pearson and Yule. Statistical Science, 1995. 10(4): p. 364-376.

17. D’az, E. and R. Osuna, Understanding spurious correlation: a rejoinder to Kliman. Journal of Post Keynesian Economics, 2008. 31(2): p. 357-362.

18. Crowley, T.J. and T.S. Lowery, How Warm Was the Medieval Warm Period? AMBIO, 2000. 29(1): p. 51-54.

19. Osborn, T.J. and K.R. Briffa, The Spatial Extent of 20th-Century Warmth in the Context of the Past 1200 Years. Science, 2006. 311(5762): p. 841-844.

20. Mann, M.E., et al., Proxy-based reconstructions of hemispheric and global surface temperature variations over the past two millennia. Proc. Natl. Acad. Sci., 2008. 105(36): p. 13252-13257.

21. D’Arrigo, R., R. Wilson, and G. Jacoby, On the long-term context for late twentieth century warming. J. Geophys. Res., 2006. 111(D3): p. D03103.

22. Moberg, A., et al., Highly variable Northern Hemisphere temperatures reconstructed from low- and high-resolution proxy data. Nature, 2005. 433(7026): p. 613-617.

23. Luckman, B.H., et al., Tree-ring based reconstruction of summer temperatures at the Columbia Icefield, Alberta, Canada, AD 1073-1983. The Holocene, 1997. 7(4): p. 375-389.

24. Mann, M.E., et al., Testing the Fidelity of Methods Used in Proxy-Based Reconstructions of Past Climate. J. Climate, 2005. 18(20): p. 4097-4107.

25. Wilson, R., et al., Two-hundred-fifty years of reconstructed and modeled tropical temperatures. J. Geophys. Res., 2006. 111(C10): p. C10007.

0 0 votes

Article Rating

183 Comments

Oldest

Newest Most Voted

Inline Feedbacks

View all comments

kim2ooo

April 3, 2012 4:02 am

Watching this thread with interest.

Jessie

April 3, 2012 4:05 am

Sweet, thanks Pat Frank.
Besides this great learning curve
and
Spencer shows compelling evidence of UHI in CRUTem3 data
http://wattsupwiththat.com/2012/03/30/spencer-shows-compelling-evidence-of-uhi-in-crutem3-data/
I have organised my weekend around some serious study and social events. And enlightenment.
Thank you for penning, and the hard work undertaken in providing such a comprehensive piece.

Mark Smith

April 3, 2012 4:10 am

Deriving a physical theory in rigorous method would hideously difficult for any of the nonradiological processes. Take tree rings, the number of factor that go into tree growth are enormous and many factors like local rainfall are simply unknown at the start before you even derive how it would affect the trees.

Robin

April 3, 2012 4:21 am

A tour de force! What Journal would publish this, I wonder? Pat Frank has really gone to town, and one can only hope that the authors of the studies that he has examined will have the courage to respond to his challenge.
As someone who has been involved in industrial applications of the statistics of precision estimation I can only contribute my congratulations.
Robin

johnfpittman

April 3, 2012 4:22 am

Did you send M Tobis amd A Smith a copy? And an invitation to post? I would like to read the response.
You do need to change your rankings though. ANY that use methods as outlined here:
7. Michael E. Mann, Scott Rutherford, Eugene Wahl, and Caspar Ammann (2005) “Testing the Fidelity of Methods Used in Proxy-Based Reconstructions of Past Climate.” [24]
get -1 from biophysics. The meta data from the Climategate 1 emails indicated that as early as 1998 physical evidence existed that refuted the basic methodological assumption outlined in the referenced study.
I beleive that is almost all of them that use tree rings.

Ric Werme

Editor

April 3, 2012 4:33 am

There are several URL link problems in this post. Typical is Ross McKitrick revealed itswhich goes to http://wattsupwiththat.com/2012/04/03/proxy-science-and-proxy-pseudo-science/%E2%80%9D (that’s a quote mark on the end).
I’d volunteer to clean them up, but I have neither access nor time today.

01wmarsh

April 3, 2012 4:34 am

Interesting post, although I will admit that a good deal of it was over my head. 🙂
I’m still confused by the link to the ‘insulting comment’ provided. Traveling to the link destination yields a post on this blog called “”Climate flicker” at the end of the last glacial period” by Mr Watts. Was that the intended destination and, if it was I’m having a hard time identifying what part of the post and/or comments were deemed insulting.

Keitho

Editor

April 3, 2012 4:37 am

Dear Pat Frank thank you so very much for that cogent and eminently readable article.
It does leave one wondering what on Earth these people think they are doing and more importantly why.
I will be spreading this message far and wide. Once again thanks.
(BTW many of the links you provide just go 404 )

Bill Marsh

April 3, 2012 4:38 am

Interesting post, although I will admit most was over my head.
I am confused about the link you provided to the ‘insulting comment’. Traveling the link yields an Anthony Watts post ”Climate flicker” at the end of the last glacial period”. Was that the intended destination? If so, I am having trouble identifying exactly what was insulting about the post. Could you be more specific?

Peter Miller

April 3, 2012 4:50 am

Wow!
Absolutely, definitely not something which will find its way into the next IPCC.Report.
I always enjoy some genuine factual analysis which does the Team and their Cause a well-deserved piece of no good.

jim

April 3, 2012 4:51 am

Keith: “…what on Earth these people think they are doing and more importantly why. ”
JK: Collecting research grants, award money, fame and some business income.
Thanks
JK

Dan Johnston

April 3, 2012 4:51 am

Well done. Your analysis shows just how easy it is to make good science go bad.

kadaka (KD Knoebel)

April 3, 2012 4:55 am

Arghh! Pull the post, pull it now! More than half the links are bad from an added double-quote at the end, and three links in the first two paragraphs are trying to reference this post.
Please fix!

David A

April 3, 2012 5:02 am

Thank you Pat Frank. I read the entire article which is certainly above a layman’s pay grade. However the basic understanding of how systemic error is not recognised, physical mechanism not accounted for, and physical uncertainties, not properly accounted for, comes through.
It started a little slow in that the link in this sentance appears to have nothing to do with M&Ms work, but is instead an article on rapid climate flux 12,000 years ago. “It’s become very clear that most published proxy thermometry since 1998 [1] is not at all science, and most thoroughly so because Steve McIntyre and Ross McKitrick revealed its foundation in ad hoc statistical numerology.”
Did M&M point out an entirely different line of criticism to the same papers in their evaluations?
Do all the reconstructions, including those showing a strong MWP linked at CO2 science, have similar limitations? (I understand if you have not looked at all of them, 😉 It is also true that the numerous solar reconstructions, which statisticaly have a greater correlation to assumed T then CO2, also lack a definitive physical mechanism which can be tested? Finally, if the MWP was as warm or warmer then present, then, assuming the “teams” flat lined CO2 levels, are the climate science models senstivity to CO2 likely very wrong? …and is this why they work so hard to minimize the MWP? (sorry about asking five questions)

Dr Burns

April 3, 2012 5:03 am

Bad links:
>>I’ve now examined the reliability of all three legs of AGW science: Climate models (GCMs) here (calculations here), the surface air temperature record here (pdf downloads, all),

Bloke down the pub

April 3, 2012 5:03 am

Pity the links to the correlation isn’t causation were broken, they could have been useful.
Question, in science is correlation without causation better or not than causation without correlation?

j ferguson

April 3, 2012 5:05 am

Thank you, Pat Frank,
this is a very compelling review.
btw, these links don’t work:
“With this study, I’ve now examined the reliability of all three legs of AGW science: Climate models (GCMs) here (calculations here), the surface air temperature record here (pdf downloads, all), and now proxy paleotemperature reconstructions.”
I look forward to their repair and reading the pdfs.
I’ve had a lot of personal difficulty reconciling my suspicion that the tree-ring paelothermometry was nonsense with the possibility that it actually is nonsense. You’ve helped immensely.
Thanks again.

Lance

April 3, 2012 5:05 am

The emperor has no clothes.
Of course he has been strutting around naked for years now.
Hopefully Mr Frank’s work will help to increase the laughter in the crowd until the old fellow retreats in embarrassment.

tallbloke

April 3, 2012 5:11 am

A tour de force. Clearly written and presented, and a damning indictment of the big money paleo hockey team. The salinity issue is particularly interesting wrt to millenial scale series. The TOC is around a thousand years, and is possibly temperature related. The salinity variable may either be amplifying or diminishing the temperature signal in results. but which is it?
Minor note.
There are spurious quote marks on the ends of the spurious correlations links which are breaking them.

ColinW

April 3, 2012 5:13 am

[some of the web links in the article have a spurious quotation mark ” at the end of the link, breaking them]

David A

April 3, 2012 5:15 am

Oh, one more comment Mr Frank. Do you suppose this is somewhat why one of the team members (Edward Cook) suggested they all (the team) get together, do their best combined work, after which they would know “snip” all about less then 100 year reconstructions, and “snip all” about greater then 100 year reconstructions as far as the TRUTH (his caps, not mine) relating to T?
Yes Sir, I think in the climategate e-mails you have a “Willis like elevator speech” to summarize your article, which is perhaps why, when being secretly honest, they thought they should then publish, retire, and not leave a forwarding address. Link to entire quote here. http://www.google.com/url?sa=t&rct=j&q=&esrc=s&frm=1&source=web&cd=1&ved=0CCYQFjAA&url=http%3A%2F%2Fjunkscience.com%2F2011%2F11%2F26%2Fclimategate-2-0-we-know-f-all%2F&ei=Ael6T7meE6muiQLrx-GpCg&usg=AFQjCNFvGcCz8RH_ZMAR7fHomqJ6T7435g&sig2=1FHlOXR2iVKjLfEZlxaXAQ

Kaboom

April 3, 2012 5:22 am

The three PDF download links near the end are all broken due to an extra ” at the end of the URL. Please fix and delete this comment 🙂

James Ard

April 3, 2012 5:23 am

I’m not a scientist, but using trees as a temperature proxy seems crazy. Both temperature and co2 levels affect the growth rate of trees, among other factors, how do you attribute what growth to what?

Craig Goodrich

April 3, 2012 5:24 am

Thanks for an illuminating analysis. One point, though: all of your links end with a double-quote, which confuses Internet Explorer and have to be removed manually to retrieve the page.

Bill Illis

April 3, 2012 5:27 am

The δ18O isotopes vary considerably depending on temperature, latitude, altitude, proximity to the ocean, rate of precipitation, seasonally and local climate and geologic effects. On long time scales over millions of years, the amount of δ18O declines in sedimentary rock due to diagenesis.
So there are many formulae on how to convert δ18O into a local temperature and a global temperature. Climate science has often misused the different formulae to obtain whatever number they want to.
The best explanation I have found of how to use it properly is by Jan Veizer (although there are several textbooks on this). This is only for those who might end up working with the data.
http://www.science.uottawa.ca/eih/ch3/ch3.htm
Basically, you have to know how the δ18O varies with temperature (and other potential impactors such as seasonal changes) for the particular location and source you are using. If you on top of a 3 km high glacier, you need a different formula than if you taken deep sea rock cores at the equator.
Other than that, however, the δ18O isotopes are the best temperature proxy we have.

David A

April 3, 2012 5:31 am

James Ard says:
April 3, 2012 at 5:23 am
I’m not a scientist, but using trees as a temperature proxy seems crazy. Both temperature and co2 levels affect the growth rate of trees, among other factors, how do you attribute what growth to what?
——————————————————————-
Well James, you can’t, unless you are a climate scientist, and part of the team, then you can feel the soul of the tree.
“…..I have wondered about trees.
They are sensitive to light, to moisture, to wind, to pressure.
Sensitivity implies sensation. Might a man feel into the soul of a tree
for these sensations? If a tree were capable of awareness, this faculty
might prove useful. ”
“The Miracle Workers” by Jack Vance

Dr Burns

April 3, 2012 5:37 am

Pat,
An interesting paper. I’d be keen to hear your comments about the claimed errors in CRU’s measured ‘global temperature’ rather than just proxies.
CRU states of global average temperatures:
“Annual values are approximately accurate to +/- 0.05°C (two standard errors) for the period since 1951.” Global temperatures are presented to 0.001 degrees back to 1850.
http://www.cru.uea.ac.uk/cru/data/temperature/
CRU errors seem to be calculated on the assumption that ‘global temperature’ is a physical quantity that has random errors. More measurements are claimed to reduce these errors, as they would for random errors.
Temperatures around 1951 were recorded to +/- 0.5 degrees
http://www.srh.noaa.gov/ohx/dad/coop/EQUIPMENT.pdf page 11
The error for more than 90% of individual weather stations is greater than 1.0 degree
http://www.surfacestations.org/
The accuracy of measurements seems little better than a finger held aloft. It seems to me that if I used my forefinger as a temperature sensor, and took a sufficient number of readings, I too could claim an accuracy of +/- 0.05°C for my forefinger.
I’d appreciate your comments as to what the true accuracy of CRU temperatures might be.

leftturnandre

April 3, 2012 5:38 am

I guess it’s time to ponder some more about the water isotope paleothermometer. I’ll prepare something for that.
Hint: Non calor sed umor

David A

April 3, 2012 5:38 am

tallbloke says:
April 3, 2012 at 5:11 am
—————————————————
Tallbloke, I also was curious if the proxy signa, relating to T, was biased up or down with more salinity.

proskeptic

April 3, 2012 5:40 am

Apologies for hijacking this thread for a moment but you guys have seen this interview with James Lovelock from 2010 right? It’s probably old news – but wtf? And the Comments section! A bunch of supposed liberals with nothing to say about their hero advocating a suspension of democracy!
http://www.guardian.co.uk/environment/blog/2010/mar/29/james-lovelock
“We need a more authoritative world. We’ve become a sort of cheeky, egalitarian world where everyone can have their say. It’s all very well, but there are certain circumstances – a war is a typical example – where you can’t do that. You’ve got to have a few people with authority who you trust who are running it. And they should be very accountable too, of course.
But it can’t happen in a modern democracy. This is one of the problems. What’s the alternative to democracy? There isn’t one. But even the best democracies agree that when a major war approaches, democracy must be put on hold for the time being. I have a feeling that climate change may be an issue as severe as a war. It may be necessary to put democracy on hold for a while.”
[snip . . OT . . kbmod]

Mickey Reno

April 3, 2012 6:08 am

Thanks for your thorough and elucidating look at the state of the art of the modern ‘science’ of proxy-paleo-thermometry. Your honorable mention example #8 actually had me laughing.
I’m left wondering exactly when it became acceptable in any science to ignore the axiom that correlations don’t imply causation? And why are scientific journals allowing papers that make such presumptions to be published? This is one of the most stunning parts of climate science as practiced by “Real” climate scientits.

Bill Wood

April 3, 2012 6:09 am

Correlation does not equal causation. Causation normally follows the arrow of time. Two simultaneous events indicate they may have common causation. It is up to physical science to develop the theoretical basis for a causative relatonship.
One of the constant relationshps is hemlines to stock market performance. This makes Chairman Bernanke the ultimate arbiter of fashion. I am looking forward to this summer if he continues to flood the markets with liquidity.

richardscourtney

April 3, 2012 6:15 am

Several commentators miss the main point in the above article; i.e.
Correlation implies nothing about causation
but
absence of correlation disproves direct causation.
So, anything which assumes causation exists because correlation exists is pseudoscience. And almost all proxy studies in climatology make that assumption.
Simply, I am dumbfounded by the quotation in the article which reports Michael Tobis having written;
“If two signals are correlated, then each signal contains information about the other. Claiming otherwise is just silly.”
An undergraduate would obtain a fail mark for writing that in an assignment.
It is important to note that the assumption of correlations resulting from unknown causal links is common to almost all proxy studies used in ‘climate science’.
As an addendum, correlation may suggest the possibility of a causal link (because absence of correlation disproves direct causation) but that possibility needs to be investigated before any assumption that a causal link exists. When (n.b. only when) the mechanism of a causal link between two parameters has been demonstrated to exist then the correlation may be assumed to be an indicator of one parameter by the other.
Richard

Steve Richards

April 3, 2012 6:18 am

Stunning!
Someone who understands science, statistics and is prepared to put his head above the parapet.

trbixler

April 3, 2012 6:22 am

Impressive review of so much pseudo science that has received so much fanfare and money. Thank You for the effort and Thank You to Anthony for the place to post this information.

Orson Olson

April 3, 2012 6:29 am

Long and detailed; thoughtful and meaty. Thank you Pat Frank. Many of us have wanted such a tutorial for a great many years. Now we’ve got one. Thanks again!

elmer

April 3, 2012 7:14 am

The problem with ice cores is they aren’t annual rings or layers, they just indicate cold/warm temperatures. In Minnesota one storm can produce several different layers of snow types. Tree rings are also problematic, although they are annual rings, temperature is just one variable in a trees annual growth. Water, soil and EVEN CO2 all have an impact on the tree’s annual growth. If CO2 increases in a certain area the trees will grow faster regardless of temperature.

Dixon

April 3, 2012 7:14 am

This is brilliant (if rather more than a primer 🙂
Presumably the paleotemp from a shell is some kind of time and growth-weighted average of the temperature while the animal was alive? If so, and given how much the temperature in the water column can vary in coastal environments I can see potential pitfalls in using this sort of information as a temperature proxy (as well as depth/animal size). For instance off Western Australia last March we had an extreme heatwave which literally wiped out huge swathes of Haliotis Roeii (abalone). The mortality was caused by a few days of especially hot still weather which allowed water temperature to climb above those tolerated by the abalone. These shells will now be over-represented in any deposit, and the growth periods within them not reflective of the heat event at all.
How would seasonal variation in water temps get captured in the isotope ratios?
Are we sure that a shell can sit in a liquid CO2 rich medium without any exchange processes occurring between the shell matrix and the liquid? Presumably the assumption in this type of paleothemo approach is that there are none (much as ice cores are allegedly sealed and stable over the period the core is accumulating, and is removed and stored). It would be interesting to know if this has been considered?
BTW I loved your distinctions between statistical and scientific inferences, that should be explicitly stated more often!
Thank you.

elmer

April 3, 2012 7:30 am

It’s a self fullfilling prophecy
Increased CO2 levels make tree rings bigger.
Bigger tree rings are being used to show an increase in temperature.
Therefore increased CO2 levels cause temperatures to increase.

Max Hugoson

April 3, 2012 7:36 am

I’d be very careful about upsetting the vendor’s tables in the Temple, around Easter time.
You know what happens to prophets in Jerusalem. The ‘religious authorities’ are looking for someone to betray you right now. Don’t pray in any gardens and be glad the Ides of March are behind us!

theduke

April 3, 2012 7:48 am

I have used the phrase, “Correlation is not causation” frequently in posts about climate and I now have a much better understanding of what I meant!

wayne Job

April 3, 2012 8:03 am

Mr Frank you have been indeed frank and if your analysis is indeed correct you have laid down a guantlet. The gloves are off and expect a bare knuckle fight, pulling the rug out from these leading lights of global warming will most likely elicit a severe response. Thank you it is time they got a blood nose.

Joe Born

April 3, 2012 8:04 am

I’d like to come back and read this again when the links work.

Brian D

April 3, 2012 8:04 am

Was it just me or did i just see the scientific version of Hulk Hogan body slamming Andre the giant?

gnomish

April 3, 2012 8:14 am

thank you for a most cogent article. it was a pleasure to read. it is a pleasure to know there is someone out there remaining who is capable of writing informatively, implying that he credits the reader with intelligence.
contains no hawafeena!

Gary Pearse

April 3, 2012 8:21 am

This thorough and very readable work, which includes detail on how the oxygen isotopes respond to temperature and their incorporation into calcium carbonate is masterful – all so that the reader can properly follow the criticism of paleoproxies- deserves very wide distribution. I think you are too generous in your “Granting integrity, one can only conclude that the scientists, their reviewers, and their editors are uniformly incompetent.” Incompetence without doubt but the level of integrity has been fully revealed in climategate and other places as being disgustingly absent. The AGW industry has sprung from ideology and a greed fed by an unprecedented deluge of cash for the destruction of civilization. Honest science could only get in the way of this movement.
Sadly, there is a slim chance of publishing this excellent paper in the A-list journals. Roy Spencer’s recent bemoaning the unliklihood of his paper being published (on UHI accounting for much of the uptrend in local (and probably global) temperature, suggests that a new generation of scientific journals has become necessary for the health of science – please don’t give Nature, Scientific American, Proceedings of the Royal Society…. a pardon after this nightmare has ended.

wmconnolley

April 3, 2012 8:42 am

So many words, and so little point. Meanwhile, the stream of science flows on around and over you, and doesn’t even notice you exist.
The obvious thing to look at is the comparison of borehole thermometry to D-O18 in Greenland.

wayne

April 3, 2012 8:54 am

It’s great to see some good and proper science in climatology every now and then. Well written and constructed Pat, and I have to agree with your conclusion. I have little to add, you seem to have covered it all.

woodNfish

April 3, 2012 9:06 am

“Granting integrity, one can only conclude that the scientists, their reviewers, and their editors are uniformly incompetent.”
Sorry Dr. Frank, but these people have no integrity. They are outright frauds and they know it and they are doing it anyway.

Stephen Richards

April 3, 2012 9:08 am

Thank you for penning, and the hard work undertaken in providing such a comprehensive piece.
I echo this and all the above, Pat. A nice piece of cogent argument which goes to the heart of how science is done and written.
Many thanks. If only more people from the establishment would do the same.

ckb

Editor

April 3, 2012 9:12 am

“So many words, and so little point. Meanwhile, the stream of science flows on around and over you, and doesn’t even notice you exist.
The obvious thing to look at is the comparison of borehole thermometry to D-O18 in Greenland.”
Oh look, a little comment from a small man. Content free, per usual. Do you disagree with the analysis? Do tell…

Zeke

April 3, 2012 9:13 am

“So then how did Crowley and Lowery convert their unitized proxy average into temperature? Well, “The two composites were scaled to agree with the Jones et al. instrumental record for the Northern Hemisphere…,” and that settles the matter.
In short, the fifteen series were numerically adjusted to a common scale, averaged, and scaled up to the measurement record. Then C&L reported their temperatures to a resolution of (+/-)0.05 C. Measurement uncertainty in the physically real series was ignored in their final composite.”
This is a memorable lesson in forming a larger system from smaller systems. This is where the malpractice will lie. Then it gets fed into a computer model, which is basically a scientific diagram which you don’t get to look at.

James Ard

April 3, 2012 9:14 am

Years ago I would never have believed Elmer’s self fulfilling prophecy comment could be anything but a joke. But my experience watching these frauds leads me to believe that’s exactly why trees were selected as a proxy.

Dennis Nikols, P. Geo

April 3, 2012 9:17 am

Excellent review and explanation. One must now ask: One, how do these authors and the reviewers of their work justify publishing results without error bars and statements of systematic error? Two, how is it that unscientific foolishness such as paleo temperatures from tree rings even gets into the “scientific literature”?

Gary

April 3, 2012 9:20 am

It also should be mentioned that Nick Shackleton was a careful and prolific scientist who pioneered mass-spectrometry and performed thousands of O-18 analyses. A major contribution he made was refining the dating of sediment cores which is fundamental to their use as proxies.

Jan

April 3, 2012 9:21 am

I think these are the correct links with respect to this:
Michael quickly appealed to his home authorities at, Planet3.org (http://planet3.org/2012/02/21/singers-proxy-argument-refuted/) . We all had a lovely conversation that ended with moderator-cum-debater Arthur Smith indulging a false claim of insult to impose censorship (insulting comment in full here (http://planet3.org/moderated-comments/#comment-4647) for the strong of stomach).
You need to scroll down to the bottom of the page, past the flowering crab to see the discussion:
http://planet3.org/2012/02/21/singers-proxy-argument-refuted/
The link was moderated and put into the ‘bore hole’ and shown in full here:
http://planet3.org/moderated-comments/#comment-4647

LamontT

April 3, 2012 9:21 am

Very nice. Yes things CAN be used as a proxy for temperature BUT it requires knowing a lot about what was going on with the proxy. You can’t simply say that something is a proxy and then blindly go and apply it across the board. This is one of the points where climate science broke down.
They declared as a flat out rule that tree rings are always a proxy for temperature. And then use any and all tree rings that way in any and all regions. Then when they can actually compare their proxy to actual recorded temperatures in various places it is found not to work that well. It is clear that they didn’t stop and wonder about the mechanics of what is going on and investigate it. Instead they hand waved and went on.
The reality is anything used as a proxy only works some of the time when the conditions are right. And you need to know those conditions before you can safely use it as a temperature proxy. There are other things that can cause the same effect that is assigned to temperature. This means that temperature isn’t always what creates measured data. And that means that you can’t simply use something like tree rings or other proxy’s blindly as temperature proxies. You must know a lot more about each measured point and what is going on as a secondary effect before you can reliably use any set as a proxy. And this is a massive amount of work that isn’t being done.
Essentially once something has been declared a proxy it gets blindly used from then on and that can’t work. Not reliably. Sad really I suspect that the entire proxy business will be thrown out because of the defense of it rather than people sitting down and doing the massive amount of work needed to determine valid versus invalid proxies. Oh and if you can’t determine the secondary factors for any time period you can’t reliably use the proxy for that time period. A big pain but this is the reality of science. it isn’t neat and orderly but a lot of work if you want to do it right.

James Sexton

April 3, 2012 9:23 am

wmconnolley says:
April 3, 2012 at 8:42 am
So many words, and so little point. Meanwhile, the stream of science flows on around and over you, and doesn’t even notice you exist.
The obvious thing to look at is the comparison of borehole thermometry to D-O18 in Greenland.
===========================================================
Thanks Billy! Your thoughtful insights are a welcomed contribution to the discussion!!
Did you just do a hand wave? Don’t look here….. look there!! I’m a bit disappointed. After such a lengthy essay, the response is “go look at Greenland.” 😐
In the meantime, science hasn’t continued. Like a damn built with mired and muddied thought, unwarranted assumptions, untestable hypothesis’s and braced with ideological pursuits, the flow of science, particularity in the climate arena, has essentially stood still going on about 2-3 decades now, with only trickles released from time to time.
But, thanks to people like Pat Frank, the dam will burst soon enough.

Don Keiller

April 3, 2012 9:32 am

Michael Tobis was correct when he said “If two signals are correlated, then each signal contains information about the other. Claiming otherwise is just silly.”
Any climate reconstruction whose authors include “Jones” or “Mann” will be devoid of real science and vice versa.

Share this:

Like this:

Related