Guest Post by Willis Eschenbach
People sometimes ask why I don’t publish in the so-called scientific journals. Here’s a little story about that. Back in 2004, Michael Mann wrote a mathematically naive piece about how to smooth the ends of time series. It was called “On smoothing potentially non-stationary climate time series“, and it was published in Geophysical Research Letters in April of 2004. When I read it, I couldn’t believe how bad it was. Here is his figure illustrating the problem:

Figure 1a. [ORIGINAL CAPTION] Figure 1. Annual mean NH series. (blue) shown along with (a) 40 year smooths of series based on alternative boundary constraints (1) – (3). Associated MSE scores favor use of the ‘minimum roughness’ constraint.
Note the different colored lines showing different estimates of what the final averaged value will be, based on different methods of calculating the ends of the averages. The problem is how to pick the best method.
I was pretty naive back then. I was living in Fiji for one thing, and hadn’t had much contact with scientific journals and their curious ways. So I innocently thought I should write a piece pointing out Mann’s errors, and suggesting a better method. I append the piece I wrote back nearly a decade ago. It was called “A closer look at smoothing potentially non-stationary time series.”
My main insight in my paper was that I could actually test the different averaging methods against the dataset by truncating the data at various points. By doing that you can calculate what you would have predicted using a certain method, and compare it to what the true average actually turned out to be.
And that means that you can calculate the error for any given method experimentally. You don’t have to guess at which one is best. You can measure which one is best. And not just in general. You can measure which one is best for that particular dataset. That was the insight that I thought made my work worth publishing.
Now, here comes the story.
I wrote this, and I submitted it to Geophysical Research Letters at the end of 2005. After the usual long delays, they said I was being too hard on poor Michael Mann, so they wouldn’t even consider it … and perhaps they were right, although it seemed pretty vanilla to me. In any case, I could see which way the wind was blowing. I was pointing out the feet of clay, not allowed.
I commented about my lack of success on the web. I described my findings over at Climate Audit, saying:
Posted Oct 24, 2006 at 2:09 PM
…
[Mann] recommends using the “minimum roughness” constraint … apparently without noticing that it pins the endpoints.
I wrote a reply to GRL pointing this out, and advocating another method than one of those three, but they declined to publish it. I’m resubmitting it.
w.
So, I pulled out everything but the direct citations to Mann’s paper and resubmitted it basically in the form appended below. But in the event, I got no joy on my second pass at publishing it either. They said no thanks, not interested, so I gave up. I posted it on my server at the time (long dead), put a link up on Climate Audit, and let it go. I was just a guy living in Fiji and working a day job, what did I know?
Then a year later, in 2007 Steve McIntyre posted a piece called “Mannomatic Smoothing and Pinned End-points“. In that post, he also discussed the end point problem.
And now, with all of that as prologue, here’s the best part.
In 2008, after I’d foolishly sent my manuscript entitled “A closer look at smoothing potentially non-stationary time series” to people who turned out to be friends of Michael Mann, Dr. Mann published a brand new paper in GRL. And here’s the title of his study …
“Smoothing of climate time series revisited”
I cracked up when I saw the title. Yeah, he better revisit it, I thought at the time, because the result of his first visit was Swiss cheese.
And what was Michael Mann’s main insight in his new 2008 paper? What method did he propose?
“In such cases, the true smoothed behavior of the time series at the termination date is known, because that date is far enough into the interior of the full series that its smooth at that point is largely insensitive to the constraint on the upper boundary. The relative skill of the different methods can then be measured by the misfit between the estimated and true smooths of the truncated series.”
In other words, his insight is that if you truncate the data, you can calculate the error for each method experimentally … curious how that happens to be exactly the insight I wasted my time trying to publish.
Ooooh, dear friends, I’d laughed at his title, but when I first read that analysis of “his” back in 2008, I must admit that I waxed nuclear and unleashed the awesome power that comes from splitting the infinitive. The house smelled for days from the sulfur fumes emitted by my unabashed expletives … not a pretty picture at all, I’m ashamed to say.
But before long, sanity prevailed, and I came to realize that I’d have been a fool to expect anything else. I had revealed a huge, gaping hole in Mann’s math to people who were obviously his friends … and while for me it was an interesting scientific exercise, for him it represented much, much more. He could not afford to leave the hole unplugged or have me plug it.
And since I had kindly told him how to plug the hole, he’d have been crazy to try something else. Why? Because my method worked … hard to argue with success.
The outcome also proved to me once again that I could accomplish most anything if I didn’t care who got the credit.
Because in this case, the sting in the tale is that at the end of the day, my insights on how to deal with the problem did get published in GRL. Not only that, they got published by the guy who would have most opposed their publication under my name. I gotta say, whoever is directing this crazy goat-roping contest we call life has the most outré, wildest sense of humor imaginable …
Anyhow, that’s why I’ve never pushed too hard to try to publish my work in what used to be scientific journals, but now are perhaps better described as popular science magazines. Last time I tried, I got bit … so now, I mostly just skip getting gnawed on by the middleman and put my ideas up on the web directly.
And if someone wants to borrow or steal or plagiarise my scientific ideas and words and images, I say more power to them, take all you want. I cast my scientific ideas on the electronic winds in the hope that they will take root, and I can only wish that, just like Michael Mann did, people will adopt my ideas as their own. There’s much more chance they’ll survive that way.
Sure, I’d prefer to get credit—I’m as human as anyone, or at least I keep telling myself that. So an acknowledgement is always appreciated.
But if you just want to just take some idea of mine and run, sell it under another brand name, I say go for it, take all you want, because I’ve learned my lesson. The very best way to keep people from stealing my ideas is to give them away … and that’s the end of my story.
As always, my best wishes for each of you … and at this moment my best wish is that you follow your dream, you know the one I mean, the dream you keep putting off again and again. I wish you follow that dream because the night is coming and no one knows what time it really is …
w.
[UPDATE] In my above-mentioned comment on Steve McIntyre’s blog, I mentioned the analysis of Mannian smoothing by Willie Soon, David Legates, and Sallie Baliunas, entitled Estimation and representation of long-term (>40 year) trends of Northern-Hemisphere-gridded surface temperature: A note of caution.
Dr. Soon has been kind enough to send me a copy of that study, which I have posted up here. My thanks to him, it’s an interesting paper.
=====================================================
APPENDIX: Paper submitted to GRL, slightly formatted for the web.
—————
A closer look at smoothing potentially non-stationary time series
Willis W. Eschenbach
No Affiliation
[1] An experimental method is presented to determine the optimal choice among several alternative smoothing methods and boundary constraints based on their behavior at the end of the data series. This method is applied to the smoothing of the instrumental Northern Hemisphere (NH) annual mean, yielding the best choice of these methods and constraints.
1. Introduction
[2] Michael Mann has given us an analysis of various ways of smoothing the data at the beginning and the end of a time series of data (Mann 2004, Geophysical Research Letters, hereinafter M2004).
These involve minimizing different boundary conditions at those boundaries, and are called the “minimum norm”, “minimum slope”, and “minimum roughness” methods. These methods minimize, in order, the zeroth, first, and second derivatives of the smoothed average. M2004 describes the methods as follows:
“To approximate the ‘minimum norm’ constraint, one pads the series with the long-term mean beyond the boundaries (up to at least one filter width) prior to smoothing.
To approximate the ‘minimum slope’ constraint, one pads the series with the values within one filter width of the boundary reflected about the time boundary. This leads the smooth towards zero slope as it approaches the boundary.
Finally, to approximate the ‘minimum roughness’ constraint, one pads the series with the values within one filter width of the boundary reflected about the time boundary, and reflected vertically (i.e., about the ‘‘y’’ axis) relative to the final value. This tends to impose a point of inflection at the boundary, and leads the smooth towards the boundary with constant slope.” (M2004)
[3] He then goes on to say that the best choice among these methods is the one that minimizes the mean square error (MSE) between the smoothed data and the data itself:
“That constraint providing the minimum MSE is arguably the optimal constraint among the three tested.” (M2004)
2. Method
[4] However, there is a better and more reliable way to choose among these three constraints. This is to minimize the error of the final smoothed data point in relation, not to the data itself, but to the actual final smoothed average (which will only be obtainable in the future). The minimum MSE used in M2004 minimizes the squared error between the estimate and the data points. But this is not what we want. We are interested in the minimum mean squared error between the estimate and the final smoothed curve obtained from the chosen smoothing method. In other words, we want the minimum error between the smoothed average at the end of the data and the smoothed average that will actually be obtained in the future, when we have enough additional data to determine the smoothed average exactly.
[5] This choice can be determined experimentally, by realizing that the potential error increases as we approach the final data point. This is because as we approach the final data point, we have less and less data to work with, and so the potential for error grows. Accordingly, we can look to see what the error is with each method in the final piece of data. This will be the maximum expected error for each method. While we cannot determine this for any data nearer to the boundary than half the width of the smoothing filter, we can do so for all of the rest of the data. It is done by truncating the data at each data point along the way, calculating the estimated value of the final point in this truncated dataset using the minimum norm, slope, and roughness methods, and seeing how far they are from the actual value obtained from the full data set.
[6] In doing this, a curious fact emerges — if we calculate the average using the “minimum roughness” method outlined above, the “minimum roughness” average at the final data point is just the final data point itself. This is true regardless of the averaging method used. If we reflect data around both the time axis and the y-axis at the final value, the data will be symmetrical around the final value in both the “x” and “y” directions. Thus the average will be just the final data point, no matter what smoothing method is used. This can be seen in Fig. 1a of M2004:

ORIGINAL CAPTION: Figure 1. Annual mean NH series. (blue) shown along with (a) 40 year smooths of series based on alternative boundary constraints (1)–(3). Associated MSE scores favor use of the ‘minimum roughness’ constraint. (Mann 2004)
[7] Note that the minimum roughness method (red line) goes through the final data point. But this is clearly not what we want to do. Looking at Fig. 1, imagine a “smoothed average” which, for a data set truncated at any given year, must end up at the final data point. In many cases, this will yield wildly inaccurate results. If this method were applied to the data truncated at the high temperature peak just before 1880, for example, or the low temperature point just before that, the “average” would be heading out of the page. This is not at all what we are looking for, so the choice that minimizes the MSE between the data and the average (the “minimum roughness” choice) should not be used.
[8] Since the minimum roughness method leads to obvious errors, this leaves us a choice between the minimum norm and minimum slope methods. Fig. 2 shows the same data set with the point-by-point errors from the three methods (minimum norm, minimum slope, and minimum roughness) calculated for all possible points. (The error for the minimum roughness method, as mentioned, is identical to the data set itself.)
[9] To determine these errors, I truncated the data set at each year, starting with the year that is half the filter width after the start of the start of the dataset. Then I calculated the value for the final year of the truncated data set using each of the different methods, and compared it to the actual average for that year obtained from the full data set. I am using a 41-year Gaussian average as my averaging method, but the underlying procedure and its results are applicable to any other smoothing method. I have used the same dataset as Mann, the Northern Hemisphere mean annual surface temperature time series of the Climatic Research Unit (CRU) of the University of East Anglia [Jones et al., 1999], available at http://www.cru.uea.ac.uk/ftpdata/tavenh2v.dat.

Figure 2. Errors in the final data point resulting from different methods of treating the end conditions. The “minimum roughness” method error for the dataset truncated at any given year is the same as the data point for that year.
3. Applications
[10] The size of the errors of the three methods relative to the smoothed line can be seen in the graph, and the minimum slope method is clearly superior for this data set. This is verified by taking the standard deviation of each method’s point-by-point distance from the actual average. Minimum roughness has the greatest deviation from the average, a standard deviation of 0.110 degrees. The minimum norm method has a standard deviation of 0.065 degrees from the actual average, while the minimum slope’s standard deviation is the smallest at 0.048.
[11] Knowing how far the last point in the average of the truncated data wanders from the actual average allows us to put an error bar on the final point of our average. Here are the three methods, each with their associated error bar (all error bars in this paper show 3 standard deviations, and are slightly offset horizontally from the final data point for clarity).

Figure 3. Potential errors at the end of the dataset resulting from different methods of treating the end conditions. Error bars represent 3 standard deviations. The minimum slope constraint yields the smallest error for this dataset.
[12] Note that these error bars are not centered vertically on the final data point of each of the series. This is because, in addition to knowing the standard deviation of the error of each end condition, we also know the average of each error. Looking at Fig. 2, for example, we can see that the minimum norm end condition on average runs lower than the true Gaussian average. Knowing this, we can improve our estimate of the error of the final point. In this dataset, the centre of the confidence limits for the minimum norm will be higher than the final point by the amount of the average error.
3.1 Loess and Lowess Smoothing
[13] This dataset is regular, with a data point for each year in the series. When data is not regular but has gaps, loess or lowess smoothing is often used. These are similar to Gaussian smoothing, but use a window that encompasses a certain number of data points, rather than a certain number of years.
[14] When the data is evenly spaced, both lowess and loess smoothing yield very similar results to Gaussian smoothing. However, the treatment of the final data points is different from the method used in Gaussian smoothing. With loess and lowess smoothing, rather than using less and less data as in Gaussian smoothing, the filter window stays the same width (in this case 41 years). However, the shape of the curve of the weights changes as the data nears the end.
[15] The errors of the loess and lowess averaging can be calculated in the same way as before, by truncating the dataset at each year of the data and plotting the value of the final data point. Fig. 4 shows the errors of the two methods.

Figure 4. Lowess and loess smoothing along with their associated end condition errors.
[16] The end condition errors for lowess and loess are quite different, but the average size of the errors is quite similar. Lowess has a standard deviation of .062 from the lowess smoothed data, and loess has a standard deviation of .061 from the loess smoothed data. Fig 5 shows the Gaussian minimum slope (the least error of the three M2004 end conditions), and the lowess and loess smoothings, with their associated error bars.

Figure 5. Gaussian, lowess and loess smoothing along with their associated error bars. Both lowess and loess have larger errors than the Gaussian minimum slope error.
[17] Of the methods tested so far, the error results are as follows:
METHOD Standard Deviation of Error Gaussian Minimum Roughness 0.111 Gaussian Minimum Norm 0.065 Lowess 0.062 Loess 0.061Gaussian Minimum Slope 0.048
[18] Experimentally, therefore, we have determined that of these methods, for this data set, the Gaussian minimum slope method gives us the best estimate of the smoothed curve which we will find once we have enough additional years of data to determine the actual shape of the curve for the final years of data.
3.2 Improved and Alternate Methods
[19] At least one better method of dealing with the end conditions exists. I call it the “minimum assumptions” method, as it makes no assumptions about the future state of the data. It simply increases the result of the Gaussian smoothing by an amount equal to the weight of the missing data. Gaussian smoothing works by multiplying each data point within the filter width by a Gaussian weight. This weight is greatest for the central point of the filter. From there it decreases in a Gaussian “bell-shaped” curve for points further and further away from the central point. The weights are chosen so that the total of the weights summed across the width of the filter adds up to 1.
[20] Let us suppose that as the center of the filter approaches the end of the dataset, the final two weights do not have data associated with them because they are beyond the end of the dataset. The Gaussian average is calculated in the usual manner, by multiplying each data point with its associated weight and summing the weighted data. The final two points, of course, do not contribute to the total, as they have no data associated with them.
[21] However, we know the total of the weights for the other data points. Normally, all of the weights would add up to 1, but as we approach the end of the data there are missing data points within the filter width. Their total of the existing data points might only be say 0.95, instead of 1. Knowing that we only have 95% of the correct weight, we can approximate the correct total by dividing the sum of the existing weighted data points by 0.95. The net effect of this is a shifted weighting which, as the final data point is approached, shifts the center of the weighting function further and further forwards toward the final data point.
[22] The standard deviation of the error of the minimum slope method, calculated earlier, was 0.048. The standard deviation of the error of the minimum assumptions method is 0.046. This makes it, for this data set, the most accurate of the methods tested. Fig. 6 shows the difference between these two methods at the end of the data set.

Figure 6. Gaussian minimum slope and minimum assumptions error bars. The minimum assumptions method provides the better estimate of the future smoothed curve.
[23] We can also improve upon an existing method. The obvious candidate for improvement is the minimum norm method. It has been calculated by padding the data with the average of the full dataset, from the start to the end of the data. However, we can choose an alternate interval on which to take our average. We can calculate (over most of the dataset) the error resulting from any given choice of interval. This allows us to choose the particular interval that will minimize the error. For the dataset in question, this turns out to be padding the end of the dataset with the average of the previous 5 years of data. Fig 7 shows the individual errors from this method, compared with the minimum assumptions method. Since the results from the two very different methods are quite similar, this increases confidence in the conclusion that these are the best of the alternatives.

Figure 7. Smoothed data (red), minimum assumptions errors (green), tuned minimum norm (previous 5-year average) errors (blue)
[24] The standard deviation of the error from the minimum norm with a 5-year average is slightly smaller than from the minimum assumptions method, 0.045 versus 0.046.
4. Discussion
[25] I have presented a method for experimentally determining which of a number of methods yields the closest approximation to a given smoothing of a dataset at the ends of the dataset. The method can be used with most smoothing filters (Gaussian, loess, low-pass, Butterworth, or other filter). The method also experimentally determines the average error and the standard deviation of the error of the last point of the dataset. Although the Tuned Minimum Norm method yields the best results for this dataset, this does not mean that it will give the best results for other datasets. It also does not mean that the Tuned Minimum Norm method is the best smoothing method possible; there may be other smoothing methods out there, known or unknown, which will give a better result on a given dataset.
[26] The method for experimentally determining the smoothing method with the smallest end-point error is as follows:
1) For each data point for which all of the data is available to determine the exact smoothed average, determine the smoothed result that would be obtained by each candidate method if that data point were the final point of the data. (While this can be done by truncating the data at each point, padding the data if required, and calculating the result, it is much quicker to use a modified smoothing function which simply treats each data point as if it were the last point of the dataset and applies the required padding.)
2) For each of these data points, subtract the actual smoothed result of the given filter at that point from the smoothed result of treating that point as if it were the final point. This gives the error of the smoothing method for the series if it were truncated at that data point.
3) Take the average and the standard deviation of all of the errors obtained by this analysis.
4) Use the standard deviation of these errors to determine the best smoothing method.
5) Use the average and the standard deviation of these errors to establish confidence limits at the final point of the smoothed data.
5. Conclusions
1) The Minimum Roughness method will always yield the largest standard deviation of the endpoint error in relation to the smoothed data and is thus the worst method to choose.
2) For any given data set, the best method can be chosen by selecting the method with the smallest standard deviation of error as measured on the dataset itself.
3) The use of an error bar at the end of the smoothed average allows us to gauge the reliability of the smoothed average as it reaches the end of the data set.
References
Mann, M., 2004, On smoothing potentially non-stationary climate time series, Geophysical Research Letters, Vol. 31, 15 April 2004
butterworth has one or two mails from Mann
david moon said:
To hell it doesn’t. What exactly do you think a Fourier Transform (more particularly, an FFT) does? It correlates a data against an orthonormal basis set of sinusoids.
Indeed, the only thing an FFT will “detect” (it doesn’t really “detect” anything) is a sinusoid, and if there are none there, it is a relatively useless tool.
White noise is stationary, just so you know, and it consists of a sum of sinusoids at all frequencies.
As an EE, you should have learned the above points in your first systems analysis class. You should have also learned about stationarity if you moved on to deeper signal processing concepts.
Do you understand why this happens, i.e., why the Gibbs phenomenon results, and hence, how a window reduces the effect? An FFT – which is what your scope does (spectrum analyzers are better at this, btw), applied to a block of data “looks” like a repeated copy of that data in the time (and frequency) domain, i.e., the waveform is your data and it has period of the length of the data. At the end of each block into the next block is a discontinuity (end of data to start of data) which appears as a wideband signal and results in ringing in the time domain and spectral leakage (scalloping) in the frequency domain. Windowing reduces the size of the discontinuity and hence, the magnitude of the effect.
You and others keep harping on “frequencies.” I’m guessing none of you have ever dealt with non-stationary data. Hint: “frequency” does not mean much with non-stationary data, and thus the concept of “high-pass/low-pass” is equally meaningless.
Indeed.
This is only a certainty if they are linear phase, which is only guaranteed with a symmetric impulse response. Gaussian filters do meet this criteria, btw. Delay is an immaterial point w.r.t. this discussion, however.
Bart’s got it right, above, not that I’m surprised. But again, even an optimal model (w.r.t. any given criteria) will fail as soon as the statistics of the data change, which is what happens with non-stationary data.
Mark
I said:
I should note that this does not occur if the data consists only of sinusoids with an integer number of cycles. EEs know this, too, or should. This is why ADC manufacturers always pick “tones” that have frequencies that are integer multiples (er, divisors?) of the length of the FFT they are plotting – they get a nice clean line and it is easy to see the non-linearities of the sample-hold mechanism (or whatever the input circuitry looks like).
Mark
In response to Fred from Canuckistan re. puddles etc, if a day old puppy piddled a puddle then Michael Mann’s ethics would drown, IMHO.
I use binomial smoothing and taper it towards the ends of the data set with the last average point being the mrean of the last two data points. It gives good results I think but I have not gone back and calculated the error as you truncate the data back in time so I guess it looks like it gives good results. Lets face it these are not precision numbers, they are about trends and the bigger pictures. All this obsessing about whether a trend is established after 10, 12, 13, 15 or 20 years is all a bit cretinous really.
————————————————————————————-
I had exactly the same thought. Even though you are not making a formal claim to GRL it reads like a public “accusation”.
Plagiarism is one of the high crimes in academia and I would gather that Mann would launch a legal attack, or at very least a twit, if this was without merit.
This seems like it may be a risky post. I would love to see if GRL passed it on to Mann, this would be a game changer and worth pursuing if you are confident this is the case. The way I read it is that both yourself and Steve McIntyre at the least were finding fault in his math so his re-visit may have been inevitable.
This could escalate.
Welcome to life at the bottom.
David Moon says: “Gaussian averaging in the time domain- how is the Gaussian function truncated? It has a parameter alpha or equivalently standard deviation. Is this specified or standard in the climate change field? (excuse my ignorance)”
I don’t think most of them would know a gaussian bell if they sat on one. !
“These “moving average” type filters are Finite Impulse Response filters and necessarily have a delay 1/2 the length. Infinite Impulse Response (IIR) filters can be designed for a desired frequency response with much less delay”
Less delay, how much? Please expand. No filter can see into the future. IIR is all history so there has to be phase delay. They also take forever to spin up. If you want any kind of accuracy you have to throw away a huge amount of the result that will not have converged.
Thanks, Willis. Very instructive, in more than one way.
A prediction can never claim to be ‘science.’ It can bebased on science, but the predicted future remains to e seen, and there is no certaintly that predictions will bear out – since we have never been there, we have no surety of what will happen.
A bit before 65 million years ago, I am sure evreyone was quite sure a meteorite would not appear to disrupt everything.
We seem to have been caught quite off-guard by the housing market collapse, even though many of us everyday ppl saw it coming.
I did when my mortgage guy told me he could get me 50% more house than I was asking.
Just because a scientist uses scientific methods and evidence to make a prediction does not mean it will happen.
The essence of science is skepticism, and second to that the essence is makig a prediction, declaring what would be observed if the predicton is correct, a priori, then making the oservation to see if the prediction is true.
with no observed data to compare to hypothesis, there is no hypothesis testing. Hence a prediction can never be science.
Hey, it was just a trick, a clever trick.
I understand Willis’s reluctance to get his hands dirty – it’s a pretty filthy pig he would be wrestling – but I would ask, at what point do we confront the liars, cheats and stealers and force them to expose themselves publicly for what they are? It’s a dilemma, for sure, but one that will have to be faced sooner or later.
One might think that Willis would have an excellent defense – truth – with the opportunity to demonstrate all of the “pig’s” and the others’ lies, cheating and stealing publicly, plus a countersuit of his own against the “pig” and the journal that wouldn’t publish Willis’s rebuttal for conspiracy to deny him his fee speech rights. Are there any lawyers posting here on WUWT who can weigh in on this? I’m certainly no authority here.
All of this said – Willis is doing a tremendous service, and I will support whatever he decides to do here..
I agree with the sentiment that its better for your ideas to be thrown into the public to increase the public’s scientific knowledge. But if scientists in any way betray the trust the people give them, they should be fired and banned from life from academia and Government employ after that forever more. This is especially important in this case, which if true probably required multiple people to throw the idea to Dr. Mann. Yes, I see him as being the “sacrificial goat” where several guys who read this paper probably tossed the idea over to him simply to attempt to get him to do the work.
Perhaps FOIA requests should be made to determine if the emails can be located about this? If he is guilty of violating the trust that the people put into him as a scientist, he deserves to do the time. Like they say, don’t do the crime if you can’t handle the time. FOIA these emails and see where they take you. I am willing to bet that where there is one instance of plagiarism there are many more. Just look out for that “plausible denibility” which these guys are so good at acquiring. I wouldn’t be surprised if he told a graduate student to do the evil deed, and then took that work under his own name. Dr. Mann seems slimy enough to do something like that.
“As always, my best wishes for each of you … and at this moment my best wish is that you follow your dream, you know the one I mean, the dream you keep putting off again and again. I wish you follow that dream because the night is coming and no one knows what time it really is …”
Thanks Willis. I am on it.
Technical question:
When smoothing within a time series one assumes that the system has a homogeneous behaviour (e.g. normal or Gaussian distribution). But at the edge of the dataset the smoothing transforms itself into a prediction that the system will continue to behave as during the core period.
And there is no way to prove or disprove such hypothesis because there is less data available to make the statistical test.
Can statistics be good enough to detect a step change when a systems begins to pass from one stationary state to another one? No, it is an impossible task without making some hypothesis that will most probably be proven false when it will be time to have hindsight.
If, just as an example, one assumes that climate data is erratic then any extrapolation is valid and probably as wrong ay any other one.
If, just another example, one assumes that climate passes from one stationary state to another one, then any extrapolation will miss the step change between two quasi stable periods.
if, as a last example, one assumes that the value of a financial instrument will continue to grow as in the past then you get some surprises (as in 2007 with the subprime crisis).
If, as a very last example, a temperature time period remains more or less stable for 15 years while it had shown a constant increase for some decades, then this tells that the system is not behaving as it was assumed to be. Then the scientiste must begin to look at other models and underlying physical phenomena. More research is needed…
I wouldn’t say the following is true in all fields of science, at all times, but having trained as a scientist, in that country still a British colonial outpost after more than 200 years we call Australia, it is certainly true in some fields, and in some cases.
There are certain scientists who are in the game for recognition. They don’t care what others think, or what the truth is, a person is only of use to them as a commodity they can use. And to them you only have the right to make a contribution if you rise through the establishment, pandering all the way, and then you can publish and make a difference. To them it is very much a class based system, the way society should be, and those at the top get special privelages.
I’m quite sure, having personally experienced it, that the old British class system and values still permeates in various scientific fields and strands; there are indeed still those who become scientists mostly for that reason: to relive the glory days where gentlemen were gentlemen, and everyone else towed the line.
Mark T- my comments were somewhat simplified and broad-brush. I just think it’s odd that all these discussions of smoothing and averaging rarely describe what effect the process has when viewed in the frequency domain. Conversely, why not design the filtering in the frequency domain, e.g., Nth order Butterworth low-pass with cut-off frequency 1/30 yr.
I once worked with some engineers that inserted a “moving average” to clean up some noisy data. I asked if they knew they had made a digital filter, and did they know the frequency response? The answer was No and No. And then it was inserted in the feedback of a closed-loop control system. The overall system operation was totally screwed.
Agreed that a symmetrical FIR filter has its peak at the center, which is where this article started- what do you do when the data runs out 20 years from the end of the data set?
Mark T says:
March 30, 2013 at 9:59 am
This notion assumes a priori that the “signal” consists of sinusoids, which is all you will get back. Sorry, but this is not superior in any real sense of the word.
Your ignorance of mathematics is on display for all to see. Any function without discontinuities and singularities can be represented by the sum of sines. Sorry, that’s not a complete definition, but it will do here. You don’t have to use sines. Sines are merely convenient in some treatments of Fourier analysis.
Willis Eschenbach says:
March 30, 2013 at 10:12 am
Sorry, Willis, I guess I didn’t make it clear. I tried to include a verbal explanation of why I thought the method was BS. But truncation is still BS. That opinion depends on the assumption I understand your process. Let’s say you truncate data from the right side where those points have a fit slope of 1. Now you test various smoothing methods A, B, and C, and then restore the missing data. Then you check how well you did using methods A, B, and C. You find method A worked the best. Now replace the right side data with another set of points with a fit slope of -1 (or anything else you can imagine that makes sense). Does method A still do the best job of smoothing/fitting the new data set? Not necessarily. How does truncation test the intrinsic superiority of various smoothing methods? Seems too much like climate science to me.
Willis Eschenbach says:
March 30, 2013 at 10:34 am
GRAB THE DAMN DATASET AND SHOW US!!
Screaming doesn’t help, dude. Where are your precious data?
I hereby acknowledge Willis as the instigator and discoverer of this work. Poor ol Mann doesn’t know what’s gonna hit him when MSM starts turning on AGW (as is now happening), as the realize how they have been had by this bunch of fraudsters. I hope Mann is fired by the University of Pennsylvania as he is a disgrace to science and higher education in general
Hoser! NO WONDER WILLIS SCREAMS!
The data set is in paragraph [9] of the original paper, and no, I’m not going to put the link here and make it easier. It just proves Willis’ original contention that you are bloviating and have not read his paper. Assumed knowledge is just assumptions and not knowledge at all.
Read the paper. Do the math. Complain about specific issues. It’s easy when you think about it.
pbh
I do not think GRL will weant any involvment in this. The timeline does not support them.
Like I said about greening the deserts, the heck with bridges, lay a pipeline to the nearest big water and put Michael Mann at the end of it. If Mann can suck in just one tenth as much as he blows out, you’ll have it in thirty seconds. Guaranteed!
Megalomaniac!
Hoser says:
March 31, 2013 at 5:19 am
You can write but you can’t read?
Oh, I get it–you have a ghost writer.
Willis, take a look at the DILBERT cartoon of 31 March 2013 (https://blu177.mail.live.com/default.aspx?id=64855#n=2080954&fid=1&mid=eb93fe7d-99c5-11e2-95fc-00215ad6a644). Who comes to mind?
Solomon Green says:
March 30, 2013 at 12:06 pm
The coming absence of lawsuits screams guilt!
DaveE.
Marcott et al have replied through Realclimate. From a qiuick read it all it seems to be a major attempt to get rid of the uptick and pretend it didn;t happen or if it did none of the stuff is “robust”anyway….Cannot wait to see SM demolish this new HS.