Guest Post by Willis Eschenbach
I’ve been involved in climate science for a while now, this is not my first rodeo. And I’ve read so many pseudo-scientific studies that I’m starting to develop a list of signs that indicate when all is not well with a particular piece of work.
One sign is whether, how, and when they cite the IPCC “Bible”, their “IPCC Fourth Assessment Report”. The previous report was called the “T. A. R.” for “Third Assessment Report”, but the most recent one is called “AR4” rather than the “F. A. R. “, presumably to avoid using the “F-word”. This report is thousands upon thousands of pages of … of … of a complex mix of poorly documented “facts”, carefully selected computer model runs, good science, blatantly political screeds from Greenpeace and the World Wildlife fund, excellent science, laughable errors, heavily redacted observations, poor science, “data” which turns out to be computer model output, claims based on unarchived data, things that are indeed known and correctly described, shabby science, alarmist fantasies, things they claim are known that aren’t known or are incorrectly described, post-normal science, overstated precision, and understated uncertainty. That covers most of the AR4, at least.
Since many of the opinions expressed therein are vague waffle-mouthed mush, loaded with “could” and “may” and “the chance of” and “we might see by 2050”, you can find either support or falsification within its pages for almost any position you might take.
I have an “IPCC fail-scale” that runs from 1 to 30. The higher the number, the more likely it is that the paper will be quoted in the next IPCC report, and thus the less likely it is that the paper contains any actual science.
I’d seen some high-scoring papers, but a team of unknowns has carried off the prize, and very decisively, with a perfect score of 30 out of 30. So how does my “IPCC Fail-Scale” work, and how did the newcomers walk off with the gold?
First, there are three categories, “how”, “whether”, and “when”. They are each rated from zero to ten. The most important of these is how they cite the IPCC report in the text. If they cite it as something like “IPCC Fourth Assessment Report: Climate Change 2007 (AR4), Volume I, pages 37-39 and p. 40, Footnote [3]”, they get no points at all. That’s far too scientific and too specific. You could quickly use that citation to see if it supports their claims, without blindly searching and guessing at what they are citing. No points at all for that.
If they cite it as “IPCC Fourth Assessment Report: Climate Change 2007 (AR4), Volume I” I award them five points for leaving out the page and paragraph numbers. They get only two points if they just omit the paragraph. And they get eight points if they leave out the volume. Leaving out a URL so their version can’t be found gets a bonus point. But to get the full ten points, they have to disguise the report in the document. They can’t seem to be building their castles on air. So how did the winning paper list the IPCC Fourth Assessment Report in their study?
They list it in the text as “Solomon 2007”. That’s absolutely brilliant. I had to award the full ten points just for style. Plus they stuck the landing, because Susan Solomon is indeed listed as the chief culprit in the IPCC documents, and dang, I do like the way they got around advertising that they haven’t done their homework. 10 full points.
Next, where do they cite it? Newcomers to the field sometimes cite it way at the end of their study (0 to 5 points) or in the middle somewhere (six to nine points). But if you have real nerve, you throw it in as your very first reference. That’s what got them the so-called “brownie point”, the extra score named after the color of their nose, the final point that improves their chances of being in the Fifth Assessment Report. Once again, 10 out of 10 points to the winner, “Solomon 2007” is the first reference out of the box.
Finally, do they cite the IPCC at all? Of course, the authors not citing the IPCC Report greatly improves the odds that the author has actually read, understood, and classified the IPCC document as a secondary source, so no points if they don’t cite it, 10 points if they cite it. One points per occurrence for citing it indirectly through one of their citations, to a maximum of 8. And of course, the winner has ten points in this category as well.
And what is this paragon of scientific studies, this ninja reference-master of analyses, this brazen grab by the newcomers for the crown?
Quite appropriately, it is a study which shows that when the Arctic is warmer, we should expect Northern winters to be colder.
Lately there have been a string of bitterly cold winters … who would have guessed? Well, as the authors of the study point out, none of the climate models guessed it, that’s for sure.
The study is “Arctic warming, increasing snow cover and widespread boreal winter cooling“, by Judah L Cohen, Jason C Furtado, Mathew A Barlow, Vladimir A Alexeev and Jessica E Cherry. This study proves once again that in the topsy-turvy world of climate science, all things are explainable by the AGW hypothesis … but only in hindsight.
It’s also a curious study in that the authors, who are clearly AGW supporters, are baldly stating that the climate models are wrong, and trying to explain why they are wrong … man, if I say the models are wrong, I get my hand slapped by the AGW folks, but these authors can say it no problem. It does put them into a difficult position, though, explaining why their vaunted models got it wrong.
Finally, if they are correct that a warmer Arctic has cooler winters, then for the average Arctic temperature to be rising, it would have to be much, much warmer in the summers. I haven’t seen any data supporting that, but I could have missed it. In fact, thinking about cooling winters, one of the longest underlying claims was that CO2 warming was going to lead to warming winters in the extra-tropics and polar regions … what happened to that claim?
CONCLUSIONS in no particular order
• I have no idea if what they are claiming, about snow and cold being the result of warming, is correct or not. They say:
Understanding this counterintuitive response to radiative warming of the climate system has the potential for improving climate predictions at seasonal and longer timescales.
And they may be right in their explanation. My point was not whether they are correct. I just do love how every time the models are shown to be wrong, it has the “possibility of improving climate predictions”. It’s never “hmmm … maybe there’s a fundamental problem with the models.” It’s always the Panglossian “all is for the best in the best of all possible worlds.” From their perspective, this never ever means that the models were wrong up until now. Instead, it just makes them righter in the future. They’ve been making them righter and even righterer for so long that any day now we should reach righterest, and in all that time, the models have never been wrong. In fact, we are advised to trust them because they are claimed to do so well …
• Mrs. Henninger, my high school science teacher, had very clear rules about references. The essence of it was the logical scientific requirement that the reader be able to unambiguously identify exactly what you were referencing. For example, I couldn’t list “The Encyclopedia Britannica, Volume ‘Nox to Pat'” as a reference in a paper I submitted to her. I’d have gotten the paper back with a huge red slash through that reference, and deservedly so.
Now imagine if I’d cited my source as just “The Encyclopedia Britannica”? A citation to “The Encyclopedia Britannica” is worse than no citation, because it is misleading. It lends a scientifically deceptive mask of actual scholarship to a totally unsupported claim. And as a result …
• Citing the IPCC TAR in its entirety, without complete volume, page, and if necessary paragraph numbers, is an infallible mark of advocacy disguised as science. It means that the authors have drunk the koolaid, and that the reviewers are asleep at the switch.
• Mrs. Henninger also would not let us cite secondary sources as being authoritative. If we wanted a rock to build on, it had to, must be, was required to refer to the original source. Secondary sources like citing Wikipedia were anathema to her. The Encyclopedia Britannica was OK, but barely, because the articles in the Britannica are signed by the expert who wrote each article. She would not accept Jones’s comments on Smith’s work except in the context of discussing Smith’s work itself.
But the IPCC is very upfront about not doing a single scrap of science themselves. They are just giving us their gloss on the science, a gloss from a single highly-slanted point of view that assumes what they are supposed to be setting out to establish.
As a result, the IPCC Reports are a secondary source. In other words, if there is something in the IPCC report that you are relying on, you need to specify the underlying original source. The IPCC’s comments on the original source are worthless, they are not the science you are looking for.
• If the global climate models were as good as their proprietors claim, if the models were based on physical principles as the programmers insist … how come they all missed it? How come every one of them, without exception, got the wrong answer about cold wintertimes?
• And finally, given that the models are unanimously wrong on the decadal scale, why would anyone place credence in the unanimity of their predictions of the upcoming Thermageddon™ a century from now? Seriously, folks, I’ve written dozens of computer models, from the simple to the very complex. They are all just solid, fast-calculating embodiments of my beliefs, ideas, assumptions, errors, and prejudices. Any claim that my models make is nothing more than my beliefs and errors made solid and tangible. And my belief gains no extra credibility simply because I have encoded it plus the typical number of errors into a computer program.
If my beliefs are right, then my model will be accurate. But all too often, my models, just like everyones’ models, end up being dominated by my errors and my prejudices. Computer climate models are no different. The programmers didn’t believe that arctic warming would cause cooler winters, so guess what? The models agree, they say that arctic warming will cause warmer winters. Fancy that. Now that the modelers think it will happen, guess what future models will do.
Now think about their century-long predictions, and how they can only reflect the programmers beliefs, prejudices, and errors … here is the part that many people don’t seem to understand about models:
The climate models cannot show whether our beliefs are correct or not, because they are just the embodiment of our beliefs. So the fact that their output agrees with our beliefs means nothing. People keep conflating computer model output and evidence. The only thing it is evidence of is the knowledge, assumptions, and theoretical mistakes of the programmers. It is not evidence about the world, it is only evidence of the programmers’ state of mind. And if the programmers don’t believe in cooling winters accompanying Arctic warming, the models will show warmer winters. As a result, the computer models all agreeing that the winters will be warmer is not evidence about the real world. No matter how many of the models agree, no matter how much the modelers congratulate each other on the agreement between their models, it’s still not evidence.
My best to all,
w.

The notion that a warmer Arctic means colder NH winters is directly falsifiable via a simple thought experiment. Just make the Arctic warmer and warmer… until it is physically impossible to have a cold NH winter. Now at what point did the original notion fail? There is no point of failure — because it’s all a failure. Similarly, the authors must therefore hold that if the Arctic gets colder, the NH winters will be warmer — similarly absurd. The paper is not science.
“As a result, the computer models all agreeing that the winters will be warmer is not evidence about the real world. No matter how many of the models agree, no matter how much the modelers congratulate each other on the agreement between their models, it’s still not evidence”
If you with evidence mean that only something that has been measured and agree with the assumption made is an evidence then I agree on above statement. But for me a model output that agree with measured data and predicts future data with the same accuracy as it has modeled past known data is “evidence” that I may be on the right track understanding the process. Maybe evidence is not the right “scientific” word in my case.
I can only agree on what you write about what a model represent, your own prejudice, assumptions, etc. I am doing an own study on simple models for modeling the cause for the CO2 increase and one thing that struck me is that a model using the exact same formula but with completly different input values in 2 cases can give an output that matches real world data. When you look at the details of how the model came to the same conclusion in both cases there will be some factors that are completly different but you have no real world data that can tell you which one that is most accurate on that specific factor, so which model do you put most trust on?
The one that best fits your assumption ofcourse…
I wonder how many AGW model runs that has ended with a result that matches real world data but they do not fit the modelers prejudice of what should be the main factors and the result is thus thrown away as faulty and never gets documented.
conrad clark says:
February 1, 2012 at 10:12 pm
Quit trying to hijack the thread, Conrad. Go to school or to a nunnery and learn some manners. Better yet, take your nastiness and just go away!
Here is the IARC web page at uaf.edu.
http://ine.uaf.edu/werc/projects/seward/team.html
Here is a video telling us we must live within our means. And of course, the children…
I actually think AR4 is quite moderate. You do not find the doomsday predictions of several meter increase in sea level or any certain predictions extreme weather events in the IPCC assessment reports. Those worst predictions, which are typically cited in the news, are usually not cited in the IPCC reports.
The estimate of sea level rise in AR4 is 39 cm rise from 1990 to 2100, down from estimated 48 cm for the same period in the TAR. The rise is thought to be caused by both thermal expansion of sea water and glacier melting.
AR4 do say that we know for certain that the global temperature has increased substantially in the last century, and that the level of GHG has also increased, but they do not claim that we know for certain that human activity is a cause of the temperature increase. AR4 says that it is very likely that human activity is a cause for the temperature increase, but this is not certainty. I think this comes as a surprise to many.
Willis and Conrad, what’s with the pissing contest? None of you brought anything good to that part of the discussion, and both of you came out of it looking bad. Willis, it really is not worth getting dragged into the mud like that.
From my point of view, Conrad had some interesting thoughts in that first comment. Had he put them a bit differently, or had Willis been a bit more overbearing, we might have learned something. Now, Conrad will leave this site thinking that WUWT is a place full of brats and whacky skeptics. 30 points for both of you, and that is leaving out the bonuses for self-referencing.
when you’re good, you’re awesome.
great article willis.
I notice that computer models didn’t prevent design flaws from creeping into the F-35 Joint Strike Fighter.
http://www.wired.com/dangerroom/2011/12/joint-strike-fighter-13-flaws/
And the climate system is vastly more complex than this little object.
Why isn’t AR4 called PHOARRR?
@ur momisugly Gary Mount .. from that link…
“Trillion-Dollar Jet Has Thirteen Expensive New Flaws”
add… “that have been found so far”
trouble with the AGW hypothesis, is that it is based on one MAJOR FLAW.. being that CO2 causes warming…. everything else, models included, is just corollaries from a flawed hypothesis.
Willis
I am currently reviewing the draft AR5 like, no doubt, many others on this site. Not having been involved in the process before I was quickly astonished by the amount of assertion, speculation and conjecture that was immediately apparent within the areas I am competent to comment on.
I had a very circular argument with the IPCC in Geneva when I queried a comment that ‘unpublished research has shown’ regarding a particularly contentious aspect of ocean warming. I queried as to who authored the unpublished research and what it actually said, which brought forth a string of emails between us as I asked for a copy (which we are permitted to see)
This culminated in the surprising (to me) response that only cited (unpublished or otherwise) material i.e that with a number and a correspomnding reference to the authors, could be provided. A (seemingly) wild assertion with no citation could not be provided as it…er…had no citation.
I know Mosh has written elsewhere as to how much better the draft of Ar5 is than Ar4. That may be so, but it therfore beggars the imagination as to what the draft AR4 was like.
It may well be of course that the final AR5 may be very different to its draft-but if it isn’t I can only comment that the eventual summary for policymakers-which previously seemed to bear little relationship to the original final assessment- is going to be well worth scrutinising. It is likely to contain numerous examples of puffery and hype.
tonyb
Sometimes models are programmed because the programmer has little or no idea what the outcome will be. Monte carlo methods are often used in such models. The results are sometimes the opposite of what the modeler was expecting.
I agree with anders valland.
http://wattsupwiththat.com/2012/02/01/cites-and-signs-of-the-times/#comment-882392
No way to conduct a dialogue. A very interesting article none-the-less, and I take it’s point.
I just love the way the programmers “adjust” their models to correlate with the past – especially when they totally failed before the “fixes” or should I say fudges and new routines to model the past – and then they proudly say – “see, we were right and it’s worse than we thought”.
I still remember GIGO and managed to program in that vein several times.
To help me slim down I built a predictive computer model of my weight given diet and exercise as inputs. So far the model output has been within 99.2% of my measured weight. By running this model 23 weeks and 1 day forward I have evidence that I will be lighter than air before the end of the year. Alert the media: I am going to fly like Shazam!
Anders Valland says:
February 1, 2012 at 11:24 pm
Anders, if you think Conrad had some interesting thoughts, why the heck haven’t you discussed them with Conrad? I’m certainly not stopping you. Speak up, man, and ask him questions or comment on his ideas. Because I assure you that blaming me because you and Conrad haven’t been discussing interesting things goes nowhere.
Me, when a man starts off the discussion by attacking me, and tops it by saying I should go back to school, I bite back. If you don’t like it, skip my replies. Some of us defend ourselves. If that offends you, look elsewhere. If you want someone who will blow in Conrad’s ear and rub his tummy when he comes into the discussion and acts like a pompous prat, you are reading the wrong man’s work. I’m not that guy.
Anders, you are about the 413th person who has come through the door whose first words are to tell me I ‘m doing it all wrong, that my comments are too boring, that my topics are not topical, or what I say is too overbearing, or I’m too dismissive of serial stupidity, that I should write shorter pieces, or longer pieces, that I shouldn’t puncture pomposity, or a hundred other reasons why I’m wronger than wrong. You are merely today’s version of, in Spiro T. Agnew’s memorable words, the “nattering nabobs of negativity”. By tomorrow there will be another frustrated would-be “Miss Manners” popping up to tell me that my morals or my methods or my mannerisms are terribly wrong.
So here’s the deal for me, here’s how I resolve it. It’s like what the song says:
“But it’s all right now,
I learned my lesson well.
You can’t please everybody, so
You got to please yourself.”
What will Conrad think when leaving this site? I haven’t a clue. I have written over 200 posts on WUWT, and I had a million page views last year of my work. I’m positive that not all of the million were happy with what I said. If 1% of them were dissatisfied, that’s hundreds and hundreds of folks out there who, like you, feel I should go back to school, or feel I should have been nicer to poor Conrad, or have some other complaint about my actions.
So, I can’t please everybody, and although the percentage is small, that’s still a lot of unpleased people. Some of them, like Conrad, want to start out by lecturing me, telling me I’m a jerk who doesn’t understand computers and should go back to school. Others clutch their pearls and gasp in horror when I call someone out for coming here and starting by lecturing me and going back to school.
You may be accustomed to people who trim their sails based on polls, and votes from the readers, and expressions of concern, and the like. Me, I’m a seaman who follows the stars and the winds. Focus groups mean nothing to me. So please, save your expressions of concern for someone who might appreciate them.
The thing that all you guys seem to be missing is this. The Alexa statistics show WUWT is far and away the most influential climate science blog on the web. The blog statistics show I am far and away the most popular guest author on the site. I have been pillorying serial stupidity and not putting up with abuse since I started writing here. Why should I change now?
Hard to argue with success, WUWT has been very successful, and generally my motto is, if it ain’t broke don’t fix it. So no, I fear I won’t take your well-meant and perhaps reasonable advice, Anders. I am who I am, I don’t suffer either abuse or fools gladly, it’s no good for me to pretend otherwise, and some folks actually appreciate my honesty. I welcome people who come with honest questions or scientific objections. People who just come to abuse me? Yes, they might leave dissatisfied, as you suggest Conrad might be going.
Or they might stick around and contribute to the conversation. Up to them.
All the best,
w.
“Seconding everything Willis said in response to conrad clark.
conrad clark wrote:
WTF! You don’t calibrate your models, run them against an independent test set, align your initial guesses with objective reality?”
Adding my tuppence worth… This time last year the Formula one racing teams were putting the finishing touches to their 2011 cars ready for testing. McLaren, who employ some of the best computer modellers and simulators in the world, put their new car on the track and expected it to be a competitive car. However, it was a case that the “models” were wrong. Do McLaren do all the calibration and testing required? Hell yes to the nth degree. They even sent their 2010 car to the first 2011 test to get accurate measurements to use for calibration and comparison for the parameters of their 2011 car, but a computer model cannot possibly integrate everything that happens in reality AND it does indeed integrate the assumptions (prejudices and beliefs) of the people who design the system and code the models.
McLaren had gone from the front of the grid at the end of 2010 to the back of the grid (in terms of pace) at the second test of 2011, which is a massive loss of pace. What the computer models told them about the 2011 car was wrong.
Unlike climate change “experts”, McLaren engineers HAVE to deal in reality as the results on the track cannot be fudged. They recognised that there was an error in their simulations and put in place measures to correct them. They constantly take the data gathered from the race weekends and enter that into the simulation software to calibrate their software against reality and that is with the relatively small set of parameters effecting a race car in any given weather on any given track. Even then, the simulator is wrong to a greater or lesser degree from the reality when the team gets to an actual race weekend.
Climate models have to incorporate an almost infinitely more complex system, with far less predictability, in a chaotic, non-linear integration of competing and synchronous climatic systems and sub-systems. The bias of the people designing the models, (or at least those who give them the data and operational parameters) are going to have a massive effect on what the models tell them.
I know enough about computer modelling and computer simulations to know that they are a valuable tool in running multiple variants to gather general data quickly, and only when filled with predictable logic, but they are less than useless as a predictive tool when dealing with chaotic systems with multiple complex, unpredictable and many unknown feedbacks. Tossing a coin is as effective a predictive tool, as a computer model of the climate.
Models are useful in that they can be used to encapsulate a hypothesis that in some sense is mathematically consistent. This is better than a hypothesis that is not mathematically consistent. But I would agree that they should not be cited as if their output were empirical observations.
Willis do us all a favour and don’t change.
Why say in a couple of sentences what can be said in six pages?
d.
Agnostic says:
February 2, 2012 at 12:15 am
Great, Agnostic. The vote is now 9,412 to 3 that I’m a jerk. Thanks for voting.
I notice that you and Anders haven’t had a dialog with Conrad at all, but you don’t like my dialog with him. If you can do better, why aren’t you doing so?
w.
As a non-scientist following these discussions for some time, it seems to me there is a philosophy of science question here that is more general than those raised by climate science: When is computer code “science”? Is it ever science? Can rules be laid down that would make it science? When Einstein writes down E=MC**2, any scientist on the planet (or elsewhere for that matter) can understand it, formulate experiments to test it, draw conclusions from it, etc. Where are we when someone writes a big, complex computer program claiming to be a representation of reality? At best it seems like the doodles that perhaps Einstein used as motivation before he actually formulated the science. Writing code to fit past data and then make predictions seems little more than a sophisticated form of curve fitting. If it doesn’t work or new data doesn’t fit, just tweak it and run it again. After it is tweaked, is it the same model or something different? In essence, running the code is not sufficient to say that the code is “science”. Somehow, the construction of the code and the assumptions of the code have to be accessible and atomically adjustable to start to say that some model is “science”. And most importantly, models must be capable of being compared to experimental/observed data; if they do not agree, then they are not right and must be rejected as bogus, failed attempts to grok reality.
Doug Cotton there is a problem with your paper. Page 2 is corrupted with overprinting. So I got all but that part.
What I could read I liked and it is nice to see that someone else agrees that heat cannot flow spontaneously from cold to hot. This is a major failing of the GHG theory.
Climatereason says:
“I am currently reviewing the draft AR5 like, no doubt, many others on this site. Not having been involved in the process before I was quickly astonished by the amount of assertion, speculation and conjecture that was immediately apparent within the areas I am competent to comment on. ”
The combination in AR5 of assertion, speculation, conjecture embedded in an array of good science, mediocre science, bad science mixed with politics and financially dependent ego’s is a recipe for disaster.
Willis,
I agree with a comment given earlier: please don’t ever change!
Your articles are all gems.