Usoskin Et Al. Discover A New Class of Sunspots

Guest Post by Willis Eschenbach

There’s a new post up by Usoskin et al. entitled “Evidence for distinct modes of solar activity”. To their credit, they’ve archived their data, it’s available here.

Figure 1 shows their reconstructed decadal averages of sunspot numbers for the last three thousand years, from their paper:

usoskin figure 2Figure 1. The results of Usoskin et al.

Their claim is that when the decadal average sunspot numbers are less than 21, this is a distinct “mode” of solar activity … and that when the decadal average sunspot numbers are greater than 67, that is also a separate “mode” of solar activity.

Now, being a suspicious fellow myself, I figured I’d take a look at their numbers … along with the decadal averages of Hoyt and Schatten. That data is available here.

I got my first surprise when I plotted up their results …

Figure 2 shows their results, using their data.

decadal sunspot numbers usoskinFigure 2. Sunspot numbers from the data provided by Usoskin et al.

The surprising part to me was the claim by Usoskin et al. that in the decade centered on 1445, there were minus three (-3) sunspots on average … and there might have been as few as minus ten sunspots. Like I said, Usoskin et al. seem to have discovered the sunspot equivalent of antimatter, the “anti-sunspot” … however, they must have wanted to hide their light under a bushel, as they’ve conveniently excluded the anti-sunspots from what they show in Figure 1 …

The next surprise involved why they chose the numbers 21 and 67 for the breaks between the claimed solar “modes”. Here’s the basis on which they’ve done it.

usoskin Figure 3Figure 3. The histogram of their reconstructed sunspot numbers. ORIGINAL CAPTION: Fig. 3. A) Probability density function (PDF) of the reconstructed decadal sunspot numbers as derived from the same 106 series as in Fig. 2 (gray-filled curve). The blue curve shows the best-fit bi-Gaussian curve (individual Gaussians with mean/σ being 44/23 and 12.5/18 are shown as dashed blue curves). Also shown in red is the PDF of the historically observed decadal group sunspot numbers (Hoyt & Schatten 1998) (using bins of width ΔS = 10).

The caption to their Figure 3 also says:

Vertical dashed lines indicate an approximate separation of the three modes and correspond to ±1σ from the main peak, viz. S = 21 and 67.

Now, any histogram has the “main peak” at the value of the “mode”, which is the most common value of the data. Their Figure 3 shows the a mode of 44, and a standard deviation “sigma” of 23. Unfortunately, their data shows nothing of the sort. Their data has a mode of 47, and a standard deviation of 16.8, call it 17. That means that if we go one sigma on either side of the mode, as they have done, we get 30 for the low threshold, more than they did … and we get 64 for the high threshold, not 67 as they claim.

So that was the second surprise. I couldn’t come close to reproducing their calculations. But that wouldn’t have mattered, because I must admit that I truly don’t understand the logic of using a threshold of a one-sigma variation above and below not the mean, not the median, but the mode of the data … that one makes no sense at all.

Next, in the right part of Figure 1 they show a squashed-up tiny version of their comparison of their results with the results of Hoyt and Schatten … the Hoyt-Schatten data has its own problems, but let’s at least take a look at the difference between the two. Figure 4 shows the two datasets during the period of overlap, 1615-1945:

comparison usoskin and hoyt sunspotsFigure 4. Decadal averages of sunspots, according to Hoyt-Schatten, and also according to Usoskin et al.

Don’t know about you, but I find that result pretty pathetic. In a number of decades, the difference between the two approaches 100% … and the results don’t get better as they get more modern as you’d expect. Instead, at the recent end the Hoyt-Schatten data, which at that point is based on good observations, shows about twice the number of sunspots shown by the Usoskin reconstruction. Like I said … not good.

Finally, and most importantly, I suspect that at least some of what we see in Figure 3 above is simply a spurious interference pattern between the length of the sunspot cycles (9 to 13 years) and their averaging period of ten years. Hang on, let me see if my suspicions are true …

OK, back again. I was right, here are the results. What I’ve done is picked a typical 12-year sunspot cycle from the Hoyt-Schatten data. Then I replicated it over and over starting in 1600. So I have perfectly cyclical data, with an average value of 42.

But once we do the decadal averaging? … well, Figure 5 shows that result:

pseudo sunspot dataFigure 5. The effect of decadal averaging on 12-year pseudo-sunspot cycles. Upper panel (blue) shows pseudo-sunspot counts, lower panel (red) shows decadal averaging of the upper panel data.

Note the decadal averages of the upper panel data, which are shown in red in the lower panel … bearing in mind that the underlying data are perfectly cyclical, you can see that none of the variations in the decadal averages are real. Instead, the sixty-year swings in the red line are entirely spurious cycles that do not exist in the data, but are generated solely by the fact that the 10-year average is close to the 12-year sunspot cycle … and the Usoskin analysis is based entirely on such decadal averages.

But wait … it gets worse. Sunspot cycles vary in length, so the error caused by the decadal averaging will not be constant (and thus removable) as in the analysis above. Instead, decadal averaging will lead to a wildly varying spurious signal, which will not be regular as in Figure 5 … but which will be just as bogus.

In particular, using a histogram on such decadally averaged data will lead to very incorrect conclusions. For example, in the pseudo-sunspot data above, here is the histogram of the decadal averages shown in red.

histogram pseudo sunspot dataFigure 6. Histogram of the decadal average data shown Figure 5 above.

Hmmm … Figure 6 shows a peak on the right, with secondary smaller peak on the left … does this remind you of Figure 3? Shall we now declare, as Usoskin et al. did, and with equal justification, that the pseudo-sunspot data has two “modes”?

CONCLUSIONS:

In no particular order …

1. The Usoskin et al. reconstruction gives us a new class of sunspots, the famous “anti-spots”. Like the square root of minus one, these are hard to observe in the wild … but Usoskin et al. have managed to do it.

2. Despite their claims, the correlation of their proxy-based results with observations is not very good, and is particularly bad in recent times. Their proxies often give results that are in error by ~ 100%, but not always in the same direction. Sometimes they are twice the observations … sometimes they are half the observations. Not impressive at all.

3. They have set their thresholds based on a bizarre combination of the mode and the standard deviation, a procedure I’ve never seen used.

4. They provided no justification for these thresholds other than their histogram, and in fact, you could do the same with any dataset and declare (with as little justification) that it has “modes”.

5. As I’ve shown above, the shape of the histogram (which is the basis of all of their claims) is highly influenced by the interaction between the length(s) of the sunspot cycle and the decadal averaging.

As a result of all of those problems, I’m sorry to say that their claims about the sun having “modes” simply do not stand up to close examination. They may be correct, anything’s possible … but their analysis doesn’t even come near to establishing that claim of distinct solar “modes”.

Regards to all,

w.

THE USUAL: If you disagree with me or someone else, please quote the exact words that you disagree with. That way, we can all understand just what it is that you object to.

0 0 votes
Article Rating

Discover more from Watts Up With That?

Subscribe to get the latest posts sent to your email.

140 Comments
Inline Feedbacks
View all comments
Matthew R Marler
February 22, 2014 7:41 pm

Thank you again to Willis Eschenbach, Bob Tisdale and Leif Svalgaard.

February 22, 2014 7:41 pm

timetochooseagain says:
February 22, 2014 at 6:47 pm
Could we calibrate the data Usoskin used against more accurate sunspot data
Yes, we could [and should], but then we don’t find the same hockey stick with late 20th century data going through the roof [like temps].
Jean Parisot says:
February 22, 2014 at 6:48 pm
Why are we averaging relatively sparse data sets that can be visualized with today’s technology?
The modern sunspot data is produced [deliberately] with the technology of centuries ago.
bushbunny says:
February 22, 2014 at 7:01 pm
I can not rationalize where they got the data from for 3,000 years ago.
Sunspots are magnetic fields that when extended from the sun keeps some cosmic rays out of the solar system. Cosmic rays smash into Nitrogen and Oxygen atoms in the atmosphere converting them to Carbon 14 which is radioactive. Carbon is plant food, so the radioactive Carbon ends up in tree rings which we can count and so measure how active cosmic rays [and thus the Sun] were thousands of years ago.

February 22, 2014 7:45 pm

JBirks says:
February 22, 2014 at 7:34 pm
But is that even possible in an environment where the scientists themselves are going rogue to promote their own causes?
From what I know about Ilya, Usoskin believes in his stuff, so has not gone rogue.

Duncan
February 22, 2014 7:54 pm

Maybe I’m just tired, but usually I understand basic math better than this.
[quote Willis]
The caption to their Figure 3 also says:
Vertical dashed lines indicate an approximate separation of the three modes and correspond to ±1σ from the main peak, viz. S = 21 and 67.
Now, any histogram has the “main peak” at the value of the “mode”, which is the most common value of the data. Their Figure 3 shows the a mode of 44, and a standard deviation “sigma” of 23. Unfortunately, their data shows nothing of the sort. Their data has a mode of 47, and a standard deviation of 16.8, call it 17. That means that if we go one sigma on either side of the mode, as they have done, we get 21 for the low threshold, just as they did … but we get 74 for the high threshold, not 67 as they claim.
[/quote]
47 – 17 is 21?
47 + 17 is 74?
[You’re right, I was the tired one. Mea culpa. The point remains. I’ve corrected the post. Many thanks, w.]

Matthew R Marler
February 22, 2014 7:59 pm

Also shown in red is the PDF of the historically observed decadal group sunspot numbers (Hoyt & Schatten 1998) (using bins of width ΔS = 10).
Binning by decadal groups of sunspot numbers, then producing the pdf of that, produces less precise estimates of the pdf of the sunspot numbers than what the authors did, which was to fit mixtures of Gaussians to the unbinned annual counts. For reference, see the soon to be published book “Analysis of Neuronal Data” by R. Kass, U. Eden and E. Brown (Springer, 2014), and other texts on non-parametric density estimation.
Thus this is inaccurate: Their Figure 3 shows the a mode of 44, and a standard deviation “sigma” of 23.
That mean and sd refer to the mean and sd of the fitted Gaussian, so 44 +/- 23 is correct for mean +/- sd. The individual Gaussians are scaled so that the sum of their areas is 1, rather than the area of each.
I grant you that’s little evidence for a break around 21 defining 2 separate “modes of solar activity”, and there is no evidence for a break around 67 defining another “mode of activity”. Their paper does not describe this “mixture distribution” modeling (as it is called) in any detail. The full figure 3 has alternatives, names of fitting algorithms, and references to sources.

Matthew R Marler
February 22, 2014 8:02 pm

oops: That mean and sd refer to the mean and sd of the fitted Gaussian, so 44 +/- 23 is correct for mean +/- sd.
I meant in that sentence to refer to the fitted Gaussian with the largest error.

Matthew R Marler
February 22, 2014 8:03 pm

oh, man. I meant “the fitted Gaussian with the largest area”. grumble, grumble.

Owen in GA
February 22, 2014 8:10 pm

47+17=64
7+7=14 carry the one
4+1+1=6
I’ve been known to carry an extra one from time to time myself.

bobl
February 22, 2014 8:12 pm

A classic nyquist rate problem , they are generating an intermodulation product within the frequency band of the signal by sampling at a rete below the nyquist rate that is less than twice the frequency of the data. Their result is therefore total rubbish, the 11-13 year sunspot cycle has been aliased to a 2- 3 year signal in the average data, Nyquist says it must be thus.

Tad
February 22, 2014 8:55 pm

Numerology. Now a recognized branch of Science.

DonV
February 22, 2014 9:04 pm

Hear! Hear! bobl. Right on!

February 22, 2014 9:44 pm

FWIW,
last 1000 years sure has a lot of similarity to this :
http://en.wikipedia.org/wiki/File:2000_Year_Temperature_Comparison.png

February 22, 2014 9:47 pm

Jeff L says:
February 22, 2014 at 9:44 pm
FWIW, last 1000 years sure has a lot of similarity to this :
http://en.wikipedia.org/wiki/File:2000_Year_Temperature_Comparison.png

I think that is the whole idea…

February 22, 2014 10:18 pm

Tom says:
February 22, 2014 at 4:58 pm
“This is astrophysics, not astronomy. Astrophysicists are happy just to get the decimal in the right place. A sporadic deviation of only 100% is the same as nailing it, in astrophysics”.
So, by your logic, all space launches could have had been 10 times heavier, or lighter. F=ma=10ma. You then imply the equivalence of a deviation of an order of magnitude with doubling, or halving. You could be a taxpayer-funded climastrologist with those error bars, deserving public ridicule. It seems to me that you have crossed that line by a factor of between 2 and 10.
February 22, 2014 at 5:49 pm
That’s not my logic, Tom, it’s entirely yours, a straw man based on things I didn’t say, and displaying your stellar ignorance at the same time. If you don’t know the difference between astrophysics and astronomy, I’d strongly recommend you not open your mouth on either subject.

David L
February 22, 2014 10:20 pm

Serious question: how do you measure sunspot activity from thousands of years ago? What proxy is used?
Second: anytime you are dealing with bounded data, in this case sunspots can’t physically be less than zero, Gaussian fits (normal distributions) are not statistically nor physically meaningful, nor are averages and standard deviations. Technically one would apply non-normal distributions such as log-normal fit, and for statistical central tendency report the mode and range of the data, or perform a log transformation on the sunspot activity and then fit with gaussians and report averages and standard deviations of the log transformed data set.

February 22, 2014 10:22 pm

David L says:
February 22, 2014 at 10:20 pm
Serious question: how do you measure sunspot activity from thousands of years ago?
See my explanation upthread at February 22, 2014 at 7:41 pm

February 22, 2014 11:05 pm

Willis I cannot see how you missed this
67 -1=66
21 + 1 = 22
66/22 = 3
Its related to orbits and phi

crosspatch
February 22, 2014 11:08 pm

Sunspots are magnetic fields that when extended from the sun keeps some cosmic rays out of the solar system. Cosmic rays smash into Nitrogen and Oxygen atoms in the atmosphere converting them to Carbon 14 which is radioactive. Carbon is plant food, so the radioactive Carbon ends up in tree rings which we can count and so measure how active cosmic rays [and thus the Sun] were thousands of years ago.

Wouldn’t that assume cosmic rays entering the solar system at a constant rate? I mean, sure, if cosmic rays entered the solar system at a constant rate, then the proxies for cosmic ray collisions with the atmosphere would also be proxies for solar magnetic activity. But do we really have evidence that the cosmic rays are constant on century timescales? If the solar system were to pass through an area of greater cosmic ray activity then the amount of 14C could vary with no variation in solar magnetic field.
E.g. Could the solar system be “hit” by a burst of cosmic rays from a nova long ago that we can no longer see and wouldn’t that affect these 14C and 10Be production rates with no change in solar activity?

AndyG55
February 22, 2014 11:24 pm

Just a point.
I feel it is very much a politeness to email the principle author of any paper that is being discussed so that they may join in and make their point of view.
I think it is very disingenuous if this is not done in every instance.
People have a right of reply, and it is only proper that they be informed of the discussion on what is one of the main scientific discussion blogs on the web.
I strongly suggest that Anthony make this a point of policy.

charles nelson
February 22, 2014 11:53 pm

Three thousand years of solar activity reconstructed from ‘Tree Rings’.

February 23, 2014 12:18 am

I have lost complete faith in any agency with an agenda to promote CAGW.
Having said that, I am just waiting for the Sun go quiet long enough to show its affects on climate.
Maybe then we can get science back on track.
Time for a reboot.

Patrick
February 23, 2014 12:18 am

Has someone said no-one knows yet? Because I am fairly sure no-one knows what the state of the sun was back 3000 years ago.

Verified by MonsterInsights