#COVID19 Through A Glass, Weekly

Guest Post by Willis Eschenbach

A couple of days ago, I got to looking at the daily record of US deaths from the coronavirus. It’s shown in Figure 1 below:

Figure 1. US daily deaths. Created on May 5, but shows May 4th data.

So … have the US deaths peaked, and if so when? Hard to tell. However, I looked at that graph in Figure 1 and I thought “It looks like the data might be reflecting lower counts on the weekends”. 

Now, my go-to method for determining the existence, period, and amplitude of underlying repeating cycles in a dataset is the curious method called “CEEMD”. That stands for Complete Ensemble Empirical Mode Decomposition. I discuss the method here. It is a way to decompose a signal into underlying signals. It’s called “complete” because when you add all the underlying signals back together, it gives you back the original signal.

Once all possible underlying cycles have been removed from the data, what remains is called the “CEEMD Residual”. This residual is an excellent indicator of the overall trend of the data. Here is an overview of the CEEMD decomposition of the daily deaths data shown in Figure 1.

Figure 2. CEEMD complete decomposition of the data shown in Figure 1. The top panel is the raw data. Panels C1 to C4 are the empirical modes. Finally, at the bottom is the CEEMD residual.

As you can see, two of the four empirical modes (C2 and C4) are weak, with very low amplitude. Modes C1 and C3, on the other hand, show a much stronger signal. We can see the periods and strengths of each of the empirical modes C1-C4 in Figure 3, which shows the periodogram of each of the empirical modes C1-C4.

Figure 3. Periodograms of each of the empirical modes shown in Figure 2. The strongest signal is the seven-day signal, showing that my guess about weekends was likely correct. There is also a significant amount of energy in the first overtone of the 7-day signal, with a period of 3.5 days.

So … how does this analysis work out in practice? Here is the same data as in Figure 1, along with the CEEMD residual.

Figure 4. US daily deaths, along with the CEEMD residual. Data from May 4th, analyzed May 6th.

Well, I’d have to say that that looks like good news … it would be excellent if we were indeed 20 days past the peak.

And here is a look with the underlying 7-day signal overlaid on the daily data.

Figure 5. As in Figure 4, but overlaid with the seven-day empirical mode signal (Mode C3). The overlaid empirical mode C3 is shown for illustrative purposes only. You can see that when the empirical mode is added to the residual it will be a good match to the data.

This is a most interesting result. It shows one of the reasons that I use the CEEMD analysis—it breaks the raw data down into meaningful underlying signals. In this case, early in the spread of the virus at the left-hand side of the graph, the 7-day signal (blue line) was quite small. But now that there are a large number of deaths the 7-day signal is much larger. It is this kind of a result that is unobtainable by say standard Fourier analysis.

Finally, I prefer the CEEMD residual method over say a Gaussian smooth because it goes all of the way out to both the start and finish of the data. Not only that, but the information out near the ends is meaningful. Here’s a comparison of the CEEMD residual with a Gaussian filter.

Figure 6. Daily US deaths, CEEMD residual, and 7-day Full-Width to Half Maximum (FWHM) Gaussian smooth of the data. This is data from May 4th, processed on May 6th. Treatment of the Gaussian smooth near the endpoints is discussed in the Appendix here.

As you can see, the Gaussian smooth is high at the start of the daily deaths data, and low at the end of the data. The Gaussian smooth is dropping at the right-hand end, and the CEEMD Residual is turning upwards.

And two days later, here’s the situation:

Figure 7. More recent data, from May 6th, daily deaths and CEEMD Residual

At the right-hand end of the graph, the CEEMD residual was already foreshadowing the turn from decreasing to increasing, at the same time that the Gaussian smoothing was wrongly indicating a further decrease (see Figure 6). As I said, the CEEMD residual contains important information out at the ends.

Conclusions? Well, my first one would be that attempting to analyze coronavirus death data without removing the repeating weekly variations is … well, I’ll call it “overly optimistic” and leave it there.

My next conclusion is that the CEEMD residual is an excellent indicator of the ever-changing and oft-deceptive central tendency in time series data. 

Next, about a week ago the CDC changed its guidance on the reporting of deaths involving the COVID virus. Rather than make an explicit distinction between deaths WITH coronavirus and deaths FROM coronavirus, they said to enter COVID-19 on the death certificate if the physician SUSPECTS that the coronavirus MIGHT have CONTRIBUTED to the death … “suspects the virus might have contributed” to the death??? Could they possibly be more vague?

The size of the effect of this change on the way the US reports the death count is unknown, but it can only increase the purported count, not decrease the count. As a result, we cannot be sure that the increase in deaths is real and not just a change in reporting

Finally, it appears that the US has peaked in terms of daily deaths. Might be another peak to come, might be two more peaks, might be no more peaks, but in any case but it appears we’ve passed the first peak.

Stay well, dear friends. When I was a young man, an old geezer (who was likely about my age now) told me “Son, when you have your health you have everything!”

But back then, I didn’t understand …

w.

PLEASE: Quote the exact words you are discussing in your comment. This avoids endless misunderstandings and problems.

0 0 votes
Article Rating

Discover more from Watts Up With That?

Subscribe to get the latest posts sent to your email.

234 Comments
Inline Feedbacks
View all comments
whiten
May 8, 2020 9:27 am

Or;
Taylor Swift – Wildest Dreams

fernanda811
May 8, 2020 12:20 pm

It seems to me that one of the errors of our country is not having carried out a total quarantine from the first moment that it had the first case of coronavirus in the country ….. those sifras had to be less and more controllable … which was the main mistake of the country? underestimate the virus.

Vuk
May 8, 2020 1:57 pm

Swedish Covid-19 death rate has overtaken Ireland and now is just behind Holland, the next in line is France, which was badly affected and had very strict lock-down.
http://www.vukcevic.co.uk/EuropeCV.htm
Is it now question of time when Sweden catches up with France or it may have to introduce a lock-down to arrest rise in the death rate of its population. About 14% of Swedish population was non Swedish born, hence it could be very vulnerable to more serious CV infection/death rate.

richard
Reply to  Vuk
May 9, 2020 2:46 am

No lock down, S Korea had very few deaths.

Vuk
Reply to  richard
May 9, 2020 4:41 am

Hi Richard, Mosher is your expert on Korea.
Swedish model is in more trouble, in the last 12 hours their death rate has overtaken both Ireland and Netherlands and it is racing ahead, might catch up with France soon. Introducing strict lock down now might take a week or two to slow down the death rate.
I’ve just updated the link, so have another look.
What any individual thinks at the moment, be it Donald Trump, Boris Johnson, Vladimir Putin, Kim Jong Un, Xi Jinping, Mosher, Richard, Vuk or anyone else for that matter, at this stage is not much better than a guess. Only thing we can do is follow the data, good or bad, see what happens and hope for the best. Keep safe, just in case.

richard
Reply to  Vuk
May 11, 2020 2:44 am
Ron
Reply to  richard
May 9, 2020 5:26 am

South Korea closed schools, canceled all big events including holy mass, it is mandatory to wear masks if you want to go into a bar, restaurant or club you have to put your name on a list to allow tracking. Tracking App is mandatory as well and the app is using credit card information.

May 8, 2020 2:48 pm

Willis, what you have done is definitely more sophisticated than what I have with UK deaths. But my method is on eline of R code:

dow=rep(0,7); for(i in 1:floor(length(dM)/7)) dow=dow+dM[(7*i-6):(7*i)]; print(dow)

And here’s the current result, for Saturdays through Fridays:
4774 4512 2769 2990 5323 4662 4847

I don’t need a chi-squared test to see that the sub-3000’s are significant. In a recent email on the subject I said: “Either divine intervention is reducing mortality on a Sunday or less reporting is done on a Sunday.”

I think we can agree there is a pervasive weekly effect arising from something.

Rich.

JohnM
Reply to  See - owe to Rich
May 9, 2020 12:14 pm

Check-out the even later reporting on bank-holiday weekends, such as this weekend….todays (Sat) figures are 346 deaths…Friday was a holiday. By Tuesday the figures will have increased, as they do after every weekend.

Steven Mosher
May 8, 2020 5:20 pm

Lock down

https://twitter.com/DerrickVanGenn2/status/1258455696405868545/photo/1

let me repeat.

Lockdowns done right work, try to minimize damage to both hospital system and economy, while
maximizing impact on virus.

Lockdowns done wrong.. not so good.

richard
Reply to  Steven Mosher
May 9, 2020 2:50 am

When is lock down ever right-

The rise in the suicide rate caused by lockdowns in Australia is predicted to exceed deaths from the Wuhan coronavirus by a factor of ten, the Australian reported Thursday.

Vuk
Reply to  Steven Mosher
May 9, 2020 4:26 am

You may be right there Steven.
Swedish model is in a bit of a trouble, in the last 48 hours their death rate has overtaken both Ireland and Netherlands
http://www.vukcevic.co.uk/EuropeCV.htm
and it is racing ahead, might catch up with France soon. Introducing strict lock down now might take a week or two to slow down the death rate.

Jeff Smathers
May 8, 2020 6:50 pm

I am seeing the analogy of bloodletting a patient and the economy…. by allowing the patient to bleed out , the fractional ratio of virus in their blood can almost go near zero. Unfortunately the patient void of the virus is dead…much like the economy.

ren
May 9, 2020 12:40 am

However the increase in the rate of developing antibodies means that the much sought-after group immunity – where there are so many people immune to the virus that it has little or no opportunity of spreading – is still a long way off.

Experts generally reckon that it would take a minimum of 70% of people with antibodies before group immunity is present. At a rate of 3% every three weeks, that target would only be attained in late July 2021.
https://www.brusselstimes.com/all-news/belgium-all-news/110496/national-security-council-used-phone-data-to-help-inform-decisions/

Vuk
May 9, 2020 3:31 am

Here are details of the SAGE’s (science advising group) to the British government.
https://www.theguardian.com/world/2020/may/08/revealed-uk-scientists-fury-over-attempt-to-censor-covid-19-advice#img-2
What is going on here, you might ask?
Don’t bother, they will not tell you.

richard
May 9, 2020 3:54 am
richard
May 9, 2020 5:35 am
Terry Bixler
May 9, 2020 7:11 am

The aftermath of lockdowns will be felt by families for many years to come. There is no model that predicts when the economy will recover. Locking down the healthy made no sense. An attempt to protect the most vulnerable would have made some sense. Of course a focus on treatment is of the highest importance. Statements of the obvious. Here in California our lockdown has been proposed to extend until 4 weeks after there are no virus deaths, at least that is what I gleaned from listening to his newscast. Economy be damned.

Ron
May 9, 2020 7:18 am

Interesting update of euromomo data, more age groups:

https://www.euromomo.eu/graphs-and-maps

COVID-19 kills unprecedented number of people in the age 65-74y than the flu, more people <65y, more people from 75-84y and more 85y+ with that age having the smallest difference in relative increase.

ren
May 9, 2020 8:22 am

“Fighting the inflammatory process
He also explains that the cytokine storm is the body’s immune state in which it begins to produce various types of substances that on the one hand are designed to fight the inflammatory process, but on the other can also intensify the inflammatory response. So, as a consequence, the patient’s condition can get much worse.
 – Knowing about the occurrence of such a situation, we reach for Tocilizumab – a drug that counteracts the inflammatory storm arising as a result of virus infection – says prof. Życińska. He points out that Tocilizumab is an antibody that blocks an important substance that enhances the development of inflammation – interleukin 6.”
The Australasian Society for Clinical Immunology and Allergy recommend tocilizumab be considered as an off-label treatment for those with COVID-19 related acute respiratory distress syndrome. It states this because of its known benefit in cytokine storms caused by a specific cancer treatment, and that the cytokine storm may be a contributor to mortality in severe COVID-19.[36]

On 11 March 2020, Italian physician Paolo Ascierto reported that tocilizumab appeared to be effective in three severe cases of COVID-19 in Italy.[37] On 14 March 2020, three of the six treated patients in Naples had shown signs of improvement prompting the Italian Pharmacological Agency (AIFA) to expand testing in five other hospitals.[38] Roche and the WHO are each launching separate trials for its use in severe COVID-19 cases.[39]

In March 2020 a randomized study, at 11 locations in China, which should conclude by 31 May 2020, started to compare favipiravir versus tocilizumab versus both.[40]
https://en.wikipedia.org/wiki/Tocilizumab

ren
Reply to  ren
May 9, 2020 9:29 am

The Central Clinical Hospital of the Ministry of Interior in Warsaw uses a modern monoclonal antibody to fight COVID-19. – The effects are spectacular – says prof. Katarzyna Życińska from the CSK MSWiA. – Patients who received the medicine after a few days were disconnected from the respirator – he emphasizes.
So far, 20 patients have been given the drug, all responded “spectacularly”

Matthew R Marler
May 9, 2020 9:00 am

Willis, good effort with the data that are available to you, and us. You can’t always despair of imperfect data and shy away from them. In this case, the liabilities in case definition, error rates in the tests, and vicissitudes of reporting are severe.

Thank you for the essay.

Roger Welsh
May 9, 2020 11:30 am

I do wonder as to the bone fide deaths FROM covid19 as opposed to WITH!

I cannot find any site/publication that can,with honesty, supply this information.

Figures used are as supplied!!!

Jurgen
Reply to  Roger Welsh
May 9, 2020 12:11 pm

The reason is explained in this video-interview. It is just not possible to diagnose this with medical certainty. Too many co-morbidities and other factors are involved. Dr. Wolfgang Wodarg explains from his expertise on dying patients and the involvement of a mix of viruses and many other factors in that process.

https://off-guardian.org/2020/05/07/watch-corona-crisis-what-really-happened-and-how-to-learn-from-it/

Jurgen
Reply to  Roger Welsh
May 9, 2020 2:09 pm

My own take on this problem: it is impossible at this point to separate the corona signal from the lockdown and panic signals.

Broadie
May 9, 2020 1:49 pm

Weekly reports less? You don’t say Willis.
Look at this saw tooth!

https://depts.washington.edu/labmed/covid19/

The real artifacts affecting the infection rate curves I believe were the publicity of the ‘Plannedemic’ and the availability of tests and testing facilities. There is nothing like looking in the storeroom and finding all the test kits have been used up to crush a curve. The other curve crusher may be medical staff at the coal face realizing they have been had.

The Mortality curves are affected by the actual Lockdown, Cold weather and the CDC directive to add anything that looks like Covid-19 to the Schedule 1 of the Death Certificate. NVSS directive 24th March.

https://www.cdc.gov/nchs/data/nvss/coronavirus/Alert-2-New-ICD-code-introduced-for-COVID-19-deaths.pdf

What I find interesting from the UW Virology dashboard is the virus appears to run at about %10 of those presenting with symptoms in the early period and now at %5 after lock down or warmer weather. Why?

If the usual suspects were going to choose a virus for a ‘Plannedemic’ initiating the ‘Slump to get Trump’, this was a beauty as it is prevalent everywhere not like say Ebola.

mister bitcoin
May 9, 2020 6:54 pm

Yitzak Ben Israel hypothesis is the virus peaks on day 42 and declines rapidly after day 56 regardless of country or policy.

He looked at USA, Israel, Sweden, Germany, Italy, S Korea, etc (20 countries total).

I find his hypothesis most convincing.

Him and Michael Levitt

also: zero lupus or rheumatoid arthritis patients have tested positive for covid

Pachygrapsus
May 9, 2020 7:38 pm

Late to the party on this one…
I appreciate the analysis of the data Willis, but unfortunately the data is crap. Too many hidden variables, too much incentive to lie, and far too much that’s hidden. For example, here in NJ there is virtually no risk to otherwise healthy people. About 50% of the deaths have been patients in long term care facilities. Once that is understood, the real question is why the rest of us had to go into lockdown? 4,000 excess deaths in a state with 10 million people? That’s basically a rounding error.

Heartless? Perhaps, but the numbers are still inflated. Every COVID case is a COVID death, regardless of comorbidity? You’ll never get the answer to the question of who many everyday people developed COVID and died. Never. As soon as I see hundreds (literally!) of MSNBC videos attacking the president over the virus, I know that the “science” has been coopted by people who will say anything to make this Trump’s fault.

Also, their fawning praise of Dr. Fauci was reminiscent of their love for DeGrasse Tyson. Just another media hyped celebrity who happened to believe their narrative. Well, good for him. In the meantime, let the infirm and the elderly continue the lockdown, put them in Hazmat suits if you like, but let the rest of us get back to work. I want the restaurants, sports venues, and bars opened and I hope I never see another mask.

May 9, 2020 11:08 pm

Willis
Thanks – the advantages of the CEEMD decomposition-smoothing seem very compelling. The performance at the end is particularly impressive. According to Luukko et al., the Finnish authors of the “libeemd” paper you referred to (and posted on ResearchGate),

https://www.researchgate.net/publication/280114554_Introducing_libeemd_A_program_package_for_performing_the_ensemble_empirical_mode_decomposition

this is due to a clever approach to the problem of artificial twists at the end due to the truncation of the series:

Several ways have been proposed to mitigate the end effects by adding artificial extrema to the ends of the data, such as simple wave forms defined by the extrema near the end (Huang et al, 1998). We have adopted the method described by Wu and Huang (2009), where additional extrema are added to the ends of the data by linear extrapolation of the previous two extrema. However, if the extrapolated extremum is less extremal than the last data point, the value of the last data point is used as an additional extremum instead. This method successfully reduces the end effects while avoiding the possible complications of more complex data extrapolation.

This extrapolation if additional extrema beyond the ends of the data series allowed your fit for instance to “predict” the uptick after the end of the series that happened to finish in a weekend (lower reported deaths).

Did your method incorporate the “CEEMDAN” modification which improved removal of noise by separately averaging for each IMF (intrinsic mode function)?

They also discuss the stopping problem which I guess always afflicts iterative methods. The approach they used was again a nice one, to quote:

Therefore Huang et al (1999) proposed a simpler stopping criteria, in which iteration is stopped when the number of zero crossings and extrema differ at most by one and that these numbers stay the same for S consecutive iterations. This criterion was extensively studied by Huang et al (2003) and the optimal range for the S-number was found to be from 3 to 8. Our code … used S=4.

Did you have to choose an S value?
I would be interested to try out the CEEMD method although my software skills are rather limited.

Reply to  Phil Salmon
May 11, 2020 5:12 am

You’re right, R is the next software that I should try to get to grips with.

Bear
May 10, 2020 7:22 am

FYI, here’s an additional data source that has a lot more details and goes down to the state level. YMMV

https://covidtracking.com

brians356
Reply to  Bear
May 13, 2020 12:35 pm

Looked good, but their splash screen stopped me dead in my tracks:

We’re tracking racial and ethnic data from every state that reports it—and pushing those that don’t to start. Together with the Antiracist Research & Policy Center, we’re analyzing this data to uncover the true impact of the outbreak on vulnerable communities.

Sigh.

Daniel Godet
May 10, 2020 2:26 pm

Thanks for this week end analysis,
There are many other aspects, I believe:
– bank holidays (at Oster, the previous Friday, or the following Monday, May 1st, May 8th, etc.)
– the way fatalities from nursing homes were added to the deaths from hospitals, e g in France the catch up from nursing homes took place in several days split over approximately 2 week before the reporting became regular
– at some moment deaths at home will probably be added one way or another
For this ex appreciating Gauss curves, a Singapore based institute publishes curves for each country including an « end of pandemics date » per country. If trust is to be granted, it would mean for the US an end by mid October, vs end of Sept for UK, mid or end August in Western continental Europe.
https://ddi.sutd.edu.sg/covid-19
https://user-images.strikinglycdn.com/res/hrscywv4p/image/upload/c_limit,fl_lossy,h_1000,w_500,f_auto,q_auto/679545/
594764_580902.jpeg
comment image
comment image

Robert Hristoski
May 17, 2020 8:53 am

Hi Willis,

What is not sufficiently commented is the curious 7 days periodicity. Why should people, on average, die less on weekends and peak around Wednesdays?

Best,

Robert

Robert Hristoski
Reply to  Willis Eschenbach
May 18, 2020 1:02 am

Sounds plausible. Thanks.
Robert