Journal of Current Scientific Research

Journal of Current Scientific Research

Current Issue Volume No: 1 Issue No: 1

Research-article Article Open Access
  • Available online freely Peer Reviewed
  • Mathematical Modeling And Epidemic Prediction Of COVID-19 And Its Significance To Epidemic Prevention And Control Measures

    Li Yichi 1     Wang Bowen 2     Peng Ruiyang 3     Zhou Chen 4     Zhan Yonglong 5     Zhang Xueying 4     Liu Zhuoxun 6     Jiang Xia 7     Zhao Bin 1
       

    1 School of Science, Hubei University of Technology, Wuhan, Hubei, China. 

    2 School of Electrical and Electronic Engineering, Hubei University of Technology, Wuhan, Hubei, China. 

    3 School of Computer, Hubei Polytechnic University, Huangshi, Hubei, China. 

    4 School of Economics and Management, Hubei University of Technology, Wuhan, Hubei, China. 

    5 School of Computer Science, Hubei University of Technology, Wuhan, Hubei, China. 

    6 Normal School of Vocational and Technical Education, HuBei University of Technology, Wuhan, Hubei, China. 

    7 Hospital, Hubei University of Technology, Wuhan, Hubei, China. 

    Abstract

    Background

    Since receiving unexplained pneumonia patients at the Jinyintan Hospital in Wuhan, China in December 2019, the new coronavirus (COVID-19) has rapidly spread in Wuhan, China and spread to the entire China and some neighboring countries. We establish the dynamics model of infectious diseases and time series model to predict the trend and short-term prediction of the transmission of COVID-19, which will be conducive to the intervention and prevention of COVID-19 by departments at all levels in mainland China and buy more time for clinical trials.

    Methods

    Based on the transmission mechanism of COVID-19 in the population and the implemented prevention and control measures, we establish the dynamic models of the six chambers, and establish the time series models based on different mathematical formulas according to the variation law of the original data.

    Findings

    The results based on time series analysis and kinetic model analysis show that the cumulative diagnosis of pneumonia of COVID-19 in mainland China can reach 36,343 after one week (February 8, 2020), and the number of basic regenerations can reach 4.01. The cumulative number of confirmed diagnoses will reach a peak of 87,701 on March 15, 2020; the number of basic regenerations in Wuhan will reach 4.3, and the cumulative number of confirmed cases in Wuhan will reach peak at 76,982 on March 20. Whether in Mainland China or Wuhan, both the infection rate and the basic regeneration number of COVID-19 continue to decline, and the results of the sensitivity analysis show that the time it takes for a suspected population to be diagnosed as a confirmed population can have a significant impact on the peak size and duration of the cumulative number of diagnoses. Increased mortality leads to additional cases of pneumonia, while increased cure rates are not sensitive to the cumulative number of confirmed cases.

    Interpretation

    Chinese governments at various levels have intervened in many ways to control the epidemic. According to the results of the model analysis, we believe that the emergency intervention measures adopted in the early stage of the epidemic, such as blocking Wuhan, restricting the flow of people in Hubei province, and increasing the support to Wuhan, had a crucial restraining effect on the original spread of the epidemic.

    It is a very effective prevention and treatment method to continue to increase investment in various medical resources to ensure that suspected patients can be diagnosed and treated in a timely manner.

    Based on the results of the sensitivity analysis, we believe that enhanced treatment of the bodies of deceased patients can be effective in ensuring that the bodies themselves and the process do not result in additional viral infections, and once the pneumonia patients with the COVID-19 are cured, the antibodies left in their bodies may prevent them from reinfection COVID-19 for a longer period of time.

    Author Contributions
    Received Feb 19, 2020     Accepted Mar 17, 2020     Published Mar 20, 2020

    Copyright© 2020 Li Yichi, et al.
    License
    Creative Commons License   This work is licensed under a Creative Commons Attribution 4.0 International License. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

    Competing interests

    The authors have declared that no competing interests exist.

    Funding Interests:

    Citation:

    Li Yichi, Wang Bowen, Peng Ruiyang, Zhou Chen, Zhan Yonglong et al. (2020) Mathematical Modeling And Epidemic Prediction Of COVID-19 And Its Significance To Epidemic Prevention And Control Measures Journal of Current Scientific Research. - 1(1):19-36
    DOI 10.14302/issn.2766-8681.jcsr-21-3719

    Introduction

    Introduction

    Since December 2019, many unexplained cases of pneumonia with cough, dyspnea, fatigue, and fever as the main symptoms have occurred in Wuhan, China in a short period of time. 12

    China's health authorities and CDC quickly identified the pathogen of such cases as a new type of coronavirus, which the World Health Organization (WHO) named COVID-19 on January 10, 2020. 3

    On January 22, 2020, the Information Office of the State Council of the People's Republic of China held a press conference introduced the relevant situation of pneumonia prevention and control of new coronavirus infection.

    On the same day, the People's Republic of China's CDC released a plan for the prevention and control of pneumonitis of new coronavirus infection, including the COVID-19 epidemic Research, specimen collection and testing, tracking and management of close contacts, and propaganda, education and risk communication to the public. 4

    Wuhan, China is the origin of COVID-19 and one of the cities most affected by it. The Mayor of Wuhan stated at a press conference on January 31, 2020 that Wuhan is urgently building Vulcan Mountain Hospital and Thunder Mountain Hospital patients will be officially admitted on February 3 and February 6. 5 By 24:00 on February 6, 2020, a total of 31,161 confirmed cases, including 636 deaths, were reported in the Chinese mainland, 22,112 confirmed cases, including 618 deaths, were reported in Hubei province, and 11,618 confirmed cases, including 478 deaths, were reported in Wuhan city. The spread of COVID-19 and various interventions have had an incalculable negative impact on People's daily lives and the normal functioning of society. Cities in China's Hubei province have issued varying degrees of closures and traffic restrictions. 6

    In fact, there are many imminent questions about the spread of COVID-19. How many people will be infected tomorrow? When will the inflection point of the infection rate appear? How many people will be infected during the peak period? Can existing interventions effectively control the COVID-19? What mathematical models are available to help us answer these questions?

    The COVID-19 is a novel coronavirus that was only discovered in December 2019, so data on the outbreak is still insufficient, and medical means such as clinical trials are still in a difficult exploratory stage. 7 So far, epidemic data have been difficult to apply directly to existing mathematical models, and questions need to be addressed as to how effective the existing emergency response has been and how to invest medical resources more scientifically in the future and so on. Based on this, this article aims to study the gaps in this part.

    Results

    Results TS Model-Based Estimates

    We use sequence diagrams and autocorrelation functions of the original data to determine the stationarity of these time series, and to smooth the series whose average and variance are not always constant. In the exponential smoothing method, we perform a natural logarithmic transformation on the series to omplete the smoothing process. In the ARIMA and ARIMAX models, we use the first-order difference or the second- order difference to smooth the original sequence. Using the above processing, we can obtain the time series analysis model summary information of the number of confirmed cases in mainland China as shown in Table 2.

    Summary of 6 time series models of cumulative confirmed cases in mainland China.
      Method   Stability treatment   Model  Fitting effect Ljung- Box Q(18)  Number of Outliers   Serial number
    Stationary R-squared R-squared Normalized BIC Sig.
     Exponential smoothing method _ Brown 0.245 0.996 11.338 0.787 0 1
    Natural logarithmic transformation  Brown  0.605  0.992  12.039  0.958  0  2
     ARIMA First order difference ARIMA (0,1,0) 0.235 0.961 13.644 0.000 0 3
    Two order difference ARIMA (0,2,0) 0.687 0.997 11.223 0.912 0 4
      ARIMAX First order difference ARIMA X(0,1,0) 0.977 0.999 10.634 0.987 0 5
    Two order difference ARIMA X(0,2,0) 0.208 0.997 11.793 0.997 0 6

    As shown in Table 2, we have established multiple time series analysis models for the number of confirmed cases in mainland China. By comparing, we can initially find that the Brown model is a natural logarithmic transformation of the original sequence and the second order ARIMAX (0,1,0) model for difference processing seems to be more suitable. Among them, the Brown model has a stationary R-square of 0.605, the Ljung-Box Q (18) test statistic has a value of 0.958, the stationary R-square of the ARIMAX (0,1,0) model is 0.977, and the value of the Ljung-Box Q (18) test statistic is 0.987. According to Table 2, we find that ARIMAX (0,1,0) model is the best of the six time series models in terms of goodness of fit and Ljung box Q (18) test results. We preliminarily think that Brown model and ARIMAX (0,1,0) model should have good statistical significance, and they should be able to predict the number of confirmed cases of new coronavirus pneumonia in mainland China in the next week. To further diagnose the results of these time series models, we make their autocorrelation function diagrams of the residuals as shown in Figure 2.

    Residual ACF of six time series models of cumulative confirmed cases in mainland China.

    In the function graph, we can judge whether the residuals still contain valid information of the time series. Through Figure 2, we can intuitively find that there may still be some valid information in the residuals of model 3. After observing the Residual ACF of other models, we think the residual sequences of Model 2 and Model 5 can be regarded as white noise sequences.29 That is, to a certain extent, the Brown model with natural logarithmic transformation of the original sequence and ARIMAX (0,1,0) models can be used to predict the short-term development of the confirmed number of patients with new type of coronavirus pneumonia in mainland China. As shown in Figure 3 and Figure 4 below, we have made a short-term prediction of the number of confirmed cases of new coronavirus pneumonia in mainland China based on Brown model and ARIMAX (0,1,0) model after second-order difference processing. We can clearly see from Figure 3 and Figure 4 that the prediction of the exact number of cases diagnosed in the past time by these two models is in good agreement with the real value, while the ARIMAX (0,1,0) model processed by second-order difference is more consistent with the historical data. The predictions of these two models for the number of confirmed cases of new coronavirus pneumonia in the next week are relatively close, but we noticed that under the 95% confidence interval, the ARIMAX (0,1,0) after second-order difference processing the upper and lower limits of the model's prediction of the number of confirmed cases are very small, so we finally selected the ARIMAX (0,1,0) model that was subjected to second-order difference processing in six time series models.

    Prediction chart of cumulative number of confirmed cases in mainland China based on logarithm transformation Brown model. Prediction chart of cumulative number of confirmed cases in mainland China based on ARIMAX(0,1,0).

    Then the number of confirmed cases in mainland China is predicted. Based on the predictions in Figure 4, we believe that the number of confirmed cases of new coronavirus pneumonia in mainland China in a week (February 8, 2020) may be as high as 36,343.

    We also made six time series models of cumulative confirmed cases in Hubei Province as shown in Table 3 below and the autocorrelation function diagrams of the residuals of these 6 models are shown in Figure 5. Based on the information in Table 3 and Figure 5, we selected the ARIMAX (0,0,0) model made a short- term prediction of the number of confirmed cases of pneumonitis of the new coronavirus in Hubei Province, and plotted the prediction of the ARIMAX (0,0,0) model as shown in Figure 6.

    Summary of 6 time series models of cumulative confirmed cases in Hubei Province.
    Method Stability treatment Model Fitting effect Ljung- Box Q(18) Number of Outliers Serial number
    Stationary R-squared R-squared Normalized BIC Sig.
    Exponential smoothing method _ Brown 0.136 0.994 11.228 0.958 0 7
    Natural logarithmic transformation Brown 0.021 0.987 11.928 0.845 0 8
    ARIMA First order difference ARIMA (0,1,0) 0.205 0.958 13.141 0.000 0 9
    Two order difference ARIMA (0,2,0) 0.264 0.994 11.200 0.977 0 10
    ARIMAX _ ARIMA X(0,0,0) 0.999 0.999 10.383 0.806 0 11
    First order difference ARIMA X(0,1,0) 0.973 0.999 10.279 0.655 0 12
    Residual ACF of six time series models of cumulative confirmed cases in Hubei Province. Prediction chart of cumulative number of confirmed cases in Hubei Province Based on ARIMAX(0,0,0).

    According to Figure 6, we believe that the number of confirmed cases of pneumonitis with new coronavirus in Hubei Province will rise to 26,455 in a week (February 8, 2020). It is worth mentioning that because the time series model is based on the original sequence itself time series models based on different mathematical formulas may not be suitable for long-term prediction of the spread of the COVID-19 epidemic, but these models can provide rapid prediction of short-term transmission of the COVID-19 epidemic. So as to provide reference value for all levels of departments and hospitals in the next few days to implement effective intervention and prevention of the spread of new coronavirus.

    SEIQDR-Based Estimates

    According to the data released by the National Health Construction Commission of China, we set the data on January 10 as the initial value. On January 10, the transmission of COVID-19 only occurred in Hubei Province, of which 41 were confirmed, 0 were suspected, 2 were cured, 2 were infected with the COVID-19 but not yet sick, and 0 people were ill but not isolated, namely:

    E(0) = 2 , I (0) = 0 , Q(0) = 0 , D(0) = 41 , R(0) = 2 .

    Since COVID-19 originates from Wuhan, Hubei, the above initial value can also be used as the national initial value. Based on this, we use the least square method to calculate k and errors SSE of Hubei Province and Mainland China as follows:

    kH= 0.815 , SSEH= 2236 .

    kN= 0.553 , SSEN= 2527 .

    Among them, kH means Hubei Province’s k value, kN means the k value of mainland China, because Hubei Province is the birthplace of the epidemic, with a large number of patients and limited medical resources, a large number of mild patients are self-isolated at home, which increases the transmission time after the incubation period. 30 The number of cases outside Hubei Province is small, and medical resources are sufficient, making the k value small. The latent can get timely treatment after the onset, and reduce the transmission time after the onset. It can be seen from the comparison of k value that it is a very correct and effective decision for medical staff from other provinces in mainland China to go to Wuhan and Hubei to fight the epidemic. It can be seen from Figure 7 that the SEIQDR model based on the dynamic propagation of COVID-19 has a good fitting effect with the original data in the duration period (from January 10, 2020 to February 9, 2020), and the fitting curve of the number of confirmed cases is basically consistent with the duration data.

    Matching effect chart of the real value and predicted value of the cumulative number of confirmed cases in China and Wuhan.

    We use SEIQDR model to predict the spread of COVID-19 in mainland China and Wuhan, and get the development trend of the cumulative number of confirmed cases in mainland China and Wuhan as shown in Figure 8. Based on the prediction results in Figure 8, we believe that the cumulative number of confirmed cases in Wuhan will continue to rise within 70 days from January 10, 2020, and the cumulative number of confirmed cases in Wuhan may reach a peak around March 20, 2020, the peak is 76982, that is to say, after the cumulative number of confirmed cases in Wuhan reached 76982 on March 20, there may no new confirmed cases appear.

    Prediction results of cumulative number of confirmed cases in China and Wuhan.

    To observe the development trend of the cumulative number of confirmed cases in mainland China, we believe that when the cumulative number of confirmed cases in Wuhan reaches a peak, the growth rate of confirmed cases in mainland China will decrease rapidly, and the cumulative number of confirmed cases in mainland China will peak around March 15, 2020, with a peak of 87,701. In order to give a clearer mathematical description of the spread of COVID-19 in Wuhan and Mainland China, we analyze the infection rate and basic regeneration number of the new coronavirus. Using the diachronic data and the calculation formula for the infection rate given above, the infection rate is fitted, we let the basic regeneration number of COVID-19 be R(t) , we use the following formula to calculate the basic regeneration number of COVID-19:

    As can be seen from the above formula, there is a close relationship between the basic regeneration number and the infection rate. In the above formula, t is the average latency of the COVID-19, and we also set the initial value of the average latency to 7. From this, we can get the trend of infection rate and basic regeneration number of 2019 nCoV in Wuhan and Mainland China over time as shown in Figure 9 below. As shown in Figure 9, during the period from January 10 to February 9, 2020, No matter in mainland China or Wuhan, the infection rate and basic regeneration number of COVID-19 continue to decline, and may continue to decline in the future. In our opinion, this may indicate that the emergency intervention and special control measures taken by the Chinese government to block Wuhan City, restrict the flow of people in Hubei Province and enhance the medical resources in the severely affected areas in the early stage of COVID-19 transmission have played a crucial role in the spread of the epidemic. According to the results of SEIQDR model, we believe that during the period from January 10 to February 9, 2020, the average basic regeneration number of COVID-19 in mainland China is 4.01, while the average basic regeneration number in Wuhan is 4.3. The infection rate of COVID-19 in mainland China should be reduced to 0 in 45 days after January 10, 2020, that is, around February 25.

    Infection rate band basic regeneration number R0 in China and Wuhan.

    According to the results of the SEIQDR model, we can also study more about the development trend of the COVID-19 epidemic in mainland China, Hubei Province, and Wuhan. Without the loss of generality, other parameters in the SEIQDR model remain initially set. Under the circumstances, we analyze some parameters in mainland China to explore the possible changes in the cumulative number of confirmed diagnoses. We have made a trend chart of the number of confirmed cases in mainland China when the average time required for the suspected population to be transformed into the confirmed population changes as shown in Figure 10.

    The changes of different  in the cumulative of confirmed cases in mainland China during the COVID-19 transmission process.

    In Figure 10, The reciprocal of s indicates that it takes the average time for the suspected population to be diagnosed as the confirmed population, that is = dqd , as shown in Figure 10, at that time of = 1 / 5 , it took an average of 5 days for suspected cases to be diagnosed as confirmed cases for necessary isolation and treatment. If the various preventive measures remain unchanged, the cumulative number of confirmed patients in mainland China will reach a peak after 73 days on January 1, 2020. The peak time was 94731 people. At that time of = 1 / 3 , patients can get relatively timely isolation and treatment after the onset of the disease, in which case the peak number of cumulative diagnoses will be reduced to 87701, a relative decrease of 7030. According to Figure 10, we can find that the peak number increases as gets smaller of cumulative diagnoses, which means that if the number of patients diagnosed with suspected patients will increase rapidly if they are not diagnosed in time. We believe that this trend may not be obvious enough within 30 days after January 10, however, once the epidemic situation becomes serious, the rapid increase in the number of confirmed cases and the difficulty in timely diagnosis and treatment may bring great challenges to the prevention and control of the epidemic in mainland China.

    Finally, we analyzed the sensitivity of mortality and cure rate in SEIQDR model. The Figure 11 shows the change in the cumulative number of confirmed cases with mortality and cure rate in mainland China. In Figure 11, on the left is the cumulative number of confirmed diagnoses with death during the transmission of COVID-19 in mainland China, the change of the cure rate is the change of the cumulative number of diagnoses with the cure rate on the right.

    Sensitivity analysis results of mortality and cure rate.

    We found that when the mortality rate gradually increased from 2% to 10%, the cumulative number of diagnoses increased by about 2,000. We believe that ,this means that the bodies of patients killed by COVID 19 may still carry a certain number of new infectious coronaviruses. According to the results of this simulation, we suggest that mainland China, especially Hubei Province and Wuhan City, should pay attention to the treatment of the dead bodies of the dead patients, try to ensure that the body itself and the handling process do not cause additional contagion. We also gradually increased the cure rate from 2% to 10%, and found that the cumulative number of confirmed diagnoses has almost no fluctuation. We think this means that once the pneumonia patients with the new coronavirus are cured, the antibodies left in their bodies may make them no longer a member of the susceptible population infected with the new coronavirus. In the early stage of COVID-19 transmission, as we did not know about COVID-19 and clinical trials were difficult to proceed immediately, our modeling and results provided reference values for intervention and prevention and clinical trials at all levels.

    Discussion

    Discussion

    There is no doubt that the propagation of COVID-19 in the population will be affected by the intricacies of many factors. In the early stage of the COVID-19 propagation, it is difficult to establish a dynamic propagation model with parameters to be estimated and obtain fairly accurate simulation results, but the preliminary estimation of parameters such as average latency and mortality through existing data may be helpful for solving important parameters such as infection rate and rehabilitation rate, which will help us have a more accurate grasp of the transmission trend of COVID-19. On the other hand, statistical modeling of the spread of new coronavirus pneumonia in the population based on time series analysis is a thing that can be done immediately after getting the latest data every day, because the dynamic model of the time series is based on the law of the data itself. Although this method often requires sufficient data to support it, in the early stages of epidemic transmission, this method can still be used to more accurately predict the indicators of epidemic transmission in the short term, so as to provide intervention control at all levels of the departments and Policy implementation provides short-term emergency prevention programs.

    Conclusion

    Affiliations:
    Affiliations: