Predict COVID-19 Spreading With C-SMOTE

Cite

Related Material

Technische Informationsbibliothek (TIB)

Bernardo, Alessio Della Valle, Emanuele

Formal Metadata

Title

Predict COVID-19 Spreading With C-SMOTE

Title of Series

24th International Conference on Business Information Systems (BIS 2021)

Number of Parts

Author

Bernardo, Alessio

0000-0002-3492-0345 (ORCID)

Della Valle, Emanuele

0000-0002-5176-5885 (ORCID)

License

CC Attribution 4.0 International:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.

Identifiers

10.5446/53697 (DOI)

Publisher

Technische Informationsbibliothek (TIB)

Release Date

2021

Language

English

Content Metadata

Subject Area

Information Science

Genre

Conference/Talk

Abstract

Data continuously gathered monitoring the spreading of the COVID-19 pandemic form an unbounded flow of data. Accurately forecasting if the infections will increase or decrease has a high impact, but it is challenging because the pandemic spreads and contracts periodically. Technically, the flow of data is said to be imbalanced and subject to concept drifts because signs of decrements are the minority class during the spreading periods, while they become the majority class in the contraction periods and the other way round. In this paper, we propose a case study applying the Continuous Synthetic Minority Oversampling Technique (C-SMOTE), a novel meta-strategy to pipeline with Streaming Machine Learning (SML) classification algorithms, to forecast the COVID-19 pandemic trend. Benchmarking SML pipelines that use C-SMOTE against state-of-the-art methods on a COVID-19 dataset, we bring statistical evidence that models learned using C-SMOTE are better.

Keywords