Abstract:
:A key factor in the fight against viral diseases such as the coronavirus (COVID-19) is the identification of virus carriers as early and quickly as possible, in a cheap and efficient manner. The application of deep learning for image classification of chest X-ray images of COVID-19 patients could become a useful pre-diagnostic detection methodology. However, deep learning architectures require large labelled datasets. This is often a limitation when the subject of research is relatively new as in the case of the virus outbreak, where dealing with small labelled datasets is a challenge. Moreover, in such context, the datasets are also highly imbalanced, with few observations from positive cases of the new disease. In this work we evaluate the performance of the semi-supervised deep learning architecture known as MixMatch with a very limited number of labelled observations and highly imbalanced labelled datasets. We demonstrate the critical impact of data imbalance to the model's accuracy. Therefore, we propose a simple approach for correcting data imbalance, by re-weighting each observation in the loss function, giving a higher weight to the observations corresponding to the under-represented class. For unlabelled observations, we use the pseudo and augmented labels calculated by MixMatch to choose the appropriate weight. The proposed method improved classification accuracy by up to 18%, with respect to the non balanced MixMatch algorithm. We tested our proposed approach with several available datasets using 10, 15 and 20 labelled observations, for binary classification (COVID-19 positive and normal cases). For multi-class classification (COVID-19 positive, pneumonia and normal cases), we tested 30, 50, 70 and 90 labelled observations. Additionally, a new dataset is included among the tested datasets, composed of chest X-ray images of Costa Rican adult patients.
journal_name
Appl Soft Computjournal_title
Applied soft computingauthors
Calderon-Ramirez S,Yang S,Moemeni A,Elizondo D,Colreavy-Donnelly S,Chavarría-Estrada LF,Molina-Cabello MAdoi
10.1016/j.asoc.2021.107692keywords:
["COVID-19","Computer aided diagnosis","Coronavirus","Data imbalance","Semi-supervised learning"]subject
Has Abstractpub_date
2021-11-01 00:00:00pages
107692eissn
1568-4946issn
1872-9681pii
S1568-4946(21)00613-Xjournal_volume
111pub_type
杂志文章abstract::In the aftermath of the COVID-19 pandemic, supply chains experienced an unprecedented challenge to fulfill consumers' demand. As a vital operational component, manual order picking operations are highly prone to infection spread among the workers, and thus, susceptible to interruption. This study revisits the well-kno...
journal_title:Applied soft computing
pub_type: 杂志文章
doi:10.1016/j.asoc.2020.106953
更新日期:2021-03-01 00:00:00
abstract::A pneumonia of unknown causes, which was detected in Wuhan, China, and spread rapidly throughout the world, was declared as Coronavirus disease 2019 (COVID-19). Thousands of people have lost their lives to this disease. Its negative effects on public health are ongoing. In this study, an intelligence computer-aided mo...
journal_title:Applied soft computing
pub_type: 杂志文章
doi:10.1016/j.asoc.2020.106580
更新日期:2020-12-01 00:00:00
abstract::To satisfy a user's need to find and understand the whole picture of an event effectively and efficiently, in this paper we formalize the problem of temporal event searches and propose a framework of event relationship analysis for search events based on user queries. We define three kinds of event relationships: temp...
journal_title:Applied soft computing
pub_type: 杂志文章
doi:10.1016/j.asoc.2019.105750
更新日期:2019-12-01 00:00:00
abstract::Simulation studies are useful in various disciplines for a number of reasons including the development and evaluation of new computational and statistical methods. This is particularly true in human genetics and genetic epidemiology where new analytical methods are needed for the detection and characterization of dise...
journal_title:Applied soft computing
pub_type: 杂志文章
doi:10.1016/j.asoc.2003.08.003
更新日期:2004-02-01 00:00:00
abstract::Because of government intervention, such as quarantine and cancellation of public events at the peak of the COVID-19 outbreak and donors' health scare of exposure to the virus in medical centers, the number of blood donors has considerably decreased. In some countries, the rate of blood donation has reached lower than...
journal_title:Applied soft computing
pub_type: 杂志文章
doi:10.1016/j.asoc.2021.107821
更新日期:2021-08-13 00:00:00