Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic Disease Prediction Modelling

Research output: Contribution to journalArticle

36 Downloads (Pure)

Abstract

This research presents viable solutions for prediction modelling of schistosomiasis disease based on vector density. Novel training models pro- posed in this work aim to address various aspects of interest in the artificial intelligence applications domain. Topics discussed include data imputation, semi-supervised learning and synthetic instance simulation when using sparse training data. This research applies Remote Sensing and Earth Observation sample data provided by European Space Agency satellites as well as environ- ment feature characteristics extracted by research partners at The Academy of Opto-Electronics in China. Innovative semi-supervised ensemble learning paradigms are proposed which focus on labelling threshold selection and strin- gency of classification confidence levels. A Regression-Correlation Combina- tion (RCC) imputation method is also introduced for handling of partially complete training data. Results presented in this work show data imputa- tion precision improvement over benchmark value replacement using proposed RCC method. Proposed Incremental Transductive models have provided in- teresting findings based on threshold constraints that can be applied with alternative environment-based epidemic disease domains. The Synthetic Mi- nority Over-Sampling Technique (SMOTE) Equilibrium approach has yielded subtle classification performance increases which can be further interrogated to assess classification performance and efficiency relationships with synthetic instance generation.
Original languageEnglish
Number of pages31
JournalInternational Journal of Machine Learning and Cybernetics
Publication statusAccepted/In press - 28 Oct 2019

Keywords

  • Disease Prediction Modelling
  • Data Imputation
  • Synthetic Data Simulation

Fingerprint Dive into the research topics of 'Data mining and machine learning approaches for prediction modelling of schistosomiasis disease vectors: Epidemic Disease Prediction Modelling'. Together they form a unique fingerprint.

  • Profiles

    No photo of Yaxin Bi

    Cite this