Abstract
Due to the nature of biological and medical data, biostatistics has been playing an increasing role in a wide range of applications in biology and medicine. The aim of this article is to provide insights on some basic concepts and measurement procedures used in biostatistics. The amphasis has been placed on the application of biostatistics in the realm of classification aiming to design a good prediction model. Various statistical metrics and significance tests used to evaluate the performance of a predictor have been discussed. It has been highlighted that the interpretation of the values of these metrics should be cautious when applied to biological domain especially when dealing with highliy imbalanced datasets.
Original language | English |
---|---|
Title of host publication | Encyclopedia of Bioinformatics and Computational Biology |
Editors | Shoba Ranganathan, Michael Gribskov, Kenta Nakai, Christian Schönbach |
Publisher | Elsevier |
Pages | 685-690 |
Number of pages | 6 |
Volume | 1 |
ISBN (Electronic) | 9780128114148 |
ISBN (Print) | 9780128114322 |
DOIs | |
Publication status | Published (in print/issue) - 6 Sept 2018 |
Keywords
- Accuracy
- Area under the ROC curve (AUC)ClassificationReceiver operating characteristic (ROC) curve
- Classification
- Receiver operating characteristic (ROC) curve
- Test of Significance