Abstract
FFLUX is a Machine Learning Force Field that uses the Maximum Expected Prediction Error (MEPE) active learning algorithm to improve the efficiency of model training. MEPE uses the predictive uncertainty of a Gaussian Process to balance exploration and exploitation when selecting the next training sample. However, the predictive uncertainty of a Gaussian Process is unlikely to be accurate or precise immediately after training. We hypothesize that calibrating the uncertainty quantification within MEPE will improve active learning performance. We develop and test two methods to improve uncertainty estimates: post-hoc calibration of predictive uncertainty using the CRUDE algorithm, and replacing the Gaussian Process with a Student-t Process. We investigate the impact of these methods on MEPE for single sample and batch sample active learning. Our findings suggest that post-hoc calibration does not improve the performance of active learning using the MEPE method. However, we do find that the Student-t Process can outperform active learning strategies and random sampling using a Gaussian Process if the training set is sufficiently large.
| Original language | English |
|---|---|
| Article number | 045034 |
| Pages (from-to) | 1-19 |
| Number of pages | 19 |
| Journal | Machine Learning: Science and Technology |
| Volume | 4 |
| Issue number | 4 |
| Early online date | 23 Nov 2023 |
| DOIs | |
| Publication status | Published online - 23 Nov 2023 |
Bibliographical note
Publisher Copyright:© 2023 The Author(s). Published by IOP Publishing Ltd.
Data Availability Statement
The data that support the findings of this study are openly available at the following URL/DOI: https:// github.com/AdamThomas-Mitchell/uqlab/tree/master/uqlab/data.Funding
We are grateful for the use of the computing resources from the Northern Ireland High Performance Computing (NI-HPC) service. We are also grateful to Yulian Manchev for providing the dataset. P L A P is grateful to the European Research Council (ERC) for the award of an Advanced Grant underwritten by the UKRI-funded Frontier Research grant EP/XO24393/1.
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 9 Industry, Innovation, and Infrastructure
-
SDG 12 Responsible Consumption and Production
Keywords
- machine learning force fields
- Gaussian process regression (GPR)
- calibration
- Active learning
- uncertainty quantification
- active learning
- Gaussian process
Fingerprint
Dive into the research topics of 'Calibration of uncertainty in the active learning of machine learning force fields'. Together they form a unique fingerprint.Datasets
-
Calibration of uncertainty in the active learning of machine learning force fields Dataset
Thomas-Mitchell, A. (Creator), Hawe, G. (Creator) & Popelier, P. L. A. (Creator), Oct 2022
https:// github.com/AdamThomas-Mitchell/uqlab/tree/master/uqlab/data
Dataset
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver