The use of supervised machine learning techniques to identify factors influencing vitamin D bio-enrichment of pork

Erika Rosbotham, Debbie Rankin, Chris IR Gill, EJ McDonald, W.C. McRoberts, Holly Neill, R Boland, L. Kirsty Pourshahidi

Research output: Contribution to journalMeeting Abstractpeer-review

Abstract

Vitamin D bio-enriched foods can complement traditional fortification measures to improve sub optimal vitamin D intakes across the population(Reference Cashman1). Recent attention has been given to bio-enrichment practices such as fortified livestock feed and/or animal UVB exposure to naturally enhance vitamin D content in meat(Reference Duffy2). This study aimed to use supervised machine learning techniques to identify features influencing vitamin D response in bio-enriched pork.

In this study, swine (n = 20) were exposed daily to narrowband UVB radiation (irradiance 440μW/cm2) (n = 10) or standard lighting (n = 10) for a period of 11 weeks prior to slaughter. Vitamin D3 and 25-hydroxyvitamin D3 (25(OH)D3) concentrations were analysed in cooked pork loin by liquid chromatography tandem mass spectrometry. A dataset capturing the features and outcomes of the on-farm study was collected. Rapidminer Studio (version 9.8) was used to identify the most important features influencing a higher vitamin D response in bio-enriched pork. Feature selection utilising information gain indicated eight features of significance for predicting vitamin D response to the bio-enrichment process: body weight gain (%), body weight at slaughter (kg), behaviour, sex, serum 25(OH)D3 (μg/l) at baseline, midpoint and endpoint, and cooking loss (%). These features were used to train five classification models: random forest, Naïve Bayes, decision trees (C4.5 and CART) and KNN. A train/test split of 60/40 was applied with 5-fold cross-validation.

The target variable was vitamin D activity, calculated as [vitamin D3 + (25(OH)D3 x 5)](Reference Cashman3) in bio-enriched pork, categorised as “high” (>19μg/kg) or “low” (≤19μg/kg). Models obtained F1-scores ranging 57.1–77.1%. KNN and C4.5 performed best (77.1% and 75%). Serum 25(OH)D3 levels at the midpoint and endpoint of the study, body weight gain and body weight at slaughter were commonly identified by Naïve Bayes, decision trees (C4.5 and CART) and KNN as the four most important factors influencing vitamin D activity in bio-enriched pork. Feature importance results determined by random forest differed, identifying serum 25(OH)D3 taken at the baseline and midpoint of the study, body weight gain and body weight at slaughter as the four most important features. Results from all five classification models determined sex to be the least important feature influencing the bio-enrichment process.

Supervised machine learning techniques were able to identify the most important features influencing vitamin D response in bio-enriched pork. Preliminary results from the current analysis will be built upon through future studies to enable more robust model development. This will in turn inform bio-enrichment process optimisation for industry to produce novel bio-enriched meat products for consumers.
Original languageEnglish
Article numberE141
JournalProceedings of the Nutrition Society.
Volume80
Issue numberOCE3
DOIs
Publication statusPublished (in print/issue) - 17 Aug 2021

Fingerprint

Dive into the research topics of 'The use of supervised machine learning techniques to identify factors influencing vitamin D bio-enrichment of pork'. Together they form a unique fingerprint.

Cite this