Abstract
| Original language | English |
|---|---|
| Article number | bbad382 |
| Pages (from-to) | 1-17 |
| Number of pages | 17 |
| Journal | Briefings in Bioinformatics |
| Volume | 24 |
| Issue number | 6 |
| Early online date | 26 Oct 2023 |
| DOIs | |
| Publication status | Published (in print/issue) - Nov 2023 |
Bibliographical note
Publisher Copyright:© The Author(s) 2023. Published by Oxford University Press.
Data Access Statement
This study employed four distinct datasets. The LYRIKS dataset, owned by the Institute for Mental Health, Singapore, is not publicly available due to privacy considerations and the absence of participant consent for public data sharing. Researchers interested in accessing this dataset for scientific purposes may reach out directly to the Institute for Mental Health, Singapore, to explore potential data access arrangements. The Bipolar dataset, on the other hand, is publicly accessible and can be downloaded from the following link: [Link]. The Lung Adenocarcinoma (LUAD) dataset, part of The Cancer Genome Atlas (TCGA) PanCancer Atlas study, can be downloaded from the following link: [Link]. The Pancreatic Ductal Adenocarcinoma (PDAC) dataset is publicly available at the following link: [Link] We encourage researchers to utilize these resources in accordance with the respective data use agreements and ethical guidelines.Funding
This research is supported by the MBIE Catalyst: Strategic–New Zealand-Singapore Data Science Research Program and the National Research Foundation, Singapore, under its Industry Alignment Fund–Pre-positioning (IAF-PP) Funding Initiative. The LYRIKS study was supported by the National Research Foundation Singapore under the National Medical Research Council Translational and Clinical Research Flagship Program (NMRC/TCR/003/2008). Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore.
Keywords
- feature selection
- biomarker discovery
- ensemble learning
- high-dimensional data
- genomics
- proteomics