Abstract
Big data classification problems have drawn great attention from diverse fields, and many classifiers have been developed. Among those classifiers, the extended belief rule-based system (EBRBS) has shown its potential in both big data and multiclass situations, while the time complexity and computing efficiency are two challenging issues to be handled in EBRBS. As such, three improvements of EBRBS are proposed first in this paper to decrease the time complexity and computing efficiency of EBRBS for multiclass classification under the assumption of large amount of data, including the strategy to skip rule weight calculation, a simplified evidential reasoning algorithm, and the domain division-based rule reduction method. This turns out to be a micro version of the EBRBS, called Micro-EBRBS. Moreover, one of commonly used cluster computing, named Apache Spark, is then applied to implement the parallel rule generation and inference schemes of the Micro-EBRBS for big data multiclass classification problems. The comparative analyses of experimental studies demonstrate that the Micro-EBRBS not only can obtain a desired accuracy but also has the comparatively better time complexity and computing efficiency than some popular classifiers, especially for multiclass classification problems.
Original language | English |
---|---|
Pages (from-to) | 420-440 |
Number of pages | 21 |
Journal | IEEE Transactions on Systems, Man, and Cybernetics: Systems |
Volume | 51 |
Issue number | 1 |
Early online date | 26 Oct 2018 |
DOIs | |
Publication status | Published (in print/issue) - 1 Jan 2021 |
Bibliographical note
Funding Information:Manuscript received July 1, 2018; accepted September 11, 2018. Date of publication October 26, 2018; date of current version January 12, 2021. This work was supported in part by the National Natural Science Foundation of China under Grant 72001043, Grant 61773123, Grant 71701050, Grant 71501047, and Grant 72001042; in part by the Humanities and Social Science Foundation of the Ministry of Education under Grant 20YJC630188; and in part by the Natural Science Foundation of Fujian Province, China, under Grant 2020J05122. This paper was recommended by Associate Editor L. Wang. (Corresponding author: Ying-Ming Wang.) L.-H. Yang is with the Decision Sciences Institute, Fuzhou University, Fuzhou 350108, China, and also with the Department of Computer Science, University of Jaén, 23008 Jaén, Spain (e-mail: [email protected]).
Publisher Copyright:
© 2013 IEEE.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
Keywords
- Apache spark
- big data
- extended belief rule-based system (EBRBS)
- multiclass