Big data classification problems have drawn great attention from diverse fields and many classifiers have been developed. Among those classifiers, the extended belief rule-based system (EBRBS) has shown its potential in both big data and multi-class situations while time complexity and computing efficiency are two challenging issues to be handled in EBRBS. As such, three improvements of EBRBS are proposed firstly in the present paper to decrease the time complexity and computing efficiency of EBRBS for multi-class classification under the assumption of large amount of data, including the strategy to skip rule weight calculation, a simplified evidential reasoning algorithm, and the domain division-based rule reduction method. This turns out to be a micro version of the EBRBS, classed Micro-EBRBS. Moreover, one of commonly used cluster computing, named Apache Spark, is then applied to implement the parallel rule generation and inference schemes of the Micro-EBRBS for big data multi- class classification problems. The comparative analyses of experimental studies demonstrate that the Micro-EBRBS not only can obtain a desired accuracy, but also has the comparatively better time complexity and computing efficiency than some popular classifiers, especially for multi-class classification problems.
|Number of pages||21|
|Journal||IEEE Transactions on Systems, Man, and Cybernetics: Systems|
|Publication status||Published - 26 Oct 2018|
- Apache spark
- big data
- extended belief rule-based system (EBRBS)