Multiple Sets of Rules for Text Categorization

Y Bi, TJ Anderson, SI McClean

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review


. In this paper, we present an investigation into the combination of rules for text categorization using Dempster’s rule of combination. We first propose a boosting-like technique for generating multiple sets of rules based on rough set theory, and then describe how to use Dempster’s rule of combination to combine the classification decisions produced by multiple sets of rules. We apply these methods to 10 out of the 20-newsgroups – a benchmark data collection, individually and in combination. Our experimental results show that the performance of the best combination of the multiple sets of rules on the 10 groups of the benchmark data can achieve 80.47% classification accuracy, which is 3.24% better than that of the best single set of rules.
Original languageEnglish
Title of host publicationUnknown Host Publication
Number of pages10
Publication statusPublished (in print/issue) - Oct 2004
EventAdvances in Information Systems 2004 - Izmir, Turkey
Duration: 1 Oct 2004 → …


ConferenceAdvances in Information Systems 2004
Period1/10/04 → …


Dive into the research topics of 'Multiple Sets of Rules for Text Categorization'. Together they form a unique fingerprint.

Cite this