Abstract
Data fusion in information retrieval has been investigated by many researchers and quite a few data fusion methods have been proposed. However, their impact on effectiveness has not been well understood. In this paper, we apply statistical principles to data fusion and present a statistical data fusion model, which specifies the algorithm for fusion and conditions to be satisfied. The statistical model can be used as a guideline for data fusion methods. Based on this analysis, we compare CombSum and CombMNZ, which are the two best-known data fusion methods. We explain why sometimes CombMNZ does outperform Comb- Sum and what can be done to make CombSum more effective. Experimental results with TREC data are reported to support the conclusion that our enhancements to the algorithm improve effectiveness.
Original language | English |
---|---|
Title of host publication | Unknown Host Publication |
Publisher | IEEE |
Pages | 313-319 |
Number of pages | 7 |
ISBN (Print) | 978-1-4244-0991-4 |
DOIs | |
Publication status | Published (in print/issue) - 2007 |
Event | IEEE International Conference on Systems, Man and Cybernetics, 2007 - Montreal, Canada Duration: 1 Jan 2007 → … |
Conference
Conference | IEEE International Conference on Systems, Man and Cybernetics, 2007 |
---|---|
Period | 1/01/07 → … |