Abstract
The Latent Dirichlet Allocation model is an unsupervised generative model that is widely used for topic modelling in text. We propose to add supervision to the model in the form of domain knowledge to direct the focus of topics to more relevant aspects than the topics produced by standard LDA. Experimental results demonstrate the effectiveness of our method. We also propose a novel Twofold-LDA model to improve the current output of LDA in order to visualize results in graphical form, which can ultimately be used by potential customers. Experiments show the benefit of this new output, with the ability to produce topics focused on our desired aspects in a user friendly chart.
Original language | English |
---|---|
Title of host publication | Unknown Host Publication |
Place of Publication | Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on (Volume:1 ) |
Publisher | IEEE |
Pages | 253-256 |
Number of pages | 4 |
Volume | 1 |
ISBN (Print) | 978-1-4577-1373-6 |
DOIs | |
Publication status | Published (in print/issue) - 2011 |
Event | 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Duration: 1 Jan 2011 → … |
Conference
Conference | 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology |
---|---|
Period | 1/01/11 → … |