Please use this identifier to cite or link to this item:
https://hdl.handle.net/10316/35717
Title: | Supervised Topic Models with Multiple Annotators | Authors: | Lourenço, Mariana Rodrigues | Orientador: | Ribeiro, Bernardete Martins | Keywords: | Annotators | Issue Date: | 17-Jul-2015 | Serial title, monograph or event: | Supervised Topic Models with Multiple Annotators | Place of publication or event: | Coimbra | Abstract: | We live in an era where information over ows. Yet, for this information to become knowledge, it has to be given meaning. This thesis focuses on a machine learning approach that evolved from probabilistic graphical models, which automatically extracts knowledge from vast amounts of data by assigning themes to documents: topic modeling. Topic models are an emergent technique used for both descriptive and predictive tasks. As a result, it was soon extended to other goals that do not only model topics, but also target variables. This work presents a supervised topic model that is able to learn from crowds. That is, we consider the case where the label set of the data was provided by multiple annotators. In the multi-annotator setting, the ground truth labels need to be modeled from several noisy versions of them given by the di erent annotators. To address this sort of problems, it is often assumed that all labelers are equally reliable through the use of voting techniques, which was proven to be an unrealistic conjecture. On the contrary, the proposed model takes into account the di erent levels of expertise and biases of annotators, by jointly modeling them together with the topics and the true labels. In order to make this process computationally tractable, a variational inference algorithm was developed, which provides an e cient approximate inference method. We nalize by showing how general supervised topic models can be used to predict demand in special events by correlating internet search query data with real measurements of transport usage, thus, motivating the usage of the topic models in real-world applications. | Description: | Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia da Universidade de Coimbra | URI: | https://hdl.handle.net/10316/35717 | Rights: | openAccess |
Appears in Collections: | UC - Dissertações de Mestrado FCTUC Eng.Informática - Teses de Mestrado |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Supervised Topic Models with Multiple Annotators.pdf | 7.55 MB | Adobe PDF | View/Open |
Page view(s) 20
688
checked on Apr 16, 2024
Download(s)
276
checked on Apr 16, 2024
Google ScholarTM
Check
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.