Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/35717
Title: Supervised Topic Models with Multiple Annotators
Authors: Lourenço, Mariana Rodrigues 
Orientador: Ribeiro, Bernardete Martins
Keywords: Annotators
Issue Date: 17-Jul-2015
Serial title, monograph or event: Supervised Topic Models with Multiple Annotators
Place of publication or event: Coimbra
Abstract: We live in an era where information over ows. Yet, for this information to become knowledge, it has to be given meaning. This thesis focuses on a machine learning approach that evolved from probabilistic graphical models, which automatically extracts knowledge from vast amounts of data by assigning themes to documents: topic modeling. Topic models are an emergent technique used for both descriptive and predictive tasks. As a result, it was soon extended to other goals that do not only model topics, but also target variables. This work presents a supervised topic model that is able to learn from crowds. That is, we consider the case where the label set of the data was provided by multiple annotators. In the multi-annotator setting, the ground truth labels need to be modeled from several noisy versions of them given by the di erent annotators. To address this sort of problems, it is often assumed that all labelers are equally reliable through the use of voting techniques, which was proven to be an unrealistic conjecture. On the contrary, the proposed model takes into account the di erent levels of expertise and biases of annotators, by jointly modeling them together with the topics and the true labels. In order to make this process computationally tractable, a variational inference algorithm was developed, which provides an e cient approximate inference method. We nalize by showing how general supervised topic models can be used to predict demand in special events by correlating internet search query data with real measurements of transport usage, thus, motivating the usage of the topic models in real-world applications.
Description: Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia da Universidade de Coimbra
URI: https://hdl.handle.net/10316/35717
Rights: openAccess
Appears in Collections:UC - Dissertações de Mestrado
FCTUC Eng.Informática - Teses de Mestrado

Files in This Item:
File Description SizeFormat
Supervised Topic Models with Multiple Annotators.pdf7.55 MBAdobe PDFView/Open
Show full item record

Page view(s) 20

688
checked on Apr 16, 2024

Download(s)

276
checked on Apr 16, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.