Utilize este identificador para referenciar este registo: https://hdl.handle.net/10316/35717
Título: Supervised Topic Models with Multiple Annotators
Autor: Lourenço, Mariana Rodrigues 
Orientador: Ribeiro, Bernardete Martins
Palavras-chave: Annotators
Data: 17-Jul-2015
Título da revista, periódico, livro ou evento: Supervised Topic Models with Multiple Annotators
Local de edição ou do evento: Coimbra
Resumo: We live in an era where information over ows. Yet, for this information to become knowledge, it has to be given meaning. This thesis focuses on a machine learning approach that evolved from probabilistic graphical models, which automatically extracts knowledge from vast amounts of data by assigning themes to documents: topic modeling. Topic models are an emergent technique used for both descriptive and predictive tasks. As a result, it was soon extended to other goals that do not only model topics, but also target variables. This work presents a supervised topic model that is able to learn from crowds. That is, we consider the case where the label set of the data was provided by multiple annotators. In the multi-annotator setting, the ground truth labels need to be modeled from several noisy versions of them given by the di erent annotators. To address this sort of problems, it is often assumed that all labelers are equally reliable through the use of voting techniques, which was proven to be an unrealistic conjecture. On the contrary, the proposed model takes into account the di erent levels of expertise and biases of annotators, by jointly modeling them together with the topics and the true labels. In order to make this process computationally tractable, a variational inference algorithm was developed, which provides an e cient approximate inference method. We nalize by showing how general supervised topic models can be used to predict demand in special events by correlating internet search query data with real measurements of transport usage, thus, motivating the usage of the topic models in real-world applications.
Descrição: Dissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia da Universidade de Coimbra
URI: https://hdl.handle.net/10316/35717
Direitos: openAccess
Aparece nas coleções:UC - Dissertações de Mestrado
FCTUC Eng.Informática - Teses de Mestrado

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato
Supervised Topic Models with Multiple Annotators.pdf7.55 MBAdobe PDFVer/Abrir
Mostrar registo em formato completo

Visualizações de página 20

689
Visto em 23/abr/2024

Downloads

276
Visto em 23/abr/2024

Google ScholarTM

Verificar


Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.