Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/35724
DC FieldValueLanguage
dc.contributor.advisorOliveira, Hugo Ricardo Gonçalo-
dc.contributor.authorFerrugento, Adriana Figueiredo-
dc.date.accessioned2017-01-13T16:07:09Z-
dc.date.available2017-01-13T16:07:09Z-
dc.date.issued2015-07-15-
dc.identifier.urihttps://hdl.handle.net/10316/35724-
dc.descriptionDissertação de Mestrado em Engenharia Informática apresentada à Faculdade de Ciências e Tecnologia da Universidade de Coimbrapt
dc.description.abstractTopic models came to improve the way search, browse and summarization of large sets of texts is performed. These models are used for uncovering the main theme of the documents in a corpus, where topics are probability distributions over a collection of words that is representative of a document. The most widely used topic model is called Latent Dirichlet Allocation (LDA) and it enables for documents to be characterized by more than one topic. This allows for a more accurate representation of what happens with real documents, where a text may have more than one underlying theme. However, this popular model is still far from producing excellent topics, given that it does not account for the semantic relations between words. It may thus result in redundant topics that contain di erent words, but with the same meaning. This thesis o ers a way to improve the LDA algorithm and, hence, solve the problem of not considering the semantics of words. The model proposed here uses the LDA algorithm as a starting point, however some changes are made, since it is our interest to introduce semantic relations in this model. A main component of the proposed model is the use of a lexical database for English, WordNet, which enables the integration of semantics by accessing its content.pt
dc.language.isoengpt
dc.rightsopenAccesspt
dc.subjectSemantic Topic Modellingpt
dc.titleSemantic Topic Modellingpt
dc.typemasterThesispt
degois.publication.locationCoimbrapt
degois.publication.titleSemantic Topic Modellingpor
dc.date.embargo2015-07-15*
dc.identifier.tid201537966pt
thesis.degree.grantor00500::Universidade de Coimbrapt
thesis.degree.nameMestrado em Engenharia Informática-
uc.degree.grantorUnit0501 - Faculdade de Ciências e Tecnologiapor
uc.rechabilitacaoestrangeiranopt
uc.date.periodoEmbargo0pt
item.grantfulltextopen-
item.fulltextCom Texto completo-
item.openairetypemasterThesis-
item.languageiso639-1en-
item.openairecristypehttp://purl.org/coar/resource_type/c_18cf-
item.cerifentitytypePublications-
crisitem.advisor.researchunitCISUC - Centre for Informatics and Systems of the University of Coimbra-
crisitem.advisor.parentresearchunitFaculty of Sciences and Technology-
crisitem.advisor.orcid0000-0002-5779-8645-
Appears in Collections:UC - Dissertações de Mestrado
FCTUC Eng.Informática - Teses de Mestrado
Files in This Item:
File Description SizeFormat
Semantic Topic Modelling.pdf1.02 MBAdobe PDFView/Open
Show simple item record

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.