Utilize este identificador para referenciar este registo: https://hdl.handle.net/10316/107675
Título: Distributional and Knowledge-Based Approaches for Computing Portuguese Word Similarity
Autor: Oliveira, Hugo Gonçalo 
Palavras-chave: semantic similarity; word similarity; lexical knowledge bases; lexical semantics; word embeddings; distributional semantics
Data: 2018
Editora: MDPI
Título da revista, periódico, livro ou evento: Information (Switzerland)
Volume: 9
Número: 2
Resumo: Identifying similar and related words is not only key in natural language understanding but also a suitable task for assessing the quality of computational resources that organise words and meanings of a language, compiled by different means. This paper, which aims to be a reference for those interested in computing word similarity in Portuguese, presents several approaches for this task and is motivated by the recent availability of state-of-the-art distributional models of Portuguese words, which add to several lexical knowledge bases (LKBs) for this language, available for a longer time. The previous resources were exploited to answer word similarity tests, which also became recently available for Portuguese. We conclude that there are several valid approaches for this task, but not one that outperforms all the others in every single test. Distributional models seem to capture relatedness better, while LKBs are better suited for computing genuine similarity, but, in general, better results are obtained when knowledge from different sources is combined.
URI: https://hdl.handle.net/10316/107675
ISSN: 2078-2489
DOI: 10.3390/info9020035
Direitos: openAccess
Aparece nas coleções:I&D CISUC - Artigos em Revistas Internacionais
FCTUC Eng.Informática - Artigos em Revistas Internacionais

Mostrar registo em formato completo

Visualizações de página

61
Visto em 17/jul/2024

Downloads

21
Visto em 17/jul/2024

Google ScholarTM

Verificar

Altmetric

Altmetric


Este registo está protegido por Licença Creative Commons Creative Commons