Please use this identifier to cite or link to this item: https://hdl.handle.net/10316/99659
Title: Visual recognition for localization purposes using omnidirectional images maps
Authors: Pedro, Vítor Manuel Castela
Orientador: Barreto, João Pedro de Almeida
Issue Date: Sep-2011
Place of publication or event: Coimbra
Abstract: This thesis is an exploratory work with the objective of developing techniques for recognition using visual maps constituted by paracatadioptric images. This kind of images contain a more complete description of the scene, covering a wider area than e.g. perspective images. However, the radial distortion they present is high which difficult their usage. The main challenge of the thesis is establishing correspondences between a query image captured by a standard camera and the paracatadioptric images. Different types of rectification strategies are studied in order to correct the radial distortion present in the paracatadioptric images. Matching between perspective images and rectified images from the paracatadioptric images is performed, using SIFT algorithm [18]. Also, a simple procedure to roughly estimate the calibration matrix of the system is explained. Additionally, several modifications to the original SIFT algorithm are proposed, including a change in the initial scale for the Gaussian blurring and a novel approach for feature detection and description of stable local features, directly over the paracatadioptric images (based on [17]). With this approach, the detection is carried in a scale-space image representation built using an adaptive Gaussian filter that takes into account the geometry of the paracatadioptric system. As an additional result, a brief study of feature detection and matching between paracatadioptric images is performed. A new mapping function for the adaptive filtering is tested, outperforming the other approaches. Another important topic of the thesis is to perform image based localization using a database of omnidirectional images. A recognition scheme using an hierarchical vocabulary tree is built, based on [25]. Different visual vocabularies and training data are used. Additionally, several methods for feature extraction in the database images are analyzed, including the original SIFT algorithm and SIFT with implicit filtering. Finally, methods to improve the retrieval performance are studied. To encode more spatial information in the searching step, the concept of geometry-preserving visual phrases is used [33]. Additionally, to provide a more precise ranking of the retrieved images, a geometric consistency check (using RANSAC) is performed on the top-ranked images.
Description: Dissertação de Mestrado em Engenharia Electrotécnica e de Computadores apresentada à Faculdade de Ciências e Tecnologia da Universidade de Coimbra.
URI: https://hdl.handle.net/10316/99659
Rights: openAccess
Appears in Collections:FCTUC Eng.Electrotécnica - Teses de Mestrado

Files in This Item:
File Description SizeFormat
MsCthesis_VitorPedro.pdf70.87 MBAdobe PDFView/Open
Show full item record

Page view(s)

38
checked on Apr 23, 2024

Download(s)

10
checked on Apr 23, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.