2014
Authors
Hertzberg, J; Zhang, J; Zhang, L; Rockel, S; Neumann, B; Lehmann, J; Dubba, KSR; Cohn, AG; Saffiotti, A; Pecora, F; Mansouri, M; Konecný,; Günther, M; Stock, S; Lopes, LS; Oliveira, M; Lim, GH; Kasaei, H; Mokhtari, V; Hotz, L; Bohlken, W;
Publication
Künstl Intell - KI - Künstliche Intelligenz
Abstract
2016
Authors
Sappa, AD; Carvajal, JA; Aguilera, CA; Oliveira, M; Romero, D; Vintimilla, BX;
Publication
SENSORS
Abstract
This paper evaluates different wavelet-based cross-spectral image fusion strategies adopted to merge visible and infrared images. The objective is to find the best setup independently of the evaluation metric used to measure the performance. Quantitative performance results are obtained with state of the art approaches together with adaptations proposed in the current work. The options evaluated in the current work result from the combination of different setups in the wavelet image decomposition stage together with different fusion strategies for the final merging stage that generates the resulting representation. Most of the approaches evaluate results according to the application for which they are intended for. Sometimes a human observer is selected to judge the quality of the obtained results. In the current work, quantitative values are considered in order to find correlations between setups and performance of obtained results; these correlations can be used to define a criteria for selecting the best fusion strategy for a given pair of cross-spectral images. The whole procedure is evaluated with a large set of correctly registered visible and infrared image pairs, including both Near InfraRed (NIR) and Long Wave InfraRed (LWIR).
2014
Authors
Kasaei, SH; Oliveira, M; Lim, GH; Lopes, LS; Tome, AM;
Publication
2014 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC)
Abstract
Three-dimensional object detection and recognition is increasingly in manipulation and navigation applications in autonomous service robots. It involves clustering points of the point cloud from an unstructured scene into objects candidates and estimating features to recognize the objects under different circumstances such as occlusions and clutter. This paper presents an efficient approach capable of learning and recognizing object categories in an interactive and open-ended manner. In this paper, "open-ended" implies that the set of object categories to be learned is not known in advance. The training instances are extracted from actual experiences of a robot, and thus become gradually available, rather than being available at the beginning of the learning process. This paper focuses on two state-of-the-art questions: (1) How to automatically detect, conceptualize and recognize objects in 3D unstructured scenes in an open-ended manner? (2) How to acquire and utilize high-level knowledge obtained from the user (e. g. category label) to improve the system performance? This approach starts with a pre-processing phase to remove unnecessary information and prepare a suitable point cloud. Clustering is then applied to detect object candidates. Subsequently, all object candidates are described based on a 3D shape descriptor called spin-image. Finally, a nearest-neighbor classification rule is used to assign category labels to the detected objects. To examine the performance of the proposed approach, a leave-one-out cross validation algorithm is utilized to compute precision and recall. The experimental results show the fulfilling performance of this approach on different types of objects.
2014
Authors
Oliveira, M; Lim, GH; Lopes, LS; Kasaei, SH; Tome, AM; Chauhan, A;
Publication
2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014)
Abstract
This paper addresses the problem of grounding semantic representations in intelligent service robots. In particular, this work contributes to addressing two important aspects, namely the anchoring of object symbols into the perception of the objects and the grounding of object category symbols into the perception of known instances of the categories. The paper discusses memory requirements for storing both semantic and perceptual data and, based on the analysis of these requirements, proposes an approach based on two memory components, namely a semantic memory and a perceptual memory. The perception, memory, learning and interaction capabilities, and the perceptual memory, are the main focus of the paper. Three main design options address the key computational issues involved in processing and storing perception data: a lightweight, NoSQL database, is used to implement the perceptual memory; a thread-based approach with zero copy transport of messages is used in implementing the modules; and a multiplexing scheme, for the processing of the different objects in the scene, enables parallelization. The system is designed to acquire new object categories in an incremental and open-ended way based on user-mediated experiences. The system is fully integrated in a broader robot system comprising low-level control and reactivity to high-level reasoning and learning.
2016
Authors
Kasaei, SH; Lopes, LS; Tome, AM; Oliveira, M;
Publication
2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016)
Abstract
Object representation is one of the most challenging tasks in robotics because it must provide reliable information in real-time to enable the robot to physically interact with the objects in its environment. To ensure reliability, a global object descriptor must be computed based on a unique and repeatable object reference frame. Moreover, the descriptor should contain enough information enabling to recognize the same or similar objects seen from different perspectives. This paper presents a new object descriptor named Global Orthographic Object Descriptor (GOOD) designed to be robust, descriptive and efficient to compute and use. The performance of the proposed object descriptor is compared with the main state-of-the-art descriptors. Experimental results show that the overall classification performance obtained with GOOD is comparable to the best performances obtained with the state-ofthe-art descriptors. Concerning memory and computation time, GOOD clearly outperforms the other descriptors. Therefore, GOOD is especially suited for real-time applications.
2016
Authors
Oliveira, M; Santos, V; Sappa, AD; Dias, P;
Publication
ROBOT 2015: SECOND IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 1
Abstract
In this paper, we present a novel methodology to compute a 3D scene representation. The algorithm uses macro scale polygonal primitives to model the scene. This means that the representation of the scene is given as a list of large scale polygons that describe the geometric structure of the environment. Results show that the approach is capable of producing accurate descriptions of the scene. In addition, the algorithm is very efficient when compared to other techniques.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.