2012
Autores
Oliveira, M; Sappa, AD; Santos, V;
Publicação
2012 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV)
Abstract
The current paper proposes a novel color correction approach for onboard multi-camera systems. It works by segmenting the given images into several regions. A probabilistic segmentation framework, using 3D Gaussian Mixture Models, is proposed. Regions are used to compute local color correction functions, which are then combined to obtain the final corrected image. An image data set of road scenarios is used to establish a performance comparison of the proposed method with other seven well known color correction algorithms. Results show that the proposed approach is the highest scoring color correction method. Also, the proposed single step 3D color space probabilistic segmentation reduces processing time over similar approaches.
2010
Autores
Santos, V; Almeida, J; Avila, E; Gameiro, D; Oliveira, M; Pascoal, R; Sabino, R; Stein, P;
Publicação
IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC
Abstract
The future of intelligent vehicles will rely on robust information to allow the proper feedback to the vehicle itself, to issue several kinds of active safety, but before all, to generate information for the driver by calling his or her attention to potential instantaneous or mid-term risks associated with the driving. Before true vehicle autonomy, safety and driver assistance are a priority. Sophisticated sensorial and perceptive mechanisms must be made available for, in a first instance, assisting the driver and, on a latter phase, participate in better autonomy. These mechanisms rely on sensors and algorithms that are mostly available nowadays, but many of them are still unsuited for critical situations. This paper presents a project where engineering and scientific solutions have been devised to settle a full featured real scale platform for the next generation of ITS vehicles that are concerned with the immediate issues of navigation and challenges on the road. The car is now ready and running, and the data gathering has just begun. ©2010 IEEE.
2012
Autores
Almeida, M; Dias, P; Oliveira, M; Santos, V;
Publicação
IMAGE ANALYSIS AND RECOGNITION, PT I
Abstract
The AtlasCar is a prototype that is being developed at the University of Aveiro to research advanced driver assistance systems. The car is equipped with several sensors: 3D and 2D laser scanners, a stereo camera, inertial sensors and GPS. The combination of all these sensor data in useful representations is essential. Therefore, calibration is one of the first problems to tackle. This paper focuses on 3D/2D laser calibration. The proposed method uses a 3D Laser Range Finder (LRF) to produce a reference 3D point cloud containing a known calibration object. Manual input from the user and knowledge of the object geometry are used to register the 3D point cloud with the 2D Lasers. Experimental results with simulated and real data demonstrate the effectiveness of the proposed calibration method.
2012
Autores
Oliveira, M; Sappa, AD; Santos, V;
Publicação
IMAGE ANALYSIS AND RECOGNITION, PT I
Abstract
The current paper proposes a novel color correction approach based on a probabilistic segmentation framework by using 3D Gaussian Mixture Models. Regions are used to compute local color correction functions, which are then combined to obtain the final corrected image. The proposed approach is evaluated using both a recently published metric and two large data sets composed of seventy images. The evaluation is performed by comparing our algorithm with eight well known color correction algorithms. Results show that the proposed approach is the highest scoring color correction method. Also, the proposed single step 3D color space probabilistic segmentation reduces processing time over similar approaches.
2014
Autores
Oliveira, M; Santos, V; Sappa, AD;
Publicação
Information Fusion
Abstract
Over the past years, inverse perspective mapping has been successfully applied to several problems in the field of Intelligent Transportation Systems. In brief, the method consists of mapping images to a new coordinate system where perspective effects are removed. The removal of perspective associated effects facilitates road and obstacle detection and also assists in free space estimation. There is, however, a significant limitation in the inverse perspective mapping: the presence of obstacles on the road disrupts the effectiveness of the mapping. The current paper proposes a robust solution based on the use of multimodal sensor fusion. Data from a laser range finder is fused with images from the cameras, so that the mapping is not computed in the regions where obstacles are present. As shown in the results, this considerably improves the effectiveness of the algorithm and reduces computation time when compared with the classical inverse perspective mapping. Furthermore, the proposed approach is also able to cope with several cameras with different lenses or image resolutions, as well as dynamic viewpoints. © 2014 Elsevier B.V.
2015
Autores
Kasaei, SH; Oliveira, M; Lim, GH; Lopes, LS; Tome, AM;
Publicação
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS
Abstract
3D object detection and recognition is increasingly used for manipulation and navigation tasks in service robots. It involves segmenting the objects present in a scene, estimating a feature descriptor for the object view and, finally, recognizing the object view by comparing it to the known object categories. This paper presents an efficient approach capable of learning and recognizing object categories in an interactive and open-ended manner. In this paper, "open-ended" implies that the set of object categories to be learned is not known in advance. The training instances are extracted from on-line experiences of a robot, and thus become gradually available over time, rather than at the beginning of the learning process. This paper focuses on two state-of-the-art questions: (1) How to automatically detect, conceptualize and recognize objects in 3D scenes in an open-ended manner? (2) How to acquire and use high-level knowledge obtained from the interaction with human users, namely when they provide category labels, in order to improve the system performance? This approach starts with a pre-processing step to remove irrelevant data and prepare a suitable point cloud for the subsequent processing. Clustering is then applied to detect object candidates, and object views are described based on a 3D shape descriptor called spin-image. Finally, a nearest-neighbor classification rule is used to predict the categories of the detected objects. A leave-one-out cross validation algorithm is used to compute precision and recall, in a classical off-line evaluation setting, for different system parameters. Also, an on-line evaluation protocol is used to assess the performance of the system in an open-ended setting. Results show that the proposed system is able to interact with human users, learning new object categories continuously over time.
The access to the final selection minute is only available to applicants.
Please check the confirmation e-mail of your application to obtain the access code.