Satwik Kottur bio photo

Satwik Kottur

Graduate Student
Carnegie Mellon University
Pittsburgh, PA.

Email Facebook Google+ LinkedIn Github

Static Vehicle Detection and Analysis in Aerial Imagery using Depth

Static Vehicle Detection and Analysis in Aerial Imagery using Depth

Internship at IRIS, University of Southern California, Summer 2013
Guide: Prof. Gerard Medioni
[Report] [Poster]

Vehicle detection and identification in urban scenarios has many civilian, security and surveillance application. The easy availability of aerial images along with its advantage of spanning wide areas due to large field of view, makes the problem of such identification in aerial images interesting. Primarily, this work proposes a detection algorithm using depth information for identification of static vehicles, particularly in parking lots. The scene depth is obtained from Digital Surface Model (DSM) of the urban landscape. Finally, the dependence of performance on the 3D model parameters is analyzed.

Original aerial image (left), detections based on depth (right).

From the approaches proposed by various researchers in the community, [1, 2, 3, 4] are closely related to the current work. [1] provides the 3D model of the landscape used in the proposed algorithm. Depth features have been explored for vehicle identification by B. Yang et al. in [2]. However, [2] uses LIDAR data unlike [1] which uses stream of aerial images for estimating scene depth.

In this work, depth features are obtained from [1], which itself is a novel technique to simultaneously solve for camera positions and dense depth estimation. For a given image, height is backprojected onto it using the 3D model. Haar-like features are used for a small portion of the image using sliding window, to estimate the presence or absence of a vehicle. In order to improve the accuracy, validation is incorporated using the edge-saliency of wire frame model as proposed in [4]. To understand the behavior of the system better, analysis of the performance is performed by varying the parameters of optical flow (regularization term, number of iterations, etc.) used in 3D model generation. A high negative correlation between the regularization term and accuracy of the system can be clearly concluded. This analysis can be used to better the overall system thereby an indicative of the scope for improvement.


  1. Zhouliang Kang, Gerard Medioni. 3D Urban Reconstruction from Wide Area Aerial Surveillance Video. Workshop on Applications for Aerial Video Exploitation (WAVE), 2015. [Paper]
  2. Bo Yang, Pramod Sharma, Ram Nevatia. Vehicle Detection from Low Quality Aerial LIDAR Data. [Paper]
  3. Y. Lin H. Liao and G. Medioni. Aerial 3D reconstruction with line-constrained dynamic programming. In ICCV,2011. [Paper]
  4. Stefan Hinz. Integrating local and global features for vehicle detection in high resolution aerial imagery. ISPRS Archives,2003, XXXIV(3/W8) [Paper]