CONFIDENTIAL materials. Contact the instructor for related claims.
Last updated: 05/10/2013 10:59 EST
Feature matching based 3D reconstruction is a standard technique in 3D Computer Vision. An natural extension is to reconstruct dynamic surfaces from videos, such as reconstructing sinus surfaces from endoscopic videos. However, since the camera is moving and the sinus surfaces are normally deformable and non-planar, the feature matching is usually unsatisfactory. We will employ a state-of-the-art feature matching strategy in the domain of minimally invasive image analysis. Instead of restricting inliers using a global affie transformation, multiple affine components are hierarchically clustered.
The main goal of this project is to prototypes the image matching and motion estimation and analyze their uncertainties. Specific tasks includes:
Figure 1. A full 3D reconstruction of a pediatric airway from video imagery acquired with a tracked endoscope. [Image from a NIH-funded project proposal with permission.]
[Descriptions in the following are cited from a NIH-funded project proposal with permission.]
The Hierarchical Multi-Afne (HMA) algorithm for fast and accurate feature matching is illustrated in the figure below. Its basic idea is to represent a plane or surface using multiple affine-transformation components.
Figure from G. A. Puerto-Souza and G. L. Mariottini. Hierarchical Multi-Affine (HMA) Algorithm for Fast and Accurate Feature Matching in Minimally-Invasive Surgical Images. 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems October 7-12, 2012. Vilamoura, Algarve, Portugal.
The comparison between SIFT raw matching and HMA matching is shown below.
(Patient data, distributed with permission.)
RANSAC detected outlier number vs. Total ‘matched’ feature number. Shown for both HMA and SIFT for a comparison.
The 3D reconstruction pipeline is shown in the figure followed.
Figure from [Mirota etal 2012]: D. Mirota, H. Wang, R. H. Taylor, M. Ishii, G. L. Gallia, and G. D. Hager. A System for Video-Based Navigation for Endoscopic Endonasal Skull Base Surgery. IEEE Trans. Med. Imaging, 31(4), 963-976 (2012).
The basic idea of empirical uncertainty analysis is to compute statistics such as variance and covariance from results in a number of experiments, either by cross validation or Monte Carlo simulation.
Estimated standard deviation of alpha, beta and gamma vs. feature number. Shown for both HMA and SIFT for a comparison.
Comparison of the estimated covariance matrix by HMA matching vs. SIFT matching. Images shown for the pair (frame 6, frame 7).
Projection error of the held-out query keypoint with HMA matching.
External libraries.
Camera calibration is performed by using Caltech Matlab calibration toolki.
SIFT features are extracted using VLfeat Matlab library.
HMA matching are performed using HMA Matlab toolbox.
RANSAC based essential matrix estimation is performed using OpenCVs findFundamentalMat.
Camera motion recovery from the essential matrix is done using Structure and Motion Matlab toolkit.
Patient Data
Patient data are collected at Johns Hopkins Hospital on December 19, 2012. The endoscopic video is hours long. A data collection system has been developed to simultaneously capture both the endoscopic video and external motion tracking data. However, data collection is out of the scope of this course project, which focuses on the algorithm design and testing.
The CT data is still unavailable till the end of this course project.
}
Here give list of other project files (e.g., source code) associated with the project. If these are online give a link to an appropriate external repository or to uploaded media files under this name space.