Contact Us
CiiS Lab
Johns Hopkins University
112 Hackerman Hall
3400 N. Charles Street
Baltimore, MD 21218
Directions
Lab Director
Russell Taylor
127 Hackerman Hall
rht@jhu.edu
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| courses:456:2023:projects:456-2023-12:project-12 [2023/05/11 03:47] – [Milestones and Status] jmangul1 | courses:456:2023:projects:456-2023-12:project-12 [2023/05/11 12:40] (current) – [Finalized Approach] jmangul1 | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ======Evaluation of CT Registration for Image-Based Sinus Reconstruction====== | ======Evaluation of CT Registration for Image-Based Sinus Reconstruction====== | ||
| - | **Last updated: | + | **Last updated: |
| ======Summary====== | ======Summary====== | ||
| - | This project intends to evaluate the accuracy of an existing image-based 3D reconstruction pipeline of the sinus anatomy by implementing a framework for global and local registration to the ground-truth CT scan. The initial evaluation of the pipeline will then serve as a baseline for subsequent changes to account for uncertainties and integrate robot kinematics. | + | This project intends to evaluate the accuracy of an existing image-based 3D reconstruction pipeline of the sinus anatomy by implementing a framework for global and local registration to the ground-truth CT scan. The initial evaluation of the pipeline will then serve as a baseline for subsequent changes to investigate the influence of depth uncertainties and integrate robot kinematics. |
| * **Students: | * **Students: | ||
| Line 25: | Line 25: | ||
| The main goal of this project is to implement a quantitative framework to evaluate the accuracy of the dense reconstruction based on the ground truth CT. The development of this assessment and framework would enable further research towards the usage of the sinus reconstruction pipeline in clinical settings. The specific aims of this project are listed as follows: | The main goal of this project is to implement a quantitative framework to evaluate the accuracy of the dense reconstruction based on the ground truth CT. The development of this assessment and framework would enable further research towards the usage of the sinus reconstruction pipeline in clinical settings. The specific aims of this project are listed as follows: | ||
| - Implement a rigid registration framework to evaluate the image-based 3D reconstruction of the sinus anatomy with respect to the corresponding CT image. | - Implement a rigid registration framework to evaluate the image-based 3D reconstruction of the sinus anatomy with respect to the corresponding CT image. | ||
| - | - Evaluate global | + | - Integrate multiple |
| - | - Evaluate local registration of specific anatomical regions of interest in the reconstruction. | + | - Report error evaluation metrics |
| - | - Implement methods to report | + | |
| - Analyze the influence of uncertainty in the reconstruction pipeline, and evaluate the resulting reconstruction with respect to the CT. | - Analyze the influence of uncertainty in the reconstruction pipeline, and evaluate the resulting reconstruction with respect to the CT. | ||
| - | - Analyze the distribution of uncertainties in depth estimation for features present | + | - Adjust |
| - | - Integrate probabilistic model to adjust the influence | + | - Evaluation |
| - | - Integrate robot kinematics | + | - Transactions |
| + | - Process additional cadaveric data in dense reconstruction pipeline | ||
| + | - Ablation experiments of dense reconstruction pipeline. | ||
| ======Deliverables====== | ======Deliverables====== | ||
| Line 56: | Line 56: | ||
| - | ======Technical Approach====== | + | ====== Technical Approach ====== |
| This project requires input data of endoscopic video sequences and CT scans of the same sinus anatomy. The data utilized for the maximum goal requires sequences obtained using the Galen robot to retrieve corresponding robot kinematics at the time of video capture. The endoscopic sequence for the dense reconstruction pipeline and the resulting 3D structure will be used for the registration with the corresponding CT scan to report an accuracy evaluation. | This project requires input data of endoscopic video sequences and CT scans of the same sinus anatomy. The data utilized for the maximum goal requires sequences obtained using the Galen robot to retrieve corresponding robot kinematics at the time of video capture. The endoscopic sequence for the dense reconstruction pipeline and the resulting 3D structure will be used for the registration with the corresponding CT scan to report an accuracy evaluation. | ||
| Line 65: | Line 65: | ||
| {{ : | {{ : | ||
| //Figure 2. Proposed workflow of planned modifications and implementations shown in blue.// | //Figure 2. Proposed workflow of planned modifications and implementations shown in blue.// | ||
| + | |||
| + | Based on initial registration results and changes in the deliverables, | ||
| + | |||
| + | {{ : | ||
| + | // Figure 3. Final registration workflow. // | ||
| + | |||
| + | ====Data Preprocessing==== | ||
| + | Registration is dependent on input data of the dense reconstruction (DRECO) pipeline output (processed image sequence, estimated camera trajectories from SfM, and fused mesh of sinus anatomy), the ground-truth anatomical structure of the sinus from the corresponding CT scan and optical tracking data of the endoscope camera. The collected video sequence was preprocessed for use in the DRECO pipeline by curating the images to isolate the subsequence of frames that capture the sinus cavity. The input image frames were also downsampled and undistorted based on checkerboard camera calibration. The preprocessed images were then used as input to the DRECO pipeline. The CT scans were processed using 3D Slicer to segment both the attached marker spheres and sinus anatomy. Instructions for segmentation can be found on the GitHub repository [[https:// | ||
| + | |||
| + | |||
| ===== Registration Framework ===== | ===== Registration Framework ===== | ||
| + | |||
| + | ==== Proposed Plan ==== | ||
| I plan to integrate rigid registration methods including the iterative closest point (ICP) algorithm and iterative most likely point algorithm to register the dense reconstruction to the corresponding CT image. The iterative most likely point algorithm and variations will be integrated using the cisstICP library available on [[https:// | I plan to integrate rigid registration methods including the iterative closest point (ICP) algorithm and iterative most likely point algorithm to register the dense reconstruction to the corresponding CT image. The iterative most likely point algorithm and variations will be integrated using the cisstICP library available on [[https:// | ||
| Additionally, | Additionally, | ||
| + | |||
| + | ==== Finalized Approach ==== | ||
| + | Direct rigid registration of sampled point clouds from the DRECO and CT meshes fails since the dense reconstruction only constructs a portion of the segmented anatomy present in the CT mesh. Therefore, this framework required advanced options including camera pose, keypoint, and coherent point drift registration to align the meshes. | ||
| + | === Camera Pose Registration === | ||
| + | The segmented anatomy marker spheres are required to obtain the ground-truth positions of the endoscope camera in CT space. Based on the segmentation, | ||
| + | |||
| + | The tracked positions were also manually adjusted by comparing the recorded images and CT renderings when large errors were observed (camera position was outside of the anatomy). This alignment transformation was applied at the beginning of the chain in Equation 1. | ||
| + | The DRECO pipeline estimates the camera trajectories of each input image frame in the SfM algorithm which can be matched to the corresponding ground- truth position of the endoscope for rigid registration. The resulting transforma- tion was then used to transform the DRECO mesh to CT space and initialize the iterative closest point algorithm for Video-CT registration. | ||
| + | The estimated camera trajectories were observed to have large variations com- pared to the ground-truth, | ||
| + | {{ : | ||
| + | //Figure 4. Camera pose registration.// | ||
| + | |||
| + | === Coherent Point Drift Registration === | ||
| + | In addition to rigid registration methods, the Coherent Point Drift (CPD) reg- istration algorithm was also investigated. CPD is a probabilistic method integrated for rigid and affine point cloud registration between sampled points from the dense reconstruction and CT meshes. This method optimizes regis- tration based on the most likely shape of the DRECO mesh within the CT structure, considering that the computed mesh only represents the section of the sinus anatomy visible in the input video. | ||
| + | |||
| + | === Comparison of Registration Types === | ||
| + | |||
| + | Table 1 displays the mean errors of the various registration algorithms. Rigid camera pose registration using ICP had the lowest translational error, ranging from 1 to 3mm differences, | ||
| + | |||
| + | {{ : | ||
| + | //Table 1. Comparison of various registration types using the entire image se- quence (indexes 0 - 1059) and multiple sections for local registration reported as the mean across poses, pixels, and sampled points for camera pose, scale invariant depth, and mesh distance errors, respectively.// | ||
| + | |||
| + | The CPD rigid and affine registration algorithms have lower errors in the mean distance between meshes but based on visual inspection of the layered meshes and depth renderings, this does not seem to mean that the anatomy is more aligned. The smaller magnitudes may be a result of the intricate sinus anatomy as the closest points between the meshes do not necessarily correspond to the same points within the sinus cavity. | ||
| + | |||
| + | It is expected that the camera pose + ICP registrations have lower pose errors and the CPD registrations have lower mesh errors because these algorithms com- pute the transformation by minimizing those parameters. The scale invariant depth error serves as a metric independent of the registration. Since the camera pose + ICP registration type had the smallest error in the depth renderings, this algorithm was used to further investigate adjustments to the depth fusion step in the dense reconstruction pipeline. | ||
| ===== Influence of Uncertainties in Dense Reconstruction Pipeline ===== | ===== Influence of Uncertainties in Dense Reconstruction Pipeline ===== | ||
| - | The dense reconstruction pipeline utilizes depth estimations in addition the SfM point cloud and camera trajectories to generate the 3D structure. This information is integrated into a fusion method which resolves variation between the estimates of common points in multiple frames of the input sequence. The fusion method currently considers every estimate equally; however, the points in the sinus anatomy that are further away from the camera when the image is captured is shown to have more uncertainty as seen in Figure | + | The dense reconstruction pipeline utilizes depth estimations in addition the SfM point cloud and camera trajectories to generate the 3D structure. This information is integrated into a fusion method which resolves variation between the estimates of common points in multiple frames of the input sequence. The fusion method currently considers every estimate equally; however, the points in the sinus anatomy that are further away from the camera when the image is captured is shown to have more uncertainty as seen in Figure |
| {{ : | {{ : | ||
| - | // | + | // |
| - | We hypothesize that this uncertainty may be introducing errors which are propagated into the reconstruction. I plan to analyze these uncertainties by examining | + | We hypothesize that this uncertainty may be introducing errors which are propagated into the reconstruction. |
| + | outside of the 68th percentile (one standard deviation) | ||
| + | {{ : | ||
| + | //Figure 6. The resulting dense reconstructions with different adjustment schemes (from left to right): original, weighting by uncertainty, | ||
| + | // | ||
| ===== Integration of Robot Kinematics ===== | ===== Integration of Robot Kinematics ===== | ||
| Line 150: | Line 191: | ||
| * {{: | * {{: | ||
| * Project Final Presentation | * Project Final Presentation | ||
| - | * {{: | + | * {{: |
| * Project Final Report | * Project Final Report | ||
| * {{: | * {{: | ||
| - | | + | |
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | * [[https:// | ||
| + | |||
| ======Project Bibliography======= | ======Project Bibliography======= | ||
| Line 175: | Line 222: | ||
| **[10]** B. Curless and M. Levoy, "A volumetric method for building complex models from range images," | **[10]** B. Curless and M. Levoy, "A volumetric method for building complex models from range images," | ||
| + | |||
| ======Other Resources and Project Files====== | ======Other Resources and Project Files====== | ||
| Here give list of other project files (e.g., source code) associated with the project. | Here give list of other project files (e.g., source code) associated with the project. | ||