Summary
Aiming to Build a robust system to assist surgeons especially novice surgeons perform at expert-level by a simple query “Do something(task) somewhere(task context) as an expert”.
Motivations, Specific Aims
Motivations
Skill is among the strongest predictors for patient outcome:
The higher surgeon skill, the better the outcome
Empower novice surgeons:
“Translate” beginner-level commands to expert-level proficiency
Recent deep learning algorithm is powerful but vulnerable:
Deep learning algorithms dominate large amount of benchmarks but suffer from generalization issues and low interpretability.
Causality brings more robustness and interpretability:
With casual relations provided and causal inference mechanisms, we might be able to make the algorithm more robust and easier to interpret.
Specific Aims
Data:
Build a suitable dataset for causal inference study in surgery area.
Understandings:
Explore the difference between novice- and expert- level commands.
Algorithm:
Explore methods to incorporate causal inference mechanisms into deep learning algorithms of kinematic prediction for more robustness and interpretability.
Deliverables
Technical Approach
Overall Approach
data: prerequisites of the understandings and the algorithm
Understandings: gain understanding from data, and help build the algorithm
Algorithm: Train on the data.
Target: Temporally align JIGSAW samples for causal analysis
Works to be done:
Technical Approach – understanding
Do a statistical analysis for the motions performed by surgeons at different levels
Analyze the cause of failure and different performance.
Technical Approach – Kinematics prediction network
Architecture: Basic Transformer Network
Input:
Task information: task id and its context
Expected skill level: novice- or expert- level
Kinematics of previous frames
Output: Kinematics of future frames
Dependencies
Milestones and Status
Manually segment and pair videos. (Finished on March 14th)
Aligning videos segments (Finished on March 20th)
supplementary annotation (Finished on April 4th)
Statistical analysis for some properties of the trajectory (Finished on April 10th)
Kinematic Prediction Network (Finished on May 4th)
Incorporate Causal inference (Put into future plan)
Reports and presentations
Project Plan
Project Background Reading
Project Checkpoint
Paper Seminar Presentations
Project Final Presentation
Project Final Report
Project Bibliography
Reading List
Murat Kocaoglu, Christopher Snyder, Alexandros G. Dimakis, and Sriram Vishwanath. Causalgan: Learning causal implicit generative models with adversarial training. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings. OpenReview.net, 2018
hristos Louizos, Uri Shalit, Joris M. Mooij, David A. Sontag, Richard S. Zemel, and Max Welling. Causal effect inference with deep latent-variable models. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pages 6446–6456, 2017
Nick Pawlowski, Daniel Coelho de Castro, and Ben Glocker. Deep structural causal models for tractable counterfactual inference. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual, 2020.
Yu Kong and Yun Fu. Human action recognition and prediction: A survey. CoRR, abs/1806.11230, 2018
Reference
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Lukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pages 5998–6008, 2017.
Yixin Gao, S. Swaroop Vedula, Carol E. Reiley, Narges Ahmidi, Balakrishnan Varadarajan, Henry C. Lin, Lingling Tao, Luca Zappella, Benjam ́ın B ́ejar, David D. Yuh, Chi Chiung Grace Chen, Ren ́e Vidal, Sanjeev Khudanpur and Gregory D. Hager, The JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS): A Surgical Activity Dataset for Human Motion Modeling, In Modeling and Monitoring of Computer Assisted Interventions (M2CAI) – MICCAI Workshop, 2014.
Narges Ahmidi, Lingling Tao, Shahin Sefati, Yixin Gao, Colin Lea, Benjamın Bejar Haro, Luca Zappella, Sanjeev Khudanpur, Rene Vidal, Fellow, IEEE, Gregory D. Hager, A Dataset and Benchmarks for Segmentation and Recognition of Gestures in Robotic Surgery, Transaction of Biomedical Engineering, 2017.
Other Resources and Project Files