Presentation + Paper
2 April 2024 From generalization to precision: exploring SAM for tool segmentation in surgical environments
Kanyifeechukwu J. Oguine, Roger D. Soberanis-Mukul, Nathan Drenkow, Mathias Unberath
Author Affiliations +
Abstract
Purpose: Accurate tool segmentation is essential in computer-aided procedures. However, this task conveys challenges due to artifacts’ presence and the limited training data in medical scenarios. Methods that generalize to unseen data represent an interesting venue, where zero-shot segmentation presents an option to account for data limitation. Initial exploratory works with the Segment Anything Model (SAM) show that bounding-box-based prompting presents notable zero-short generalization. However, point-based prompting leads to a degraded performance that further deteriorates under image corruption. We argue that SAM drastically over-segment images with high corruption levels, resulting in degraded performance when only a single segmentation mask is considered, while the combination of the masks overlapping the object of interest generates an accurate prediction. Method: We use SAM to generate the over-segmented prediction of endoscopic frames. Then, we employ the ground-truth tool mask to analyze the results of SAM when the best single mask is selected as prediction and when all the individual masks overlapping the object of interest are combined to obtain the final predicted mask. We analyze the Endovis18 and Endovis17 instrument segmentation datasets using synthetic corruptions of various strengths and an In-House dataset featuring counterfactually created real-world corruptions. Results: Combining the over-segmented masks contributes to improvements in the IoU. Furthermore, selecting the best single segmentation presents a competitive IoU score for clean images. Conclusions: Combined SAM predictions present improved results and robustness up to a certain corruption level. However, appropriate prompting strategies are fundamental for implementing these models in the medical domain.
Conference Presentation
© (2024) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Kanyifeechukwu J. Oguine, Roger D. Soberanis-Mukul, Nathan Drenkow, and Mathias Unberath "From generalization to precision: exploring SAM for tool segmentation in surgical environments", Proc. SPIE 12926, Medical Imaging 2024: Image Processing, 1292603 (2 April 2024); https://doi.org/10.1117/12.3006981
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Image segmentation

Medical imaging

Binary data

Contour modeling

Endoscopy

RELATED CONTENT

Object-based interpolation via cores
Proceedings of SPIE (May 11 1994)
Boundary detection via dynamic programming
Proceedings of SPIE (September 22 1992)
A shape prior based MRF model for 3D masseter muscle...
Proceedings of SPIE (February 14 2012)

Back to Top