NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

3D Occupancy Reconstruction in Dynamic and Deforming Surgical Environments

https://doi.org/10.1109/ISMR63436.2024.10585941

Shah, Om; Su, Yun-Hsuan (June 2024, IEEE 2024 International Symposium on Medical Robotics (ISMR))

Vision dimensionality during minimally invasive surgery is a critical contributor to patient success. Traditional visualizations of the surgical scene are 2D camera streams that obfuscate depth perception inside the abdominal cavity. A lack of depth in surgical views cause surgeons to miss tissue targets, induce blood loss, and incorrectly assess deformation. 3D sensors, while offering key depth information, are expensive and often incompatible with current sterilization techniques. Furthermore, methods inferring a 3D space from stereoscopic video struggle with the inherent lack of unique features in the biological domain. We present an application of deep learning models that can assess simple binary occupancy from a single camera perspective to recreate the surgical scene in high-fidelity. Our quantitative results (IoU=O.82, log loss=0.346) indicate a strong representational capability for structure in surgical scenes, enabling surgeons to reduce patient injury during minimally invasive surgery.
more » « less
Full Text Available
Reducing annotating load: Active learning with synthetic images in surgical instrument segmentation

https://doi.org/10.1016/j.media.2024.103246

Peng, Haonan; Lin, Shan; King, Daniel; Su, Yun-Hsuan; Abuzeid, Waleed M; Bly, Randall A; Moe, Kris S; Hannaford, Blake (October 2024, Medical Image Analysis)

Accurate instrument segmentation in the endoscopic vision of minimally invasive surgery is challenging due to complex instruments and environments. Deep learning techniques have shown competitive performance in recent years. However, deep learning usually requires a large amount of labeled data to achieve accurate prediction, which poses a significant workload. To alleviate this workload, we propose an active learning-based framework to generate synthetic images for efficient neural network training. In each active learning iteration, a small number of informative unlabeled images are first queried by active learning and manually labeled. Next, synthetic images are generated based on these selected images. The instruments and backgrounds are cropped out and randomly combined with blending and fusion near the boundary. The proposed method leverages the advantage of both active learning and synthetic images. The effectiveness of the proposed method is validated on two sinus surgery datasets and one intraabdominal surgery dataset. The results indicate a considerable performance improvement, especially when the size of the annotated dataset is small. All the code is open-sourced at: https://github.com/HaonanPeng/active_syn_generator
more » « less
Full Text Available
Iterative Morphological Training Set Decomposition for Endoscopic Tool Segmentation

https://doi.org/10.1109/AIM55361.2024.10637195

Zhu, Yicheng; Wu, Xiaoyi; Tan, Sylvia; Sun, Cuiling; Saha, Sulagna; Su, Yun-Hsuan; Huang, Kevin (July 2024, 2024 IEEE International Conference on Advanced Intelligent Mechatronics (AIM))

This paper proposes a modified method for training tool segmentation networks for endoscopic images by parsing training images into two disjoint sets: one for rectangular representations of endoscopic images and one for polar. Previous work [1], [2] demonstrated that certain endoscopic images may be better segmented by a U-Net network trained on the original rectangular representation of images alone, and others performed better with polar representations. This work extends that observation to the training images and seeks to intelligently decompose the aggregate training data into disjoint image sets — one ideal for training a network to segment original, rectangular endoscopic images and the other for training a polar segmentation network. The training set decomposition consists of three stages: (1) initial data split and models, (2) image reallocation and transition mechanisms with retraining, and (3) evaluation. In (2), two separate frameworks for parsing polar vs. rectangular training images were investigated, with three switching metrics utilized in both. Experiments comparatively evaluated the segmentation performance (via Sørenson Dice coefficient) of the in-group and out-of-group images between the set-decomposed models. Results are encouraging, showing improved aggregate in-group Dice scores as well as image sets trending towards convergence.
more » « less
Full Text Available
Expanding the Surgical Robotics Community: An Intuitive Sim-to-Real Control Framework for Raven-II with a Budget-Friendly Gamepad Controller

https://doi.org/10.1109/ISMR63436.2024.10585789

Bui, Mai; Chalfant, Natalie; Sun, Cuiling; Fabrega, Sean; Peng, Haonan; Huang, Kevin; Su, Yun-Hsuan (June 2024, IEEE 2024 International Symposium on Medical Robotics (ISMR))

This paper proposes a low-cost interface and refined digital twin for the Raven-II surgical robot. Previous simulations of the Raven-II, e.g. via the Asynchronous Multibody Framework (AMBF), presented salient drawbacks, including control inputs inconsistent with Raven-II software, and lack of stable, high-fidelity physical contact simulations. This work bridges both of these gaps, both (1) enabling robust, simulated contact mechanics for dynamic physical interactions with the Raven-II, and (2) developing a universal input format for both simulated and physical platforms. The method furthermore proposes a low cost, commodity game-controller interface for controlling both virtual and real realizations of Raven-II, thus greatly reducing the barrier to access for Raven-II research and collaboration. Overall, this work aims to eliminate the inconsistencies between simulated and real representations of the Raven-II. Such a development can expand the reach of surgical robotics research. Namely, providing end-to-end transparency between the simulated AMBF and physical Raven-II platforms enables a software testbed previously unavailable, e.g. for training real surgeons, for creating digital synthetic datasets, or for prototyping novel architectures like shared control strategies. Experiments validate this transparency by comparing joint trajectories between digital twin and physical testbed given identical inputs. This work may be extended and incorporated into recent efforts in developing modular or common software infrastructures for both simulation and control of real robotic devices, such as the Collaborative Robotics Toolkit (CRTK).
more » « less
Full Text Available
Biometric Signature Authentication with Low Cost Embedded Stylus

https://doi.org/10.1109/AIM46323.2023.10196285

Subedi, Divas; Chitrakar, Digesh; Yung, Isabella; Zhu, Yicheng; Su, Yun-Hsuan; Huang, Kevin (June 2023, IEEE)
Smoothness Constrained Curiosity Driven Multicamera Trajectory Optimization for Robot-Assisted Minimally Invasive Surgery

https://doi.org/10.1109/ISMR57123.2023.10130237

Subedi, Divas; Jiang, Wenfan; Rahman, Ramisa Tahsin; Zhang, Heidi; Huang, Kevin; Su, Yun-Hsuan (April 2023, IEEE)
A Deep-Learning Approach to Marble-Burying Quantification: Image Segmentation of Marbles and Bedding

https://doi.org/10.1109/SII55687.2023.10039320

Zhu, Yicheng; Hudson, Brandon; Chakraborttii, Chandranil; Su, Yun-Hsuan; Huang, Kevin (January 2023, 2023 IEEE/SICE International Symposium on System Integration (SII))

Full Text Available
Surgical Tool Segmentation with Pose-Informed Morphological Polar Transform of Endoscopic Images

https://doi.org/10.1142/S2424905X22410033

Huang, Kevin; Chitrakar, Digesh; Jiang, Wenfan; Yung, Isabella; Su, Yun-Hsuan (June 2022, Journal of Medical Robotics Research)

This paper presents a tool-pose-informed variable center morphological polar transform to enhance segmentation of endoscopic images. The representation, while not loss-less, transforms rigid tool shapes into morphologies consistently more rectangular that may be more amenable to image segmentation networks. The proposed method was evaluated using the U-Net convolutional neural network, and the input images from endoscopy were represented in one of the four different coordinate formats (1) the original rectangular image representation, (2) the morphological polar coordinate transform, (3) the proposed variable center transform about the tool-tip pixel and (4) the proposed variable center transform about the tool vanishing point pixel. Previous work relied on the observations that endoscopic images typically exhibit unused border regions with content in the shape of a circle (since the image sensor is designed to be larger than the image circle to maximize available visual information in the constrained environment) and that the region of interest (ROI) was most ideally near the endoscopic image center. That work sought an intelligent method for, given an input image, carefully selecting between methods (1) and (2) for best image segmentation prediction. In this extension, the image center reference constraint for polar transformation in method (2) is relaxed via the development of a variable center morphological transformation. Transform center selection leads to different spatial distributions of image loss, and the transform-center location can be informed by robot kinematic model and endoscopic image data. In particular, this work is examined using the tool-tip and tool vanishing point on the image plane as candidate centers. The experiments were conducted for each of the four image representations using a data set of 8360 endoscopic images from real sinus surgery. The segmentation performance was evaluated with standard metrics, and some insight about loss and tool location effects on performance are provided. Overall, the results are promising, showing that selecting a transform center based on tool shape features using the proposed method can improve segmentation performance.
more » « less
Full Text Available
Camera Configuration Models for Machine Vision Based Force Estimation in Robot-Assisted Soft Body Manipulation

https://doi.org/10.1109/ISMR48347.2022.9807587

Liu, Wenjun; Pickett, Anna; Huang, Kevin; Su, Yun-Hsuan (April 2022, IEEE)
Deep Curiosity Driven Multicamera 3D Viewpoint Adjustment for Robot-Assisted Minimally Invasive Surgery

https://doi.org/10.1109/ICRA46639.2022.9812413

Su, Yun-Hsuan; Zhang, Heidi; Jiang, Wenfan; Ngo, Khanh; Huang, Kevin (May 2022, IEEE)

« Prev Next »

Search for: All records