NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

More isn’t always better: Technology in the intensive care unit

https://doi.org/10.1097/HMR.0000000000000398

Olsen, Esther; Novikov, Zhanna; Sakata, Theadora; Lambert, Monique H; Lorenzo, Javier; Bohn, Roger; Singer, Sara J (February 2024, Health Care Management Review)

BackgroundClinical care in modern intensive care units (ICUs) combines multidisciplinary expertise and a complex array of technologies. These technologies have clearly advanced the ability of clinicians to do more for patients, yet so much equipment also presents the possibility for cognitive overload. PurposeThe aim of this study was to investigate clinicians’ experiences with and perceptions of technology in ICUs. Methodology/ApproachWe analyzed qualitative data from 30 interviews with ICU clinicians and frontline managers within four ICUs. ResultsOur interviews identified three main challenges associated with technology in the ICU: (a) too many technologies and too much data; (b) inconsistent and inaccurate technologies; and (c) not enough integration among technologies, alignment with clinical workflows, and support for clinician identities. To address these challenges, interviewees highlighted mitigation strategies to address both social and technical systems and to achieve joint optimization. ConclusionWhen new technologies are added to the ICU, they have potential both to improve and to disrupt patient care. To successfully implement technologies in the ICU, clinicians’ perspectives are crucial. Understanding clinicians’ perspectives can help limit the disruptive effects of new technologies, so clinicians can focus their time and attention on providing care to patients. Practice ImplicationsAs technology and data continue to play an increasingly important role in ICU care, everyone involved in the design, development, approval, implementation, and use of technology should work together to apply a sociotechnical systems approach to reduce possible negative effects on clinical care for critically ill patients.
more » « less
Full Text Available
A human mesh-centered approach to action recognition in the operating room

https://doi.org/10.20517/ais.2024.19

Liu, Benjamin; Soenens, Gilles; Villarreal, Joshua; Jopling, Jeffrey; Van_Herzeele, Isabelle; Rau, Anita; Yeung-Levy, Serena (June 2024, Artificial Intelligence Surgery)

Aim: Video review programs in hospitals play a crucial role in optimizing operating room workflows. In scenarios where split-seconds can change the outcome of a surgery, the potential of such programs to improve safety and efficiency is profound. However, leveraging this potential requires a systematic and automated analysis of human actions. Existing methods predominantly employ manual methods, which are labor-intensive, inconsistent, and difficult to scale. Here, we present an AI-based approach to systematically analyze the behavior and actions of individuals from operating rooms (OR) videos. Methods: We designed a novel framework for human mesh recovery from long-duration surgical videos by integrating existing human detection, tracking, and mesh recovery models. We then trained an action recognition model to predict surgical actions from the predicted temporal mesh sequences. To train and evaluate our approach, we annotated an in-house dataset of 864 five-second clips from simulated surgical videos with their corresponding actions. Results: Our best model achieves an F1 score and the area under the precision-recall curve (AUPRC) of 0.81 and 0.85, respectively, demonstrating that human mesh sequences can be successfully used to recover surgical actions from operating room videos. Model ablation studies suggest that action recognition performance is enhanced by composing human mesh representations with lower arm, pelvic, and cranial joints. Conclusion: Our work presents promising opportunities for OR video review programs to study human behavior in a systematic, scalable manner.
more » « less
Full Text Available
Exploring human–artificial intelligence interactions in a negative pragmatic trial of computer-aided polyp detection

https://doi.org/10.1016/j.igie.2024.04.016

Watkins, Kate; Ladabaum, Uri; Olsen, Esther; Hoogerbrug, Jonathan; Mannalithara, Ajitha; Weng, Yingjie; Shaw, Blake; Bohn, Roger; Singer, Sara (June 2024, iGIE)

Full Text Available
Addressing 6 challenges in generative AI for digital health: A scoping review

Templin, T; Perez, M; Sylvia, S; Leek, J; Sinnott-Armstrong, N (May 2024, PLOS digital health)

Full Text Available
VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Wang, X; Zhang, Y; Zohar, O; Yeung-Levy, S (March 2024, European Conference on Computer Vision)

Full Text Available
Artificial Intelligence Identifies Factors Associated with Blood Loss and Surgical Experience in Cholecystectomy

https://doi.org/10.1056/AIoa2300088

Aklilu, Josiah G; Sun, Min Woo; Goel, Shelly; Bartoletti, Sebastiano; Rau, Anita; Olsen, Griffin; Hung, Kay S; Mintz, Sophie L; Luong, Vicki; Milstein, Arnold; et al (January 2024, NEJM AI)

Full Text Available
Diffusion-HPC: Synthetic data generation for human mesh recovery in challenging domains.

Weng, Z; Sánchez, L B; Yeung-Levy, S (December 2023, International Conference on 3D Vision)

Full Text Available
Awakening and breathing coordination: A mixed-methods analysis of determinants of implementation

https://doi.org/10.1513/AnnalsATS.202212-1048OC

Olsen, Griffin H; Gee, Perry M; Wolfe, Doug; Winberg, Carrie; Carpenter, Lori; Jones, Chris; Jacobs, Jason R; Leither, Lindsay; Peltan, Ithan D; Singer, Sara J; et al (July 2023, Annals of the American Thoracic Society)

Full Text Available
Computer-aided Detection of Polyps Does Not Improve Colonoscopist Performance in a Pragmatic Implementation Trial

https://doi.org/10.1053/j.gastro.2022.12.004

Ladabaum, Uri; Shepard, John; Weng, Yingjie; Desai, Manisha; Singer, Sara J.; Mannalithara, Ajitha (March 2023, Gastroenterology)

Full Text Available
MOMA-LRG: Language-Refined Graphs for Multi-Object Multi-Actor Activity Parsing

Luo, Zelun; Durante, Zane; Li, Linden; Xie, Wanze; Liu, Ruochen; Jin, Emily; Huang, Zhuoyi; Li, Lun Yu; Wu, Jiajun; Niebles, Juan Carlos; et al (January 2022, Neural Information Processing Systems (NeurIPS) Datasets and Benchmarks Track)

Video-language models (VLMs), large models pre-trained on numerous but noisy video-text pairs from the internet, have revolutionized activity recognition through their remarkable generalization and open-vocabulary capabilities. While complex human activities are often hierarchical and compositional, most existing tasks for evaluating VLMs focus only on high-level video understanding, making it difficult to accurately assess and interpret the ability of VLMs to understand complex and fine-grained human activities. Inspired by the recently proposed MOMA framework, we define activity graphs as a single universal representation of human activities that encompasses video understanding at the activity, sub10 activity, and atomic action level. We redefine activity parsing as the overarching task of activity graph generation, requiring understanding human activities across all three levels. To facilitate the evaluation of models on activity parsing, we introduce MOMA-LRG (Multi-Object Multi-Actor Language-Refined Graphs), a large dataset of complex human activities with activity graph annotations that can be readily transformed into natural language sentences. Lastly, we present a model-agnostic and lightweight approach to adapting and evaluating VLMs by incorporating structured knowledge from activity graphs into VLMs, addressing the individual limitations of language and graphical models. We demonstrate a strong performance on activity parsing and few-shot video classification, and our framework is intended to foster future research in the joint modeling of videos, graphs, and language.
more » « less
Full Text Available

« Prev Next »

Search for: All records