skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Award ID contains: 2007661

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. ObjectivesPatients with primary muscle tension dysphonia (pMTD) commonly report paralaryngeal pain and discomfort, and extrinsic laryngeal muscle (ELM) tension and hyperfunction are commonly implicated. However, quantitative physiological metrics to study ELM movement patterns for the characterization of pMTD diagnosis and monitoring of treatment progress are lacking. The objectives of this study were to validate motion capture (MoCap) technology to study ELM kinematics, determine whether MoCap could distinguish ELM tension and hyperfunction between individuals with and without pMTD, and investigate relationships between common clinical voice metrics and ELM kinematics. MethodsThirty subjects (15 with pMTD and 15 controls) were recruited for the study. Sixteen markers were placed on different anatomical landmarks on the chin and anterior neck. Movements across these regions were tracked during four voice and speech tasks using two three‐dimensional cameras. Movement displacement and variability were determined based on 16 key‐points and 53 edges. ResultsIntraclass correlation coefficients demonstrated high intra‐ and inter‐rater reliability (p's < 0.001). Other than greater movement displacements around the thyrohyoid space during longer phrasing (reading passage, 30‐s diadochokinetics) and more movement variability in patients with pMTD, kinematic patterns between groups were similar across the 53 edges for the four voice and speech tasks. There were also no significant correlations between ELM kinematics and standard voice metrics. ConclusionResults demonstrate the feasibility and reliability of MoCap for the study of ELM kinematics. Level of Evidence3Laryngoscope, 133:3472–3481, 2023 
    more » « less
  2. Abstract It remains an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music. In this paper, we present a method for this task with natural motions for the lips, facial expression, head pose, and eyes. Due to the coupling of mixed information for the human voice and backing music in common music audio signals, we design a decouple-and-fuse strategy to tackle the challenge. We first decompose the input music audio into a human voice stream and a backing music stream. Due to the implicit and complicated correlation between the two-stream input signals and the dynamics of the facial expressions, head motions, and eye states, we model their relationship with an attention scheme, where the effects of the two streams are fused seamlessly. Furthermore, to improve the expressivenes of the generated results, we decompose head movement generation in terms of speed and direction, and decompose eye state generation into short-term blinking and long-term eye closing, modeling them separately. We have also built a novel dataset, SingingFace, to support training and evaluation of models for this task, including future work on this topic. Extensive experiments and a user study show that our proposed method is capable of synthesizing vivid singing faces, qualitatively and quantitatively better than the prior state-of-the-art. 
    more » « less
  3. We present a novel topology-preserving 3D medial axis computation framework based on volumetric restricted power diagram (RPD), while preserving the medial features and geometric convergence simultaneously, for both 3D CAD and organic shapes. The volumetric RPD discretizes the input 3D volume into sub-regions given a set of medial spheres. With this intermediate structure, we convert the homotopy equivalency between the generated medial mesh and the input 3D shape into a localized contractibility checking for each restricted element (power cell, power face, power edge), by checking their connected components and Euler characteristics. We further propose a fractional Euler characteristic algorithm for efficient GPU-based computation of Euler characteristic for each restricted element on the fly while computing the volumetric RPD. Compared with existing voxel-based or point-cloud-based methods, our approach is the first to adaptively and directly revise the medial mesh without globally modifying the dependent structure, such as voxel size or sampling density, while preserving its topology and medial features. In comparison with the feature preservation method MATFP [Wang et al. 2022], our method provides geometrically comparable results with fewer spheres and more robustly captures the topology of the input 3D shape. 
    more » « less
    Free, publicly-accessible full text available December 19, 2025
  4. Free, publicly-accessible full text available December 3, 2025
  5. In mesh simplification, common requirements like accuracy, triangle quality, and feature alignment are often considered as a trade-off. Existing algorithms concentrate on just one or a few specific aspects of these requirements. For example, the well-known Quadric Error Metrics (QEM) approach [Garland and Heckbert 1997] prioritizes accuracy and can preserve strong feature lines/points as well, but falls short in ensuring high triangle quality and may degrade weak features that are not as distinctive as strong ones. In this paper, we propose a smooth functional that simultaneously considers all of these requirements. The functional comprises a normal anisotropy term and a Centroidal Voronoi Tessellation (CVT) [Du et al. 1999] energy term, with the variables being a set of movable points lying on the surface. The former inherits the spirit of QEM but operates in a continuous setting, while the latter encourages even point distribution, allowing various surface metrics. We further introduce a decaying weight to automatically balance the two terms. We selected 100 CAD models from the ABC dataset [Koch et al. 2019], along with 21 organic models, to compare the existing mesh simplification algorithms with ours. Experimental results reveal an important observation: the introduction of a decaying weight effectively reduces the conflict between the two terms and enables the alignment of weak features. This distinctive feature sets our approach apart from most existing mesh simplification methods and demonstrates significant potential in shape understanding. Please refer to the teaser figure for illustration. 
    more » « less