Abstract PurposeTo demonstrate speech‐production real‐time MRI (RT‐MRI) using a contemporary 0.55T system, and to identify opportunities for improved performance compared with conventional field strengths. MethodsExperiments were performed on healthy adult volunteers using a 0.55T MRI system with high‐performance gradients and a custom 8‐channel upper airway coil. Imaging was performed using spiral‐based balancedSSFPand gradient‐recalled echo (GRE) pulse sequences using a temporal finite‐difference constrained reconstruction. Speech‐production RT‐MRI was performed with three spiral readout durations (8.90, 5.58, and 3.48 ms) to determine trade‐offs with respect to articulator contrast, blurring, banding artifacts, and overall image quality. ResultsBoth spiral GRE and bSSFP captured tongue boundary dynamics during rapid consonant‐vowel syllables. Although bSSFP provided substantially higher SNR in all vocal tract articulators than GRE, it suffered from banding artifacts at TR > 10.9 ms. Spiral bSSFP with the shortest readout duration (3.48 ms, TR = 5.30 ms) had the best image quality, with a 1.54‐times boost in SNR compared with an equivalent GRE sequence. Longer readout durations led to increased SNR efficiency and blurring in both bSSFP and GRE. ConclusionHigh‐performance 0.55T MRI systems can be used for speech‐production RT‐MRI. Spiral bSSFP can be used without suffering from banding artifacts in vocal tract articulators, provide better SNR efficiency, and have better image quality than what is typically achieved at 1.5 T or 3 T.
more »
« less
Localizing Concurrent Sound Sources with Binaural Microphones: A Simulation Study
The human auditory system can localize multiple sound sources using time, intensity, and frequency cues in the sound received by the two ears. Being able to spatially segregate the sources helps perception in a challenging condition when multiple sounds coexist. This study used model simulations to explore an algorithm for localizing multiple sources in azimuth with binaural (i.e., two) microphones. The algorithm relies on the “sparseness” property of daily signals in the time-frequency domain, and sound coming from different locations carrying unique spatial features will form clusters. Based on an interaural normalization procedure, the model generated spiral patterns for sound sources in the frontal hemifield. The model itself was created using broadband noise for better accuracy, because speech typically has sporadic energy at high frequencies. The model at an arbitrary frequency can be used to predict locations of speech and music that occurred alone or concurrently, and a classification algorithm was applied to measure the localization error. Under anechoic conditions, averaged errors in azimuth increased from 4.5° to 19° with RMS errors ranging from 6.4° to 26.7° as model frequency increased from 300 to 3000 Hz. The low-frequency model performance using short speech sound was notably better than the generalized cross-correlation model. Two types of room reverberations were then introduced to simulate difficult listening conditions. Model performance under reverberation was more resilient at low frequencies than at high frequencies. Overall, our study presented a spiral model for rapidly predicting horizontal locations of concurrent sound that is suitable for real-world scenarios.
more »
« less
- Award ID(s):
- 2217032
- PAR ID:
- 10463906
- Editor(s):
- Canlon Barbara
- Date Published:
- Journal Name:
- Hearing research
- ISSN:
- 0378-5955
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The temperature dependence of the thermal conductivity is linked to the nature of the energy transport at a frequency ω , which is quantified by thermal diffusivity d ( ω ). Here we study d ( ω ) for a poorly annealed glass and a highly stable glass prepared using the swap Monte Carlo algorithm. To calculate d ( ω ), we excite wave packets and find that the energy moves diffusively for high frequencies up to a maximum frequency, beyond which the energy stays localized. At intermediate frequencies, we find a linear increase of the square of the width of the wave packet with time, which allows for a robust calculation of d ( ω ), but the wave packet is no longer well described by a Gaussian as for high frequencies. In this intermediate regime, there is a transition from a nearly frequency independent thermal diffusivity at high frequencies to d ( ω ) ∼ ω −4 at low frequencies. For low frequencies the sound waves are responsible for energy transport and the energy moves ballistically. The low frequency behavior can be predicted using sound attenuation coefficients.more » « less
-
There are a variety of common situations in which specification of a onedimensional modulation transfer function (MTF) or two orthogonal profiles of the 2D MTF are not adequate descriptions of the image quality performance of an optical system. These include systems with an asymmetric on-axis impulse response, systems with off-axis aberrations, systems with surfaces that include mid-spatial frequency errors, and freeform systems. In this paper, we develop the concept of the Minimum Modulation Curve (MMC). Starting with the two-dimensional MTF in polar form, the minimum MTF for any azimuth angle is plotted as a function of the radial spatial frequency. This can be presented in a familiar form similar to an MTF curve and is useful in the context of guaranteeing that a given MTF specification is met for any possible orientation of spatial frequencies in the image. In this way, an MMC may be of value in specifying the required performance of an optical system. We illustrate application of the MMC using profile data for surfaces with midspatial frequency errors. © 2019 Optical Society of America under the terms of the OSA Open Access Publishing Agreementmore » « less
-
Multiple mechanized ocean vessels, including both surface ships and submerged vehicles, can be simultaneously monitored over instantaneous continental-shelf scale regions >10,000 km 2 via passive ocean acoustic waveguide remote sensing. A large-aperture densely-sampled coherent hydrophone array system is employed in the Norwegian Sea in Spring 2014 to provide directional sensing in 360 degree horizontal azimuth and to significantly enhance the signal-to-noise ratio (SNR) of ship-radiated underwater sound, which improves ship detection ranges by roughly two orders of magnitude over that of a single hydrophone. Here, 30 mechanized ocean vessels spanning ranges from nearby to over 150 km from the coherent hydrophone array, are detected, localized and classified. The vessels are comprised of 20 identified commercial ships and 10 unidentified vehicles present in 8 h/day of Passive Ocean Acoustic Waveguide Remote Sensing (POAWRS) observation for two days. The underwater sounds from each of these ocean vessels received by the coherent hydrophone array are dominated by narrowband signals that are either constant frequency tonals or have frequencies that waver or oscillate slightly in time. The estimated bearing-time trajectory of a sequence of detections obtained from coherent beamforming are employed to determine the horizontal location of each vessel using the Moving Array Triangulation (MAT) technique. For commercial ships present in the region, the estimated horizontal positions obtained from passive acoustic sensing are verified by Global Positioning System (GPS) measurements of the ship locations found in a historical Automatic Identification System (AIS) database. We provide time-frequency characterizations of the underwater sounds radiated from the commercial ships and the unidentified vessels. The time-frequency features along with the bearing-time trajectory of the detected signals are applied to simultaneously track and distinguish these vessels.more » « less
-
We perform empirical fits to the Chandra and XMM-Newton spectra of three ultraluminous X-ray sources (ULXs) in the edge-on spiral galaxy NGC 891, monitoring the region over a 17-year time window. One of these sources was visible since the early 1990s with ROSAT and was observed multiple times with Chandra and XMM-Newton. Another was visible since 2011. We build upon prior analyses of these sources by analyzing all available data at all epochs. Where possible Chandra data is used, since its superior spatial resolution allows for more effective isolation of the emission from each individual source, thus providing a better determination of their spectral properties. We also identify a new transient ULX, CXOU J022230.1+421937, which faded from view over the course of a two month period from Nov 2016 to Jan 2017. Modeling of each source at every epoch was conducted using six different models ranging from thermal bremsstrahlung to accretion disk models. Unfortunately, but as is common with many ULXs, no single model yielded a much better fit than the others. The two known sources had unabsorbed luminosities that remained fairly consistent over five or more years. Various possibilities for the new transient ULX are explored.more » « less