skip to main content


Title: Metamorph: Injecting Inaudible Commands into Over-the-air Voice Controlled Systems
This paper presents Metamorph, a system that generates imperceptible audio that can survive over-the-air trans- mission to attack the neural network of a speech recognition system. The key challenge stems from how to ensure the added perturbation of the original audio in advance at the sender side is immune to unknown signal distortions during the transmission process. Our empirical study reveals that signal distortion is mainly due to device and channel frequency selectivity but with different characteristics. This brings a chance to capture and further pre-code this impact to generate adversarial examples that are robust to the over-the-air transmission. We leverage this opportunity in Metamorph and obtain an initial perturbation that captures the core distortion’s impact from only a small set of prior measurements, and then take advantage of a domain adaptation algorithm to refine the perturbation to further im- prove the attack distance and reliability. Moreover, we consider also reducing human perceptibility of the added perturbation. Evaluation achieves a high attack success rate (90%) over the attack distance of up to 6 m. Within a moderate distance, e.g., 3 m, Metamorph maintains this high success rate, yet can be further adapted to largely improve the audio quality, confirmed by a human perceptibility study.  more » « less
Award ID(s):
1617161
NSF-PAR ID:
10212193
Author(s) / Creator(s):
; ; ;
Date Published:
Journal Name:
Network and Distributed Systems Security (NDSS) Symposium
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Despite remarkable improvements, automatic speech recognition is susceptible to adversarial perturbations. Compared to standard machine learning architectures, these attacks are significantly more challenging, especially since the inputs to a speech recognition system are time series that contain both acoustic and linguistic properties of speech. Extracting all recognition-relevant information requires more complex pipelines and an ensemble of specialized components. Consequently, an attacker needs to consider the entire pipeline. In this paper, we present VENOMAVE, the first training- time poisoning attack against speech recognition. Similar to the predominantly studied evasion attacks, we pursue the same goal: leading the system to an incorrect and attacker-chosen transcription of a target audio waveform. In contrast to evasion attacks, however, we assume that the attacker can only manipulate a small part of the training data without altering the target audio waveform at runtime. We evaluate our attack on two datasets: TIDIGITS and Speech Commands. When poisoning less than 0.17% of the dataset, VENOMAVE achieves attack success rates of more than 80.0%, without access to the victim’s network architecture or hyperparameters. In a more realistic scenario, when the target audio waveform is played over the air in different rooms, VENOMAVE maintains a success rate of up to 73.3%. Finally, VENOMAVE achieves an attack transferability rate of 36.4% between two different model architectures. 
    more » « less
  2. Excessive phosphorus (P) applications to croplands can contribute to eutrophication of surface waters through surface runoff and subsurface (leaching) losses. We analyzed leaching losses of total dissolved P (TDP) from no-till corn, hybrid poplar (Populus nigra X P. maximowiczii), switchgrass (Panicum virgatum), miscanthus (Miscanthus giganteus), native grasses, and restored prairie, all planted in 2008 on former cropland in Michigan, USA. All crops except corn (13 kg P ha−1 year−1) were grown without P fertilization. Biomass was harvested at the end of each growing season except for poplar. Soil water at 1.2 m depth was sampled weekly to biweekly for TDP determination during March–November 2009–2016 using tension lysimeters. Soil test P (0–25 cm depth) was measured every autumn. Soil water TDP concentrations were usually below levels where eutrophication of surface waters is frequently observed (> 0.02 mg L−1) but often higher than in deep groundwater or nearby streams and lakes. Rates of P leaching, estimated from measured concentrations and modeled drainage, did not differ statistically among cropping systems across years; 7-year cropping system means ranged from 0.035 to 0.072 kg P ha−1 year−1 with large interannual variation. Leached P was positively related to STP, which decreased over the 7 years in all systems. These results indicate that both P-fertilized and unfertilized cropping systems may leach legacy P from past cropland management. Experimental details The Biofuel Cropping System Experiment (BCSE) is located at the W.K. Kellogg Biological Station (KBS) (42.3956° N, 85.3749° W; elevation 288 m asl) in southwestern Michigan, USA. This site is a part of the Great Lakes Bioenergy Research Center (www.glbrc.org) and is a Long-term Ecological Research site (www.lter.kbs.msu.edu). Soils are mesic Typic Hapludalfs developed on glacial outwash54 with high sand content (76% in the upper 150 cm) intermixed with silt-rich loess in the upper 50 cm55. The water table lies approximately 12–14 m below the surface. The climate is humid temperate with a mean annual air temperature of 9.1 °C and annual precipitation of 1005 mm, 511 mm of which falls between May and September (1981–2010)56,57. The BCSE was established as a randomized complete block design in 2008 on preexisting farmland. Prior to BCSE establishment, the field was used for grain crop and alfalfa (Medicago sativa L.) production for several decades. Between 2003 and 2007, the field received a total of ~ 300 kg P ha−1 as manure, and the southern half, which contains one of four replicate plots, received an additional 206 kg P ha−1 as inorganic fertilizer. The experimental design consists of five randomized blocks each containing one replicate plot (28 by 40 m) of 10 cropping systems (treatments) (Supplementary Fig. S1; also see Sanford et al.58). Block 5 is not included in the present study. Details on experimental design and site history are provided in Robertson and Hamilton57 and Gelfand et al.59. Leaching of P is analyzed in six of the cropping systems: (i) continuous no-till corn, (ii) switchgrass, (iii) miscanthus, (iv) a mixture of five species of native grasses, (v) a restored native prairie containing 18 plant species (Supplementary Table S1), and (vi) hybrid poplar. Agronomic management Phenological cameras and field observations indicated that the perennial herbaceous crops emerged each year between mid-April and mid-May. Corn was planted each year in early May. Herbaceous crops were harvested at the end of each growing season with the timing depending on weather: between October and November for corn and between November and December for herbaceous perennial crops. Corn stover was harvested shortly after corn grain, leaving approximately 10 cm height of stubble above the ground. The poplar was harvested only once, as the culmination of a 6-year rotation, in the winter of 2013–2014. Leaf emergence and senescence based on daily phenological images indicated the beginning and end of the poplar growing season, respectively, in each year. Application of inorganic fertilizers to the different crops followed a management approach typical for the region (Table 1). Corn was fertilized with 13 kg P ha−1 year−1 as starter fertilizer (N-P-K of 19-17-0) at the time of planting and an additional 33 kg P ha−1 year−1 was added as superphosphate in spring 2015. Corn also received N fertilizer around the time of planting and in mid-June at typical rates for the region (Table 1). No P fertilizer was applied to the perennial grassland or poplar systems (Table 1). All perennial grasses (except restored prairie) were provided 56 kg N ha−1 year−1 of N fertilizer in early summer between 2010 and 2016; an additional 77 kg N ha−1 was applied to miscanthus in 2009. Poplar was fertilized once with 157 kg N ha−1 in 2010 after the canopy had closed. Sampling of subsurface soil water and soil for P determination Subsurface soil water samples were collected beneath the root zone (1.2 m depth) using samplers installed at approximately 20 cm into the unconsolidated sand of 2Bt2 and 2E/Bt horizons (soils at the site are described in Crum and Collins54). Soil water was collected from two kinds of samplers: Prenart samplers constructed of Teflon and silica (http://www.prenart.dk/soil-water-samplers/) in replicate blocks 1 and 2 and Eijkelkamp ceramic samplers (http://www.eijkelkamp.com) in blocks 3 and 4 (Supplementary Fig. S1). The samplers were installed in 2008 at an angle using a hydraulic corer, with the sampling tubes buried underground within the plots and the sampler located about 9 m from the plot edge. There were no consistent differences in TDP concentrations between the two sampler types. Beginning in the 2009 growing season, subsurface soil water was sampled at weekly to biweekly intervals during non-frozen periods (April–November) by applying 50 kPa of vacuum to each sampler for 24 h, during which the extracted water was collected in glass bottles. Samples were filtered using different filter types (all 0.45 µm pore size) depending on the volume of leachate collected: 33-mm dia. cellulose acetate membrane filters when volumes were less than 50 mL; and 47-mm dia. Supor 450 polyethersulfone membrane filters for larger volumes. Total dissolved phosphorus (TDP) in water samples was analyzed by persulfate digestion of filtered samples to convert all phosphorus forms to soluble reactive phosphorus, followed by colorimetric analysis by long-pathlength spectrophotometry (UV-1800 Shimadzu, Japan) using the molybdate blue method60, for which the method detection limit was ~ 0.005 mg P L−1. Between 2009 and 2016, soil samples (0–25 cm depth) were collected each autumn from all plots for determination of soil test P (STP) by the Bray-1 method61, using as an extractant a dilute hydrochloric acid and ammonium fluoride solution, as is recommended for neutral to slightly acidic soils. The measured STP concentration in mg P kg−1 was converted to kg P ha−1 based on soil sampling depth and soil bulk density (mean, 1.5 g cm−3). Sampling of water samples from lakes, streams and wells for P determination In addition to chemistry of soil and subsurface soil water in the BCSE, waters from lakes, streams, and residential water supply wells were also sampled during 2009–2016 for TDP analysis using Supor 450 membrane filters and the same analytical method as for soil water. These water bodies are within 15 km of the study site, within a landscape mosaic of row crops, grasslands, deciduous forest, and wetlands, with some residential development (Supplementary Fig. S2, Supplementary Table S2). Details of land use and cover change in the vicinity of KBS are given in Hamilton et al.48, and patterns in nutrient concentrations in local surface waters are further discussed in Hamilton62. Leaching estimates, modeled drainage, and data analysis Leaching was estimated at daily time steps and summarized as total leaching on a crop-year basis, defined from the date of planting or leaf emergence in a given year to the day prior to planting or emergence in the following year. TDP concentrations (mg L−1) of subsurface soil water were linearly interpolated between sampling dates during non-freezing periods (April–November) and over non-sampling periods (December–March) based on the preceding November and subsequent April samples. Daily rates of TDP leaching (kg ha−1) were calculated by multiplying concentration (mg L−1) by drainage rates (m3 ha−1 day−1) modeled by the Systems Approach for Land Use Sustainability (SALUS) model, a crop growth model that is well calibrated for KBS soil and environmental conditions. SALUS simulates yield and environmental outcomes in response to weather, soil, management (planting dates, plant population, irrigation, N fertilizer application, and tillage), and genetics63. The SALUS water balance sub-model simulates surface runoff, saturated and unsaturated water flow, drainage, root water uptake, and evapotranspiration during growing and non-growing seasons63. The SALUS model has been used in studies of evapotranspiration48,51,64 and nutrient leaching20,65,66,67 from KBS soils, and its predictions of growing-season evapotranspiration are consistent with independent measurements based on growing-season soil water drawdown53 and evapotranspiration measured by eddy covariance68. Phosphorus leaching was assumed insignificant on days when SALUS predicted no drainage. Volume-weighted mean TDP concentrations in leachate for each crop-year and for the entire 7-year study period were calculated as the total dissolved P leaching flux (kg ha−1) divided by the total drainage (m3 ha−1). One-way ANOVA with time (crop-year) as the fixed factor was conducted to compare total annual drainage rates, P leaching rates, volume-weighted mean TDP concentrations, and maximum aboveground biomass among the cropping systems over all seven crop-years as well as with TDP concentrations from local lakes, streams, and groundwater wells. When a significant (α = 0.05) difference was detected among the groups, we used the Tukey honest significant difference (HSD) post-hoc test to make pairwise comparisons among the groups. In the case of maximum aboveground biomass, we used the Tukey–Kramer method to make pairwise comparisons among the groups because the absence of poplar data after the 2013 harvest resulted in unequal sample sizes. We also used the Tukey–Kramer method to compare the frequency distributions of TDP concentrations in all of the soil leachate samples with concentrations in lakes, streams, and groundwater wells, since each sample category had very different numbers of measurements. Individual spreadsheets in “data table_leaching_dissolved organic carbon and nitrogen.xls” 1.    annual precip_drainage 2.    biomass_corn, perennial grasses 3.    biomass_poplar 4.    annual N leaching _vol-wtd conc 5.    Summary_N leached 6.    annual DOC leachin_vol-wtd conc 7.    growing season length 8.    correlation_nh4 VS no3 9.    correlations_don VS no3_doc VS don Each spreadsheet is described below along with an explanation of variates. Note that ‘nan’ indicate data are missing or not available. First row indicates header; second row indicates units 1. Spreadsheet: annual precip_drainage Description: Precipitation measured from nearby Kellogg Biological Station (KBS) Long Term Ecological Research (LTER) Weather station, over 2009-2016 study period. Data shown in Figure 1; original data source for precipitation (https://lter.kbs.msu.edu/datatables/7). Drainage estimated from SALUS crop model. Note that drainage is percolation out of the root zone (0-125 cm). Annual precipitation and drainage values shown here are calculated for growing and non-growing crop periods. Variate    Description year    year of the observation crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” precip_G    precipitation during growing period (milliMeter) precip_NG    precipitation during non-growing period (milliMeter) drainage_G    drainage during growing period (milliMeter) drainage_NG    drainage during non-growing period (milliMeter)      2. Spreadsheet: biomass_corn, perennial grasses Description: Maximum aboveground biomass measurements from corn, switchgrass, miscanthus, native grass and restored prairie plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2015. Data shown in Figure 2.   Variate    Description year    year of the observation date    day of the observation (mm/dd/yyyy) crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” replicate    each crop has four replicated plots, R1, R2, R3 and R4 station    stations (S1, S2 and S3) of samplings within the plot. For more details, refer to link (https://data.sustainability.glbrc.org/protocols/156) species    plant species that are rooted within the quadrat during the time of maximum biomass harvest. See protocol for more information, refer to link (http://lter.kbs.msu.edu/datatables/36) For maize biomass, grain and whole biomass reported in the paper (weed biomass or surface litter are excluded). Surface litter biomass not included in any crops; weed biomass not included in switchgrass and miscanthus, but included in grass mixture and prairie. fraction    Fraction of biomass biomass_plot    biomass per plot on dry-weight basis (Grams_Per_SquareMeter) biomass_ha    biomass (megaGrams_Per_Hectare) by multiplying column biomass per plot with 0.01 3. Spreadsheet: biomass_poplar Description: Maximum aboveground biomass measurements from poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2015. Data shown in Figure 2. Note that poplar biomass was estimated from crop growth curves until the poplar was harvested in the winter of 2013-14. Variate    Description year    year of the observation method    methods of poplar biomass sampling date    day of the observation (mm/dd/yyyy) replicate    each crop has four replicated plots, R1, R2, R3 and R4 diameter_at_ground    poplar diameter (milliMeter) at the ground diameter_at_15cm    poplar diameter (milliMeter) at 15 cm height biomass_tree    biomass per plot (Grams_Per_Tree) biomass_ha    biomass (megaGrams_Per_Hectare) by multiplying biomass per tree with 0.01 4. Spreadsheet: annual N leaching_vol-wtd conc Description: Annual leaching rate (kiloGrams_N_Per_Hectare) and volume-weighted mean N concentrations (milliGrams_N_Per_Liter) of nitrate (no3) and dissolved organic nitrogen (don) in the leachate samples collected from corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2016. Data for nitrogen leached and volume-wtd mean N concentration shown in Figure 3a and Figure 3b, respectively. Note that ammonium (nh4) concentration were much lower and often undetectable (<0.07 milliGrams_N_Per_Liter). Also note that in 2009 and 2010 crop-years, data from some replicates are missing.    Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” crop-year    year of the observation replicate    each crop has four replicated plots, R1, R2, R3 and R4 no3 leached    annual leaching rates of nitrate (kiloGrams_N_Per_Hectare) don leached    annual leaching rates of don (kiloGrams_N_Per_Hectare) vol-wtd no3 conc.    Volume-weighted mean no3 concentration (milliGrams_N_Per_Liter) vol-wtd don conc.    Volume-weighted mean don concentration (milliGrams_N_Per_Liter) 5. Spreadsheet: summary_N leached Description: Summary of total amount and forms of N leached (kiloGrams_N_Per_Hectare) and the percent of applied N lost to leaching over the seven years for corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2016. Data for nitrogen amount leached shown in Figure 4a and percent of applied N lost shown in Figure 4b. Note the fraction of unleached N includes in harvest, accumulation in root biomass, soil organic matter or gaseous N emissions were not measured in the study. Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” no3 leached    annual leaching rates of nitrate (kiloGrams_N_Per_Hectare) don leached    annual leaching rates of don (kiloGrams_N_Per_Hectare) N unleached    N unleached (kiloGrams_N_Per_Hectare) in other sources are not studied % of N applied N lost to leaching    % of N applied N lost to leaching 6. Spreadsheet: annual DOC leachin_vol-wtd conc Description: Annual leaching rate (kiloGrams_Per_Hectare) and volume-weighted mean N concentrations (milliGrams_Per_Liter) of dissolved organic carbon (DOC) in the leachate samples collected from corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2016. Data for DOC leached and volume-wtd mean DOC concentration shown in Figure 5a and Figure 5b, respectively. Note that in 2009 and 2010 crop-years, water samples were not available for DOC measurements.     Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” crop-year    year of the observation replicate    each crop has four replicated plots, R1, R2, R3 and R4 doc leached    annual leaching rates of nitrate (kiloGrams_Per_Hectare) vol-wtd doc conc.    volume-weighted mean doc concentration (milliGrams_Per_Liter) 7. Spreadsheet: growing season length Description: Growing season length (days) of corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in the Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2009-2015. Date shown in Figure S2. Note that growing season is from the date of planting or emergence to the date of harvest (or leaf senescence in case of poplar).   Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” year    year of the observation growing season length    growing season length (days) 8. Spreadsheet: correlation_nh4 VS no3 Description: Correlation of ammonium (nh4+) and nitrate (no3-) concentrations (milliGrams_N_Per_Liter) in the leachate samples from corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2013-2015. Data shown in Figure S3. Note that nh4+ concentration in the leachates was very low compared to no3- and don concentration and often undetectable in three crop-years (2013-2015) when measurements are available. Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” date    date of the observation (mm/dd/yyyy) replicate    each crop has four replicated plots, R1, R2, R3 and R4 nh4 conc    nh4 concentration (milliGrams_N_Per_Liter) no3 conc    no3 concentration (milliGrams_N_Per_Liter)   9. Spreadsheet: correlations_don VS no3_doc VS don Description: Correlations of don and nitrate concentrations (milliGrams_N_Per_Liter); and doc (milliGrams_Per_Liter) and don concentrations (milliGrams_N_Per_Liter) in the leachate samples of corn, switchgrass, miscanthus, native grass, restored prairie and poplar plots in Great Lakes Bioenergy Research Center (GLBRC) Biomass Cropping System Experiment (BCSE) during 2013-2015. Data of correlation of don and nitrate concentrations shown in Figure S4 a and doc and don concentrations shown in Figure S4 b. Variate    Description crop    “corn” “switchgrass” “miscanthus” “nativegrass” “restored prairie” “poplar” year    year of the observation don    don concentration (milliGrams_N_Per_Liter) no3     no3 concentration (milliGrams_N_Per_Liter) doc    doc concentration (milliGrams_Per_Liter) 
    more » « less
  3. null (Ed.)
    Drilling and milling operations are material removal processes involved in everyday conventional productions, especially in the high-speed metal cutting industry. The monitoring of tool information (wear, dynamic behavior, deformation, etc.) is essential to guarantee the success of product fabrication. Many methods have been applied to monitor the cutting tools from the information of cutting force, spindle motor current, vibration, as well as sound acoustic emission. However, those methods are indirect and sensitive to environmental noises. Here, the in-process imaging technique that can capture the cutting tool information while cutting the metal was studied. As machinists judge whether a tool is worn-out by the naked eye, utilizing the vision system can directly present the performance of the machine tools. We proposed a phase shifted strobo-stereoscopic method (Figure 1) for three-dimensional (3D) imaging. The stroboscopic instrument is usually applied for the measurement of fast-moving objects. The operation principle is as follows: when synchronizing the frequency of the light source illumination and the motion of object, the object appears to be stationary. The motion frequency of the target is transferring from the count information of the encoder signals from the working rotary spindle. If small differences are added to the frequency, the object appears to be slowly moving or rotating. This effect can be working as the source for the phase-shifting; with this phase information, the target can be whole-view 3D reconstructed by 360 degrees. The stereoscopic technique is embedded with two CCD cameras capturing images that are located bilateral symmetrically in regard to the target. The 3D scene is reconstructed by the location information of the same object points from both the left and right images. In the proposed system, an air spindle was used to secure the motion accuracy and drilling/milling speed. As shown in Figure 2, two CCDs with 10X objective lenses were installed on a linear rail with rotary stages to capture the machine tool bit raw picture for further 3D reconstruction. The overall measurement process was summarized in the flow chart (Figure 3). As the count number of encoder signals is related to the rotary speed, the input speed (unit of RPM) was set as the reference signal to control the frequency (f0) of the illumination of the LED. When the frequency was matched with the reference signal, both CCDs started to gather the pictures. With the mismatched frequency (Δf) information, a sequence of images was gathered under the phase-shifted process for a whole-view 3D reconstruction. The study in this paper was based on a 3/8’’ drilling tool performance monitoring. This paper presents the principle of the phase-shifted strobe-stereoscopic 3D imaging process. A hardware set-up is introduced, , as well as the 3D imaging algorithm. The reconstructed image analysis under different working speeds is discussed, the reconstruction resolution included. The uncertainty of the imaging process and the built-up system are also analyzed. As the input signal is the working speed, no other information from other sources is required. This proposed method can be applied as an on-machine or even in-process metrology. With the direct method of the 3D imaging machine vision system, it can directly offer the machine tool surface and fatigue information. This presented method can supplement the blank for determining the performance status of the machine tools, which further guarantees the fabrication process. 
    more » « less
  4. INTRODUCTION: Apollo-11 (A-11) was the first manned space mission to successfully bring astronauts to the moon and return them safely. Effective team based communications is required for mission specialists to work collaboratively to learn, engage, and solve complex problems. As part of NASA’s goal in assessing team and mission success, all vital speech communications between these personnel were recorded using the multi-track SoundScriber system onto analog tapes, preserving their contribution in the success of one of the greatest achievements in human history. More than +400 personnel served as mission specialists/support who communicated across 30 audio loops, resulting in +9k hours of data for A-11. To ensure success of this mission, it was necessary for teams to communicate, learn, and address problems in a timely manner. Previous research has found that compatibility of individual personalities within teams is important for effective team collaboration of those individuals. Hence, it is essential to identify each speaker’s role during an Apollo mission and analyze group communications for knowledge exchange and problem solving to achieve a common goal. Assessing and analyzing speaker roles during the mission can allow for exploring engagement analysis for multi-party speaker situations. METHOD: The UTDallas Fearless steps Apollo data is comprised of 19,000 hours (A-11,A-13,A-1) possessing unique and multiple challenges as it is characterized by severe noise and degradation as well as overlap instances over the 30 channels. For our study, we have selected a subset of 100 hours manually transcribed by professional annotators for speaker labels. The 100 hours are obtained from three mission critical events: 1. Lift-Off (25 hours) 2. Lunar-Landing (50 hours) 3. Lunar-Walking (25 hours). Five channels of interest, out of 30 channels were selected with the most speech activity, the primary speakers operating these five channels are command/owners of these channels. For our analysis, we select five speaker roles: Flight Director (FD), Capsule Communicator (CAPCOM), Guidance, Navigation and, Control (GNC), Electrical, environmental, and consumables manager (EECOM), and Network (NTWK). To track and tag individual speakers across our Fearless Steps audio dataset, we use the concept of ‘where’s Waldo’ to identify all instances of our speakers-of-interest across a cluster of other speakers. Also, to understand speaker roles of our speaker-of-interests, we use speaker duration of primary speaker vs secondary speaker and speaker turns as our metrics to determine the role of the speaker and to understand their responsibility during the three critical phases of the mission. This enables a content linking capability as well as provide a pathway to analyzing group engagement, group dynamics of people working together in an enclosed space, psychological effects, and cognitive analysis in such individuals. IMPACT: NASA’s Apollo Program stands as one of the most significant contributions to humankind. This collection opens new research options for recognizing team communication, group dynamics, and human engagement/psychology for future deep space missions. Analyzing team communications to achieve such goals would allow for the formulation of educational and training technologies for assessment of STEM knowledge, task learning, and educational feedback. Also, identifying these personnel can help pay tribute and yield personal recognition to the hundreds of notable engineers and scientist who made this feat possible. ILLUSTRATION: In this work, we propose to illustrate how a pre-trained speech/language network can be used to obtain powerful speaker embeddings needed for speaker diarization. This framework is used to build these learned embeddings to label unique speakers over sustained audio streams. To train and test our system, we will make use of Fearless Steps Apollo corpus, allowing us to effectively leverage a limited label information resource (100 hours of labeled data out of +9000 hours). Furthermore, we use the concept of 'Finding Waldo' to identify key speakers of interest (SOI) throughout the Apollo-11 mission audio across multiple channel audio streams. 
    more » « less
  5. The past few years have witnessed a growing interest in wireless and batteryless implants, due to their potential in long-term biomedical monitoring of in-body conditions such as internal organ movements, bladder pressure, and gastrointestinal health. Early proposals for batteryless implants relied on inductive near-field coupling and ultrasound harvesting, which require direct contact between the external power source and the human body. To overcome this near-field challenge, recent research has investigated the use of RF backscatter in wireless micro-implants because of its ability to communicate with wireless receivers that are placed at a distance outside the body (∼0.5 m), allowing a more seamless user experience. Unfortunately, existing far-field backscatter designs remain limited in their functionality: they cannot perform biometric sensing or secure data transmission; they also suffer from degraded harvesting efficiency and backscatter range due to the impact of variations in the surrounding tissues. In this paper, we present the design of a batteryless, wireless and secure system-on-chip (SoC) implant for in-body strain sensing. The SoC relies on four features: 1) employing a reconfigurable in-body rectenna which can operate across tissues adapting its backscatter bandwidth and center frequency; 2) designing an energy efficient 1.37 mmHg strain sensing front-end with an efficiency of 5.9 mmHg·nJ/conversion; 3) incorporating an AES-GCM security engine to ensure the authenticity and confidentiality of sensed data while sharing the ADC with the sensor interface for an area efficient random number generation; 4) implementing an over-the-air closed-loop wireless programming scheme to reprogram the RF front-end to adapt for surrounding tissues and the sensor front-end to achieve faster settling times below 2 s. 
    more » « less