Clustering analysis of inputs to a geospatial model of outdoor ambient sound

Butler, Brooks; Pedersen, Katrina; Gee, Kent; Transtrum, Mark; Gaza, Casie

Citation Details

Outdoor ambient acoustical environments may be predicted through supervised machine learning using geospatial features as inputs. However, collecting sufficient training data is an expensive process, particularly when attempting to improve the accuracy of models based on supervised learning methods over large, geospatially diverse regions. Unsupervised machine learning methods, such as K-Means clustering analysis, enable a statistical comparison between the geospatial diversity represented in the current training dataset versus the predictor locations. In this case, the geospatial features that represent the regions of western North Carolina and Utah have been simultaneously clustered to examine the common clusters between the two locations. Initial results show that most geospatial clusters group themselves according to a relatively small number of prominent geospatial features, and that Utah requires appreciably more clusters to represent its geospace. Additionally, the training dataset has a relatively low geospatial diversity because most of the current training data sites reside in a small number of clusters. This analysis informs a choice of new site locations for data acquisition that maximize the statistical similarity of the training and input datasets. more »

Award ID(s):: 1757998

PAR ID:: 10106067

Author(s) / Creator(s):: Butler, Brooks; Pedersen, Katrina; Gee, Kent; Transtrum, Mark; Gaza, Casie

Date Published:: 2018-10-01

Journal Name:: Bulletin of the American Physical Society

Volume:: 63

Issue:: 16

ISSN:: 0003-0503

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Journal Article:
The DOI is not currently available.

More Like this