skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Simple and Effective Augmentation Methods for CSI Based Indoor Localization
Indoor localization is a challenging task. Compared to outdoor environments where GPS is dominant, there is no robust and almost-universal approach. Recently, machine learning (ML) has emerged as the most promising approach for achieving accurate indoor localization. Nevertheless, its main challenge is requiring large datasets to train the neural networks. The data collection procedure is costly and laborious, requiring extensive measurements and labeling processes for different indoor environments. The situation can be improved by Data Augmentation (DA), a general framework to enlarge the datasets for ML, making ML systems more robust and increasing their generalization capabilities. This paper proposes two simple yet surprisingly effective DA algorithms for channel state information (CSI) based indoor localization motivated by physical considerations. We show that the number of measurements for a given accuracy requirement may be decreased by an order of magnitude. Specifically, we demonstrate the algorithms’ effectiveness by experiments conducted with a measured indoor WiFi measurement dataset: As little as 10% of the original dataset size is enough to get the same performance as the original dataset. We also showed that if we further augment the dataset with the proposed techniques, test accuracy is improved more than three-fold.  more » « less
Award ID(s):
2008443
PAR ID:
10526819
Author(s) / Creator(s):
; ; ;
Publisher / Repository:
IEEE
Date Published:
ISBN:
979-8-3503-1090-0
Page Range / eLocation ID:
3947 to 3952
Format(s):
Medium: X
Location:
Kuala Lumpur, Malaysia
Sponsoring Org:
National Science Foundation
More Like this
  1. Indoor localization plays a vital role in applications such as emergency response, warehouse management, and augmented reality experiences. By deploying machine learning (ML) based indoor localization frameworks on their mobile devices, users can localize themselves in a variety of indoor and subterranean environments. However, achieving accurate indoor localization can be challenging due to heterogeneity in the hardware and software stacks of mobile devices, which can result in inconsistent and inaccurate location estimates. Traditional ML models also heavily rely on initial training data, making them vulnerable to degradation in performance with dynamic changes across indoor environments. To address the challenges due to device heterogeneity and lack of adaptivity, we propose a novel embedded ML framework calledFedHIL. Our framework combines indoor localization and federated learning (FL) to improve indoor localization accuracy in device-heterogeneous environments while also preserving user data privacy.FedHILintegrates a domain-specific selective weight adjustment approach to preserve the ML model's performance for indoor localization during FL, even in the presence of extremely noisy data. Experimental evaluations in diverse real-world indoor environments and with heterogeneous mobile devices show thatFedHILoutperforms state-of-the-art FL and non-FL indoor localization frameworks.FedHILis able to achieve 1.62 × better localization accuracy on average than the best performing FL-based indoor localization framework from prior work. 
    more » « less
  2. This work introduces an novel approach to improving cybersecurity systems to focus on spam email-based cyberattacks. The proposed technique tackles the challenge of training Machine Learning (ML) models with limited data samples by leveraging Bidirectional Encoder Representations from Transformers (BERT) for contextualized embeddings. Unlike traditional embedding methods, BERT offers a nuanced representation of smaller datasets, enabling more effective ML model training. The methodology will use several pre-trained BERT models for generating contextualized embeddings using data samples, and these embeddings will be fed to various ML algorithms for effective training. This approach demonstrates that even with scarce data, BERT embeddings significantly enhance model performance compared to conventional embedding approaches like Word2Vec. The technique proves especially advantageous for insufficient instances of high-quality dataset. The result of this proposed work outperforms traditional techniques to mitigate phishing attacks with few data samples. This work provides a robust accuracy of 99.25% when we use multilingual BERT (M-BERT) to embed dataset. 
    more » « less
  3. This paper presents SVIn2, a novel tightly-coupled keyframe-based Simultaneous Localization and Mapping (SLAM) system, which fuses Scanning Profiling Sonar, Visual, Inertial, and water-pressure information in a non-linear optimization framework for small and large scale challenging underwater environments. The developed real-time system features robust initialization, loop-closing, and relocalization capabilities, which make the system reliable in the presence of haze, blurriness, low light, and lighting variations, typically observed in underwater scenarios. Over the last decade, Visual-Inertial Odometry and SLAM systems have shown excellent performance for mobile robots in indoor and outdoor environments, but often fail underwater due to the inherent difficulties in such environments. Our approach combats the weaknesses of previous approaches by utilizing additional sensors and exploiting their complementary characteristics. In particular, we use (1) acoustic range information for improved reconstruction and localization, thanks to the reliable distance measurement; (2) depth information from water-pressure sensor for robust initialization, refining the scale, and assisting to limit the drift in the tightly-coupled integration. The developed software—made open source—has been successfully used to test and validate the proposed system in both benchmark datasets and numerous real world underwater scenarios, including datasets collected with a custom-made underwater sensor suite and an autonomous underwater vehicle Aqua2. SVIn2 demonstrated outstanding performance in terms of accuracy and robustness on those datasets and enabled other robotic tasks, for example, planning for underwater robots in presence of obstacles. 
    more » « less
  4. Location awareness is vital in next generation (xG) wireless networks to enable different use cases, including location-based services (LBSs) and efficient network management. However, achieving the service level requirements specified by the 3rd Generation Partnership Project (3GPP) is challenging. This calls for new localization algorithms as well as for 3GPP-standardized scenarios to support their systematic development and testing. In this context, the availability of public datasets with 3GPP-compliant configurations is essential to advance the evolution of xG networks. xG-Loc is the first open dataset for localization algorithms and services fully compliant with 3GPP technical reports and specifications. xG-Loc includes received localization signals, measurements, and analytics for different network and signal configurations in indoor and outdoor scenarios with center frequencies from micro-waves in frequency range 1 (FR1) to millimeter-waves in frequency range 2 (FR2). Position estimates obtained via soft information-based localization and wireless channel quality via blockage intelligence are also included in xG-Loc. The rich set of data provided by xG-Loc enables the characterization of localization algorithms and services under common 3GPP-standardized scenarios in xG networks. 
    more » « less
  5. Corrosion of materials impacts critical economic sectors from infrastructure, transportation, defense, health, to the environment. The development of safe anti-corrosive materials is thus an important area of study in materials science. Corrosion science of preparing materials and then monitoring their corrosion under adverse conditions is labor intensive, time consuming, and extremely costly. While deep learning has become popular in automating various engineering tasks, the development of deep models for corrosion assessment is lacking. We are the first to study deep domain adaptation (DA) models for the automated assessment of the corrosion status of anti-corrosive materials. Corrosion data, i.e., photographic images of treated corroding materials, is abundant when produced in artificially controlled laboratory settings, while corrosion image data sets from rich natural outdoor environments are more challenging to produce and thus much smaller. We leverage the more readily available indoor corrosion data to train a classifier and then transfer it via deep domain adaptation to also perform well on the small yet more realistic outdoor corrosion image data set – without requiring target labels. We empirically compare 5 popular domain adaptation models on real-world corrosion image data sets. Our study finds that DA achieves 27% improvement in test accuracy compared to the performance of the no-DA baseline for classifying real-world outdoor corrosion data. 
    more » « less