skip to main content

This content will become publicly available on September 11, 2024

Title: A Deep Reinforcement Learning-Based Resource Scheduler for Massive MIMO Networks
The large number of antennas in massive MIMO systems allows the base station to communicate with multiple users at the same time and frequency resource with multi-user beamforming. However, highly correlated user channels could drastically impede the spectral efficiency that multi-user beamforming can achieve. As such, it is critical for the base station to schedule a suitable group of users in each time and frequency resource block to achieve maximum spectral efficiency while adhering to fairness constraints among the users. In this paper, we consider the resource scheduling problem for massive MIMO systems with its optimal solution known to be NP-hard. Inspired by recent achievements in deep reinforcement learning (DRL) to solve problems with large action sets, we propose SMART, a dynamic scheduler for massive MIMO based on the state-of-the-art Soft Actor-Critic (SAC) DRL model and the K-Nearest Neighbors (KNN) algorithm. Through comprehensive simulations using realistic massive MIMO channel models as well as real-world datasets from channel measurement experiments, we demonstrate the effectiveness of our proposed model in various channel conditions. Our results show that our proposed model performs very close to the optimal proportionally fair (Opt-PF) scheduler in terms of spectral efficiency and fairness with more than one order of magnitude lower computational complexity in medium network sizes where Opt-PF is computationally feasible. Our results also show the feasibility and high performance of our proposed scheduler in networks with a large number of users and resource blocks.  more » « less
Award ID(s):
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
IEEE transactions on machine learning in communications and networking
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Abstract

    User subset selection requires full downlink channel state information to realize effective multi-user beamforming in frequency-division duplexing (FDD) massive multi-input multi-output (MIMO) systems. However, the channel estimation overhead scales with the number of users in FDD systems. In this paper, we propose a novel propagation domain-based user selection scheme, labeled aszero-measurement selection, for FDD massive MIMO systems with the aim of reducing the channel estimation overhead that scales with the number of users. The key idea is to infer downlink user channel norm and inter-user channel correlation from uplink channel in the propagation domain. In zero-measurement selection, the base-station performs downlink user selection before any downlink channel estimation. As a result, the downlink channel estimation overhead for both user selection and beamforming is independent of the total number of users. Then, we evaluate zero-measurement selection with both measured and simulated channels. The results show that zero-measurement selection achieves up to 92.5% weighted sum rate of genie-aided user selection on the average and scales well with both the number of base-station antennas and the number of users. We also employ simulated channels for further performance validation, and the numerical results yield similar observations as the experimental findings.

    more » « less
  2. null (Ed.)
    Massive multi-user (MU) multiple-input multiple-output (MIMO) provides high spectral efficiency by means of spatial multiplexing and fine-grained beamforming. However, conventional base-station (BS) architectures for systems with hundreds of antennas that rely on centralized baseband processing inevitably suffer from (i) excessive interconnect data rates between radio-frequency circuitry and processing fabrics, and (ii) prohibitive complexity at the centralized baseband processor. Recently, decentralized baseband processing (DBP) architectures and algorithms have been proposed, which mitigate the interconnect bandwidth and complexity bottlenecks. This paper systematically explores the design trade-offs between error-rate performance, computational complexity, and data transfer latency of DBP architectures under different system configurations and channel conditions. Considering architecture, algorithm, and numerical precision aspects, we provide practical guidelines to select the DBP architecture and algorithm that are able to realize the full benefits of massive MU-MIMO in the uplink and downlink. 
    more » « less
  3. Multiuser MIMO (MU-MIMO) technologies can help provide rapidly growing needs for high data rates in modern wireless networks. Co-channel interference (CCI) among users in the same resource-sharing group (RSG) presents a serious user scheduling challenge to achieve high overall MU-MIMO capacity. Since CCI is closely related to correlation among spatial user channels, it would be natural to schedule co-channel user groups with low inter-user channel correlation. Yet, establishing RSGs with low co-channel correlations for large user populations is an NP-hard problem. More practically, user scheduling for wideband channels exhibiting distinct channel characteristics in each frequency band remains an open question. In this work, we proposed a novel wideband user grouping and scheduling algorithm named SC-MS. The proposed SC-MS algorithm first leverages spectral clustering to obtain a preliminary set of user groups. Next, we apply a post-processing step to identify user cliques from the preliminary groups to further mitigate CCI. Our last step groups users into RSGs for scheduling such that the sum of user clique sizes across the multiple frequency bands is maximized. Simulation results demonstrate network performance gain over benchmark methods in terms of sum rate and fairness. 
    more » « less
  4. null (Ed.)
    In a multi-user system with multiple antennas at the base station, precoding techniques in the downlink broadcast channel allow users to detect their respective data in a non-cooperative manner. Vector Perturbation Precoding (VPP) is a non-linear variant of transmit-side channel inversion that perturbs user data to achieve full diversity order. While promising, finding an optimal perturbation in VPP is known to be an NP-hard problem, demanding heavy computational support at the base station and limiting the feasibility of the approach to small MIMO systems. This work proposes a radically different processing architecture for the downlink VPP problem, one based on Quantum Annealing (QA), to enable the applicability of VPP to large MIMO systems. Our design reduces VPP to a quadratic polynomial form amenable to QA, then refines the problem coefficients to mitigate the adverse effects of QA hardware noise. We evaluate our proposed QA based VPP (QAVP) technique on a real Quantum Annealing device over a variety of design and machine parameter settings. With existing hardware, QAVP can achieve a BER of 10 −4 with 100µs compute time, for a 6 × 6 MIMO system using 64 QAM modulation at 32 dB SNR. 
    more » « less
  5. null (Ed.)
    Wireless charging coupled with computation offloading in edge networks offers a promising solution for realizing power-hungry and computation intensive applications on user-devices. We consider a multi-access edge computing (MEC) system with collocated MEC server and base-station/access point (AP), each equipped with a massive MIMO antenna array, supporting multiple users requesting data computation and wireless charging. The goal is to minimize the energy consumption for computation offloading and maximize the received energy at the user from wireless charging. The proposed solution is a novel two-stage algorithm employing nested descent algorithm, primal-dual subgradient and linear programming techniques to perform data partitioning and time allocation for computation offloading and design the optimal energy beamforming for wireless charging, all within MEC-AP transmit power and latency constraints. Algorithm results show that optimal energy beamforming significantly outperforms other schemes such as isotropic or directed charging without beam power allocation. Compared to binary offloading, data partition in partial offloading leads to lower energy consumption and more charging time, leading to better wireless charging performance. The charged energy over an extended period of multiple time-slots both with and without computation offloading can be substantial. Wireless charging from MEC-AP thus offers a viable untethered approach for supplying energy to user-devices. 
    more » « less