<?xml-model href='http://www.tei-c.org/release/xml/tei/custom/schema/relaxng/tei_all.rng' schematypens='http://relaxng.org/ns/structure/1.0'?><TEI xmlns="http://www.tei-c.org/ns/1.0">
	<teiHeader>
		<fileDesc>
			<titleStmt><title level='a'>Modulation and Coding for NOMA and RSMA</title></titleStmt>
			<publicationStmt>
				<publisher>IEEE</publisher>
				<date>01/01/2024</date>
			</publicationStmt>
			<sourceDesc>
				<bibl> 
					<idno type="par_id">10549319</idno>
					<idno type="doi">10.1109/JPROC.2024.3472501</idno>
					<title level='j'>Proceedings of the IEEE</title>
<idno>0018-9219</idno>
<biblScope unit="volume"></biblScope>
<biblScope unit="issue"></biblScope>					

					<author>Hamid Jafarkhani</author><author>Hossein Maleki</author><author>Mojtaba Vaezi</author>
				</bibl>
			</sourceDesc>
		</fileDesc>
		<profileDesc>
			<abstract><ab><![CDATA[Next generation multiple access (NGMA) serves as an umbrella term encompassing transmission schemes distinct from conventional orthogonal methods. As a prominent candidate of NGMA, non-orthogonal multiple access (NOMA) emerges as a promising solution, enhancing connectivity by allowing multiple users to concurrently share time, frequency, and space. However, NOMA faces challenges in practical implementation, particularly in canceling inter-user interference. In this paper, first, we discuss the principles behind NOMA and review the conventional NOMA methods and results. Then, to address the above challenges, we present asynchronous transmission and interference-aware modulation techniques, leading to decoding free from successive interference cancellation. The goal is to design constellations that dynamically adapt to interference, minimizing bit error rates (BERs) and enhancing user throughput in the presence of interuser, inter-carrier, and inter-cell interference. The traditional linkage between minimizing BER and increasing spectral efficiency is addressed, with the exploration of deep autoencoders for end-to-end communication as a new concept with significant potential for improving BERs. Interference-aware modulation techniques can revolutionize constellation design and communication over non-orthogonal channels. Rate-splitting multiple access (RSMA) is another promising interference management technique in multi-user systems. Beyond addressing existing challenges and misconceptions in finite-alphabet NOMA, this paper offers fresh insights to the field and provides an overview of code-domain NOMA schemes, trellis-coded NOMA, and RSMA as other potential candidates for NGMA. Additionally, we discuss the evolution of channel coding towards low-latency communication and examine the modulation and coding schemes in fifth-generation cellular networks. Finally, we examine future research avenues and challenges, highlighting the importance of addressing them for the practical realization of NOMA from a theoretical concept to a functional technology.]]></ab></abstract>
		</profileDesc>
	</teiHeader>
	<text><body xmlns="http://www.tei-c.org/ns/1.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink">
<div xmlns="http://www.tei-c.org/ns/1.0"><head>I. INTRODUCTION AND HISTORICAL NOTES</head><p>The next generation of communication systems is expected to deliver improved end-user experience by offering new applications and services such as industry automation, smart cities, virtual and augmented reality, remote medical surgery, self-driving cars, and uncrewed aerial vehicles (UAVs). These envisioned services pose many challenging requirements, such as low latency, high data rates, massive connectivity, high reliability, and support of diverse quality of service. The need for massive connectivity in fifth-generation (5G) wireless networks and beyond is mainly pushed by the explosion of the Internet of things (IoT) devices, as projected by leading industry including Cisco and Ericsson <ref type="bibr">[1]</ref>. Particularly, 6G wireless networks require a connection density of 10 7 devices/km 2 , which is 1000 times higher than that of 4G and 10 times higher than that of 5G networks <ref type="bibr">[1]</ref>- <ref type="bibr">[3]</ref>. The requirements in terms of improving reliability, spectral efficiency, and energy efficiency are also stringent. In this context, the roles of multiple access in general and modulation and coding in particular are crucial toward achieving these goals.</p><p>Non-orthogonal multiple access (NOMA) <ref type="bibr">[4]</ref> is perhaps the most prominent candidate for next generation multiple access (NGMA). NOMA increases the number of connected devices and enhances the spectral efficiency of communication by enabling multiple users to share time, frequency, and space, thus accommodating a larger number of users compared to conventional orthogonal multiple access (OMA) schemes. NOMA facilitates massive connectivity by allowing the concurrent service of multiple users within the same resource block, such as a time slot, sub-carrier, or spreading code. It has actively been considered by academia <ref type="bibr">[4]</ref>- <ref type="bibr">[9]</ref>, standardization bodies, and industry <ref type="bibr">[10]</ref>- <ref type="bibr">[13]</ref>. An intriguing aspect of NOMA is its flexibility in integration with various technologies, including orthogonal frequency division multiplexing (OFDM) which is the multiple access method in 4G and 5G. That means a NOMA user has the capability to share a single resource block of OFDM with one or more additional users.</p><p>Despite its significant promise and immense academic work in this field, NOMA has not been incorporated into any standards yet. Several factors contribute to this. A primary reason is that the theoretical gains of NOMA were not achieved in practical implementations <ref type="bibr">[14]</ref>. There are primary challenges that make it difficult to achieve NOMA's theoretical gains in practice. They include the difficulty of canceling inter-user interference introduced by NOMA (such as the complexity of successive interference cancellation), sensitivity of NOMA to channel state information (CSI), and non-synchronous nature of multi-user communication. Another reason for this shortfall is the absence of novel modulation schemes addressing inter-user interference introduced by NOMA. Overall, there has been limited research and innovation on finite-alphabet NOMA. A third contributing factor is the rise of competitive solutions, such as massive multiple-input multiple-output (MIMO), millimeter-wave (mmWave), and narrow-band IoT, which have effectively tackled some of the requirements for massive connectivity and spectrum efficiency through innovative solutions and the fact that many theoretical NOMA results are limited to single antenna cases.</p><p>Nonetheless, inheriting the rich theoretical background of the broadcast channel (BC) <ref type="bibr">[15]</ref>, NOMA still holds great promise as a future multi-user transmission technique, referred to as NGMA. Besides, emerging methods like rate-splitting multiple access (RSMA) have presented themselves as new potential candidates for NGMA. RSMA utilizes successive interference cancellation (SIC) to decode a portion of the interference and treats the remaining interference as noise. Thus, RSMA is in between space division multiple access (SDMA), which treats interference as noise, and NOMA, which decodes the interference of the users with weaker channels and removes it from the received signal.</p><p>With a specific focus on NOMA and RSMA, this article delves into current and future modulation and coding schemes for NGMA. Modulation and coding techniques play a critical role in achieving the ultimate goal of digital communication, which is transmitting a maximum number of bits reliably, i.e., with a small bit error rate (BER). Moreover, the emphasis here is on finite-alphabet and asynchronous NOMA, representing a crucial step in evaluating NOMA's gains in more practical settings and advancing research toward the integration of NOMA into wireless standards.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Motivation and Objectives</head><p>Modulation techniques, like quadrature amplitude modulation (QAM), are employed to increase the bit rate (spectral efficiency) while resulting in an acceptable BER. Current modulation techniques are, however, designed several decades ago with point-to-point communication in mind <ref type="bibr">[16]</ref>- <ref type="bibr">[19]</ref>. They have predefined, inflexible symbols and their constellation shaping is oblivious to interference, whereas modern communication systems are limited by interference more than by any other single effect <ref type="bibr">[20]</ref>. Interference appears in these networks in different forms such as inter-user interference (IUI), intercell interference (ICI), and inter-symbol interference (ISI). These all distort the received constellations in one way or another and thus reduce the reliability of communications by increasing the BER. The typical practical solution is then to sacrifice the spectral efficiency and limit the number of users by allocating orthogonalized resources to each user or by using low-rate and high-energy constellations.</p><p>The described interference scenarios share a common issue: they can displace a constellation symbol from its predefined decoding region (boundary), leading to decoding errors and, consequently, symbol and bit errors. This challenge arises because current modulation techniques, such as QAM, were originally designed for point-to-point systems without interference. These modulation techniques feature predefined constellation symbols and their shape is insensitive to interference. The rigidity of these constellations poses a significant hurdle to improving the BER and spectral efficiency in today's interference-limited communication systems. Therefore, there is a need for innovative interference-aware modulation techniques to meet key performance indicators in future wireless communication networks, including spectral efficiency, the number of supported devices, and high reliability.</p><p>A key goal of this paper is to present NGMA modulation and coding methods that reduce decoding BERs and increase the number of users and their throughput for a given number of resource blocks in the presence of inter-user and intercell interference. Minimizing BER and increasing spectral efficiency are linked together <ref type="bibr">[21,</ref><ref type="bibr">Fig. 1]</ref> and are traditionally optimized by designing modulation and coding schemes. We present a comprehensive survey of modulation schemes, both with and without coding, for both OMA and NOMA scenarios. In the context of NOMA, the survey critically evaluates the implications of superimposed signals on symbol overlapping and BER. Then, we propose using deep autoencoders for end-toend (E2E) NOMA. Deep learning-based E2E communication is a novel concept with significant potential. As a concrete example, this approach outperforms the MIMO precoder in terms of BER with/without the channel's knowledge <ref type="bibr">[22]</ref>- <ref type="bibr">[25]</ref>.</p><p>In line with the fundamental objective of digital communication, which revolves around the reliable transmission of a maximal number of bits, our focus in this work is on exploring various avenues that pave the way for reliable and feasible nonorthogonal transmission. Instead of integrating NOMA with emerging and existing communication technologies, our work centers around understanding the essential steps required to transform non-orthogonal transmission, specifically NOMA, from a theoretical research topic into a practical and feasible technology. Toward this goal, our discussion encompasses practical considerations, including asynchronous transmission, SIC-free decoding, trellis-coded NOMA, interference-aware constellation design, end-to-end communications, and numerous other aspects detailed in the following section.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Contributions and Insights</head><p>This paper comprehensively explores existing and emerging solutions related to synchronous and asynchronous NOMA, code-domain NOMA, trellis-coded NOMA, uniform, nonuniform, and interference-aware constellation design, bitinterleaved coded modulation, SIC-free decoding, and endto-end deep learning-based NOMA. Additionally, the paper provides a state-of-the-art overview of RSMA and discusses open problems and future directions in NGMA. Particularly, we contribute to</p><p>&#8226; Exploring the utilization of asynchronous transmission to tackle synchronization challenges inherent in multi-user and distributed systems, such as those found in uplink and downlink NOMA. &#8226; Reviewing the structure of transmitters and receivers in code-domain NOMA and categorizing various codedomain NOMA schemes accordingly. &#8226; Investigating the effectiveness of non-uniform modulation, trellis-coded NOMA, and bit-interleaved coded modulation with iterative decoding to approach the capacity region of downlink NOMA. <ref type="foot">1</ref>&#8226; Debunking misconceptions surrounding power allocation in NOMA, which may have arisen from assumptions tied to a specific type of modulation and decoding. &#8226; Reviewing modulation and coding schemes utilized in 5G and discussing the road to reduce latency in 6G. &#8226; Introducing the innovative concept of interference-aware constellation design and end-to-end NOMA and demonstrating its merit. &#8226; Conducting a review of uplink and downlink RSMA and exploring its interconnections with NOMA. &#8226; Highlighting research directions and open problems related to the above topics, with a specific emphasis on advancing NOMA as a feasible and practical technique.</p><p>In light of the above contributions, we gain diverse insights that propel NOMA from a theoretical concept towards a practical technology. We hope that these insights, as listed below, may trigger advancements to pave the way for NOMA's inclusion in wireless standards in the near future.</p><p>1) Contrary to conventional wisdom, which states that asynchronous transmission results in increased overall interference and performance degradation due to extra ISI, we show that asynchronous transmission decreases the overall interference. This unexpected outcome occurs because the reduction in IUI outweighs the impact of the added ISI. 2) We demonstrate that drawing general conclusions about NOMA power allocation solely based on a specific type of modulation and decoding may lead to misconceptions. 3) Interference-aware constellation design is a practical approach to realize non-overlapping super-constellations in NOMA. Attaining this objective becomes challenging, if not impossible, when utilizing established constellations like QAM for NOMA users. 4) For MIMO-NOMA systems, the joint design of constellations and precoding for all MIMO sub-channels holds great potential.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Structure</head><p>Figure <ref type="figure">1</ref> illustrates the organization of the paper, providing an overview of the topics covered in each section.</p><p>In Section II, we discuss different techniques under the umbrella of NOMA. These could be primarily categorized into power-domain NOMA (P-NOMA) and code-domain NOMA uniform constellations and their advantages, particularly with bit-interleaved coded modulation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>Introduction</head><p>Section IV discusses the channel codes utilized in different generations of cellular networks, with a particular emphasis on modulation and coding schemes in 5G networks. We explore the evolution of channel coding techniques to meet the growing demands of ultra-reliable low-latency communications (URLLC). Additionally, we cover trellis-coded modulation for both OMA and NOMA scenarios, emphasizing the role of joint detection methods in enhancing performance.</p><p>Section V introduces interference-aware constellation design, incorporating SIC-free NOMA and autoencoder-based E2E communication. Autoencoders are utilized to create super-constellations with distinguishable symbols, transitioning from block-by-block to E2E communication. The section includes IUI-aware NOMA, ICI-resilient NOMA, and modulation strategies for MIMO-NOMA.</p><p>In Section VI, we review RSMA, discussing its origin and advantages in both downlink and uplink scenarios. Uplink RSMA achieves the capacity region of the Gaussian multiple access channel (MAC) without time sharing, while downlink RSMA is advantageous with imperfect CSI at the transmitter. We briefly explore the use of RSMA in connection with different technologies, including integrated communication and sensing, reconfigurable intelligent surfaces, and UAVs.</p><p>Section VII highlights research areas for NGMA, addressing non-uniform modulation in NOMA with bit-interleaved coded modulation. Challenges in E2E NOMA and MIMO-NOMA modulation are discussed. Limited feedback and CSI present open problems, including robust MIMO-NOMA system design with estimated and quantized CSI.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Related Works</head><p>Numerous survey papers have appeared on NOMA and</p><p>TABLE I: List of Key Abbreviations. Acronym Description 3GPP 3rd Generation Partnership Project AE Autoencoder AI Artificial intelligence A-NOMA Asynchronous NOMA AWGN Additive white Gaussian noise BC Broadcast channel BER Bit error rate BICM Bit-interleaved coded modulation CDMA Code division multiple access C-NOMA Code-domain NOMA CSI Channel state information CSIT Channel state information at the transmitter E2E End-to-end ICI Inter-cell interference IDMA Interleave division multiple access IGMA Interleave-grid multiple access IoT Internet of things ISAC Integrated sensing and communications ISI Inter-symbol interference IUI Inter-user interference LDS Low-density signature/spreading LPMA Lattice partition multiple access LTE Long Term Evolution MAC Multiple access channel MCS Modulation and coding scheme MIMO Multiple-input multiple-output ML Machine learning NGMA Next generation multiple access NOMA Non-orthogonal multiple access NR New Radio OFDM Orthogonal frequency division multiplexing OMA Orthogonal multiple access PDMA Pattern division multiple access P-NOMA Power-domain NOMA PSK Phase-shift keying QAM Quadrature amplitude modulation QPSK Quadrature phase-shift keying RB Resource block RIS Reconfigurable intelligent surface RSMA Rate-splitting multiple access SCMA Sparse code multiple access SDMA Space division multiple access SER Symbol error rate SIC Successive interference cancellation SISO Single-input single-output SNR Signal-to-noise ratio SVD Singular value decomposition TCM Trellis-coded modulation TC-NOMA Trellis-coded NOMA UAV Uncrewed aerial vehicle URLLC Ultra-reliable low-latency communications</p><p>RSMA in recent years. For example, see <ref type="bibr">[7]</ref>- <ref type="bibr">[9]</ref> for NOMA and <ref type="bibr">[29]</ref> for RSMA. However, many of these survey papers are not directly related to this paper as their focus is not on modulation, finite-alphabet NOMA, or RSMA. The most related works are <ref type="bibr">[30]</ref>- <ref type="bibr">[32]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>II. P-NOMA &amp; C-NOMA</head><p>4G cellular networks have been architected around orthogonal radio resource allocation techniques not allowing for overlapping resource allocation. For instance, a resource block (RB) in Long Term Evolution (LTE), which spans 180kHz, cannot be shared among multiple users; it must be exclusively assigned to one user. This resource allocation approach has two limitations in the context of massive IoT:</p><p>&#8226; With the rapid growth of massive IoT devices, there would not be enough spectrum to allocate a dedicated RB to each device. 2 &#8226; Massive IoT users typically do not exhaust an entire RB, rendering such resource allocation inefficient. Due to the above challenges, the communication system design has recently undergone a transformation, shifting from an orthogonal resource allocation to a non-orthogonal one <ref type="bibr">[5]</ref>. This paradigm shift encompasses various aspects including waveform design, multiple access, and random access <ref type="bibr">[5]</ref>. Particularly, NOMA has attracted significant attention as a promising multiple access technique.</p><p>It should be highlighted that the proposed NOMA techniques for the uplink and downlink are distinct, rooted in the inherent differences between communication requirements for each direction. The downlink predominantly serves humancentric communications, characterized by larger packets and higher data rates. Conversely, the uplink involves an extensive array of uncoordinated devices transmitting small packets at low data rates. Consequently, addressing the demands of massive, low-rate IoT devices in the uplink mandates a unique set of techniques compared to P-NOMA which is good for the downlink channels.</p><p>It is worth noting that, P-NOMA has been studied by the 3rd Generation Partnership Project (3GPP) for LTE in TR 36.859, Release 13 <ref type="bibr">[35]</ref>. It has also been studied for New Radio (NR) in TR 38.812, Release 16 <ref type="bibr">[36]</ref>. Also, there are several P-NOMA-related survey papers worth mentioning, including <ref type="bibr">[32]</ref>, <ref type="bibr">[37]</ref>- <ref type="bibr">[39]</ref>. These works cover various topics including 2 Interestingly, the concept of allocating more than one user to one RB has already been successfully implemented in 2G under the name of voice services over adaptive multi-user channels on one slot (VAMOS) which is used to increase the capacity of voice services by allowing multiple users to share the same time slot. In the simplest case, it is known as the adaptive QPSK (AQPSK) modulation scheme. It enables scheduling two users on inphase (I) and quadrature-phase (Q) channels, thus doubling the number of users served by a single radio resource <ref type="bibr">[33]</ref>, <ref type="bibr">[34]</ref>. Additionally, allocation of different power levels for each user is possible <ref type="bibr">[33]</ref>, <ref type="bibr">[34]</ref>.</p><p>With the above definition of NOMA, which involves using one RB for multiple users' signals, VAMOS is a NOMA scheme. There is, however, a subtle difference. The constellation symbols of users try to be orthogonal (one is mapped to the I channel and the other to the Q channel). In other words, half of the constellation points are for the first user and the other half are for the second user. However, in the NOMA modulation schemes considered in <ref type="bibr">[35]</ref> and this paper, by receiving every symbol, we can reconstruct information for both users. Therefore, the two schemes are different in terms of transmission rates.</p><p>Coarse Points: 4 PSK Fine Points: 32 QAM Fig. <ref type="figure">2</ref>: An example of superposition coding. The constellation for the user with weaker channel is 4-PSK while the constellation for the user with stronger channel is 8-QAM. The overall superposed constellation is called 32-QAM. grant-free NOMA, resource management mechanisms, and others.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. P-NOMA</head><p>To increase the efficiency of wireless networks, the concurrent transmission of different users is unavoidable. Such a concurrent transmission in a wireless multi-user network results in interference among different signals. Therefore, one of the most distinctive features of multi-user wireless networks is the interference phenomenon. As a result, dealing with interference is very important when shifting from the single-user paradigm to a multi-user paradigm. To deal with interference, currently, communication standards rely on assigning separate time, frequency, and code resources to different users such that each user utilizes only one such a given resource in an orthogonal multiple access framework. This is a huge waste of resources and to curtail it cellular networks utilize frequency reuse in cells that are far from each other. Nevertheless, in each cell, and at each time, frequency, and code, only one user is served using a resource that is orthogonal to other resources. Therefore, the main source of interference will be the leak from users of adjacent resources due to the imperfection of the filters or the inter-cell interference due to the frequency reuse. However, NOMA relies on assigning more than one user to each resource, for example by employing superposition coding at the transmitter and SIC at the receiver. In fact, superposition coding is not new and has been used in achieving the capacity of the degraded BC <ref type="bibr">[15]</ref> as well as the single-antenna Gaussian BC which is always degraded. In addition, combined with an appropriate transmitter, SIC at the receiver is capable of approaching the boundaries of the capacity region of both the degraded BC and MAC <ref type="bibr">[40]</ref>.</p><p>To study the main principles behind P-NOMA, let us take a downlink P-NOMA system as an example. It consists of one base station (BS) and K users. Let us assume the transmit power of User k's signal is P k and K k=1 P k = P , where P is the total transmit power of the BS. The transmitted signal s is defined as</p><p>where s k is the transmitted symbol for User k. Let us denote the channel coefficient between the BS and User k by h k . Then, the received signal at User k is given by</p><p>where &#951; k is the additive Gaussian noise, &#951; k &#8764; CN (0, &#963; 2 k ). For simplicity of the notation, we assume the same noise power for all users, i.e., &#963; 2 k = &#963; 2 . Obviously, different users generate interference for each other and their decoders should manage the interference. P-NOMA manages the interference by decoding the signal of users with weaker channels and canceling it from the received signal. As such, in P-NOMA, it is important to know the relative strength of the users' channels to understand which signals can be decoded for interference cancellation. The main principle behind P-NOMA decoding is that stronger channels, i.e., channels with larger channel magnitudes |h k |, will have larger capacities. Therefore, they can support higher rates. As a result, they can successfully decode the symbols of the weaker users with lower rates.</p><p>Without loss of generality, let us assume that</p><p>, a lower index represents a stronger user. For any k &lt; K, the capacity of User k is higher than that of User l, for l = k + 1, &#8226; &#8226; &#8226; , K. Therefore, using SIC, User k can decode the signal for Users k + 1, &#8226; &#8226; &#8226; , K and remove their interference before decoding its own signal. Under the assumption of perfect SIC, the throughput of User k for a case with K users can be calculated by</p><p>It is clear from (2) and (3) that the throughput values depend heavily on the power allocation and the channels. The cochannel interference makes the resource allocation problem in P-NOMA a non-convex optimization problem. The power allocation for a two-user P-NOMA system is studied in <ref type="bibr">[41]</ref>.</p><p>Power allocation for multiple users sharing one channel, i.e., multi-user NOMA, is investigated in <ref type="bibr">[42]</ref>, <ref type="bibr">[43]</ref>. The solution to the power minimization problem can be obtained using the uplink-downlink duality. In general, resource management is an important aspect of the transmitter design in different NOMA scenarios. In addition, user grouping is essential for balancing spectral efficiency, fairness, and system throughput.</p><p>While having a single group is ideal in theory <ref type="bibr">[44]</ref>, practical limitations like the complexity of SIC necessitate multiple groups <ref type="bibr">[45]</ref>. As group size increases, the SIC complexity rises. Typically, users are grouped by channel quality, pairing those with strong conditions with those having weaker conditions to enhance performance. While the above analysis is based on capacity formulas, i.e., error-free decoding, similar principles can be applied to a practical modulation scheme that can cause errors. In what  follows, we present an example of superposition coding and SIC decoding for traditional modulation schemes.</p><p>Example: An example of superposition coding using 4-phaseshift keying (PSK) and 8-QAM constellations for two users is shown in Fig. <ref type="figure">2</ref>. In this example, the input bits of the user with weaker channel, i.e., smaller |h k |, are modulated with a 4-PSK modulation as shown by the coarse points in Fig. <ref type="figure">2</ref>. The input bits for the user with stronger channel are modulated with an 8-QAM modulation. As described in (1), the transmitted symbol is the weighted sum of the two modulated symbols. In this example, we consider equal power allocation that results in a 32-QAM constellation shown by the fine points in Fig. <ref type="figure">2</ref>.</p><p>Figure <ref type="figure">3</ref> shows the SIC technique for decoding the superposed signal of the two users in Fig. <ref type="figure">2</ref> at the receiver. As shown in Fig. <ref type="figure">3a</ref>, the receiver of the weaker user only decodes the coarse points by mapping the received signal to the nearest point in the corresponding PSK constellation. Practically, this means that the weaker user is considering the symbol of the stronger user as noise. The stronger user is also able to decode the coarse points since its channel is stronger than that of the weak user. Therefore, the stronger user decodes the symbol of the weaker user, using the same approach, and subtracts the effects of the decoded coarse symbol from the received signal. This will cancel the weaker user's interference from the stronger user's received signal. The remaining signal is decoded using the corresponding constellation (8-QAM) as shown in Fig. <ref type="figure">3b</ref>. The modulation constellations in this example are chosen to make the example visually more appealing. However, the example works for any choice of constellations. A more practical example may choose a 4-QAM constellation for the stronger user, instead of 8-QAM, that results in an overall 16-QAM modulation.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Asynchronous NOMA</head><p>Traditional modulation and coding schemes assume perfect symbol synchronization. The performance of many existing modulation and coding schemes will severely degrade without symbol-level synchronization. Because of the distributed nature of multi-user networks and the effects of multipath and propagation delays, signals from different users experience different time delays to the receiver <ref type="bibr">[46]</ref>. Perfect synchronization requires feedback and coordination, which complicates the system greatly. Having multiple antennas at the receiver makes the matter more complicated. For example, assuming a multiple antenna receiver in uplink NOMA, one can synchronize the received signal perfectly at one of the receive antennas but other antennas may experience imperfect timing synchronization among received signals. Even if such a complete synchronization is possible, it is not clear if it is desirable. The possible advantages along with the difficulties in achieving perfect synchronization motivate a thorough analysis and design of NOMA systems under imperfect timing synchronization. In fact, symbol-level asynchrony has been advantageous in managing interference in some communication systems as discussed in <ref type="bibr">[46]</ref>- <ref type="bibr">[48]</ref>. Similar principles can be applied to NOMA to design asynchronous NOMA (A-NOMA) <ref type="bibr">[49]</ref>- <ref type="bibr">[52]</ref>. In this section, we discuss the principles behind asynchronous NOMA for both uplink and downlink.</p><p>Uplink A-NOMA: Let us assume User k transmits</p><p>, where s k [n] is the nth element of vector s k denoting the nth normalized transmitted symbol and P k denotes the transmit power. Assuming the pulse-shaping filter p(.) and a frame length of N , the transmitted signal from User k will be</p><p>Considering the relative time delay of &#964; k for User k, i.e., a time delay of &#964; k T , and the corresponding channel h k , the signal from User k arrives at the receiver as</p><p>Then, the received signal at the BS is given by</p><p>where &#951;(t) denotes the additive white Gaussian noise (AWGN) with a power spectral density of &#963; 2 . Unlike the case with perfect synchronization, in addition to IUI, the received signal includes ISI as well. The set of sufficient statistics is found by proper filtering at the receiver, i.e., a matched filter with the impulse response p(t), and over-sampling K times, each time synched with one of the users <ref type="bibr">[46]</ref>. In other words, the receiver samples at <ref type="figure">5</ref> illustrates one possible structure of the decoder including oversampling. As shown in the figure, the sampling rate is still f s = 1/T , i.e., the same as that of the synchronous case, but there are K parallel branches of sampling.</p><p>When sampled synched with User k's signal, the relative delay of User l's signal is &#964; kl = &#964; l -&#964; k . The resulting samples can be collected in an N &#215; 1 vector y k as</p><p>where the (n, m)th element of the</p><p>in which g(t) = p(t) * p(t), where * denotes the convolution, and the noise vector &#951; k has the co-variance matrix</p><p>Using a square-root Nyquist pulse, p(t), like the practically common root raised cosine (RRC) pulse shape, R kk = I N and R T kl = R lk . An example of the received</p><p>&#119910;</p><p>2 [&#119899; -1] &#119910; 2 [&#119899;] &#119910; 2 [&#119899; + 1] &#119962; 1 = &#8462; 1 &#119875; 1 &#119920; &#119873; &#119956; 1 + &#8462; 2 &#119875; 2 &#119929; 12 &#119956; 2 + &#120636; 1 &#119962; 2 = &#8462; 1 &#119875; 1 &#119929; 21 &#119956; 1 + &#8462; 2 &#119875; 2 &#119920; &#119873; &#119956; 2 + &#120636; 2 (b) Symbol Asynchronous signals and the corresponding sufficient statistics for two users is shown in Fig. <ref type="figure">4</ref>. To collect sufficient statistics in A-NOMA, the receiver should perform oversampling, compared with synchronous NOMA. For the example of two users in Fig. <ref type="figure">4</ref>, the number of samples in A-NOMA is twice that of the synchronous NOMA. Note that for the case with perfect synchronization, when all delays &#964; k are the same, we have R kl = I N , i.e., there is no ISI. Therefore, the perfect synchronization results in the conventional uplink P-NOMA system model of</p><p>where compared to (5), the index k is dropped as all resulting K equations are identical. As shown in <ref type="bibr">(7)</ref>, in a perfectly synchronized NOMA system, at each time instant, only the IUI, from the same time-instant, degrades the performance. Therefore, SIC can provide the optimal performance. However, in A-NOMA, not only IUI but also ISI degrades the performance. Therefore, new challenges arise with asynchronous transmission.</p><p>The conventional wisdom suggests that because of the additional ISI, asynchronous transmission increases the overall interference and the overall performance is degraded. However, surprisingly, asynchronous transmission in fact decreases the overall interference, as the reduction in IUI outweighs the addition of ISI <ref type="bibr">[51]</ref>. On the other hand, because of the ISI, the conventional SIC is not optimal anymore and the design of efficient sequence detection methods is required <ref type="bibr">[52]</ref>. More specifically, to remove the stronger users' signals, all the symbols in a frame need to be decoded which adds delay and complexity to the system. In addition, because of the timing asynchrony, sufficient statistics results in over-sampling and the corresponding sampling diversity <ref type="bibr">[46]</ref> that improves the overall performance.</p><p>To manage the above issues, one can collect the K vector equations ( <ref type="formula">5</ref>) in a matrix format. The resulting input-output matrix equation can be represented as a virtual MIMO system</p><p>where y, R, H, s and &#951; represent the set of samples, the timing offsets matrix, the effective channel matrix, the transmitted symbols (including the assigned power), and the noise vector, respectively. The formulation of the problem with a virtual MIMO system enables the use of various interference mitigation methods and decoder designs developed in the literature for MIMO systems <ref type="bibr">[53]</ref>.</p><p>To quantify the effects of asynchronous transmission, let us focus on a two-user uplink P-NOMA system. Figure <ref type="figure">4</ref> shows the received signals of such a system. It can be modeled as an asynchronous MAC with a typical rate-region shown in Fig. <ref type="figure">6</ref>. As proved in <ref type="bibr">[52]</ref> and depicted in Fig. <ref type="figure">6</ref>, not only does A-NOMA outperform synchronous NOMA, but also SIC is not optimal for A-NOMA. There are several conclusions proved in <ref type="bibr">[52]</ref> and shown in Fig. <ref type="figure">6:</ref> &#8226; Intentionally creating a &#964; = 0.5 symbol timing mismatch between the two signals with double time oversampling can enlarge the rate-region. &#8226; While the maximum sum-rate using SIC for uplink NOMA is independent of the decoding order, the maximum sum-rate of A-NOMA depends on the decoding order. For example, in Fig. <ref type="figure">6</ref>, the SIC pentagon rateregion vertices corresponding to the decoding orders {1, 2} and {2, 1} for A-NOMA provide sum-rates of 1.302 and 1.254, respectively. On the other hand, the corresponding vertices for NOMA provide the same sumrate of 1.233. &#8226; When perfect synchronization is lacking, SIC is not optimal anymore.</p><p>The above observations can guide the principles behind designing practical transceivers for asynchronous multiple access</p><p>Matched Filter &#119910;(&#119905;) Delay &#120591; &#119870; &#119879; Delay &#120591; 1 &#119879; Delay &#120591; 2 &#119879; . . . Decode based on K sets of samples Sampling at &#119891; &#119904; = 1/&#119879; Sampling at &#119891; &#119904; = 1/&#119879; Sampling at &#119891; &#119904; = 1/&#119879; . . . Decoding Order: {1,2} &#119877; &#119904;&#119906;&#119898; = 1.233 Decoding Order: {1,2} &#119877; &#119904;&#119906;&#119898; = 1.302 Decoding Order: {2,1} &#119877; &#119904;&#119906;&#119898; = 1.254 Decoding Order: {2,1} &#119877; &#119904;&#119906;&#119898; = 1.233 Fig. 6: Different rate-regions for P 1 = P 2 = 10 dBm, |h 1 | 2 = &#963; 2 , and |h 2 | 2 = 0.2&#963; 2 , using root raised cosine pulse shaping with &#946; = 0.5. Different decoding orders in SIC result in different sum rates in asynchronous NOMA. 0 30 60 90 120 150 180 R 1 [Mbps] 0 15 30 45 60 75 90 R 2 [Mbps] = 0.5, Asynchronous = 0, Synchronous R 1 = 120, R 2 = 75</p><p>Fig. <ref type="figure">7</ref>: An example of asynchronous NOMA operational point that is not achievable by synchronous NOMA. systems. For example, a practical A-NOMA transceiver that creates an intentional &#964; = 0.5 symbol asynchrony between the two users' signals has been designed in <ref type="bibr">[52]</ref>. The operational point achieved by such an A-NOMA transceiver, shown in Fig. <ref type="figure">7</ref>, is outside of the synchronous NOMA's capacity region and therefore is not achievable by any synchronous NOMA transceiver. Downlink A-NOMA: In a downlink NOMA system, the superposition is performed at the transmitter and the transmitted signal is received by the intended users. Therefore, unlike uplink NOMA, the transmitter has full control of the time asynchrony and can intentionally add any desired set of time delays. In fact, adding intentional time delays at the transmitter is beneficial and can improve the performance <ref type="bibr">[51]</ref>. Using the same notation developed for the uplink A-NOMA, the transmitted signal, including the added intentional time delays, can be written as</p><p>and the received signal by User k is</p><p>Similar to the case of uplink A-NOMA, the set of sufficient statistics includes over-sampling K times, each time synched with the signal of one of the users. After proper match filtering and over-sampling, the set of samples at the kth receiver can be represented in the following vector <ref type="bibr">[54]</ref> </p><p>Note that the perfect synchronization results in the conventional system model of</p><p>Using SIC, each user decodes all the signals from weaker users and removes them from the received signal. Then, it considers the remaining interference from the stronger users as noise and decodes its own symbols. This system is called asynchronous P-NOMA (AP-NOMA) in Fig. <ref type="figure">8</ref> and Fig. <ref type="figure">9</ref>.</p><p>In downlink NOMA, the virtual MIMO system in (11) resembles a multiuser system. As such, apart from SIC, other techniques for multiuser communications can be applied as well. For example, since the transmitter has access to all signals, after superposition coding, it can perform precoding The transmitter sends the signal toward the eigenvectors of the effective virtual MIMO system. The receiver uses post-processing, in the direction of the corresponding eigenvector, to recover its signal.</p><p>in the direction of matrix R's eigenvectors to further improve the performance. The decoder of such a system is named asynchronous time-domain NOMA (AT-NOMA). At User k, AT-NOMA uses post-processing, in the direction of the corresponding eigenvector, to recover its signal. Figure <ref type="figure">8</ref> shows the block diagram of such a system that does not include a SIC block for decoding <ref type="bibr">[51]</ref>. The corresponding rate regions are shown for two users in Fig. <ref type="figure">9</ref>. As shown in Fig. <ref type="figure">9</ref>, AT-NOMA's rate-region is larger than that of AP-NOMA and both of them enlarge the achievable rate-region of P-NOMA.</p><p>In addition, for systems with M transmit antennas, transmit beamforming can be implemented as well. Considering an M &#215; 1 beamforming vector W k for User k, the asynchronous transmitted signal, including intentional time delays, is <ref type="bibr">[54]</ref> </p><p>Then, the received signal at User k is</p><p>where h k is User k's channel. As before, the set of sufficient statistics after match filtering and over-sampling can be collected in the following vector <ref type="bibr">[54]</ref> </p><p>The above input-output relationship is very similar to that of a MIMO system and the corresponding receiver designs from the MIMO literature can be used for decoding <ref type="bibr">[53]</ref>, <ref type="bibr">[54]</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. C-NOMA</head><p>In the uplink, C-NOMA refers to a diverse range of nonorthogonal transmission techniques <ref type="bibr">[5]</ref>, <ref type="bibr">[55]</ref>, <ref type="bibr">[56]</ref>. C-NOMA techniques revolve around the concept of allowing more than one user to share the same resource block, marking a distinct departure from OMA techniques. On the other hand, C-NOMA distinguishes itself from P-NOMA by requiring a signature, such as a spreading code, for the differentiation of users</p><p>Channel Encoder Interleaving/ Scrambling Modulator Spreading Interleaving/ Scrambling Resource Mapping Bit-level operations Symbol-level operations and the cancellation of inter-user interference. In contrast, P-NOMA achieves this through the variation in power allocation to each user. Given the predominant emphasis on uplink communication in IoT, C-NOMA emerges as a promising way to accommodate IoT users, especially those with limited resources such as simple sensors.</p><p>The range of C-NOMA techniques is extensive. A major category within C-NOMA schemes is notably influenced by code division multiple access (CDMA). CDMA is a multiple access technique where data symbols are spread over a set of user-specific, mutually-orthogonal codes. An adaptation of CDMA is LDS-CDMA, where spreading codes exhibit low density, meaning only a small fraction of code elements are non-zero <ref type="bibr">[57]</ref>. Resource overloading is the common theme of many non-orthogonal access methods which allows the number of supportable users to be more than the number of available resources, i.e., K users share N resources in a nonorthogonal fashion (K &gt; N ). With this, each scheme needs a multiple access signature to differentiate users and cancel inter-user interference.</p><p>We next explore the transmitter and receiver structures and classification of C-NOMA techniques.</p><p>1) Transmitter Structure: A simplified, generic structure of a C-NOMA transmitter is depicted in Fig. <ref type="figure">10</ref> [55], <ref type="bibr">[56]</ref>, <ref type="bibr">[58]</ref>. Different NOMA schemes apply their signatures in one or more of the dashed blocks in Fig. <ref type="figure">10</ref>. For example, the LDS-CDMA signature is a short spreading sequence applied in the 'spreading' block followed by a unique interleaver for each user, resulting in a sparse signature matrix. Thus, LDS-CDMA is a symbol-based C-NOMA applying both spreading and interleaving after modulation. Sparse resource element (RE) mapping has emerged as a category of multiple access signatures within several C-NOMA schemes, such as SCMA, PDMA, and interleave-grid multiple access (IGMA). In these schemes, the intentional transmission of zeros occurs in specific REs to make resource mapping sparse. An illustration of such a sparse resource mapping is provided in the matrix below:</p><p>.</p><p>According to the above resource allocation matrix, a total of Detection Decoding</p><p>Interference Cancellation Fig. <ref type="figure">11</ref>: A high-level structure of a C-NOMA receiver.</p><p>9 user equipments (UEs) are mapped to 6 resource elements.</p><p>Each UE is assigned access to two REs, and each IGMA is accessed by three different UEs. For instance, RE1 accommodates UE2, UE5, and UE7 while UE4 utilizes RE3 and RE5. Further, it is noteworthy that UE-specific signatures exhibit, at most, a one-position overlap, because a pair of columns do not overlap in more than one position.</p><p>2) Receiver Structure: The fundamental components of a high-level C-NOMA receiver consist of three key building blocks: a detector, a channel decoder, and interference cancellation <ref type="bibr">[55]</ref>, as depicted in Fig. <ref type="figure">11</ref>. In the following, we briefly elaborate on the functionality of each block.</p><p>The detection block, more precisely, the multi-user detection block, addresses challenges arising from multiple users sharing the same channel. It could employ various techniques <ref type="bibr">[55]</ref>, including adaptive filtering and optimization algorithms like minimum mean squared error (MMSE) <ref type="bibr">[59]</ref>, matched filter, maximum a posteriori <ref type="bibr">[60]</ref>, and message passing algorithm <ref type="bibr">[61]</ref>, to separate signals from different users.</p><p>While the detection block handles scenarios where multiple users transmit simultaneously, and is commonly used in multiple access systems, the decoding block serves as a fundamental process in communication systems. Its applicability extends to both single-user and multi-user environments, encompassing error correction and the reconstruction of the original message or data. The decoding block ensures the accurate recovery of information from received signals.</p><p>Interference cancellation block may or may not exist. As an example, to decode a UE's data packet, only MMSE detection and channel decoding could be executed. Nonetheless, interference cancellation is commonly used. This cancellation process can occur successively, in parallel, or through a hybrid approach. MMSE-SIC is a well-known scheme in this context where interference cancellation is performed successively. This technique is commonly utilized in symbol-level spreading schemes, where interference cancellation can take either a 'hard' or 'soft' form. In the hard interference cancellation approach, interference is subtracted once a user's signal is successfully decoded. On the other hand, in the soft interference cancellation method, the output of the decoder includes soft information. This soft information is then used to reconstruct symbols, involving the consideration and processing of probabilistic or continuous-valued information, rather than relying on discrete, hard decisions.</p><p>3) Categories: C-NOMA schemes can be classified in different ways. As can be seen in Fig. <ref type="figure">12</ref>, C-NOMA related operations can be done in the bit-level (before modulator), symbol-level (after modulator), and hybrid. For this reason, as also shown in Fig. <ref type="figure">10</ref>, <ref type="figure">C</ref>-NOMA schemes may be categorized as follows:</p><p>&#8226; Symbol-level spreading-based: Most NOMA schemes fall into this category where a spreading sequence serves as a signature to distinguish users. Earlier, we discussed examples of such signatures in the context of a resource allocation matrix. Notable instances of this category include SCMA <ref type="bibr">[62]</ref> proposed by Huawei, non-orthogonal coded access (NOCA) <ref type="bibr">[63]</ref> proposed by Nokia, multi-user shared access (MUSA) <ref type="bibr">[64]</ref> proposed by ZTE, PDMA <ref type="bibr">[65]</ref> suggested by CATT, and LDS-CDMA, as well as LDS-OFDM <ref type="bibr">[57]</ref>, <ref type="bibr">[66]</ref>. &#8226; Symbol-level interleaving/scrambling-based: In this category, a symbol-level interleaver/scrambler is used to distinguish the users. Resource spread multiple access (ReSMA) <ref type="bibr">[67]</ref> and repetition division multiple access (RDMA) <ref type="bibr">[68]</ref> are examples of this category. &#8226; Bit-level interleaving/scrambling-based: This category involves bit-level operations, specifically utilizing an interleaver or a scrambler at the bit-level to distinguish users <ref type="bibr">[69]</ref>. A bit-level scrambler is advantageous in terms of lower processing delay and memory requirements compared to a bit-level interleaver <ref type="bibr">[36]</ref>. Examples of this category include interleave division multiple access (IDMA) <ref type="bibr">[69]</ref> suggested by InterDigital and low-code-rate spreading (LCRS) <ref type="bibr">[70]</ref> proposed by Intel. Release 15 NR already supports a bit-level scrambler for randomization, which can be leveraged for C-NOMA as well. &#8226; Hybrid: Certain NOMA schemes may combine multiple methods. For instance, IGMA <ref type="bibr">[60]</ref>, proposed by Samsung, incorporates both bit-level interleaving and sparse mapping in its design. Lattice partition multiple access (LPMA) is another example in this category <ref type="bibr">[71]</ref>.</p><p>C-NOMA schemes can be categorized differently, as illustrated in Fig. <ref type="figure">13</ref>. In this representation, the classification is based on the utilization of scrambling/interleaving and spreading operations. Thus, methods that use spreading are in one group, while methods that use interleaving-whether at the bit or symbol level-are in another group.</p><p>4) Prominent Schemes: Here, we explore several prominent C-NOMA schemes in more details.</p><p>&#8226; LDS-CDMA: This C-NOMA scheme represents a nonorthogonal variant of CDMA. While in CDMA, data symbols are spread over user-specific, mutually-orthogonal codes, LDS-CDMA deviates from this norm by utilizing non-orthogonal codes. LDS underscores that spreading codes exhibit low density, i.e., only a small fraction of code elements are non-zero <ref type="bibr">[57]</ref>. This low-density feature allows for the utilization of near-optimal message passing algorithms with practical complexity. However, despite its moderate detection complexity, LDS-CDMA may encounter performance degradation for constellation sizes larger than quadrature phase-shift keying (QPSK).</p><p>&#8226; SCMA: As one of the most renowned C-NOMA schemes, SCMA uses a multi-dimensional codebook where incoming data bits are directly mapped to codewords selected from a layer-specific codebook <ref type="bibr">[72]</ref>. Each codeword represents a spread transmission layer. In contrast to LDS-CDMA (and CDMA), where spreading and bit-tosymbol mapping are conducted separately, SCMA integrates these two steps by directly mapping incoming bits to a spread codeword within the SCMA codebook sets. Similar to LDS, the sparsity of codewords in SCMA allows for the implementation of low-complexity reception techniques <ref type="bibr">[73]</ref>. SCMA also leverages additional degrees of freedom in the design of multi-dimensional constellations, outperforming LDS <ref type="bibr">[72]</ref>. Importantly, the sparser SCMA codewords can tolerate more overloading, facilitating massive IoT connectivity in the uplink. However, it is worth noting that a sparser code results in lower coding gain. While SCMA can be used for the downlink as well <ref type="bibr">[62]</ref>, <ref type="bibr">[74]</ref>, <ref type="bibr">[75]</ref>, the decoding complexity remains high for low-cost devices, limiting SCMA's ability to efficiently support massive IoT connectivity in the downlink <ref type="bibr">[75]</ref>.</p><p>5) SDMA: SDMA is a technique used to exploit the spatial dimension for improving system capacity and performance. Multi-user transmit beamforming or SDMA allows simultaneous communication with different users by directing radio frequency signals towards specific users. In SDMA, multiple users are served simultaneously on the same frequency channel but are separated spatially using MIMO techniques. A careful design of beamforming vectors, utilizing antenna arrays, results in minimizing the average transmit power while maintaining a desired quality of service for each user <ref type="bibr">[76]</ref>. Each user has a distinct spatial signature, which is leveraged to create separate beams or spatial channels. By employing beamforming techniques, the base station can direct the signal energy to specific users, thereby reducing inter-user interference and enhancing signal quality. SDMA systems are usually designed to beam directly towards a user while reducing the interference experienced by other users. For example, SDMA uses linear precoding to separate users in the spatial domain and any interference from the other users will be treated as noise. Suppose H represents the channel matrix between the base station with M antennas and K users, where H &#8712; C K&#215;M . The received signal y can be expressed as</p><p>where W is the beamforming matrix, x is the transmitted signal vector, and &#951; is the noise vector. The beamforming matrix W is designed to maximize the SNR for each user while minimizing interference. Typically, the beamforming vector w k for User k is chosen to be the dominant eigenvector of the user's channel covariance matrix. This approach ensures that the signal intended for User k is maximized at the user's location while minimizing the leakage to other users. In a broadcast channel, multi-user linear precoding is often useful when users experience similar channel strengths and the channels are semi-orthogonal or orthogonal. SDMA based on MU-LP is a well-established multiple access technique that builds up the core principle behind many spatial techniques in 4G and 5G such as multi-user MIMO CoMP, coordinated beamforming, network MIMO, millimeter-wave MIMO, and massive MIMO <ref type="bibr">[77]</ref>. SDMA's effectiveness heavily depends on accurate CSI at the transmitter. With perfect CSI, the base station can precisely steer beams, achieving significant spatial multiplexing gains. However, in practical scenarios, obtaining perfect CSI is challenging due to factors like channel estimation errors, limited feedback, and feedback delays. To mitigate these issues, robust beamforming techniques and adaptive algorithms are employed, allowing the system to dynamically adjust to varying channel conditions and maintain high performance. Therefore, SDMA, represents a powerful method to enhance the capacity and efficiency of modern wireless communication systems, particularly in high-density user environments.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Integration with OFDM</head><p>NOMA schemes can be seamlessly integrated into OFDMbased systems to enhance spectral efficiency and support more users <ref type="bibr">[78]</ref>- <ref type="bibr">[80]</ref>. OFDM's fine-grained frequency division and ability to handle frequency-selective fading and multipath interference make it a suitable platform for NOMA, as subcarriers can be dynamically allocated based on channel conditions. However, the complexity increases due to the need for SIC at the receiver. In what follows, we briefly explain how different NOMA schemes fit into the OFDM structure:</p><p>1) PD-NOMA in OFDM: In PD-NOMA, each OFDM subcarrier carries a superimposed signal from multiple users, instead of a symbol from an individual constellation <ref type="bibr">[78]</ref>- <ref type="bibr">[80]</ref>. This is achieved by using different power levels for each user.</p><p>2) CD-NOMA in OFDM: Instead of using power levels to differentiate users, CD-NOMA assigns specific code sequences to different users <ref type="bibr">[81]</ref>- <ref type="bibr">[83]</ref>, allowing them to share the same OFDM subcarrier. At the receiver, the design of efficient multi-user detection schemes is crucial to separate and decode each user's signal based on the codes.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>III. TRADITIONAL MODULATION DESIGN</head><p>This section investigates the design and evolution of modulation techniques for OMA communication and their extension to NOMA communication. Modulation design, or constellation design, deals with the ways symbols can be arranged in constellations, with a particular focus on achieving a desirable BER. This process can be approached independently of channel coding or in conjunction with it. We will discuss both approaches in this section.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Modulation Design Evolution</head><p>Consider a complex-input, complex-output point-to-point AWGN channel. The channel input s and output y are complex random variables related by</p><p>where &#951; is a complex random variable whose real and imaginary parts are independently and identically distributed (i.i.d.) Gaussian random variables. The capacity of this channel is given by the well-known Shannon formula C = log(1 + SNR) where SNR denotes the signal power over the noise power and C is the capacity measured in nats per channel use.</p><p>The proof for achieving the capacity of this channel utilizes a random codeword formed with i.i.d. Gaussian components. However, employing a Gaussian codeword in practice is impractical, as the decoding would require an exhaustive search throughout all codewords in the codebook to determine the most probable candidate. This has resulted in the adoption of signaling constellations, or digital modulation techniques, such as PSK and QAM, comprised of a finite number of points in the complex plane. Extensive literature exists on the problem of choosing a set of M symbols with in-phase and quadrature components for transmission. Foschini et al. were among the first researchers who studied the signal constellation design that minimizes the probability of error on the AWGN channel under an average power constraint <ref type="bibr">[16]</ref>. Other notable contributions to constellation design can be found in works such as <ref type="bibr">[17]</ref>- <ref type="bibr">[19]</ref>, <ref type="bibr">[84]</ref>.</p><p>A constellation with M symbols can carry a maximum of log 2 M information bits per symbol. The average power of a constellation is defined by the mean of the squares of all symbol amplitudes, i.e., E{|s| 2 }. Consequently, constellations with larger M must position their points in closer proximity to each other <ref type="bibr">[84]</ref>. As such, distinct constellations exhibit different BER versus SNR performance for the same channel. The primary objective of constellation design is to identify configurations with the smallest BER. Another highly relevant parameter in this context is the symbol error rate (SER).</p><p>Constellation design for modulation schemes has a long, rich history. Initial works primarily concentrated on developing constellations that demonstrated both spectral and power efficiency independent of channel coding <ref type="bibr">[84]</ref>, <ref type="bibr">[85]</ref>. While the pursuit of spectral and power-efficient modulation schemes remains as relevant today as it was then, there has been notable improvement and evolution. The notion of treating coding and modulation as a unified entity <ref type="bibr">[86]</ref>, called coded modulation, led to the development of trellis-coded modulation (TCM)</p><p>[87]- <ref type="bibr">[89]</ref> and bit-interleaved coded modulation (BICM) <ref type="bibr">[90]</ref>- <ref type="bibr">[93]</ref> as two prominent examples. Due to its efficiency, BICM is now a standard in various modern communication systems, including WiMax and 4G/5G cellular networks.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Uniform Modulation Schemes</head><p>1) OMA: By adopting OMA as the multiple access scheme, modulation techniques originally designed for point-to-point communication can be used without necessitating major modifications. These modulation methods, such as PSK and QAM, are designed to enhance spectral efficiency (bit per symbol) while maintaining an acceptable BER for a specified power constraint <ref type="bibr">[16]</ref>- <ref type="bibr">[19]</ref>. Various ways that symbols can be arranged in constellations, with a particular focus on achieving a desirable BER, in the absence of channel coding are discussed in <ref type="bibr">[84]</ref>. The paper also studies capacity of those constellations on the AWGN channel assuming optimum coding.</p><p>2) NOMA: Non-orthogonal transmission is known to be optimal for Gaussian BC <ref type="bibr">[40]</ref>, <ref type="bibr">[94]</ref>, <ref type="bibr">[95]</ref>, which is also referred to as NOMA in this paper. We consider the two-user NOMA for illustration. Assume s 1 and s 2 are the signals for User 1 and User 2 and h 1 and h 2 are their corresponding complex channel gains, respectively. The BS broadcasts superimposed signal</p><p>, where P is the BS power and &#945;, 0 &#8804; &#945; &#8804; 1, and &#8113; &#8796; 1 -&#945; are the fractions of total power allocated to the signals of User 1 and User 2, respectively. This is a special case of (1), with K = 2 users where P 1 = &#945;P and P 2 = &#8113;P . The received signal at User k, k &#8712; {1, 2}, is given by</p><p>where &#951; k is the complex noise at User k. For decoding, assuming |h 1 | &#8805; |h 2 |, User 1 (the user with a stronger channel gain) first decodes the other user's message and then uses SIC to decode its message free of interference, whereas User 2 (which has a weaker channel gain) treats the signal of the stronger user as noise. Similar to OMA, achieving the capacity region of NOMA involves the use of random Gaussian codewords, whose decoding is impractical as it requires an exhaustive search over the entire codebook to identify the most probable candidate. In practice, s 1 and s 2 are chosen from a discrete and finitealphabet set like QPSK modulation. This simplification comes with its costs. For example, for certain values of power allocation coefficient &#945;, the constellations of the two users may overlap. In such cases, the mapping is non-bijective and decoding (SIC or maximum-likelihood decoding) with zero error may not be possible. Figure <ref type="figure">14</ref> represents the noiseless superimposed signal when both users use a QPSK constellation. While for &#945; = 0.2, we can draw distinctive detection boundaries (bijective mapping), this is not possible for &#945; = 0.5. The issue stems from predefined constellations (in this case, QPSK) being individually designed for each user, rather than for the transmitted signal, which is a superposition of the signals from both users. Consequently, the overlap of the superimposed constellation has not been taken into account in the constellation design. Existing literature extensively relies on the above adoption. Especially, BER and SER analysis for two-user NOMA over AWGN channel with regular constellations is studied by various researchers. To denote some, exact SER expressions for pulse amplitude modulation and QAM are derived in <ref type="bibr">[96]</ref>. Analytical bit error probability expressions are derived in <ref type="bibr">[97]</ref> when both users employ a QPSK constellation and <ref type="bibr">[98]</ref> for QAM. Similarly, BER expressions for binary phaseshift keying (BPSK) modulation are studied in <ref type="bibr">[99]</ref>. SER and BER in NOMA with rotated constellations are studied in <ref type="bibr">[100]</ref>. BER of NOMA with BPSK and QPSK modulations over fading channels was studied in <ref type="bibr">[101]</ref>. A list of papers evaluating NOMA can be found in <ref type="bibr">[31]</ref>.</p><p>Despite their findings, these studies have also contributed to one of the most prevalent misconceptions in NOMA literature. This is the myth that a user with a smaller channel gain should receive higher power. Specifically, many papers have assumed that in a two-user NOMA, &#945; should be smaller than 0.5, where &#945; is the fraction of total power allocated to the user with a stronger channel. While this misconception has been debunked in <ref type="bibr">[102]</ref> based on Gaussian inputs, this has also been shown to be incorrect even in the finite-alphabet case by several independent works <ref type="bibr">[103]</ref>- <ref type="bibr">[105]</ref>.</p><p>The goal of the above approach is to prevent super-symbol overlapping when utilizing uniform constellations by adjusting the power allocation coefficient and decoding. However, it has long been known that even using uniform constellations that result in overlapping (non-bijective) or partially overlapping super-symbols, it is still possible to decode both NOMA users' information with carefully designed turbo channel coding and iterative decoding <ref type="bibr">[106]</ref>. Specifically, when employing BICM with iterative decoding, non-bijective mapping may even outperform bijective ones, as elucidated in Myth 1 in <ref type="bibr">[106]</ref> and other works <ref type="bibr">[107]</ref>- <ref type="bibr">[109]</ref>.</p><p>Before proceeding to non-uniform constellation design, we would like to emphasize the above discussions in the following remarks.</p><p>Remark 1: Drawing general conclusions about NOMA power allocation solely based on a specific type of modulation and decoding is not appropriate and may lead to misconceptions. For instance, as discussed in <ref type="bibr">[105]</ref>, even within a given modulation scheme like QAM, successive interference cancellation and maximum likelihood decoding suggest different acceptable values for the power allocation coefficient &#945;.</p><p>Remark 2: By leveraging bit-interleaved coded modulation with iterative decoding, it is not necessary, or even favorable, to design non-overlapping super-symbols, as detailed in [106, Myth 1] and other references therein. This clarification indicates that there may not be a strict constraint on the power allocation coefficient in NOMA, even within the framework of finite-alphabet inputs. This fact is well-established theoretically and has been noted independently in [102, <ref type="bibr">Myth 1]</ref>.</p><p>So far, we have established that within the framework of finite-alphabet NOMA, it is not mandatory to allocate power in the reverse order of the users' channel gains, contrary to the common belief in many NOMA papers. Specifically, in a two-user NOMA scenario, &#945; &#8804; 0.5 is not a necessity. Such misconceptions arise from limiting assumptions, such as using uniform constellations with SIC decoding, without exploring other established techniques like BICM and maximum likelihood decoding.</p><p>With this clarification in mind, in the next subsection, we introduce a less-discussed topic in NOMA literature. Namely, we shift the focus from employing predefined, uniform constellation schemes for NOMA users to exploring the possibility of designing/employing non-uniform constellations for NOMA users. The goal of designing non-uniform constellations for NOMA users is to ensure that the resulting super-constellation provides a bijective mapping from the beginning.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Non-Uniform Modulation</head><p>The constellations discussed in the previous subsection exhibit regular shapes. In such a design, the emphasis is almost exclusively on constructing large set of symbols in 2D space with the objective of maximizing the minimum Euclidean distance between them. 3 For example, the conventional uniform QAM employs signal points on a regular orthogonal grid. In contrast, non-uniform constellations, by relaxing this constraint, provide an added shaping gain that facilitates reception, even under lower SNRs <ref type="bibr">[109]</ref>, <ref type="bibr">[117]</ref>, <ref type="bibr">[118]</ref>. Nonuniform constellations have found recent applications in digital broadcasting systems.</p><p>1) OMA: Much of the work in non-uniform constellation is related to point-to-point channels trying to improve the performance at lower SNRs <ref type="bibr">[109]</ref>, <ref type="bibr">[117]</ref>, <ref type="bibr">[118]</ref>. Nonuniform constellation has been particularly important in the context of video btoadcasing and is included in standards like digital video broadcasting terrestrial (DVB-T) and advanced television systems committee (ATSC). The utilization of nonuniform constellations in MIMO channels, along with two signal processing algorithms for MIMO precoding, can be found in <ref type="bibr">[119]</ref>. Using a different approach, guessing random additive noise decoding is recently used to design non-uniform 3 It is worth noting that the concept of 2D modulation can be extended to multi-dimensional modulation in various ways <ref type="bibr">[110]</ref>- <ref type="bibr">[114]</ref>. In MIMO systems, spatial modulation utilizes both the antenna index and complex symbols to form a 3D constellation for efficient information transmission <ref type="bibr">[111]</ref>. Other forms of multi-dimensional modulation are explored in <ref type="bibr">[112]</ref>- <ref type="bibr">[116]</ref>.</p><p>constellations based on channel statistics <ref type="bibr">[120]</ref>. This approach is code-agnostic, focusing on identifying the impact of noise on the data.</p><p>2) NOMA: As highlighted in Remark 2, non-uniform constellations combined with BICM hold potential advantages in NOMA. While it has been noted that superposition of individual uniform constellations generally leads to non-uniform constellations <ref type="bibr">[100]</ref>, <ref type="bibr">[101]</ref>, <ref type="bibr">[121]</ref>- <ref type="bibr">[123]</ref>, there has not been an explicit exploration of this opportunity in the existing literature of NOMA. The related topic of hierarchical modulation, which involves constellations with non-uniformly spaced signal points, holds a more prominent presence <ref type="bibr">[124]</ref>- <ref type="bibr">[126]</ref>. Constellation rotation <ref type="bibr">[127]</ref>- <ref type="bibr">[129]</ref> is another related topic in this area. In fact, the idea of rotating constellations goes back to the design of &#960;/4-QPSK for differential coherent detection in <ref type="bibr">[130]</ref>. To avoid higher dimensional constellation points collapsing on top of each other and losing diversity, the idea of constellation rotation has been used extensively in the spacetime coding literature <ref type="bibr">[131]</ref>- <ref type="bibr">[133]</ref>. Although the hierarchical modulation and constellation rotation methods may help overcoming rate losses observed in NOMA implementation <ref type="bibr">[14]</ref>, the design and implementation of non-uniform constellations with BICM could be transformative.</p><p>It should be noted that BICM improves error performance of uncoded modulation but presents implementation challenges, especially in low-resource IoT devices. The increased receiver complexity, including de-interleaving and soft-decision decoding, along with higher computational demands and power consumption-particularly when using iterative decoding methods like turbo codes-are significant hurdles. These processes also introduce processing delays and potential latency, crucial considerations in real-time communication systems.</p><p>Despite these hurdles, BICM implementation in IoT devices is achievable with a focus on efficient algorithms, hardware support, and complexity-performance tread-off. Optimized algorithms for interleaving and decoding, energy-efficient errorcorrecting codes tailored for low-power operation, and hardware accelerators for key functions can effectively mitigate these challenges. For example, narrowband IoT demonstrates a practical application of BICM for low-resource devices, employing simplified modulation and coding strategies to enhance reliability while efficiently managing complexity <ref type="bibr">[134]</ref>, <ref type="bibr">[135]</ref>. We will discuss such possible avenues in Section VII-A.</p><p>IV. CHANNEL CODING Channel coding, also referred to as forward error correction, is an indispensable component of today's wireless communications. These coding techniques introduce redundancy into transmitted data using error correction and error detection codes. The redundancy enables receivers to detect and correct errors that may occur during transmission, thereby enhancing the robustness of communication systems and approaching the capacity limits of wireless channels <ref type="bibr">[89]</ref>. It improves cellular network performance metrics such as reliability, throughput, coverage, spectral efficiency, and energy efficiency <ref type="bibr">[136]</ref>. In addition, channel coding ensures reliable data transmission by detecting and correcting errors caused by noise and interference, reducing the need for re-transmission and as a result improving the throughput and latency. It also extends coverage by enhancing data transmission over longer distances and under challenging radio conditions.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Channel Coding Evolution from 2G to 5G</head><p>Channel coding has evolved significantly since its inception. In early digital communications, simple error detection methods like parity checks were used, while Hamming codes were among the first error-correcting codes applied in computer systems and data protocols. With the advent of more sophisticated coding schemes like Bose-Chaudhuri-Hocquenghem (BCH) and Reed-Solomon codes <ref type="bibr">[137]</ref> in the 1960s and convolutional codes <ref type="bibr">[138]</ref> in the 1970s, error correction capabilities improved substantially. The development of turbo codes <ref type="bibr">[139]</ref> and low-density parity-check (LDPC) codes <ref type="bibr">[140]</ref>, <ref type="bibr">[141]</ref> in the 1990s represented a significant advancement, achieving performance close to the theoretical limits derived by Shannon. Polar codes <ref type="bibr">[142]</ref>, introduced in 2009, further advanced the field by offering provably capacity-achieving performance with low-complexity encoding and decoding algorithms.</p><p>Channel coding has been integral to digital communication in cellular networks since the emergence of 2G technology. With each generation of cellular technology, channel coding has advanced and evolved to meet the increasing demands for reliability and efficiency in data transmission. In addition, different channel coding techniques have been adopted for control channels and data/traffic channels because of their distinct requirements. Specifically, channel coding for control channels emphasizes robust error detection and correction to ensure reliable transmission of critical signaling information with minimal latency, while coding for data/traffic channels prioritizes maximizing throughput and spectral efficiency.</p><p>The tree diagram in Fig. <ref type="figure">15</ref> illustrates the channel codes employed in each generation of cellular systems for both control and traffic channels. A more detailed explanations follows.</p><p>&#8226; 2G: Convolutional codes were introduced to enhance the reliability of control signaling, improve the quality of digital voice, and support text messaging. &#8226; 3G: More advanced coding techniques like turbo codes were introduced for data channels, providing better error correction capabilities and supporting higher data rates for mobile internet access. Despite the above change, 3G systems use convolutional codes for control channels. &#8226; 4G: Turbo codes with enhancements like hybrid automatic repeat request (HARQ) and flexible rate matching were used. By combining error correction coding with re-transmissions, 4G LTE improves data reliability and efficiency. Further, coding schemes in 4G are more adaptable to varying channel conditions and user requirements, allowing for better utilization of available bandwidth and more efficient data transmission. &#8226; 5G: LDPC codes and polar codes are used for data channels and control channels, respectively, offering enhanced performance for a wide range of applications, including IoT, autonomous driving, and URLLC communications. LDPC codes were adopted for data channels due to their superior performance in error correction and lower decoding complexity at high data rates.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Channel Coding and Modulation in 5G &amp; 6G</head><p>Here, we first present some details of the channel coding and modulation techniques used in 5G systems. We then discuss possible road maps for the channel coding in 6G.</p><p>1) Channel Coding and Modulation in 5G: In wireless systems, a physical channel refers to the actual medium via which data is transmitted between the network and the UE. In total, there are six physical channels in 5G NR, as described below:</p><p>-Physical downlink control channel (PDCCH): Transmits downlink control information. -Physical uplink control channel (PUCCH): Carries uplink control information. -Physical broadband channel (PBCH): Handles UE synchronization and broadcasts essential system information. -Physical random access channel (PRACH): Ensures initial network access and resource allocation. -Physical downlink shared channel (PDSCH): Transports downlink data payloads. -Physical uplink shared channel (PUSCH): Transmits uplink data payloads. Two of these channels, PDCCH and PUCCH, carry most of the control information in the downlink and uplink, respectively. PBCH and PRACH are responsible for tasks such as UE synchronization initial access to the network. As their names suggest, PBCH and PRACH are downlink and uplink channels, respectively. Actual information payloads, including text messages, audio/video call data, and web streams, are transmitted over shared channels PDSCH and PUSCH, for downlink and uplink, respectively.</p><p>TABLE II: Channel Coding and Modulation in 5G Physical Channels Channel Coding Modulation Adaptive PBCH Polar QPSK &#10007; Downlink PDCCH Polar QPSK &#10007; PDSCH LDPC M -QAM * &#10003; PRACH --&#10007; Uplink PUCCH Polar BPSK, QPSK &#10007; PUSCH LDPC M -QAM * &#10003; * M &#8712; {4, 16, 64, 256, 1024}</p><p>Table <ref type="table">II</ref> summarizes the channel coding and modulation used in each channel. PRACH does not employ channel coding or modulation; instead, it utilizes Zadoff-Chu sequences for tasks such as initial access synchronization, random access, uplink control information, and channel sounding <ref type="bibr">[143]</ref>. The table also indicates whether adaptive coding and modulation is used in each channel. An index, called modulation and coding scheme (MCS) index, determines how data is encoded and modulated before transmission and thus decides on the number of useful bits per symbol based on radio signal quality. A higher signal quality allows sending more data per symbol. MCS index selection depends on radio conditions and block error rate (BLER). This is determined by a quantity called channel quality indicator and is adjusted dynamically by the BS. 5G NR supports M -QAM modulation with M &#8712; {4, 16, 64, 256, 1024} for the PDSCH <ref type="bibr">[144]</ref>. There are 32 MCS indices (0-31), with indices 29-31 reserved for re-transmissions. 3GPP Specification 38.214 <ref type="bibr">[144]</ref> has provided four tables for PDSCH MCS indices. With M -QAM modulation and target coding rate R, the spectral efficiency of transmission is given by r log 2 M . The highest spectral efficiency is achieved with 1024-QAM and a coding rate r = 948 1024 , resulting in 9.2578 bits per symbol. The lowest spectral efficiency is achieved with 4-QAM (QPSK) and a coding rate r = 30 1024 , resulting in 0.0586 bits per symbol. It also worth noting that the NR LDPC coding process encompasses several stages <ref type="bibr">[143]</ref> such as code block segmentation, cyclic redundancy check attachment, LDPC encoding, rate matching, and bit interleaving. These steps collectively ensure robust and efficient data transmission in 5G networks.</p><p>2) Channel Coding Road to 6G: As wireless systems evolve toward 6G, latency requirements become increasingly stringent. Particularly, URLLC is one of the main use cases of 5G and beyond, envisioned to bridge the digital and physical realms. This ensures that a given data packet will be delivered within a very short time frame, such as in the order of 1 ms, and with a very high reliability, e.g., 99.999% <ref type="bibr">[1]</ref>. End-to-end delay consists of three components: 1) the access delay, 2) the computation delay, and 3) the transmission delay <ref type="bibr">[145]</ref>, <ref type="bibr">[146]</ref>. These tasks involve transmitting essential data and performing computations on both ends, such as compressing data at one end and decompressing it at the other end as well as channel encoding and decoding. These elements form a latency budget that must meet strict real-time requirements.</p><p>This has led to the exploration of channel coding techniques with shorter block lengths and reduced complexity. Ar&#305;kan <ref type="bibr">[147]</ref> made significant progress in this direction by introducing polarization-adjusted convolutional codes and demonstrating that a convolutional pre-transformation can effectively enhance BLER for short codes with sequential decoding <ref type="bibr">[148]</ref>. There has been several recent improvements including deep learningbased polar codes <ref type="bibr">[149]</ref>. Polar codes are not the only family of codes being studied for low-delay communication. Various other channel codes, including LDPC, turbo, and convolutional codes, have been considered <ref type="bibr">[150]</ref>. Other channel codes, such as analog fountain codes, analog BCH codes, and Raptor codes could also be considered toward this goal.</p><p>Finally, we should emphasize that channel codes are developed independently of whether multiple access techniques are orthogonal or non-orthogonal. Current research in the intersection of NOMA and channel coding predominantly focuses on evaluating NOMA performance using specific channel codes tailored for this purpose. Additionally, joint source-channel coding represents another approach for achieving low-latency communication, as explored in studies such as <ref type="bibr">[151]</ref>- <ref type="bibr">[153]</ref>. </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Trellis-Coded Modulations</head><p>1) OMA: The main idea behind TCM is to combine coding and modulation to increase the coding gain <ref type="bibr">[154]</ref>. To achieve this, a given constellation is expanded and then partitioned into a hierarchy of subsets with increasing minimum Euclidean distances. For encoding, a trellis, representing a finite-state machine, decides which subset should be used to maximize the coding gain at each time slot. TCM's finite-state machine and underlying modulation can be represented using convolutional codes as well. The main design challenge is how to assign subsets to trellis paths to maximize the coding gain. A TCM codeword includes a sequence of transmitted symbols chosen through the finite-state machine and the corresponding set partitioning. The free distance between two possible codewords is defined as the squared Euclidean distance between the two coded sequences. Analogous to how the minimum Euclidean distance determines the performance of a modulation scheme, the minimum free distance specifies the performance of a TCM <ref type="bibr">[154]</ref>.</p><p>Figures <ref type="figure">16</ref> and <ref type="figure">17</ref> show an example of a 4-state TCM using the 8-PSK constellation. For maximum-likelihood decoding, the Viterbi algorithm is utilized to find the most likely valid path, a path starting at State 0 and merging to State 0, and the corresponding transmitted bits <ref type="bibr">[154]</ref>.</p><p>2) NOMA: In this section, we discuss trellis-coded multiple access (TCMA) <ref type="bibr">[155]</ref> and trellis-coded NOMA (TC-NOMA) <ref type="bibr">[156]</ref>. While the main principles work for any number of users, for the sake of brevity, we focus on a downlink system with two users. Also, while the choice of TCM for each user can be different, we utilize the TCM encoder in Fig. <ref type="figure">17</ref> for both users. The BS modulates the input bits using TCM to generate two sets of symbols, a 1 (n) and a 2 (n) for Users 1 and 2, respectively. To improve the performance, a 2 (n)'s constellation can be rotated, for example by the optimal rotation &#960; 8 <ref type="bibr">[156]</ref>. In TCMA, the encoder superimposes the two outputs a 1 (n) and a 2 (n) by transmitting a 1 (n) + a 2 (n). On the other hand, similar to P-NOMA, TC-NOMA superimposes the symbols of different users on different power levels and transmits &#8730; P 1 a 1 (n)+ &#8730; P 2 a 2 (n). Optimal power allocation can be done to maximize the resulting minimum free distance <ref type="bibr">[87]</ref> and maintain a constraint on the total power, i.e., P 1 + P 2 &#8804; P To recover the transmitted bits, the receiver can employ</p><p>Signal Mapping T T &#119904; 1 (&#119899;) &#119904; 0 (&#119899;) &#119911; 0 (&#119899;) &#119909; 1 (&#119899;) &#119911; 1 (&#119899;) &#119909; 2 (&#119899;) &#119909; 2 (&#119899;) &#119886;(&#119899;) 00 01 10 11 00 01 10 11 00/000 10/100 00/001 10/101 Time n n+1 &#119904; 1 &#119904; 0 &#119909; 2 &#119909; 1 /&#119909; 2 &#119911; 1 &#119911; 0 (a) 1 000 001 010 011 100 101 101 111 (b) Fig. 17: Encoder and bit-mapping for an 8-PSK 4-state TCM. (a) Trellis representation of 8-PSK 4-state TCM, (b) The mapping of 8-PSK constellation.</p><p>SIC to separately decode the two bit streams, as discussed for P-NOMA before. In other words, the stronger user, User 1, decodes the symbols of the weaker user, User 2, and cancels the corresponding interference before decoding its own bits. The weaker user decodes its signal by assuming the symbols of the other user as noise.</p><p>In addition, joint detection using tensor product of trellises is also possible and greatly improves the performance <ref type="bibr">[156]</ref>. First, we describe the underlying tensor product of trellises <ref type="bibr">[157]</ref>, <ref type="bibr">[158]</ref> which is essential for the joint detection. Let us assume User 1 in TC-NOMA utilizes trellis T 1 with r 1 states S (1)</p><p>Then, the tensor product T 1 &#8855;T 2 is defined as a (r 1 &#215;r 2 )-state trellis with states</p><p>and only if transitions 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110 1111 0000 0001 0010 0011 0100 0101 0110 0111 1000 1001 1010 1011 1100 1101 1110 1111 &#119878; ( <ref type="formula">1</ref>) &#119878; (2)   Fig. <ref type="figure">18:</ref> The underlying 16-state trellis generated by the tensor product of two 4-state trellises. from S</p><p>(1) i to S (1) k and from S</p><p>(2) j to S</p><p>(2) l exist in T 1 and T 2 , respectively. A similar definition for the tensor product of more than two trellises works as well. Figure <ref type="figure">18</ref> shows the tensor product of the trellis in Fig. <ref type="figure">17</ref>  TC-NOMA is equivalent to a TCM using the trellis in Fig. <ref type="figure">18</ref> including transitions from S</p><p>(1) i S (2) j to S (1) k S</p><p>(2) l with the superimposed symbol &#8730; P 1 a 1 + &#8730; P 2 a 2 . Therefore, the Viterbi algorithm can be applied to the equivalent TCM for joint detection. Overall performance of the TC-NOMA can be optimized by appropriate power allocation. Additional fairness criteria can be included as well <ref type="bibr">[156]</ref>. Figure <ref type="figure">19</ref> depicts BER vs SNR of different NOMA systems using 8-PSK and the 4-state TCM encoder in Figs. <ref type="bibr">16 17</ref> for P 1 = 0.3, </p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>V. MACHINE LEARNING-BASED MODULATION DESIGN</head><p>A common characteristic of modulation techniques, as discussed in Section III, is their initial design with point-topoint communication in mind. These techniques are characterized by predefined, inflexible symbols, and their constellation shaping is oblivious to interference. In contrast, one notable shift in today's communication systems, especially in cellular networks, is the move from point-to-point to multi-user communication with multiple transmitters. With this paradigm shift, additional challenges are presented because inter-user and inter-cell interference are becoming crucial factors in modern communication system design. Despite this evolution, constellations designed decades ago are still employed and interference is often addressed through orthogonal resources or by treating interference as noise. Machine learning, particularly its rapidly advancing subset, deep learning <ref type="bibr">[159]</ref>, is becoming a cornerstone of communication systems <ref type="bibr">[160]</ref>. In general, integrating artificial intelligence (AI) into cellular networks has already been initiated with 5G Advanced, and AI is anticipated to play a pivotal role in shaping 6G networks <ref type="bibr">[161]</ref>. Machine learning has found different applications across various settings in modulationrelated challenges. For instance, machine learning techniques have been effectively applied in automatic modulation classification <ref type="bibr">[162]</ref>- <ref type="bibr">[165]</ref> and constellation design <ref type="bibr">[166]</ref>- <ref type="bibr">[170]</ref>, among other applications.</p><p>In this section, we introduce a novel approach to constellation design for NOMA, termed interference-aware constellation design. We aim to design NOMA super-constellations that inherently account for inter-user interference, as opposed to relying on traditional interference-oblivious constellations and attempting to eliminate inter-user interference at the receiver. The objective is to achieve the most distinguishable superimposed symbols for any power allocation such that decoding can be completed without needing SIC. The main performance metric is BER but simplicity of the decoding is as critical.</p><p>Designing interference-aware super-constellations using traditional methods is very challenging because the constellation shape at the transmitters needs to be adjusted depending on the interfering signal. Otherwise, symbols of superimposed constellations may overlap, as shown in Fig. <ref type="figure">14</ref>, which is not desired. AI-based approaches appear as an alternative. Our interference-aware constellation design is essentially an end-to-end autoencoder (AE)-based communication. This is because unlike noise, interference has a structure and autoencoders are useful in exploiting structures in data <ref type="bibr">[171]</ref>, <ref type="bibr">[172]</ref>. In addition AE-based E2E communications simplifies blockby-block communication. Before delving into the details of the design, we will motivate it by explaining the significance of SIC-free decoding in NOMA and provide an introduction to AE-based E2E communication in the following two subsections.</p><p>A. SIC-Free NOMA Superposition coding with successive interference cancellation (SC-SIC) is an optimal approach for achieving the capacity region of the downlink NOMA. In the proof, Gaussian inputs are employed as optimal codewords. Inspired by this theoretical result, SC-SIC is then applied to scenarios where inputs have finite-length and uses finite-alphabet modulations, such as QAM <ref type="bibr">[12]</ref>, <ref type="bibr">[14]</ref>, <ref type="bibr">[96]</ref>, <ref type="bibr">[98]</ref>. However, as discussed earlier, it is important to note that such an application may not necessarily uphold the same theoretical assertions <ref type="bibr">[103]</ref>, <ref type="bibr">[173]</ref>. At times, this discrepancy has contributed to misconceptions such as the belief that users with smaller channel gains should receive higher power allocations <ref type="bibr">[96]</ref>, <ref type="bibr">[98]</ref>, <ref type="bibr">[174]</ref>. Although these misconceptions have been disproved in <ref type="bibr">[102]</ref>, the subtle distinctions between these theoretical premises and the practical finite-alphabet inputs are of significant importance <ref type="bibr">[105]</ref>. We use the distinctions between the two premises-theoretical and finite-alphabet-to motivate SIC-free decoding in the following.</p><p>Particularly, while theoretically feasible to achieve successful decoding for both users at any &#945; &#8712; [0, 1] [102], the use of a finite-alphabet input, such as a QAM constellation, requires careful selection of the &#945; value, as noted in Remark 1 in Section III-B. Thus, using SIC decoding with finite-alphabet inputs limits power allocation choices, and thereby the possibility of achieving the entire capacity region.</p><p>In scenarios involving finite-alphabet inputs, the research community has explored alternative approaches to bypass the need for SIC. Examples of these approaches are the utilization of lattice-based techniques <ref type="bibr">[175]</ref>, index modulations <ref type="bibr">[104]</ref>, and maximum-likelihood decoding <ref type="bibr">[105]</ref>. These approaches may collectively be referred to as SIC-free decoding. They may offer several advantages such as outperforming SIC with finitealphabet inputs succeed in cases where SIC falls short. In our experimental work <ref type="bibr">[14]</ref>, we conclude that addressing constellation overlapping or finding a better way to implement or bypass SIC is necessary for advancing NOMA as a practical technology. The end-to-end NOMA introduced in this paper is an effective SIC-free NOMA, empowered by autoencoders.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Autoencoder-Based E2E Communication</head><p>1) A Primer to Autoencoders: An AE is a learning technique employed to discover a low-dimensional representation of the input data. In words, it creates a layer that has less features than the input layer. It first compresses (encodes) its input data into a lower dimension and then makes use of this lower dimensional representation to recreate (decode) the original data <ref type="bibr">[176]</ref>. While autoencoders compress the input via unsupervised learning, autoencoders are used to improve system performance through a training process that tries to minimize the reconstruction error-the difference between the input and reconstructed data.</p><p>The loss function quantifies the difference between actual and predicted values. Binary cross-entropy is the most common loss function used in classification problems. It treats each element of the AE output as a zero/one classification task. For a training sample y, the cross-entropy loss is expressed</p><p>, where y i is the ith element of the training sample y, n is the length of the vector y, and p i is the predicted probability corresponding to y i . The value of p i is obtained by passing &#375;i through a sigmoid activation function <ref type="bibr">[159]</ref>, where &#375;i is the predicted value of y i . To go from the loss for one training sample to the batch loss L, the average over a batch of samples is calculated. Alternatively, one may use the mean squared error loss function, which calculates the squared differences between the actual and predicted values, i.e., n i=1 (y i -&#375;i ) 2 , where y i and &#375;i are respectively the actual and predicted values for the i-th training sample. Again, we average the above value over multiple training samples to get the batch loss L.</p><p>2) Related Works: Utilizing autoencoders for end-to-end communication is a novel concept with significant potential for improving BERs. This approach provides a fresh alternative to the prevalent block-by-block design philosophy in contemporary communication systems <ref type="bibr">[22]</ref>- <ref type="bibr">[24]</ref>, <ref type="bibr">[171]</ref>, <ref type="bibr">[177]</ref>- <ref type="bibr">[179]</ref>. As illustrated in Fig. <ref type="figure">20</ref>, the concept is simple, mirroring the fundamental objective of digital communication, i.e, reliably transmitting a maximum number of bits. Notably, this approach outperforms state-of-the-art MIMO precoders in terms of BER both with and without the channel's knowledge <ref type="bibr">[22]</ref>- <ref type="bibr">[25]</ref>. In <ref type="bibr">[25]</ref>, an end-to-end communication is designed for the point-to-point MIMO channel when the autoencoder learns from the CSI and the transmitted symbols to eliminate the interference at the receiver and estimate the transmitted symbols with small errors. The result is remarkable as the autoencoder system exceeds the performance of the well-known singular value decomposition (SVD)-based MIMO for practical SNRs. The gain is attributed to two factors: 1) AE optimizes the transmission based on finitealphabet, finite-length inputs whereas SVD is designed for infinite-length Gaussian inputs, and 2) AE enjoys the freedom of non-uniform constellation shapes and is not limited to regular constellations.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. Interference-Aware Constellation Design</head><p>Leveraging autoencoders is a promising approach for crafting a super-constellation with distinguishable symbols, regardless of power allocation coefficients among NOMA users. This contrasts with the traditional methods where each NOMA user is assigned a predefined constellation (e.g., QPSK), and the superposition of these constellations forms the superconstellation. As mentioned earlier, in the conventional approach, the effectiveness of NOMA may be limited due to potential symbol overlap in the super-constellation, depending on power allocation. In contrast, AE-based approach anticipates interference during the design phase, resulting in constellations with a robust minimum distance, regardless of power allocation. Crucially, this enables SIC-free decoding, particularly vital for resource-limited devices.</p><p>1) IUI-Aware NOMA: Shifting from the traditional blockby-block approach to end-to-end communication allows for the simultaneous design and optimization of all components, encompassing constellation design. This innovative approach has the potential to substantially reduce BERs. While prior studies <ref type="bibr">[22]</ref>- <ref type="bibr">[25]</ref> establish a foundation for autoencoder-based endto-end communication, they primarily concentrate on pointto-point transmissions.</p><p>Extending these findings to point-to-multipoint scenarios like NOMA poses challenges. The involvement of multiple receivers in NOMA immediately makes the problem more involved as each receiver is only interested in its own message and will have its own loss. Hence, in the training process, each receiver will only feed back the loss of its own message to the transmitter. The first difficulty here is that the transmitter needs to process the losses and adjust its transmission accordingly. The two NOMA users have conflicting interests as they both want to exploit the shared environment for their own benefits. Using autoencoders for communication over NOMA channels in different settings is studied in <ref type="bibr">[177]</ref>- <ref type="bibr">[179]</ref>.</p><p>A conceptual architecture of the two-user NOMA network implemented autoencoder is depicted in Fig. <ref type="figure">21</ref>. The encoder and decoders include batch normalization and fully-connected neural networks (FCNNs). The channel state information (h 1 and h 2 ) are given to the encoder but they are known only in the corresponding decoders. This end-to-end systems will be trained and tuned to minimize the difference between transmitted (s k ) and received symbols (&#349; k ), k = 1, 2. As a result, a super-constellation (containing the superposition of the desired and interfering symbols) will be formed. A welldesigned and well-trained system will get separable superconstellation symbols.</p><p>AE designs typically consist of stacked FCNNs, which may not be suitable for structures resembling SIC. While constructing SIC-like AE structures is feasible, our specific design, as illustrated in Fig. <ref type="figure">21</ref>, aims to achieve SIC-free decoding for NOMA. The key objective is to devise an AE structure capable of creating a super-constellation with distinguishable symbols for any given power allocation coefficient &#945;.</p><p>Assume that both users want to transmit 2 bits/symbol, i.e., s k has two bits each, as illustrated in Fig. <ref type="figure">21</ref>. Then, unlike Fig. <ref type="figure">14</ref>, where each user wants to make its own maximum separable 4-symbol constellation and superimpose them, the AE will be trained to build a maximum separable 16-symbol superconstellation. The primary focus is on the shape of the superconstellation rather than that of the individual constellations. An illustrative instance of such AE-generated constellations is presented in Fig. <ref type="figure">22</ref>. It is noteworthy that these AE-generated constellations vary for each power allocation coefficient &#945;, thereby introducing a dynamic and intelligent aspect to the system. Also, in contrast to traditional constellations such as a fixed 16-QAM constellation, which has inflexible and predefined shapes, the AE has the capability to generate an extensive array of constellations. This brings a new shaping gain and lowers BERs at both users, thereby optimizing performance in diverse scenarios.</p><p>The performance of the AE-generated NOMA constellation is evaluated next. The AE-NOMA network is trained with h 1 = 1, h 2 = 2, SNR 1 = 10dB, and a loss weight of 10. The noise powers at both receivers are the same. The testing results are shown in Fig. <ref type="figure">23</ref>. Not surprisingly, the E b /N 0 requirement for this scenario is much higher than that of QPSK modulation transmitted over a point-to-point AWGN channel where the BER is given by P e = 1 2 erfc( E b /N 0 ) <ref type="bibr">[180]</ref>. However, comparing the BERs obtained in Fig. <ref type="figure">23</ref> with those obtained using QPSK constellations for a two-user NOMA channel, as seen in the literature such as in <ref type="bibr">[181,</ref><ref type="bibr">Fig. 7]</ref>, reveals that the AE-designed constellations demonstrate much lower BERs at the same E b /N 0 . This improvement arises because the combination of two QPSK constellations can lead to overlapping symbols, which degrades the BER, as previously discussed. In contrast, our constellations are specifically designed to have distinct symbols, thereby reducing the likelihood of symbol overlap and improving BER performance.</p><p>We should also emphasize that, similar to traditional constellations, the learned modulations use a finite set of symbols during each transmission. The key difference is that the symbols may vary from one transmission to another depending on the channel gains. Such a change in constellation symbols poses a significant challenge for traditional decoders, as they need to be informed about the constellation's position each time. However, in the AE-NOMA, this process is handled internally, eliminating the need to inform the decoder about the transmitters' constellation. This is because the transmitter and receivers (Tx, Rx1, and Rx2 autoencoders in Fig. <ref type="figure">21</ref>) are trained jointly and can handle those variations.</p><p>Numerous research questions revolve around the network's construction and training and the definition of the overall loss function. One possible loss function in Fig. <ref type="figure">21 is L</ref> </p><p>where L 1 and L 2 are the losses at Rx1 and Rx2, respectively, and w 1 and w 2 are their weights <ref type="bibr">[177]</ref>. In a previous work on a point-to-point channel <ref type="bibr">[25]</ref>, we trained the network for SNR 1 = 10dB and tested it across various SNRs. Given the complexity of the current problem, training multiple AEs, each tailored for distinct SNR or &#945; ranges, may be necessary. Since this problem is more involved, we may need to train multiple AEs each for a certain range of SNR or &#945;. We note that despite the time-consuming nature of training, it is performed offline and the developed model can be used for real-time over-the-fly tests, thereby minimizing computational demands during operational use.</p><p>It is worth noting that the goal of the above network is to achieve the most distinguishable super-constellation. This approach may not be the best as emphasized in Remark 2, in Section III-B. Specifically, a potentially improved BER could be achieved by adopting BICM with iterative decoding which allows for overlapping super-symbols. An example of such a design can be found in <ref type="bibr">[182]</ref> within the context of pointto-point channels. It is also worth indicating that AE-based NOMA is expected to outperform traditional designs in terms of latency, as E2E communication is generally faster than the block-by-block approach. Fig. 22: An example illustrating AE-generated constellations. (top) constellations of the two transmitters. (bottom) superconstellation at the receivers (representing a noisy version of the superimposed constellation at the transmitter). Each color represents a combination of one symbol of user 1 with four distinct symbols from user 2. 2) ICI-Resilient NOMA: Thus far, our discussion has centered on single-cell NOMA transmission where the spectrum is distributed among multiple users within a single cell. However, contemporary cellular networks operate in a multi-cell environment, aiming to reuse the same frequency band across many or all cells to enhance spectral efficiency. This shared frequency allocation leads to inter-cell interference, causing outages at cell edges and posing a substantial challenge to achieving high throughput <ref type="bibr">[183]</ref>- <ref type="bibr">[185]</ref>. The issue is worsened by the three-dimensional expansion driven by UAVs <ref type="bibr">[185]</ref>. The design of inter-cell interference-aware NOMA becomes notably complex in light of these evolving complexities within contemporary cellular networks.</p><p>While multi-cell NOMA has been studied in many papers <ref type="bibr">[186]</ref>- <ref type="bibr">[190]</ref>, these are based on Shannon-theoretic principles and are not directly applicable to finite-alphabet inputs. Recent works on end-to-end communication in interference channels focus on OMA <ref type="bibr">[191]</ref>- <ref type="bibr">[193]</ref>, often comparing their results with basic baselines like QPSK. However, as discussed before, it is known that when both users employ QPSK constellations, BER performance is notably inferior to that where one user employs rotated QPSK. Thus, those comparisons are less competitive in terms of BER. In contrast, the approach in <ref type="bibr">[194]</ref>, <ref type="bibr">[195]</ref> leverages the structure of co-channel interference through interference-resilient constellations adaptable to various interference regimes and topologies, resulting in minimized BER. Indeed, the above works also focus on addressing Z-interference channel <ref type="bibr">[196]</ref>, also known as onesided interference channel, in which only one of the users suffers from interference. Therefore, a significant gap exists in the literature regarding constellation design for multi-cell NOMA.</p><p>3) Modulation for MIMO Channels: MIMO techniques can be broadly divided into open-loop and closed-loop systems <ref type="bibr">[53]</ref>. In an open-loop system, there is no feedback from the receiver to the transmitter regarding the channel conditions. Hence, modulation symbols can be directly assigned to different transmit antennas without CSI at the transmitter. We have already covered the modulation techniques used for data channels in 5G NR in Section IV-B and summarized them in Table <ref type="table">II</ref>. Advanced open-loop MIMO techniques, such as space-time block coding <ref type="bibr">[197]</ref>- <ref type="bibr">[199]</ref>, have been used as modulation schemes for MIMO channels in various scenarios <ref type="bibr">[200,</ref><ref type="bibr">Chapter 4]</ref>. These techniques are particularly useful for achieving spatial diversity and improving signal reliability. Space-time block coding is the main block in constructing multiple-antenna differential modulation schemes <ref type="bibr">[201]</ref>, <ref type="bibr">[202]</ref> and trellis codes for MIMO, like super-orthogonal space-time trellis codes <ref type="bibr">[131]</ref>. Original goal of space-time block coding was to achieve diversity. However, it is also a building block in closed-loop MIMO systems with limited feedback, like the beamforming/precoding codebooks in WiFi and 5G <ref type="bibr">[203]</ref>, <ref type="bibr">[204]</ref>.</p><p>Closed-loop MIMO uses feedback from the receiver to inform the transmitter about channel conditions, enabling techniques like beamforming or precoding to adapt. As shown in Fig. <ref type="figure">24</ref>, modulation symbols are mapped to layers before precoding. Assuming a channel rank of two, the layer mapper takes two symbols (s 1 , s 2 ) and creates two data streams. For a rank of one, it selects one symbol (s 1 ) and sends it to the precoding block or antenna mapper. With precoding and postprocessing, the channel is converted to parallel channels so that two independent data steams, without any coupling between them, are sent. This simplifies the design of the receiver.</p><p>As explained above, and shown in Fig. <ref type="figure">24</ref>, separating precoding and post-processing blocks, modulation schemes used in MIMO channels are the same as those developed for single-input single-output (SISO) channels. The main rational behind such a separation is the optimality of the precoding and post-processing matrices, obtained from SVD decomposition, to convert the MIMO channel into parallel SISO channels. In such a system, either symbols from a single modulation scheme, such as QPSK, are used across all channels, or symbols from different modulation schemes are assigned based on channel characteristics <ref type="bibr">[205]</ref>- <ref type="bibr">[209]</ref>. Such techniques separate modulation and precoding schemes, as shown in Fig. <ref type="figure">24</ref>. These techniques and their representative works can be summarized as, adaptive modulation with SVD precoding, along with optimal bit and power allocation <ref type="bibr">[23]</ref>, linear precoding for finitealphabet <ref type="bibr">[205]</ref>, bit allocation with SVD precoding and waterfilling power allocation, bit allocation with SVD precoding and equal power allocation, and SVD-based deep autoencoder <ref type="bibr">[25]</ref>. With a few exceptions, for example <ref type="bibr">[206]</ref>, the above line of work assumes perfect CSI at the transmitter.</p><p>Recent studies have emphasized the advantages of joint modulation and precoding design strategies compared to their separate counterparts. In <ref type="bibr">[112]</ref>, lattice-based symbol layouts have been proposed to enhance spectral efficiency although they face limitations in fully leveraging MIMO multiplexing gain. The multi-dimensional constellation concept, introduced in <ref type="bibr">[113]</ref>, is designed to fully leverage MIMO multiplexing gain. This new method designs constellations and precoding by simultaneously optimizing the in-phase and quadrature components for all sub-channels within a MIMO channel. It exhibits superior performance compared to existing finitealphabet MIMO communication techniques, including current AE-based constellations, as illustrated in <ref type="bibr">[113,</ref><ref type="bibr">Fig. 5]</ref>. The BER curves obtained by this method can serve as a lower bound for AE-based constellation design, indicating potential for further enhancement in AE-based end-to-end MIMO systems. This multi-dimensional constellation approach also holds promise for the development of even more sophisticated AEbased end-to-end MIMO-NOMA systems. Several DAE-based finite-alphabet MIMO schemes are introduced in <ref type="bibr">[22]</ref>, <ref type="bibr">[23]</ref>, <ref type="bibr">[25]</ref>.</p><p>Lastly, it is crucial to note that a significant portion of downlink MIMO-NOMA literature attempts to adapt SC-SIC decoding of SISO channel to MIMO ones <ref type="bibr">[8]</ref>, <ref type="bibr">[210]</ref>, <ref type="bibr">[211]</ref>. However, such strategies are strictly sub-optimal for MIMO-NOMA with and without secrecy <ref type="bibr">[95]</ref>, <ref type="bibr">[212]</ref>- <ref type="bibr">[216]</ref>. Despite the common extension of SC-SIC (the optimal strategy of SISO-NOMA) to MIMO-NOMA, it is well-established that SC-SIC cannot achieve the capacity region in MIMO-BC (MIMO-NOMA); instead, dirty-paper coding (DPC) is the optimal choice. This point has been highlighted in a few recent MIMO-NOMA works <ref type="bibr">[215]</ref>- <ref type="bibr">[217]</ref>. In addition, in these works, various linear precoding and power allocation strategies are developed to achieve the DPC-based capacity region of MIMO-NOMA channels. More specifically, Table <ref type="table">1</ref> in <ref type="bibr">[215]</ref> lists capacity-achieving signaling design for several related problems. The fact that linear preceding approaches the DPC region is known from various other works <ref type="bibr">[215]</ref>, <ref type="bibr">[218]</ref>- <ref type="bibr">[220]</ref>. Further, in <ref type="bibr">[221]</ref>, the notion of quasi-degraded channels was introduced as a mean to achieve the DPC region using linear precoding. This concept has been applied to other settings, such as network NOMA in <ref type="bibr">[222]</ref>. Therefore, when designing modulation and coding for MIMO-NOMA, particularly, when joint design is considered <ref type="bibr">[112]</ref>, <ref type="bibr">[113]</ref>, prioritizing the established optimal approach over suboptimal ones is imperative.</p><p>The capacity region of uplink MIMO-NOMA, also referred to as MIMO-MAC, is also well-established <ref type="bibr">[223]</ref>. This capacity region is the union of pentagons, each corresponding to different transmit covariance matrices, and its boundary is curved, except at the sum-rate point, where it becomes a straight line <ref type="bibr">[223]</ref>. Several low-complexity linear methods, such as simultaneous triangularization <ref type="bibr">[224]</ref>, zero forcing, and linear MMSE with practical codes <ref type="bibr">[225]</ref>, <ref type="bibr">[226]</ref> have been developed to approach the boundary of this region. If the rank is two, both symbols s 1 and s 2 are transmitted, enhancing data throughput. This scheme can be generalized to an arbitrary number of antennas. Here, modulation and precoding are designed separately.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VI. RSMA</head><p>RSMA is a promising interference management technique in multi-user systems. While SDMA treats interference as noise and NOMA decodes interference of the users with weaker channels, RSMA uses SIC to decode a portion of the interference and treats the remaining interference as noise. This unique feature of RSMA allows for a gradual transition between decoding interference in NOMA and treating it as noise in SDMA, enabling a more flexible approach to interference management <ref type="bibr">[227]</ref>. The idea of rate splitting was first proposed long ago for a two-user SISO interference channel <ref type="bibr">[228]</ref>. The term RSMA was first used around 20 years later in <ref type="bibr">[229]</ref> where rate splitting was proposed for a multiple-access channel. Research on downlink RSMA <ref type="bibr">[230]</ref>- <ref type="bibr">[232]</ref> and its advantages resulted in the revival of the idea.</p><p>In what follows, we investigate RSMA under two broad umbrellas of downlink and uplink.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Downlink RSMA</head><p>To illustrate the framework of downlink RSMA, let us assume a single-layer scheme <ref type="bibr">[233]</ref>, <ref type="bibr">[234]</ref>. Consider a system with K single-antenna users where the BS is equipped with M antennas. Let W 1 , . . . , W K denote the message of Users 1 to K and h H 1 , . . . , h H K be the channels from the transmitter to Users 1 to K, respectively. In a single-layer scheme, the message of the kth user, W k , will be split into two parts, namely, common message W c,k and private message W p,k . The common parts {W c,1 , . . . , W c,K } are jointly encoded into the common stream s c and the private parts </p><p>The received signal at User k will be</p><p>where &#951; k is the complex AWGN with variance &#963; 2 . To recover User k's intended message, User k should decode the common message and its own private message. The idea behind downlink RSMA is to decode the common message at each user and then subtract it from the received signal, to cancel its interference, and then decode the private message by treating the private messages of all other users as noise. This approach can be considered as partial interference cancellation. For k = 1, . . . , K, the rates R k,c and R k,p can be calculated as</p><p>where symbols are assumed to be unit-power, i.e., E[|s k | 2 ] = 1. To make sure that the common message is decodable by</p><p>&#119882; &#119901;,&#119896; Messages of all users &#119882; 1 &#119882; &#119896; &#119882; &#119870; &#119882; &#119888;,1 &#119882; &#119888;,&#119896; &#119882; &#119888;,&#119870; Message combiner &#119882; &#119888; &#119882; &#119901;,1 &#119882; &#119901;,&#119896; &#119882; &#119901;,&#119870; . . . . . . . . . . . . &#119904; &#119888; &#119904; 1 &#119904; &#119896; &#119904; &#119870; Precoder . . . &#119909; 1 &#119909; 2 &#119909; &#119873; &#119879; a) Transmitter Receiver 1 &#119910; 1 &#119882; 1 Receiver K &#119910; &#119870; &#119882; &#119870; &#119910; &#119896; Decoder &#119882; &#119888; Encoder Precoder + Channel + -Splitter &#119882; &#119888;,&#119896; Decoder Message combiner &#119882; &#119896; Receiver k Message splitter Encoder b) Receivers every user, the common rate, R c , should be at most equal to the minimum of rates given in <ref type="bibr">(19)</ref>. That is</p><p>The total rate of User k will be the summation of its private message rate and the corresponding portion of the common message rate. The architecture of transmitter and receiver for this system is shown in Fig. <ref type="figure">25</ref>. More layers can be added to the system by considering common messages that will be decoded by a group of users and treated as noise by the rest <ref type="bibr">[235]</ref>. The transmitted signal in ( <ref type="formula">17</ref>) is a linear combination of precoded data streams, but nonlinear precoding is also an option <ref type="bibr">[236]</ref>. RSMA includes NOMA and SDMA as special cases. When there is no common message, RSMA transforms into SDMA. In a K-user system, where the first user has the strongest channel and the Kth user has the weakest channel, designing an RSMA system such that the Kth common message is decoded by Users 1 to K, the (K-1)th common message is decoded by Users 1 to K-1, . . . , the second common message is decoded by Users 1 and 2, and a private message decoded by User 1 turns RSMA into NOMA. Therefore, performance of RSMA, as a general framework, is never worse than those of SDMA and NOMA.</p><p>As a candidate for the NGMA, it is insightful to take a look at the advantages and disadvantages of RSMA. In comparison to other schemes, RSMA provides robust performance with imperfect channel state information at the transmitter (CSIT) <ref type="bibr">[231]</ref>, <ref type="bibr">[237]</ref>. Having NOMA and SDMA as special cases, RSMA results in better spectral efficiency and energy efficiency <ref type="bibr">[227]</ref>, <ref type="bibr">[233]</ref>. Achieving URLLC stands out as a key feature in the context of NR. A viable solution to enhance latency performance involves the reduction of packet size. Studies in <ref type="bibr">[238]</ref>, <ref type="bibr">[239]</ref> reveal that RSMA can achieve transmission rates similar to those of SDMA and NOMA, yet with shorter block-lengths, resulting in reduced latency. Naturally, these enhancements are accompanied by some associated costs. As one example, the decoding complexity of RSMA is much higher because of SIC and the need to decode common messages. This extra complexity is also present in NOMA; however, SDMA, in contrast, does not require SIC. On the other hand, SDMA lacks control over interference. Dividing the message at the transmitter introduces the challenge of optimizing message splitting, which becomes an additional task at the transmitter. Moreover, every receiver needs to be aware of the splitting/decoding rule to extract its intended message, adding to the complexity of downlink signaling. It is evident that a larger number of streams resulting from message splitting introduces more challenging optimization problems for beamforming and power allocation in RSMA.</p><p>To illustrate the concept, so far, we have discussed RSMA for multi-user systems with single-antenna users, i.e., only the transmitter is equipped with multiple antennas. MIMO has become an essential component of communication systems to significantly enhance their performance. Having multiple antennas at the receiver can further improve the performance of RSMA systems. While some works, such as <ref type="bibr">[240]</ref>, use the term MIMO to refer to systems with multiple singleantenna users and a multiple-antenna transmitter, we apply it specifically to systems where both the transmitter and receivers are equipped with multiple antennas. <ref type="bibr">[241]</ref> introduces practical stream combining methods along with regularized block diagonalization precoding for rate-splitting in MIMO broadcast channels, focusing on a single common stream and excluding precoding optimization. In <ref type="bibr">[242]</ref>, the authors examine a single-layer MIMO-RSMA system. To manage the inter-user interference at the receivers, the system employs linear combinations of the null-space basis vectors from the successively augmented MIMO channel matrices of the users as precoding vectors. <ref type="bibr">[243]</ref> studies the precoder optimization problem in MIMO-RSMA with the goal of maximizing the weighted ergodic sum-rate. Precoder design for MIMO-RSMA has been studied in several other papers such as <ref type="bibr">[244]</ref>, <ref type="bibr">[245]</ref>. Uplink MIMO-RSMA has been studied in <ref type="bibr">[246]</ref> where the authors focus on increasing energy efficiency by optimizing the transmit covariance matrices and decoding order using statistical CSI. In conclusion, these studies demonstrate that while MIMO systems are highly sensitive to CSI, RSMA proves to be much less affected by channel errors.</p><p>The benefits and promising features of RSMA have provoked a surge of research in this area. Therefore, effectiveness of RSMA on many existing problems have been studied. Performance of RSMA with OFDM has been studied in <ref type="bibr">[247]</ref> and compared with that of orthogonal frequency division multiple access and OFDM-NOMA. OFDM is a modulation technique that is widely used in modern wireless communication systems, including 4G (LTE) and 5G networks. It is anticipated that OFDM will continue to be a central component in NGMA. Therefore, ensuring compatibility of new multiple access techniques with OFDM and enhancing overall performance are crucial factors. Application of RSMA to integrated sensing and communications (ISAC) has been studied in several works like <ref type="bibr">[248]</ref>- <ref type="bibr">[251]</ref>. ISAC is expected to emerge as a pivotal feature in the next generation, seamlessly combining two technologies within a single device. This integration entails the shared utilization of hardware and spectrum for both functionalities. Particularly crucial for smart cities, ISAC enhances continuous environmental monitoring by integrating sensing capabilities into the extensive network of communication base stations. Another rising technology is the reconfigurable intelligent surface (RIS), which enhances communication by establishing new paths between the transmitter and the receiver. Every RIS consists of a large number of elements, often in the thousands, capable of altering the phase of the incident wave. Through careful control and design of these phase changes, an RIS can enhance the received power at the receiver. The large number of elements and passive characteristics of these components pose several challenges that need to be addressed. The integration of RIS in RSMA systems is an intriguing problem that has been explored in various studies, including <ref type="bibr">[252]</ref>- <ref type="bibr">[256]</ref>. UAV is another enabling technology for the next generation of wireless communications. These aerial platforms, commonly known as drones, offer a versatile and dynamic solution to address various challenges in wireless communications. Equipped with communication systems, UAVs can be deployed to enhance network coverage, particularly in distant or disaster-affected regions where conventional infrastructure may be constrained <ref type="bibr">[257]</ref>. Their mobility makes them suitable for duties such as data collection and even acting as relays to extend the network coverage <ref type="bibr">[258]</ref>- <ref type="bibr">[260]</ref>. As the need for high data rates and low-latency communication increases, incorporating UAVs into communication networks offers a creative and adaptable approach to fulfill these evolving demands <ref type="bibr">[261]</ref>, <ref type="bibr">[262]</ref>. Integration of RSMA into UAV-assisted communication has been studied in several works such as <ref type="bibr">[263]</ref>- <ref type="bibr">[267]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Uplink RSMA</head><p>One of the advantages of uplink RSMA over other multiple access techniques such as NOMA is its ability to achieve the capacity region of the Gaussian MAC without time sharing. Fig. <ref type="figure">26</ref> shows the rate region of a two-user Gaussian MAC. Conceptually, this is the same as the black pentagon in Fig. <ref type="figure">6</ref>. The horizontal and vertical lines up to points A and B are achievable by SIC and changing the order of decoding. As shown in Fig. <ref type="figure">26</ref>, NOMA can achieve the line AB by time sharing, while RSMA can achieve the line AB without time sharing. If we continue the plot beyond point A without time sharing (i.e., maintaining the decoding order as {1,2}), we obtain a line extending from point A to intersect the horizontal axis. The same occurs for point B and its extension. The resulting pentagon is the convex hull of these plots, which is achieved through time sharing. This involves decoding in the {1,2} order for a portion of the time and switching to the {2,1} order for the remaining time. Such a switching may not be desirable.</p><p>To illustrate the uplink RSMA's framework, let us consider an uplink system with K single-antenna users and a receiver with N r antennas. For K users, it is sufficient to split the messages of K-1 users to avoid time sharing <ref type="bibr">[29]</ref>, <ref type="bibr">[229]</ref>. Without loss of generality, the message of User k for k &#8712; {1, . . . , K -1}, denoted by W k , is split into two parts, W k,1 and W k,2 and the message of User K remains intact. The messages W k,1 and W k,2 will be encoded into streams s k,1 and s k,2 with powers P k,1 and P k,2 , respectively. The message of User K will be encoded to the stream s K . We assume unit-power constraint on the symbols, i.e.,</p><p>The transmit signals will be x k = P k,1 s k,1 + P k,2 s k,2 for User k &#8712; {1, . . . , K -1} and x K = &#8730; P K s K for User K. The received signal at the receiver will be</p><p>where h k is the channel between User k and the receiver and &#951; is the complex additive white Gaussian noise with the covariance matrix &#963; 2 I. The receiver uses filters w k,1 and w k,2 to decode the messages of Users 1 to K-1, and uses w K to decode the message of User K. Single-layer downlink RSMA, discussed in the previous section, does not require ordering for decoding, because each user simply decodes the common message first and then its own private message. However, ordering is required for downlink RSMA with more than one layer <ref type="bibr">[235]</ref>. Similarly, uplink RSMA requires the receiver to decide which message should be decoded first to achieve the optimal performance. This problem has been studied in <ref type="bibr">[268]</ref>. Assuming a decoding order &#960; such that &#960;</p><p>for User K we only &#119882; &#119896; &#119882; &#119870; Message Splitter &#119882; &#119896;,1 &#119882; &#119896;,2 Encoder &#119904; &#119896;,1 &#119904; &#119896;,2 Power Allocation &#119875; &#119896;,1 &#119904; &#119896;,1 &#119875; &#119896;,2 &#119904; &#119896;,2 Superposition Coding User k Encoder &#119904; &#119870; Power Allocation &#119875; &#119870; &#119904; &#119870; User K &#119909; &#119896; &#119909; &#119870; &#119910; 1 &#119910; &#119873; &#119903; &#119910; &#119896; Decode According to &#120587; SIC Decode According to &#120587; SIC &#119882; 1,1 &#119882; 1,2 &#119882; &#119896;,1 &#119882; &#119896;,2 &#119882; &#119870;-1,1 &#119882; &#119870;-1,2 &#119882; &#119870; Combiner Combiner Combiner &#119882; &#119870;-1 &#119882; &#119896; &#119882; 1 Structure of User &#119896; for &#119896; &#8712; {1, &#8230; , &#119870; -1} Structure of User &#119870; a) Transmitters b) Receiver The receiver determines the optimal decoding order, decodes each user's message, and employs SIC to remove interference.</p><p>have &#960; K ), the achievable rates will be</p><p>i, i &#8242; = 1, 2, k, k &#8242; = 1, . . . , K -1.</p><p>Fig. <ref type="figure">27</ref> shows the architecture of the uplink RSMA system. While RSMA for uplink communication was proposed prior to the downlink, the body of literature in this area is very small <ref type="bibr">[269]</ref>- <ref type="bibr">[274]</ref>. This may be viewed as a drawback of uplink RSMA; however, it also represents a relatively unexplored research area with many open problems. In particular, existing works focus on uplink systems with a single antenna. The next generation is set to deploy multiple antennas for both uplink and downlink. Consequently, tackling multi-antenna uplink RSMA problems can establish a significant and valuable research domain.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>VII. OPEN PROBLEMS &amp; FUTURE DIRECTIONS</head><p>In this section, we briefly discuss some open problems and future research directions in designing coding and modulation methods for NGMA. While NGMA systems offer numerous advantages, they may introduce new security and privacy challenges, compared to existing communication systems, because of the shared symbols and models.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>A. Non-Uniform Modulation for NOMA</head><p>It is known that superposition of two uniform constellations generally leads to a non-uniform super-constellations. As this is inherent in NOMA, non-uniform constellations with BICM are transformative shifts for addressing the rate losses observed when implemented with uniform constellations <ref type="bibr">[14]</ref>, rendering NOMA a practically viable scheme. One possible approach is to have non-uniform constellations for each user and superimpose them to create a non-uniform super-constellation, but this approach may be overly complicated, as the resulting superconstellations would be distinct for each values of the power allocation coefficient &#945;. On the other hand, the direct design of non-uniform constellations for different ranges of &#945; appears to be more promising. In addition, the integration of nonuniform constellations with BICM represents an intriguing and promising avenue for enhancing the practical performance of NOMA. BICM implementation in low-resource IoT devices is challenging due to increased receiver complexity. Therefore, efficient algorithms and hardware are essential to make BICM feasible for IoT. Exploring deep learning-based BICM presents a promising new research avenue <ref type="bibr">[275]</ref>.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>B. Modulation for MIMO-NOMA</head><p>As discussed earlier, modulation schemes for MIMO channels often borrow concepts from SISO channels, employing methods like SVD decomposition to treat each channel individually <ref type="bibr">[205]</ref>, <ref type="bibr">[206]</ref>. These techniques treat modulation and precoding separately using various bit allocation strategies. Recent studies advocate for joint modulation and precoding design <ref type="bibr">[112]</ref>, <ref type="bibr">[113]</ref>. The joint multi-dimensional constellation and precoding design proposed in <ref type="bibr">[113]</ref> optimizes constellations and precoding simultaneously for all MIMO sub-channels, demonstrating superior performance compared to existing techniques. This approach is based on two key factors: 1) the collaborative design of modulation and precoding, and 2) leveraging the Mahalanobis distance <ref type="bibr">[276]</ref> to effectively utilize sub-channels with varying gains. By treating the MIMO system as a unified multi-dimensional space, rather than separate 2D spaces for in-phase and quadrature-phase, this strategy enhances overall performance. The potential application of these techniques to MIMO-NOMA systems raises intriguing questions with substantial possibilities. Specifically, the design of MIMO-NOMA constellations can draw inspiration in the aforementioned multi-dimensional constellation concept presented in <ref type="bibr">[113]</ref>.</p><p>Additionally, designing modulation schemes based on imperfect or quantized CSI is another promising research direction <ref type="bibr">[206]</ref>. Also, like regular MIMO systems, highly correlated channels may result in rank-deficient matrices that require their own studies in MIMO-NOMA systems.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>C. AE-Based NOMA</head><p>There are several research questions to explore in AEbased constellation design for NOMA. Particularly, future research in this domain should address key challenges, including refining network construction, training, and loss function design. Additionally, a more effective BER may be achieved by exploring bit-interleaved coded modulation with iterative decoding, allowing for overlapping super-symbols. The multidimensional constellation design offers a promising avenue for any MIMO problem including MIMO-NOMA. Besides, addressing the complex challenges of inter-cell interference in multi-cell environments is another crucial future research direction for finite-alphabet NOMA. In addition, evaluating the above questions with imperfect CSI and channels with severe multipath and Doppler effects is crucial for understanding the potential improvement in BERs achievable in real-world NOMA transmission. Lastly, non-Gaussian noise channels, such as impulsive noise, can lead to diverse outcomes and offer promising directions for future research. Researchers have investigated NOMA in the presence of impulsive noise <ref type="bibr">[277]</ref>. Extending these studies to AE-based NOMA would be useful.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>D. Limited Feedback/CSI</head><p>Acquiring and distributing CSI to adapt the precoding/beamforming and modulation have been used in different wireless communication systems. For example, 3GPP Release 15 explains the role of CSI in its description of "5G Phase 1 Specifications" in Section 5.2.2 of TS 138.214 <ref type="bibr">[204]</ref>. The UE reports CSI parameters to the base station using limited feedback, i.e., a few bits. In most of the discussions so far, it has been assumed that the CSI is known perfectly. Obviously, the number of available feedback bits affects the performance of the NGMA systems, similar to the case of single-user systems <ref type="bibr">[53]</ref>. Channel values can be estimated at the receiver by transmitting pilots. The accuracy of the CSI estimation is limited and its error is usually modeled by a Guassian distribution. In frequency division duplexing (FDD) scenarios, to use CSI at the transmitter, the receiver should quantize and send back the estimated CSI. Since the channel values are real numbers and only a limited number of feedback bits are available, there will be a quantization error in addition to the estimation error. In NOMA systems that use SIC, the benefits of NOMA heavily depend on the knowledge of channel orders for decoding. As such, it is very important to design the limited/quantized feedback systems to maintain the channel orders. Recently, there have been some studies on the effects of limited/quantized feedback on NOMA systems <ref type="bibr">[211]</ref>, <ref type="bibr">[278]</ref>- <ref type="bibr">[283]</ref>. In addition to designing appropriate uniform quantizers to maximize the minimum rate in SISO-NOMA systems, <ref type="bibr">[278]</ref> analyzes the performance and demonstrates the catastrophic effects of an incorrect channel order estimation. A more general optimal scalar quantizer that works for both NOMA and A-NOMA is designed in <ref type="bibr">[279]</ref>. Designing NOMA systems with very small number of feedback bits that can achieve outage probabilities close to those of full-CSI NOMA systems is possible if the bits are mainly utilized to preserve the user ordering <ref type="bibr">[211]</ref>, <ref type="bibr">[280]</ref>- <ref type="bibr">[282]</ref>. An RIS-aided NOMA system with limited feedback is designed in <ref type="bibr">[283]</ref> and the rate loss due to quantization is analyzed. Also, there has been some limited research on RSMA systems with limited feedback <ref type="bibr">[230]</ref>, <ref type="bibr">[284]</ref>, <ref type="bibr">[285]</ref>.</p><p>Nevertheless, there are still many open problems in this area. For example, the capacity region of NOMA for a given number of feedback bits is not known. Future research directions include the design of the feedback link with optimal quantizers, studying the trade-off between the number of feedback bits and the performance, and analyzing the rate loss due to the limited feedback. One important challenge is to keep the transmitted throughput less than the capacity of the channel. Since the transmitter has only access to a quantized version of the channel gains, its estimate of the possible throughput rates may exceed the real capacity of the channel. This is more sever for downlink NOMA where channels are estimated at different nodes without knowing other channels. If the transmitter of downlink NOMA does not know the correct ordering, because of feedback error or quantization noise, a receiver may try to decode at a rate which is higher than its capacity, resulting in catastrophic outcomes and error propagation. Another challenge is the robust design of MIMO-NGMA systems for estimated and quantized CSI, especially taking into account the impact of precoding/beamforming vectors in designing user grouping methods and decoding order algorithms for each group. When RIS components exist in NOMA systems, estimating the corresponding cascaded channels is a major challenge, especially since RIS elements are usually passive and cannot transmit pilots <ref type="bibr">[286]</ref>. Designing appropriate limited feedback mechanisms for such RIS-assisted NOMA systems is another interesting open problem.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head>E. Effects of mmWave and Terahertz (THz) Channels</head><p>By enabling ultra-high data rates up to terabits per second, and massive connectivity, mmWave and THz frequencies are critical for next-generation wireless systems. However, they face challenges like limited coverage, limitations, and susceptibility to blockages <ref type="bibr">[287]</ref>. These channels are sparse, with few significant propagation paths from transmitter to receiver. Their smaller wavelengths allow highly directional antennas and beamforming, focusing energy in specific directions. Thus, communication in these bands is more directional, with signals traveling in defined paths rather than spreading widely.</p><p>Both NOMA and RSMA are used with these bands <ref type="bibr">[288]</ref>- <ref type="bibr">[292]</ref>. The potential for highly directional beamforming in these bands makes NOMA user grouping different from sub-6 GHz channels. Additionally, beamforming will shift from digital to analog or hybrid to reduce power consumption when the number of antennas is high <ref type="bibr">[293]</ref>- <ref type="bibr">[295]</ref>. Similarly, challenges related to modulation and coding in mmWave and THz bands are mostly hardware-related. High-speed analogto-digital converters (ADCs) and digital-to-analog converters (DACs) are power-hungry and costly <ref type="bibr">[296]</ref>, <ref type="bibr">[297]</ref>. As an alternative, QAM demodulation in the analog domain has been proposed <ref type="bibr">[296]</ref>. Power dissipation of ADCs and DACs reduces by lowering their resolution bits. While low-resolution ADCs and DACs, such as one-bit converters, can simplify hardware design and reduce power consumption, they also increase quantization noise, introducing challenges in signal quality, beamforming precision, and overall performance <ref type="bibr">[298]</ref>. NOMA and RSMA are more sensitive to channel imperfections, compared to OMA, since they require CSI for decoding. It is crucial to study the impact of low-resolution converters on both NOMA and RSMA decoders, particularly on SIC. Designing signal processing and machine learning techniques to mitigate these effects is essential for achieving acceptable performance levels.</p></div><note xmlns="http://www.tei-c.org/ns/1.0" place="foot" n="1" xml:id="foot_0"><p>Unless otherwise stated, downlink NOMA refers to the Gaussian broadcast channel.</p></note>
		</body>
		</text>
</TEI>
