Abstract: In this paper, we introduce an approach based on Deep Reinforcement Learning to optimize the RIS-NOMA scheme for the 6G wireless network. Some other parameters considered as actions in the proposed algorithm are clustering of users, power distribution, and RIS phase optimization. The two types of deep reinforcement learning algorithms, namely, SAC and MADDPG, were implemented on the proposed architecture. The performance of the proposed framework has been examined based on some parameters including sum rate, SINR, throughput, delay, energy efficiency, and Jain's fairness index. The suggested MADDPG and SAC models have shown better performance than existing models, based on the obtained results..
Key Word: Deep Reinforcement Learning (DRL), Non-Orthogonal Multiple Access (NOMA), Soft Actor Critic (SAC), Reconfigurable Intelligent Surface (RIS).
[1] Basar, E., Di Renzo, M., de Rosny, J., Debbah, M., Alouini, M. S., and Zhang, R., “Wireless Communications Through Reconfigurable Intelligent Surfaces,” IEEE Access, vol. 7, pp. 116753–116773, 2019.
[2] Chen, J., Zhang, H., and Letaief, K. B., “Multi-Agent Learning for Resource Management in 6G Heterogeneous Networks,” IEEE Journal on Selected Areas in Communications, vol. 42, no. 1, pp. 1–15, Jan. 2024.
[3] Gevez. Y, Y. I. Tek, and E. Basar, “Dynamic RIS partitioning in NOMA systems using deep reinforcement learning,” Frontiers in Antennas and Propagation, vol. 2, Art. no. 1418412, 2024, doi: 10.3389/fanpr.2024.1418412.
[4] Hou, Z., Sun, Y., Song, M., and Zhang, R., “Reconfigurable Intelligent Surface Assisted Non-Orthogonal Multiple Access,” IEEE Transactions on Wireless Communications, vol. 19, no. 11, pp. 6889–6903, Nov. 2020..