Learning to Schedule Network Resources Throughput and Delay Optimally Using Q?-Learning
مقال من تأليف: Bae, Jeongmin ; Lee, Joohyun ; Chong, Song ;
ملخص: As network architecture becomes complex and the user requirement gets diverse, the role of efficient network resource management becomes more important. However, existing throughput-optimal scheduling algorithms such as the max-weight algorithm suffer from poor delay performance. In this paper, we present reinforcement learning-based network scheduling algorithms for a single-hop downlink scenario which achieve throughput-optimality and converge to minimal delay. To this end, we first formulate the network optimization problem as a Markov decision process (MDP) problem. Then, we introduce a new state-action value function called
لغة:
إنجليزية