cs.NIcs.LGEmpirical

Temporally Consistent Graph Q-Networks for Intelligent Network Control

Jun 11, 2026

AI Summaryllama-3.1-8b-instruct

A novel multi-agent reinforcement learning algorithm, TC-GQN, is proposed for high-level control and orchestration of mobile networks, enabling energy savings while maintaining QoS.

Introduces TC-GQN algorithm for mobile network orchestration using self-predicting network representation and graph neural network

Keywords

Mobile networks Multi-agent reinforcement learning Graph neural network Energy savings Quality of service

Before reading this…

Reinforcement learning Graph neural networks

Applications

→Mobile network orchestration
→Energy savings

Skill Ladder

To understand this paper, make sure you know these concepts first:

Reinforcement learningfind papers →
Graph neural networksfind papers →

Abstract

More Like This

Mobile networks continue to grow in complexity and next generation networks are expected to support both increasing traffic loads and more diverse services. As network complexity rises, optimizing antenna parameters under dynamic or changing objectives becomes increasingly challenging. We propose a novel multi-agent reinforcement learning (MARL) algorithm for high-level control and orchestration of mobile networks. The Temporally Consistent Graph Q-Network (TC-GQN) algorithm learns a self-predicting representation of the whole network that is task-independent and aggregates information from all base-stations. A graph neural network is trained using a global reward function to assign coordinated local actions based on the learned encoding of the global network state. We evaluate the algorithm in a simulated environment to orchestrate an energy-saving feature across multiple sectors and multiple carriers under different quality of service (QoS) constraints. The proposed algorithm outperforms state-of-the-art graph-based baselines and a competitive rule-based controller by improving hardware sleep time while maintaining QoS. Moreover, the learned representation enables rapid adaptation to changing intents.