~ similar to 2606.13848· 20 results
The paper proposes a Network Distributed Multi-Agent Reinforcement Learning (ND-MARL) framework that enables stable, scalable consensus control for large swarms of quadcopters using only local neighbo…
The paper introduces Coordination Graphs for Constrained Multi-Agent Reinforcement Learning (CG-CMARL), a scalable framework that decomposes complex joint action spaces into pairwise regions to handle…
The paper proposes AgentxGCore, an Agentic AI-Native layer that extends the 3GPP core network to enable self-organizing, self-adapting, and continuously optimized network management for 6G.
The paper proposes a Digital Twin-assisted Adaptive Multi-Agent Deep Reinforcement Learning framework to intelligently manage spectrum and resources in complex, dynamic Open-RAN 6G networks utilizing…
The paper presents ACD$^3$-GAT, a safety-contract graph MARL framework for network security response systems, which adds budget context, CVaR estimation, opponent-belief state, and Graph Counterfactua…
The paper proposes a communication-centric 6G-LLM architecture for tactical autonomous defense vehicles, demonstrating significant improvements in coordination and communication efficiency over conven…
The paper proposes a scalable, distributed approach for constrained Multi-Agent Reinforcement Learning by using local consensus over dual variables to ensure global constraint satisfaction without cen…
The paper proposes DNQ, a scalable solver-in-the-loop framework for training agents in multi-turn simultaneous bidding games by leveraging pairwise payoff estimation to approximate complex equilibrium…
Saeid Jamshidi, Negar Shahabi, Foutse Khomh, Carol Fung +1 more
The paper proposes a two-timescale governance framework using a multi-agent LLM to safely update and guide RL agents for SDN-IoT defense, significantly improving performance and stability under advers…
CARE-RL introduces a framework combining protocol-aware reward generation and capability-aware optimization to effectively mitigate cross-domain conflicts in multi-domain reinforcement learning for LL…
Jiazhen Lei, Tianze Cao, Yuxin Sha, Sihan Wang +4 more
The paper introduces RadioMaster, a novel multi-agent system that successfully translates high-level user intents into physically viable, real-world radio signals, significantly outperforming existing…
This paper demonstrates that using a communication algorithm (CommFormer) with heterogeneous agents significantly improves the speed and performance of multi-agent reinforcement learning for autonomou…
Chris Hicks, Elizabeth Bates, Shae McFadden, Isaac Symes Thompson +11 more
This paper synthesizes expert knowledge from a workshop to provide a comprehensive framework and best-practice guidelines for developing high-quality reinforcement learning environments for autonomous…
The paper introduces AgenticRL, a self-refining reinforcement learning framework that uses a multimodal GPT agent to automatically design, refine, and deploy reward functions for complex UAV navigatio…
The paper introduces the Markov decision contest, a new framework for reinforcement learning using pairwise preferences, and proves that stationary Markov policies are optimal and solvable efficiently…
Yi Ding, Zijie Xuan, Haowei Zhou, Zhenyu Ju +5 more
The paper proposes TCP-MCP, a co-evolution framework that jointly optimizes agent prompts and communication topologies to design highly efficient and effective multi-agent systems.
Rudolf Krecht, Tamas Budai, Erno Horvath, Akos Kovacs +2 more
This paper provides a comprehensive review of network optimization aspects for Connected and Autonomous Vehicles (CAVs), aiming to clarify misconceptions and outline future research directions.
The paper introduces the Terminal Representation (TR), a novel, lower-dimensional, and structurally distinct formulation for encoding reward-weighted trajectories in RL that bypasses the need for eige…
Azal Ahmad Khan, Ammar Ahmed, Zeshan Fayyaz, Sheng Di +2 more
The paper introduces Straggler-Aware Group Control (SAGC), a dynamic group-size controller that optimizes synchronous on-policy RL training by adapting group size to minimize delays caused by slow rol…