ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:

~ similar to 2606.13848· 20 results

cs.ROcs.AIcs.LGRecentJun 1, 2026

Network Distributed Multi-Agent Reinforcement Learning for Consensus Control of Quadcopters

Youssef Mahran, Zeyad Gamal, Aamir Ahmad, Ayman El-Badawy

The paper proposes a Network Distributed Multi-Agent Reinforcement Learning (ND-MARL) framework that enables stable, scalable consensus control for large swarms of quadcopters using only local neighbo…

View →
cs.AIRecentJun 1, 2026

Coordination Graphs for Constrained Multi-Agent Reinforcement Learning

Santiago Amaya-Corredor, Miguel Calvo-Fullana, Anders Jonsson

The paper introduces Coordination Graphs for Constrained Multi-Agent Reinforcement Learning (CG-CMARL), a scalable framework that decomposes complex joint action spaces into pairwise regions to handle…

View →
cs.NIcs.AIRecentMay 29, 2026

AgentxGCore: Agentic AI for Next-Generation Mobile Core Network

Maria Katarine Santana Barbosa, Kelvin L. Dias

The paper proposes AgentxGCore, an Agentic AI-Native layer that extends the 3GPP core network to enable self-organizing, self-adapting, and continuously optimized network management for 6G.

View →
cs.ITcs.AIRecentMay 31, 2026

Digital Twin-Assisted Adaptive Multi-Agent DRL for Intelligent Spectrum and Resource Management in Open-RAN UAV-Enabled 6G Networks

Marwan Dhuheir, Thang X. Vu, Symeon Chatzinotas

The paper proposes a Digital Twin-assisted Adaptive Multi-Agent Deep Reinforcement Learning framework to intelligently manage spectrum and resources in complex, dynamic Open-RAN 6G networks utilizing…

View →
cs.MAcs.AIcs.CREmpiricalRecentJun 11, 2026

Safety-Contract Graph Multi-Agent Reinforcement Learning for Autonomous Network Security Response

Jose Luis Lima de Jesus Silva

The paper presents ACD$^3$-GAT, a safety-contract graph MARL framework for network security response systems, which adds budget context, CVaR estimation, opponent-belief state, and Graph Counterfactua…

View →
eess.SPcs.AIcs.NIRecentMay 31, 2026

A Communication-Centric 6G-LLM Architecture for Scalable Tactical Autonomous Defense Vehicle Networks

Kiran Khurshid, Shumaila Javaid, Nasir Saeed

The paper proposes a communication-centric 6G-LLM architecture for tactical autonomous defense vehicles, demonstrating significant improvements in coordination and communication efficiency over conven…

View →
cs.LGcs.AIRecentMay 28, 2026

Scalable Constrained Multi-Agent Reinforcement Learning via State Augmentation and Consensus for Separable Dynamics

Santiago Amaya-Corredor, Miguel Calvo-Fullana, Anders Jonsson

The paper proposes a scalable, distributed approach for constrained Multi-Agent Reinforcement Learning by using local consensus over dual variables to ensure global constraint satisfaction without cen…

View →
cs.GTcs.LGRecentJun 4, 2026

DNQ: Deep Nash Q-Network for Partially Observable n-Player Games

Qintong Xie, Edward Koh, Xavier Cadet, Peter Chin

The paper proposes DNQ, a scalable solver-in-the-loop framework for training agents in multi-turn simultaneous bidding games by leveraging pairwise payoff estimation to approximate complex equilibrium…

View →
cs.CRRecentApr 1, 2026

Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense

Saeid Jamshidi, Negar Shahabi, Foutse Khomh, Carol Fung +1 more

The paper proposes a two-timescale governance framework using a multi-agent LLM to safely update and guide RL agents for SDN-IoT defense, significantly improving performance and stability under advers…

View →
cs.LGcs.AIRecentMay 30, 2026

CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts

Rui Zhang, Xinle Wu, Yao Lu

CARE-RL introduces a framework combining protocol-aware reward generation and capability-aware optimization to effectively mitigate cross-domain conflicts in multi-domain reinforcement learning for LL…

View →
cs.MAcs.AIcs.NIRecentJun 1, 2026

RadioMaster: Multi-Agent System for Autonomous Radio Signal Generation

Jiazhen Lei, Tianze Cao, Yuxin Sha, Sihan Wang +4 more

The paper introduces RadioMaster, a novel multi-agent system that successfully translates high-level user intents into physically viable, real-world radio signals, significantly outperforming existing…

View →
cs.CRcs.AIcs.LGRecentMar 17, 2026

Learning Communication Between Heterogeneous Agents in Multi-Agent Reinforcement Learning for Autonomous Cyber Defence

Alex Popa, Adrian Taylor, Ranwa Al Mallah

This paper demonstrates that using a communication algorithm (CommFormer) with heterogeneous agents significantly improves the speed and performance of multi-agent reinforcement learning for autonomou…

View →
cs.CRcs.AIRecentApr 9, 2026

Building Better Environments for Autonomous Cyber Defence

Chris Hicks, Elizabeth Bates, Shae McFadden, Isaac Symes Thompson +11 more

This paper synthesizes expert knowledge from a workshop to provide a comprehensive framework and best-practice guidelines for developing high-quality reinforcement learning environments for autonomous…

View →
cs.ROcs.AIRecentJun 2, 2026

Self-Refining Agentic Reinforcement Learning for Vision-Conditioned UAV Navigation

Roohan Ahmed Khan, Yasheerah Yaqoot, Muhammad Ahsan Mustafa, Dzmitry Tsetserukou

The paper introduces AgenticRL, a self-refining reinforcement learning framework that uses a multimodal GPT agent to automatically design, refine, and deploy reward functions for complex UAV navigatio…

View →
cs.LGcs.AIRecentMay 29, 2026

Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems

Jonathan Colaço Carr, Prakash Panangaden, Doina Precup, Benjamin Van Roy

The paper introduces the Markov decision contest, a new framework for reinforcement learning using pairwise preferences, and proves that stationary Markov policies are optimal and solvable efficiently…

View →
cs.AIRecentMay 27, 2026

TCP-MCP: Landscape-Guided Co-Evolution of Prompts and Communication Topologies for Multi-Agent Systems

Yi Ding, Zijie Xuan, Haowei Zhou, Zhenyu Ju +5 more

The paper proposes TCP-MCP, a co-evolution framework that jointly optimizes agent prompts and communication topologies to design highly efficient and effective multi-agent systems.

View →
cs.NIcs.AIRecentMay 28, 2026

Network Optimization Aspects of Autonomous Vehicles: Challenges and Future Directions

Rudolf Krecht, Tamas Budai, Erno Horvath, Akos Kovacs +2 more

This paper provides a comprehensive review of network optimization aspects for Connected and Autonomous Vehicles (CAVs), aiming to clarify misconceptions and outline future research directions.

View →
cs.LGcs.AIRecentMay 29, 2026

The Terminal Representation in Reinforcement Learning

Amir Esterhuysen, Anders Jonsson

The paper introduces the Terminal Representation (TR), a novel, lower-dimensional, and structurally distinct formulation for encoding reward-weighted trajectories in RL that bypasses the need for eige…

View →
cs.LGcs.AIRecentJun 1, 2026

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

Azal Ahmad Khan, Ammar Ahmed, Zeshan Fayyaz, Sheng Di +2 more

The paper introduces Straggler-Aware Group Control (SAGC), a dynamic group-size controller that optimizes synchronous on-policy RL training by adapting group size to minimize delays caused by slow rol…

View →