Papers similar to 2606.02049

~ similar to 2606.02049· 20 results

cs.LGcs.AIRecentMay 30, 2026

Interpretable Policy Distillation for Power Grid Topology Control

This paper demonstrates that a complex deep reinforcement learning policy for power grid control can be successfully distilled into a lightweight, auditable decision tree and random forest surrogate t…

View →

cs.AIRecentMay 27, 2026

PIRS: Physics-Informed Reward Shaping for SAC-Based Building Energy Management

Shadmehr Zaregarizi, Khashayar Yavari

The paper introduces PIRS, a physics-informed reward shaping method that replaces ad-hoc comfort proxies with the ISO 7730 PMV formulation, enabling deep reinforcement learning agents to achieve energ…

View →

cs.AIRecentMay 27, 2026

OccuReward: LLM-Guided Occupant-Centric Reward Shaping for Demographic Equity in Grid-Interactive Buildings

Shadmehr Zaregarizi, Khashayar Yavari

OccuReward introduces an LLM-guided framework and a Comfort Equity Index (CEI) to shape building energy rewards, demonstrating that iterative refinement significantly improves occupant comfort equity…

View →

econ.EMcs.AIRecentMay 30, 2026

Certificates without Electrons? Theory and Evidence on Impacts from AI-Driven Power Demand

Dana Golden, Aruna Balasubramanian, Niranjan Balasubramanian

The paper models how AI-driven data center demand stresses the electrical grid, finding that relying solely on renewable energy certificates (RECs) is insufficient and that on-site storage and spatial…

View →

cs.AIRecentMay 28, 2026

Uncertainty-Aware Transfer Learning for Cross-Building Energy Forecasting: Toward Robust and Scalable District-Level Energy Management

Shadmehr Zaregarizi, Khashayar Yavari

The paper proposes an uncertainty-aware transfer learning framework using the Temporal Fusion Transformer (TFT) to achieve robust and scalable energy forecasting across different buildings, demonstrat…

View →

cs.AIcs.LGRecentMay 30, 2026

EnergyMamba: An Uncertainty-Aware Graph-Enhanced Selective State Space Model for Energy Consumption Prediction

Dahai Yu, Rongchao Xu, Lin Jiang, Guang Wang

EnergyMamba proposes an uncertainty-aware, graph-enhanced selective state space model to significantly improve both the accuracy and reliability of energy consumption prediction by explicitly modeling…

View →

cs.AIeess.SYRecentJun 1, 2026

S3TS: Stochastic Scenario-Structured Tree Search for Advanced Planning Under Uncertainty

Fabio Pavirani, Bert Claessens, Pierre Pinson, Chris Develder

The paper proposes S3TS, a novel tree search algorithm that simultaneously handles both non-linear system models and explicit uncertainties (scenarios) for advanced energy planning, achieving near-opt…

View →

cs.LGcs.AIphysics.flu-dynRecentMay 31, 2026

Explainable deep reinforcement learning reveals energy-efficient control strategies for turbulent drag reduction

Federica Tonti, Ricardo Vinuesa

The paper proposes an energy-efficient drag reduction strategy for turbulent flows by combining Multi-Agent Deep Reinforcement Learning with SHAP-guided explainable deep learning, achieving superior p…

View →

cs.AIRecentMay 28, 2026

Battery-Sim-Agent: Leveraging LLM-Agent for Inverse Battery Parameter Estimation

Jiawei Chen, Xiaofan Gui, Shikai Fang, Shengyu Tao +3 more

The paper introduces Battery-Sim-Agent, an LLM-based framework that reframes the difficult inverse problem of battery parameter estimation as a reasoning task, significantly outperforming traditional…

View →

cs.LGcs.AIRecentMay 28, 2026

Scalable Constrained Multi-Agent Reinforcement Learning via State Augmentation and Consensus for Separable Dynamics

Santiago Amaya-Corredor, Miguel Calvo-Fullana, Anders Jonsson

The paper proposes a scalable, distributed approach for constrained Multi-Agent Reinforcement Learning by using local consensus over dual variables to ensure global constraint satisfaction without cen…

View →

cs.CYcs.CRcs.DCRecentMay 22, 2026

SolarChain: Bridging Physical Law, Verifiable Trust, and Sustainable Markets for Urban Energy Resilience

Shilin Ou, Yifan Xu, Zhenshan Zhang, Luyao Zhang +1 more

SolarChain is a platform that ensures verifiable trust in decentralized solar energy markets by anchoring digital energy credits to the hard physical limits of solar yield, thereby preventing data man…

View →

cs.SEcs.CLeess.SYRecentMay 29, 2026

Knowledge Boundary Probing and Demand-Guided Intervention for LLM-Based Power System Code Generation

Hui Wu, Xiaoyang Wang, Zhong Fan

The paper addresses the reliability of open-weight LLMs for power system code generation by identifying structured API-knowledge boundary errors and proposing a boundary-aware intervention that signif…

View →

cs.CLcs.LGRecentMay 30, 2026

Internalize the Temperature: On-Policy Self-Distillation as Policy Reheater for Reinforcement Learning

Xuewei Yang, Jiachen Yu, Jie Wu, Shaoning Sun +2 more

The paper introduces Temperature-Scaled On-Policy Self-Distillation (TS-OPSD), a novel method that internalizes temperature-based policy reheating into model parameters to combat entropy collapse in r…

View →

cs.CRcs.LGcs.MARecentApr 6, 2026

Explainable Autonomous Cyber Defense using Adversarial Multi-Agent Reinforcement Learning

Yiyao Zhang, Diksha Goel, Hussain Ahmad

The paper introduces C-MADF, a causally constrained multi-agent framework that significantly reduces false positives in autonomous cyber defense by restricting response actions to structurally consist…

View →

cs.CRcs.AIRecentApr 27, 2026

X-NegoBox: An Explainable Privacy-Budget Negotiation Framework for Secure Peer-to-Peer Energy Data Exchange

Poushali Sengupta, Sabita Maharjan, Frank Eliassen, Yan Zhang

X-NegoBox introduces an explainable negotiation framework that adaptively manages privacy budgets for secure peer-to-peer energy data exchange, improving trust and reducing leakage.

View →

cs.GTcs.LGRecentJun 4, 2026

DNQ: Deep Nash Q-Network for Partially Observable n-Player Games

Qintong Xie, Edward Koh, Xavier Cadet, Peter Chin

The paper proposes DNQ, a scalable solver-in-the-loop framework for training agents in multi-turn simultaneous bidding games by leveraging pairwise payoff estimation to approximate complex equilibrium…

View →

cs.CRRecentMay 10, 2026

Operationalizing Cybersecurity Governance for Mitigation Planning with Attack-Path Modeling and Reinforcement Learning

Philip Huff, Dakota Dale, Harshith Guduru, Rohan Singh +1 more

The paper proposes a system that operationalizes cybersecurity governance frameworks by integrating them with attack-path modeling and Deep Reinforcement Learning to generate practical, resource-const…

View →

cs.AIcs.CLcs.CYRecentJun 1, 2026

SafeMCP: Proactive Power Regulation for LLM Agent Defense via Environment-Grounded Look-Ahead Reasoning

Lichao Wang, Zhaoxing Ren, Tianzhuo Yang, Jiaming Ji +3 more

SafeMCP is a server-side defense plugin that uses look-ahead reasoning to proactively filter and constrain tool acquisition for LLM agents, thereby mitigating catastrophic risks associated with expand…

View →

cs.LGcs.NEq-fin.STRecentJun 3, 2026

Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning

Damian Lebiedź, Robert Ślepaczuk

The paper develops and validates a novel Deep Reinforcement Learning (DRL) framework to enhance pair trading in volatile cryptocurrency markets, demonstrating statistically significant outperformance…

View →

cs.AIcs.LGRecentMay 30, 2026

Regularized Offline Policy Optimization with Posterior Hybrid Bayesian Belief

Hongqiang Lin, Pengfei Wang, Nenggan Zheng

The paper introduces Posterior Hybrid Bayesian Belief (PhyB), a novel framework that reformulates policy optimization in Bayesian Offline RL by approximating expectations as a convex combination over…

View →