~ similar to 2606.02049· 20 results
This paper demonstrates that a complex deep reinforcement learning policy for power grid control can be successfully distilled into a lightweight, auditable decision tree and random forest surrogate t…
The paper introduces PIRS, a physics-informed reward shaping method that replaces ad-hoc comfort proxies with the ISO 7730 PMV formulation, enabling deep reinforcement learning agents to achieve energ…
OccuReward introduces an LLM-guided framework and a Comfort Equity Index (CEI) to shape building energy rewards, demonstrating that iterative refinement significantly improves occupant comfort equity…
The paper models how AI-driven data center demand stresses the electrical grid, finding that relying solely on renewable energy certificates (RECs) is insufficient and that on-site storage and spatial…
The paper proposes an uncertainty-aware transfer learning framework using the Temporal Fusion Transformer (TFT) to achieve robust and scalable energy forecasting across different buildings, demonstrat…
EnergyMamba proposes an uncertainty-aware, graph-enhanced selective state space model to significantly improve both the accuracy and reliability of energy consumption prediction by explicitly modeling…
The paper proposes S3TS, a novel tree search algorithm that simultaneously handles both non-linear system models and explicit uncertainties (scenarios) for advanced energy planning, achieving near-opt…
The paper proposes an energy-efficient drag reduction strategy for turbulent flows by combining Multi-Agent Deep Reinforcement Learning with SHAP-guided explainable deep learning, achieving superior p…
Jiawei Chen, Xiaofan Gui, Shikai Fang, Shengyu Tao +3 more
The paper introduces Battery-Sim-Agent, an LLM-based framework that reframes the difficult inverse problem of battery parameter estimation as a reasoning task, significantly outperforming traditional…
The paper proposes a scalable, distributed approach for constrained Multi-Agent Reinforcement Learning by using local consensus over dual variables to ensure global constraint satisfaction without cen…
Shilin Ou, Yifan Xu, Zhenshan Zhang, Luyao Zhang +1 more
SolarChain is a platform that ensures verifiable trust in decentralized solar energy markets by anchoring digital energy credits to the hard physical limits of solar yield, thereby preventing data man…
The paper addresses the reliability of open-weight LLMs for power system code generation by identifying structured API-knowledge boundary errors and proposing a boundary-aware intervention that signif…
Xuewei Yang, Jiachen Yu, Jie Wu, Shaoning Sun +2 more
The paper introduces Temperature-Scaled On-Policy Self-Distillation (TS-OPSD), a novel method that internalizes temperature-based policy reheating into model parameters to combat entropy collapse in r…
The paper introduces C-MADF, a causally constrained multi-agent framework that significantly reduces false positives in autonomous cyber defense by restricting response actions to structurally consist…
X-NegoBox introduces an explainable negotiation framework that adaptively manages privacy budgets for secure peer-to-peer energy data exchange, improving trust and reducing leakage.
The paper proposes DNQ, a scalable solver-in-the-loop framework for training agents in multi-turn simultaneous bidding games by leveraging pairwise payoff estimation to approximate complex equilibrium…
Philip Huff, Dakota Dale, Harshith Guduru, Rohan Singh +1 more
The paper proposes a system that operationalizes cybersecurity governance frameworks by integrating them with attack-path modeling and Deep Reinforcement Learning to generate practical, resource-const…
Lichao Wang, Zhaoxing Ren, Tianzhuo Yang, Jiaming Ji +3 more
SafeMCP is a server-side defense plugin that uses look-ahead reasoning to proactively filter and constrain tool acquisition for LLM agents, thereby mitigating catastrophic risks associated with expand…
The paper develops and validates a novel Deep Reinforcement Learning (DRL) framework to enhance pair trading in volatile cryptocurrency markets, demonstrating statistically significant outperformance…
The paper introduces Posterior Hybrid Bayesian Belief (PhyB), a novel framework that reformulates policy optimization in Bayesian Offline RL by approximating expectations as a convex combination over…