Abstract
As electricity consumption in commercial and residential buildings continues to rise, reducing energy costs presents an increasing challenge. Heating, ventilating, and air-conditioning (HVAC) systems, which typically account for 40%-50% of a building's energy use, are prime targets for energy savings. Intelligent control of HVAC temperature through the exploitation of HVAC load flexibility brings significant potential to reduce energy consumption and electricity expenses. The nonlinear models of HVAC systems challenge traditional control methods, while the uncertainty introduced by HVAC load flexibility complicates distributed energy resource (DER) management using conventional optimal dispatch techniques. In response to these challenges, we propose a hierarchical multi-agent deep reinforcement learning (DRL) approach. The lower-level agents focus on balancing comfort and energy conservation, while the upper-level DRL agents optimize the use of DERs to reduce peak demand based on the control outcomes of the HVAC by the lower-level agents. In the upper-level agents, we incorporate a multi-agent structure based on ensemble learning, which acts based on historical and current data without relying on precise load forecasting to address the delayed rewarding issue in DRL. This allows for the effective reduction of energy costs. The proposed method is tested using a real-world microgrid comprising 413 buildings in Southern California, and the results demonstrate that our approach can significantly reduce overall electricity bills while ensuring the comfort of consumers and residents.
| Original language | American English |
|---|---|
| Pages (from-to) | 5589-5601 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Smart Grid |
| Volume | 16 |
| Issue number | 6 |
| DOIs | |
| State | Published - 2025 |
NLR Publication Number
- NREL/JA-5D00-91137
Keywords
- DERs
- HVAC
- load flexibility
- optimal dispatch
- reinforcement learning
Fingerprint
Dive into the research topics of 'Multi-Agent Hierarchical Deep Reinforcement Learning for HVAC Control With Flexible DERs'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver