强化学习驱动的能源互联网中多目标能源管理

138 浏览量更新于2024-08-28 收藏 388KB PDF 举报

"这篇研究论文探讨了在能源互联网中利用强化学习进行多目标能量管理的策略，特别是针对We-Energy这种新型的能源生产、存储和消费模式。论文由东北大学信息科学与工程学院的作者撰写，旨在通过优化模型构建一个环保且运行成本低的能源消耗结构，并采用强化学习方法解决多目标优化问题。" 正文: 在能源互联网的背景下，We-Energy作为一种创新的能源生产、储存和消费模式，旨在最大化可再生能源的利用率。本文的核心关注点在于如何在由联合热电联产（CHP）、光伏单元、供热单元和储能设备组成的能源互联网系统中实现有效的能量管理。首先，论文提出了一种多目标优化模型。这个模型的目标是平衡多个相互冲突的指标，如提高能源效率、降低运营成本、确保供电稳定性以及减少对环境的影响。这样的多目标优化对于复杂能源系统的管理至关重要，因为它需要同时考虑到经济效益和环境保护。其次，为了应对多目标优化的挑战，作者引入了强化学习的方法。强化学习是一种机器学习技术，通过智能体与环境的交互来学习最优策略。在这个场景中，智能体代表能量管理系统，它会根据当前状态（如能源供需、环境条件等）选择行动，并通过环境的反馈（奖励或惩罚）来不断调整其决策策略，以达到长期目标的最大化。在实际应用中，强化学习可以处理不确定性和动态变化的环境，这在能源管理中是非常关键的，因为能源供应（如太阳能和风能）和需求都是时间和天气敏感的。通过不断试错和学习，智能体能够适应这些变化并优化其操作策略，从而在满足不同目标之间找到最佳平衡点。此外，论文可能还讨论了如何设计合适的奖励函数来引导智能体的学习过程，以及如何处理多目标优化中的目标冲突问题。可能还包括了实验结果，展示强化学习算法在模拟和真实环境中的性能，以及与传统控制方法的比较。这篇研究论文通过将强化学习应用于多目标能量管理，为We-Energy在能源互联网中的高效运行提供了一种新的解决方案，有助于推动绿色、可持续的能源系统的发展。

Multi-objective Energy Management for We-Energy

in Energy Internet using Reinforcement Learning

Qiuye Sun

School of Information Science

and Engineering

Northeastern University

Shenyang, China

sunqiuye@ise.neu.edu.cn

Danlu Wang

School of Information Science

and Engineering

Northeastern University

Shenyang, China

1653224713@qq.com

Dazhong Ma

School of Information Science

and Engineering

Northeastern University

Shenyang, China

madazhong@ise.neu.edu.cn

Bonan Huang

School of Information Science

and Engineering

Northeastern University

Shenyang, China

huangbonan@ise.neu.edu.cn

Abstract—We-Energy is a novel energy production-storage-

consumption mode proposed for Energy Internet, where more

renewable energy can be utilized. This paper mainly focuses on

the energy management of We-Energy in Energy Internet

consisting of combined heat and power unit (CHP), photovoltaic

unit, heating only unit and storage device. To construct an

environmental-friendly and low-operating cost energy

consumption structure, a multi-objective optimization model is

proposed in this paper. Furthermore, in order to satisfy the

power and heat demands of the We-Energy simultaneously as

well as realizing minimum operating cost and pollutant emission,

an intelligent energy management system (IEMS) is presented. In

particular, reinforcement learning method has been implemented

to formulate the optimal operating strategy. Eligibility trace

theory is also been introduced to accelerate the computational

process. Finally, simulation results are given to prove the

effectiveness of the proposed optimization strategy.

Keywords—Energy management, reinforcement learning, We-

Energy, multi-objective optimization

I. INTRODUCTION

Due to the upcoming shortage of fossil fuels and increasing

concerns for environmental pollution, the current power grid is

facing the challenges of increasing power demand and

environmental protection which calls for a sustainable power

system to realize low-carbon energy consumption. As a

consequence, the concept of “Energy Internet” is proposed as a

potential solution to these issues, the conventional fuels are

replaced by renewable energy and the centralized generation is

transformed into distributed generations.

The combined of multiple types of energy is one of the

specific characteristics of Energy Internet. The Energy Internet

can be assumed as a cluster of distributed energy resources and

loads, which contains various types of energy resources such as

electricity, gas, heat and so on [1]. The use of different kind of

energy brings great benefit to Energy Internet, which allows

multiple end users to make options according to their own

power demands, hence increasing the flexibility of the power

system and weakening the impact of traditional energy supplier.

However, using distributed generations indiscriminately may

also impose undesirable effects on power system. Therefore,

issues on optimal energy management come into play. A lot of

researches concerning control and operation of power system

have been done in recent years. Several common optimization

objectives including lower cost of carbon and minimum

operating cost have been discussed in [2]. In [3], authors

proposed a smart energy management system in order to

minimize the operating cost of the micro-grid. Only electricity

is discussed during optimization process while other types of

energy resources are not considered in the paper. The authors

in [4] proposed a micro-grid scenario consists of combined

heat and power generation, as well as power and thermal

energy storage devices. And an online algorithm has been put

forward to optimize the cost of whole system.

However, the optimized economic dispatch does not always

satisfy the demands when taking pollutants emission into

account. So, multi-objective energy management has drawn

attention from researchers so as to realize optimization both

economically and environmentally. The authors in [5]

proposed an intelligent energy management system (IEMS) for

a CHP-based micro-grid, and minimized the operation cost and

the net emission simultaneously. An efficient modified

bacterial foraging optimization algorithm was used to find the

optimal set points of the system. Reference [6] proposed a

Stackelberg game-based optimization model, and a differential

evolution-based heuristic algorithm was designed to reach the

Stackelberg equilibrium. But in the previous studies, there is a

lack of consideration of specific characteristics of Energy

Internet, such as openness, sharing and peer-to-peer integration.

As there is a series of complex problems in Energy Internet

like energy management, dynamic pricing and trading and

information interaction need to be solved, a novel energy

integration mode called “We-Energy” (WE) is proposed.

According to [7], We-Energy is a fusion of energy producers,

energy storage and consumers, which breaks the traditional

energy supply pattern led by major energy supplier and realizes

open operation and energy supply complementarity.

In this paper, an optimization model based on We-Energy

in Energy Internet is proposed considering both operating cost

and environmental pollutant. We stress that recent work has

shown reinforcement learning to be very effective and suitable

for making decisions and finding optimal strategies. Thus, we

provided an intelligent energy management system (IEMS)

based on Q-learning method to find the optimal operating

strategy of a WE.

下载后可阅读完整内容，剩余5页未读，立即下载

weixin_38611796

粉丝: 8
资源: 943

强化学习驱动的能源互联网中多目标能源管理

NIPS 2020强化学习：基于模型方法的最新论文研究

多智能体强化学习研究文献翻译与综述

轻量级多智能体gridworld环境gym-multigrid分析

PhD-Thesis-Multi-agent-deep-reinforcement-learning-in-mobile-robotics

Multi-Agent-Reinforcement-Learning-Environment_强化学习_multi-agent_

Multi-Agent Cooperative Bidding Games for Multi-Objective

Multi-Agent Transfer Learning in Reinforcement Learning-Based R

multi-agent-system-with-reinforcement-learning:MAS与RL的实施

Game-based-deep-reinforcement-learning-for-target-tracking

ConnectedQ-Multi-agent-Reinforcement-Learning_M?n_q学习_强化学习_

最新资源