Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. Journal articles
  4. Finding Nash equilibrium based on reinforcement learning for bidding strategy and distributed algorithm for ISO in imperfect electricity market
 
research article

Finding Nash equilibrium based on reinforcement learning for bidding strategy and distributed algorithm for ISO in imperfect electricity market

Yu, Liying
•
Wang, Peng
•
Chen, Zhe
Show more
November 15, 2023
Applied Energy

Finding Nash equilibrium in an imperfect market with active and strategic resources is key to the successful operation of the electricity market. A hierarchical Nash distributed reinforcement learning (HNDRL) framework is proposed to realize the interaction between the market participants and independent system operator (ISO). In the bidding stage, multi-agent Nash policy reinforcement learning is utilized to update the bidding strategies for market participants, i.e., generation company (GenCo) and distributed company (DisCo), which maximizes their own profit with imperfect information. The bidding action is updated by the probability iteration to deal with the continuous bidding strategy. And in the market clearing stage, the distributed dynamic-average consensus optimization algorithm is proposed at the ISO level to obtain the locational marginal price (LMP) and transaction quantities. With the participation of the demand-side in the electricity market, the proposed HNDRL framework can reach a Nash equilibrium with the optimal bidding strategy and handle global resource constraints in a distributed way. In addition, theoretical analysis is provided to ensure that the participants' strategy exponentially converges to the Nash equilibrium for both convex and non-convex problems. Finally, the IEEE 30-bus system is employed to illustrate the efficiency. Compared with the existing method, the simulation results show that the proposed HNDRL framework has a faster convergence rate with the range of 2.667-3.333 times and can obtain higher profit with the range of 1.4286%-7.1429% at the Nash equilibrium point.

  • Details
  • Metrics
Type
research article
DOI
10.1016/j.apenergy.2023.121704
Web of Science ID

WOS:001061774600001

Author(s)
Yu, Liying
Wang, Peng
Chen, Zhe
Li, Dewen
Li, Ning
Cherkaoui, Rachid  
Date Issued

2023-11-15

Publisher

ELSEVIER SCI LTD

Published in
Applied Energy
Volume

350

Article Number

121704

Subjects

Energy & Fuels

•

Engineering, Chemical

•

Energy & Fuels

•

Engineering

•

bidding strategy

•

imperfect electricity market

•

nash equilibrium

•

distributed optimization

•

reinforcement learning

•

multi-agent

•

game approach

•

energy

Editorial or Peer reviewed

REVIEWED

Written at

EPFL

Available on Infoscience
September 25, 2023
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/200984
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés