This paper presents a Q-Learning algorithm for the development of bidding strategies for market participants in FTR auctions. Each market participant is represented by an autonomous adaptive agent capable of developing its own bidding behavior based on a Q-learning algorithm. Initially, a bilevel optimization problem is formulated. At the first level, a market participant tries to maximize his expected profit under the constraint that, at the second level, an independent system operator tries to maximize the revenues from the FTR auction. It is assumed that each FTR market participant chooses his bidding strategy, for holding a FTR, based on a probabilistic estimate of the LMP differences between withdrawal and injection points. The market participant expected profit is calculated and a Qlearning algorithm is employed to find the optimal bidding strategy. A two-bus and a five-bus test system are used to illustrate the presented method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.