With the advent of the Distributed Energy Resources within smart grid systems, traditional demand response management (DRM) models need to be redesigned to capture prosumers’ energy consumption requests and dynamic behavior within the energy market. In this paper, a coalitional DRM model is introduced based on the principles of Game Theory and reinforcement learning to dynamically determine prosumers’ formation in local energy trading communities and their optimal energy consumption. A hedonic game-theoretic model is introduced to enable prosumers to autonomously and dynamically select an energy trading community considering the partially available information regarding prosumers’ energy generation and consumption characteristics and utility companies’ provided rewards per community. Then, a log-linear reinforcement learning model is proposed to enable each prosumer to distributedly determine their optimal energy consumption. A detailed evaluation of the proposed coalitional DRM model is performed based on real data collected from the southwest area of the USA.