As renewable power generation increases in distribution networks, the real‐time power balance is becoming a tough challenge. Unlike simple peak‐load shedding or demand turn‐down scenarios, generation following (GF) requires persistent and precise control due to the temporal response performance of controlled resources. This motivates a comprehensive control design considering the temporal response limits and execution performance of air conditioner clusters (ACCs) when providing GF. Accordingly, this paper proposes a self‐constraint model predictive control (SMPC) that properly allocates the generation following task among different ACCs, consisting of three main parts: response rehearsal, distributed consistency‐based power allocation, and real‐time task execution. Specifically, the rehearsal knowledge of ACCs is evaluated by introducing model predictive control (MPC) to track power signals with different values and thus obtain prior factors, including the upward/downward limits and control cost function. On this basis, the coherence of the incremental response costs of different clusters is achieved to allocate the GF signals. Once the optimised following signals (OFS) are obtained, a real‐time MPC for generation following task execution is employed, where the OFS are used as reference and the upward/downward limits are used as constraints. Simulations are conducted to verify the feasibility and effectiveness of the proposed method.