Cognitive flexibility plays a key role in ensuring an individual’s survival, and its deficit is a key symptom in many mental conditions and neurodegenerative diseases. The prefrontal cortex and striatum are both essential to cognitive flexibility. However, how the prefrontal cortex and striatum communicate with each other to enable flexible decision-making is not well understood. Competing theories are raised, debating on which structure among these two leads the role in detecting and representing the new circumstances for a change, giving largely opposing predictions on neural activities in the prefrontal cortex and striatum during flexible behavior.To address this question, we trained head-restrained mice to perform an action-outcome based dynamic foraging task and simultaneously recorded single-neuron activities in the medial prefrontal cortex (mPFC) and dorsomedial striatum (DMS). In this task, the animal chooses one of two actions to obtain reward. The animal is guided only by previous reward outcomes. We report that mPFC but not DMS activity stores information about prior reward history. A large fraction of both mPFC and DMS neurons’ activity represents the difference in reward probability between two alternative options, namely the perceived reward probability difference (PRPD), a key decision variable that prescribes which subsequent choice to make. We find that mPFC neural activities track the change of PRPD earlier and faster than those in the DMS, and functional connectivity between mPFC and DMS increases with reducing overall reward proportion.