Neuromorphic visual sensors (NVS) based on photonic synapses hold a significant promise to emulate the human visual system. However, current photonic synapses rely on exquisite engineering of the complex heterogeneous interface to realize learning and memory functions, resulting in high fabrication cost, reduced reliability, high energy consumption and uncompact architecture, severely limiting the up‐scaled manufacture and on‐chip integration. Here we innovate a photo‐memory fundamental based on ion‐exciton coupling to simplify synaptic structure and minimize energy consumption. Due to the intrinsic organic/inorganic interface within the crystal, the photodetector based on monolithic 2D perovskite exhibits a persistent photocurrent lasting about 90 s, enabling versatile synaptic functions. The electrical power consumption per synaptic event is estimated to be ca. 1.45×10−16 J, one order of magnitude lower than that in a natural biological system. Proof‐of‐concept image preprocessing using the neuromorphic vision sensors enabled by photonic synapse demonstrates 4 times enhancement of classification accuracy. Furthermore, getting rid of the artificial neural network, an expectation‐based thresholding model is put forward to mimic the human visual system for facial recognition. This conceptual device unveils a new mechanism to simplify synaptic structure, promising the transformation of the NVS and fostering the emergence of next generation neural networks.This article is protected by copyright. All rights reserved