Frequency-modulated (FM) signals, prevalent across various applied disciplines, exhibit time-dependent frequencies and a multicomponent nature necessitating the utilization of time-frequency methods. Accurately determining the number of components in such signals is crucial for various applications reliant on this metric. However, this poses a challenge, particularly amidst interfering components of varying amplitudes in noisy environments. While the localized Rényi entropy (LRE) method is effective for component counting, its accuracy significantly diminishes when analyzing signals with intersecting components, components that deviate from the time axis, and components with different amplitudes. This paper addresses these limitations and proposes a convolutional neural network-based (CNN) approach for determining the local number of components using a time–frequency distribution of a signal as input. A comprehensive training set comprising single and multicomponent linear and quadratic FM components with diverse time and frequency supports has been constructed, emphasizing special cases of noisy signals with intersecting components and differing amplitudes. The results demonstrate that the estimated component numbers outperform those obtained using the LRE method for considered noisy multicomponent synthetic signals. Furthermore, we validate the efficacy of the proposed CNN approach on real-world gravitational and electroencephalogram signals, underscoring its robustness and applicability across different signal types and conditions.