Digital modulation classification is important for many civilian as well as military applications. In this paper, we propose a simple and robust feature to distinguish continuous-phase FSK from QAM and PSK modulations. The feature is based on product of two consecutive signal values and on time averaging of imaginary part of the product. Conditional probability density functions of the feature given modulation type are determined. In order to overcome the complexity of calculating probability density functions, central limit theorem for strictly stationary m-dependent sequences is used to obtain Gaussian approximations. After calculating probability density functions, thresholds are determined based on minimization of total probability of misclassification. Since threshold-based results are valid for special cases requiring knowledge of some parameters, we resort to usage of support vector machines for classification, which require little training and no a priori information except for carrier frequency. Following that joint effects on the performance of carrier offset, fast fading, and non-synchronized sampling are studied in the presence of additive white Gaussian noise. For comparison purposes, rectangular pulse shape is used. To prove practical usefulness, not only the performance is analyzed for root-raised cosine pulses but also for quite less oversampling of symbols than what is found in other approaches. In the course of doing that, the performance is compared with wavelet-based feature that uses support vector machines for modulation separation.