The existing classification of time-series data has difficulties that traditional methodologies struggle to address, such as complexity and dynamic variation. Difficulty with pattern recognition and longterm dependency modeling, high dimensionality and complex interactions between variables, and incompleteness of irregular intervals, missing values, and noise are the main causes for the degradation of model performance. Therefore, it is necessary to develop new classification methodologies to effectively process time-series data and make real-world applications. Accordingly, this paper proposes ViT-Based Multi-Scale Classification Using Digital Signal Processing and Image Transformation. It comprises feature extraction through digital signal processing (DSP), image transformation, and Vision Transformer (ViT) based classification. In the digital signal processing stage, a total of five features are extracted through sampling, quantization, and Discrete Fourier Transform (DFT), which are sampling time, sampled signal, quantized signal, and magnitudes and phases extracted through DFT processing. Subsequently, the extracted multi-scale features are used to generate new images. Finally, based on the generated images, a ViT model is applied to make multi-class classification. This paper confirms the superiority of the proposed approach by comparing traditional models with Vision Transformer and CNN models. Particularly, by showing excellent classification performance even for the most challenging classes, it proves effective data processing in terms of data diversity. Ultimately, this paper suggests a methodology for the analysis and classification of timeseries data and shows that it has the potential to be applied to a wide range of data analysis problems.