Music visualization is the transformation from music to images, which is a process presentation method and provides a brand-new interpretation and deduction way for music appreciation. If the music effect can be expressed in the form of images, and the audio-visual combination can be achieved, it will be more vivid and can better understand the artistic conception of the author. Because music itself has rich and subtle emotional information, it is difficult to mechanically transform it into vision by a single rule in the process of music visualization. In this article, a visual expression method of music based on multi-audio features and CAD is proposed. Firstly, multiple features of music are extracted, and then these features are comprehensively visualized, so that images can express more music information, thus improving the interactive experience of music appreciators. The results show that the proposed convolutional neural network (CNN) model achieves higher accuracy of original pitch than the traditional method. The intelligent recognition method of notes based on CAD uses hierarchical filtering and segmentation method for a musical melody to complete the operation, thus reducing the task of music feature and improving the efficiency of music feature recognition.