This paper explores sparse time-frequency distribution (TFD) using overcomplete discrete wavelet transform (DWT) and sparse representation techniques. This distribution is discovered for characterizing the periodic transient information embedded in rolling element bearings and extracting effective features that can discriminate different fault conditions. Based on the sparse TFD, a new sparse wavelet energy (SWE) feature is obtained by three main steps: first, an overcomplete discrete DWT is employed to decompose the fault signal and construct a redundant dictionary; second, the redundant dictionary is optimized by basis pursuit to obtain the sparsest TFD; finally, SWE is calculated from the new TFD to produce a feature vector for each signal. SWE features that combine the merits of overcomplete DWT and sparse representation techniques can precisely reveal fault-induced information, thereby exhibiting valuable properties for automatic fault identification by intelligent classifiers. The effectiveness and advantages of the proposed features are confirmed by simulation and the practical fault pattern recognition of rolling bearings.