In frequency-division duplexing (FDD) massive multiple-input multiple-output (MIMO), deep learning (DL)-based superimposed channel state information (CSI) feedback has presented promising performance. However, it is still facing many challenges, such as the high complexity of parameter tuning, large number of training parameters, and long training time, etc. To overcome these challenges, an extreme learning machine (ELM)-based superimposed CSI feedback is proposed in this paper, in which the downlink CSI is spread and then superimposed on uplink user data sequence (UL-US) to feed back to base station (BS). At BS, an ELM-based network is constructed to recover both downlink CSI and UL-US. In the constructed ELM-based network, we employ the simplified versions of ELM-based subnets to replace the subnets of DL-based superimposed feedback, yielding less training parameters. Besides, the input weights and hidden biases of each ELM-based subnet are loaded from the same matrix by using its full or partial entries, which significantly reduces the memory requirement. With similar or better recovery performances of downlink CSI and UL-US, the proposed ELM-based method has less training parameters, storage space, offline training and online running time than those of DL-based superimposed CSI feedback.INDEX TERMS massive multiple-input multiple-output (MIMO), channel state information (CSI), superimposed feedback, extreme learning machine (ELM). He is the author of more than 40 papers and more than 20 chinese inventions. His research interests include detection and estimation, massive MIMO systems, and deep learning in physical layer of wireless communications.