Live streaming services enable the audience to interact with one another and the streamer over live content. The surging popularity of live streaming platforms has created a competitive environment. To retain existing viewers and attract newcomers, streamers and fans often create a well-condensed summary of the streamed content. However, this process is manual and costly due to the length of online live streaming events. The current study identifies enjoyable moments in user-generated live video content by examining the audiences’ collective evaluation of its epicness. We characterize what features “epic” moments and present a deep learning model to extract them based on analyzing two million user-recommended clips and the associated chat conversations. The evaluation shows that our data-driven approach can identify epic moments from user-generated streamed content that cover various contexts (e.g., victory, funny, awkward, embarrassing). Our user study further demonstrates that the proposed automatic model performs comparably to expert suggestions. We discuss implications of the collective decision-driven extraction in identifying diverse epic moments in a scalable way.