Abstract-Episode creation is the task of classifying medical events and related clinical data to high-level concepts, such as diseases. Challenges in episode creation result in part because of data, in the patient record, only implicitly being associated with their respective episodes. Furthermore, traditional approaches have been limited to using feature-poor claims records to generate episodes. The accurate correlation of data to their episodes is valuable in health outcomes research to discern resource utilization with respect to medical conditions. This paper describes a combinatorial optimization approach for constructing episodes, which supports the incorporation of heterogeneous data types. Aspects of this approach include an episode model for characterizing the generation of data elements as a result of a process, a methodology for identifying the relationships between implicit processes and the data elements generated by the processes, a measure for evaluating candidate episode configurations, and an energy-minimization methodology for addressing episode creation. An implementation of this work, called Episode Creation Version 2 (EC2), has been applied on patient records with various episode types, which present with knee pain. EC2 demonstrated data element classification precision and recall scores of 78% and 82%, respectively. Significant improvements in precision and recall were observed over a traditional healthcare services approach.