in Wiley Online Library (wileyonlinelibrary.com).In principle, optimization-based ''superstructure'' methods for process synthesis can be more powerful than sequential-conceptual methods as they account for all complex interactions between design decisions. However, these methods have not been widely adopted because they lead to mixed-integer nonlinear programs that are hard to solve, especially when realistic unit operation models are used. To address this challenge, we develop a superstructure-based strategy where complex unit models are replaced with surrogate models built from data generated via commercial process simulators. In developing this strategy, we study aspects such as the systematic design of process unit surrogate models, the generation of simulation data, the selection of the surrogate's structure, and the required model fitting. We also present how these models can be reformulated and incorporated into mathematical programming superstructure formulations. Finally, we discuss the application of the proposed strategy to a number of applications.