Semi-supervised learning (SSL) offers a way to use the structural characteristics of an entire data set, regardless of label availability, for specific task learning. In this paper, structure refers to the salient waveform characteristics imparted by underlying physical processes that persist through the nonlinear transforms inherent to deep neural networks. Although an ideal feature set would exhibit structure dominated by target characteristics, in practice features can inherit dependencies which are not causally related to the prediction task. For seismic event data sets, where segmented input data spans the duration of the transient energy from a specific and discrete seismogenic process, inherited structure can come from processes including travel path, geologic site conditions, source properties, nonstationary seismic background noise, and various data artifacts. Although persistent seismic monitoring networks generate data-rich event catalogs that enable the development of deep neural network (DNN) models capable of making accurate decisions about source attributes, they do so with a complex set of features only some of which may relate to the seismogenic processes that matter for predictive modeling tasks. Yet despite imperfect feature learning, performance on seismic processing tasks using stable network geometries can be superior to traditional methods (Linville et al., 2019;Ross et al., 2019;Tibi et al., 2019) and provide capabilities that scale as data volumes increase (Nguyen et al., 2019). Therefore, even when models generalize poorly to new regions, they can add consequential value to the specific event processing pipelines they were developed for. While some authors have achieved meaningful generalization on seismic processing tasks beyond the regional networks their models were trained on (e.g., Ross et al., 2018), this work is focused on scenarios in the seismic domain where we can expect that we have collected data that imperfectly represents the set of possible states our system is capable of expressing, and we have, or can obtain at some cost, ground truth for a limited number of the observations that we have collected.