Elastic wavefield tomography faces challenging pitfalls due to its multiparameter and multicomponent characters, which are absent in the acoustic case. Inter-parameter crosstalk and the absence of petrophysical constraints may cause elastic inversion to fail, delivering unphysical and artifact-contaminated models. In addition, one of the goals of wavefield tomography is to deliver an earth model that generates accurate and high-quality images; however, this might not be the case for conventional data-domain tomography methods that exclude image optimization during inversion. Therefore, we propose to use the elastic reflection waveform inversion (ERWI) methodology, which inverts both for the background velocity model and for the reflectivity image, coupled with a petrophysical constraint term in the objective function. We demonstrate that constraining ERWI is successful in delivering more plausible models with fewer artifacts and that satisfy the imposed constraints. We alternate between smooth model and reflectivity updates, keeping both data fitting, image focusing and petrophysical constraints consistently satisfied in a common objective function. Compared to unconstrained inversion, our numerical examples show less-contaminated models and higher-quality images, as well as improved convergence and accuracy.