Fully developed turbulent free shear layers exhibit a high degree of order, characterized by large-scale coherent structures in the form of spanwise vortex rollers. Extensive experimental investigations show that such organised motions bear remarkable resemblance to instability waves, and their main characteristics, including the length scales, propagation speeds and transverse structures, are reasonably well predicted by linear stability analysis of the mean flow. In this paper, we present a mathematical theory to describe the nonlinear dynamics of coherent structures. The formulation is based on the triple decomposition of the instantaneous flow into a mean field, coherent fluctuations and small-scale turbulence but with the mean-flow distortion induced by nonlinear interactions of coherent fluctuations being treated as part of the organised motion. The system is closed by employing gradient type of models for the time-and phase-averaged Reynolds stresses of fine-scale turbulence. In the high-Reynolds-number limit, the nonlinear non-equilibrium critical-layer theory for laminar-flow instabilities is adapted to turbulent shear layers by accounting for (a) the enhanced non-parallelism associated with fast spreading of the mean flow, and (b) the influence of small-scale turbulence on coherent structures. The combination of these factors with nonlinearity leads to an interesting evolution system, consisting of the coupled amplitude and vorticity equations, in which non-parallelism contributes the so-called translating critical-layer effect. Numerical solutions of the evolution system capture vortex roll-up, which is the hallmark of turbulent mixing layer, and the predicted amplitude development mimics the qualitative feature of oscillatory saturation that has been observed in a number of experiments. A fair degree of quantitative agreement is obtained with one set of experimental data.