Intrinsically
disordered proteins (IDPs) populate a range of conformations
that are best described by a heterogeneous ensemble. Grouping an IDP
ensemble into “structurally similar” clusters for visualization,
interpretation, and analysis purposes is a much-desired but formidable
task, as the conformational space of IDPs is inherently high-dimensional
and reduction techniques often result in ambiguous classifications.
Here, we employ the t-distributed stochastic neighbor embedding (t-SNE)
technique to generate homogeneous clusters of IDP conformations from
the full heterogeneous ensemble. We illustrate the utility of t-SNE
by clustering conformations of two disordered proteins, Aβ42,
and α-synuclein, in their APO states and when bound to small
molecule ligands. Our results shed light on ordered substates within
disordered ensembles and provide structural and mechanistic insights
into binding modes that confer specificity and affinity in IDP ligand
binding. t-SNE projections preserve the local neighborhood information,
provide interpretable visualizations of the conformational heterogeneity
within each ensemble, and enable the quantification of cluster populations
and their relative shifts upon ligand binding. Our approach provides
a new framework for detailed investigations of the thermodynamics
and kinetics of IDP ligand binding and will aid rational drug design
for IDPs.