We introduce a novel statistical method, PyClone, for inference of clonal population structures in cancers. PyClone is a Bayesian clustering method for grouping sets of deeply sequenced somatic mutations into putative clonal clusters while estimating their cellular prevalences and accounting for allelic imbalances introduced by segmental copy number changes and normal cell contamination. Single cell sequencing validation demonstrates that PyClone infers accurate clustering of mutations that co-occur in individual cells.Human cancer progresses under Darwinian evolution where (epi)genetic variation alters molecular phenotypes in individual cells 1 . Consequently, tumours at diagnosis often consist of multiple, genotypically distinct cell populations ( Supplementary Fig. 1) 2 . These populations, referred to as clones, are related through a phylogeny and act as substrates for selection in tumour micro-environments or with therapeutic intervention 2, 3 . The prevalence of a particular clone measured over time or in anatomic space is a reflection of its growth and proliferative fitness. Thus, ascertaining the dynamic prevalence of clones can identify precise genetic determinants of phenotypes such as acquisition of metastatic potential or chemotherapeutic resistance.In this contribution, we provide a statistical model for analysis of deeply sequenced (coverage >1000x) mutations to identify and quantify clonal populations in tumours, which extends to modelling mutations measured in multiple samples from the same patient. Our approach uses the measurement of allelic prevalence to estimate the proportion of tumour ♣