BackgroundThe taxonomic classification of the Cannabis genus has been delineated through three main types: sativa (long and less branched plant with long and narrow leaves), indica (short but highly branched plant with broader leaves) and ruderalis (wild type with short stature, less branching and small thick leaves). While still under discussion, particularly whether the genus is polytypic or monotypic, this broad classification reflects putative geographical origin of each group and putative chemotypic and pharmacology.MethodsHere we describe a thorough investigation of cannabis accessions using a set of 22 highly informative and polymorphic SNP markers associated with important traits such as cannabinoid and terpenoid expression as well as fibre and resin production. The assay offers insight into cannabis population structure, phylogenetic relationship, population genetics and correlation to secondary metabolite concentrations and demonstrate the utility of this assay for rapid, repeatable and cost-efficient genotyping of commercial and industrial cannabis accessions for use in product traceability, breeding programs, regulatory compliance and consumer education.ResultsThe main outcomes are the identification of 5 clusters in the sample set available, including industrial hemp, resin hemp which likely underwent a bottleneck to stabilize CBDA accumulation (Type II & III). THC resin (type I) make up the other three clusters with terpinolene (colloquial “sativa” or “NLD”), myrcene/pinene and myrcene/limonene (colloquial “indica”, “BLD”), which also putatively harbour an active CBCAS.ConclusionThe functional basis of key traits is also discussed as recently enabled by the NCBI Cannabis sativa Annotation Release 100, allowing for hypothesis testing with regards to secondary metabolite production as well as other key traits of importance for adaptable and compliant large-scale seed production under the new US Domestic Hemp Production Program.