PurposeThis paper aims to demonstrate how skills taxonomies can be used in combination with machine learning to integrate diverse online datasets and reveal skills gaps. The purpose of this study is then to show how the skills gaps revealed by the integrated datasets can be used to achieve better labour market alignment, keep educational offerings up to date and assist graduates to communicate the value of their qualifications.Design/methodology/approachUsing the ESCO taxonomy and natural language processing, this study captures skills data from three types of online data (job ads, course descriptions and resumes), allowing us to compare demand for skills and supply of skills for three different occupations.FindingsThis study illustrates three practical applications for the integrated data, showing how they can be used to help workers who are disrupted by technology to identify alternative career pathways, assist educators to identify gaps in their course offerings and support students to communicate the value of their training to employers.Originality/valueThis study builds upon existing applications of machine learning (detecting skills from a single dataset) by using the skills taxonomy to integrate three datasets. This study shows how these complementary, big datasets can be integrated to support greater alignment between the needs and offerings of educators, employers and job seekers.