In the years since the COMPASS dataset initiative was begun, many important research questions have been investigated using its large amount of health information pertaining to high school students across Canada, with findings guiding many decisions made by policy makers [1]. However, to use traditional statistical methods, specific data points must be selected by researchers to include in the analysis, leading to possible unexpected relationships and connections across the study's 280 data points being missed. As well, most analysis is done on a per-student basis, while policies are often implemented at the school level, so understanding behaviours across a school's population can make it easier for school decision makers to interpret findings. Motivated by these goals, this study introduces a novel deep learning-driven aggregate embedding method to determine group-level representations for individual schools from student-level survey responses based on architecture introduced in Variational Autoencoders [2]. This study aims to produce a method which allows for new patterns to be identified in the COMPASS data and for the resulting embedded representations to be applied in future analysis.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.