The dynamics and diversity of human gut microbiota that can remarkably influence the wellbeing and health of the host are constantly changing through the host’s lifetime in response to various factors. The aim of the present study was to determine a set of parameters that could have a major impact on classifying subjects into a single cluster regarding gut bacteria composition. Therefore, a set of demographical, environmental, and clinical data of healthy adults aged 25–50 years (117 female and 83 men) was collected. Fecal microbiota composition was characterized using Illumina MiSeq 16S rRNA gene amplicon sequencing. Hierarchical clustering was performed to analyze the microbiota data set, and a supervised machine learning model (SVM; Support Vector Machines) was applied for classification. Seventy variables from collected data were included in machine learning analysis. The agglomerative clustering algorithm suggested the presence of four distinct community types of most abundant bacterial phyla. Each cluster harbored a statistically significant different proportion of bacterial phyla. Regarding prediction, the most important features classifying subjects into clusters were measures of obesity (waist to hip ratio, BMI, and visceral fat index), total body water, blood pressure, energy intake, total fat, olive oil intake, total fiber intake, and water intake. In conclusion, the SVM model was shown as a valuable tool to classify healthy individuals based on their gut microbiota composition.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.