Increased application of multivariate data in many scientific areas has considerably raised the complexity of analysis and interpretation. Although quite a few approaches have been suggested to address this issue, there is still a gap between the most efficient proposed methods and available software. muvis is an R package (core team ( 2017)) which is a toolkit for analyzing multivariate datasets. Several tools are implemented for common analyses of multivariate datasets, including preprocessing, dimensionality reduction, statistical analysis, Probabilistic Graphical Modeling, hypothesis testing, and visualization. Furthermore, we have implemented two novel methods-Variable-wise Kullback-Leibler Divergence (VKL) and Violating Variable-wise Kullback-Leibler Divergence (VVKL)-in muvis, which are proposed to find the features with most different probability distributions between two groups of samples. The main aim of the package is to provide a wide range of users with different levels of expertise in R with a set of tools for comprehensive analysis of multivariate datasets. We exploited the NHANES dataset to declare the functionality of muvis in practice.
Throughout time, as medical and epidemiological studies have grown larger in scale, the challenges associated with extracting useful and relevant information from these data has mounted. General health surveys provide a good example for such studies as they usually cover large populations and are conducted throughout long periods in multiple locations. The challenges associated with interpreting the results of such studies include: the presence of both categorical and continuous variables and the need to compare them within a single statistical framework; the presence of variations in data resulting from the technical limitations in data collection; the danger of selection and information biases in hypothesis-directed study design and implementation; and the complete inadequacy of p values in identifying significant relationships. As a solution to these challenges, we propose an end-to-end analysis workflow using the MUltivariate analysis and VISualization (MUVIS) package within R statistical software. MUVIS consists of a comprehensive set of statistical tools that follow the basic tenet of unbiased exploration of associations within a dataset. We validate its performance by applying MUVIS to data from the Yazd Health Study (YaHS). YaHS is a prospective cohort study consisting of a general health survey of more than 30 health-related measurements and a questionnaire with over 300 questions acquired from 10050 participants. Given the nature of the YaHS dataset, most of the identified associations are corroborated by a large body of medical literature. Nevertheless, some more interesting and less investigated connections were also found which are presented here. We conclude that MUVIS provides a robust statistical framework for extraction of useful and relevant information from medical datasets and their visualization in easily comprehensible ways.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.