We present an experimental study of the preferential concentration of sub-Kolmogorov inertial particles in active-grid-generated homogeneous and isotropic turbulence, characterized via Voronoï tessellations.We show that the detection and quantification of clusters and voids is influenced by the intensity of the laser and high values of particles volume fraction φ v . Different biases on the statistics of Voronoï cells are analyzed to improve the reliability of the detection and the robustness in the characterization of clusters and voids. We do this by adapting Big-Data techniques that allow to process the particle images up to 10 times faster than standard algorithms.Finally, as preferential concentration is known to depend on multiple parameters, we performed experiments where one parameter was varied and all others were kept constant (φ v , Reynolds number based on the Taylor length scale Re λ , and residence time of the particles interacting with the turbulence). Our results confirm, in agreement with published work, that clustering increases with both φ v and Re λ . On the other hand, we find new evidence that the mean size of clusters increases with Re λ but decreases with φ v and that the cluster settling velocity is strongly affected by Re λ up to the maximum value studied here, Re
We show how the analysis of very large amounts of drug prescription data make it possible to detect, on the day of hospital admission, patients at risk of developing complications during their hospital stay. We explore, for the first time, to which extent volume and variety of big prescription data help in constructing predictive models for the automatic detection of at-risk profiles.Our methodology is designed to validate our claims that: (1) drug prescription data on the day of admission contain rich information about the patient's situation and perspectives of evolution, and (2) the various perspectives of big medical data (such as veracity, volume, variety) help in extracting this information. We build binary classification models to identify at-risk patient profiles. We use a distributed architecture to ensure scalability of model construction with large volumes of medical records and clinical data.We report on practical experiments with real data of millions of patients and hundreds of hospitals. We demonstrate how the fine-grained analysis of such big data can improve the detection of at-risk patients, making it possible to construct more accurate predictive models that significantly benefit from volume and variety, while satisfying important criteria to be deployed in hospitals.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.