Abstract-Neuroscientists increasingly use computational tools to build and simulate models of the brain. The amounts of data involved in these simulations are immense and efficiently managing this data is key.One particular problem in analyzing this data is the scalable execution of range queries on spatial models of the brain. Known indexing approaches do not perform well, even on today's small models containing only few million densely packed spatial elements. The problem of current approaches is that with the increasing level of detail in the models, the overlap in the tree structure also increases, ultimately slowing down query execution. The neuroscientists' need to work with bigger and more importantly, with increasingly detailed (denser) models, motivates us to develop a new indexing approach.To this end we developed FLAT, a scalable indexing approach for dense data sets. We based the development of FLAT on the key observation that current approaches suffer from overlap in case of dense data sets. We hence designed FLAT as an approach with two phases, each independent of density.Our experimental results confirm that FLAT achieves independence from data set size as well as density and also outperforms R-Tree variants in terms of I/O overhead from a factor of two up to eight. I. INTRODUCTIONScientists in various disciplines increasingly use computational tools to simulate, process and analyze experimental data. Computational tools make it substantially simpler for them to conduct scientific tasks. At the same time however, scientists are also increasingly buried in the data deluge produced by their tools. Being able to access the relevant parts of their data, i.e., their spatial models, quickly in order to analyze, understand, and prepare new experiments is pivotal for them.In this paper we thus develop a new index that efficiently supports scientists in executing range queries on dense data sets stemming from increasingly detailed spatial models.The work presented in this paper is motivated by our collaboration with the Blue Brain Project (BBP [17]). With data acquired in anatomical research on the cortex of the rat brain the neuroscientists in the BBP build biophysically realistic models, the most detailed computer models of the brain to date, for simulation based research in neuroscience. The project began by focusing on the elementary building block of the neocortex, a neocortical column of about 10,000 neurons. Morphologically speaking, each of these neurons has branches extending into large parts of the tissue in order to receive and send out information to other neurons. Figure 1 (left) shows a cell morphology, with cylinders modeling the branching of the dendrite and axon in three dimensions.
Abstract. Simulations have become key in many scientific disciplines to better understand natural phenomena. Neuroscientists, for example, build and simulate increasingly fine-grained models (including subcellular details, e.g., neurotransmitter) of the neocortex to understand the mechanisms causing brain diseases and to test new treatments in-silico. The sheer size and, more importantly, the level of detail of their models challenges today's spatial data management techniques. In collaboration with the Blue Brain project (BBP) we develop new approaches that efficiently enable analysis, navigation and discovery in spatial models of the brain. More precisely, we develop an index for the scalable and efficient execution of spatial range queries supporting model building and analysis. Furthermore, we enable navigational access to the brain models, i.e., the execution of of series of range queries where he location of each query depends on the previous ones. To efficiently support navigational access, we develop a method that uses previous query results to prefetch spatial data with high accuracy and therefore speeds up navigation. Finally, to enable discovery based on the range queries, we conceive a novel in-memory spatial join. The methods we develop considerably outperform the state of the art, but more importantly, they enable the neuroscientists to scale to building, simulating and analyzing massively bigger and more detailed brain models.
An emerging class of database applications is characterized by frequent updates of low-dimensional data, e.g. coming from sensors that sample continuous real world phenomena. Traditional persistency requirements can be weakened in this setting of frequent updates, emphasizing a role of the main-memory in external storage index structures and enabling a higher update throughput. Moreover, in order for an index to be suitable for practical applications, efficient past-state queries should be supported without significantly penalizing other operations.These issues are not adequately addressed in the database research. We report on the R R -tree-our first step towards resolving them. Based on this, we outline a number of concrete short-term and more abstract longer-term future research directions.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.