Integrated platform and API for electrophysiological data

Sobolev, Andrey; Adrian, Stoewer; Leonhardt, Aljoscha; Rautenberg, Philipp L.; Kellner, Christian Johannes; Garbers, Christian; Wachtler, Thomas

doi:10.3389/fninf.2014.00032

Cited by 10 publications

(14 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The primary features of the Cloudwave data flow include the use of Hadoop MapReduce and HDFS together with the flexibility to configure multiple parameters based on the availability of resources on a Hadoop cluster. This allows Cloudwave data flow to be deployed on different types of Hadoop clusters and to be used as a template to develop scalable neuroscience data processing data flow in many existing neuroinformatics projects, such as the GNDataPlatform (Sobolev et al, 2014a ).…”

Section: Discussionmentioning

confidence: 99%

“…The existing work on electrophysiological signal data management can be divided into two categories: (a) Data Representation Formats; and (b) Data Processing Tools. Although there is no existing standard for signal data representation, there are a large number of data formats developed by instrument vendors, researchers, and different neuroscience projects (Schlögl, 2010 ; Sobolev et al, 2014a ). Signal data representation formats need to meet the requirements of multiple stakeholders and address multiple challenges, including the inherent complexity of signal data such as different sampling rates and scaling factors (Schlögl, 2010 ).…”

Section: Introductionmentioning

confidence: 99%

“…The German Neuroinformatics Node (G-Node) integrates the NEO format with the open metadata Markup Language (odML; Grewe et al, 2011 ) to define the GNData format for use in a data management platform (Sobolev et al, 2014b ). In addition to the data format, the GNData signal data management platform aims to develop a common storage layer with a generic API based on Representation State Transfer (REST) web services for signal data annotation and access control (Sobolev et al, 2014a ). The GNData platform also uses the Hierarchical Data Format (HDF5) ( Hierarchical Data Format (HDF5), 2014 ), which has generated a lot of interest in the neuroinformatics community as a potential common representation standard, to store the signal data.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A scalable neuroinformatics data flow for electrophysiological signals using MapReduce

Jayapandian

Wei

Ramesh

et al. 2015

Front. Neuroinform.

View full text Add to dashboard Cite

Data-driven neuroscience research is providing new insights in progression of neurological disorders and supporting the development of improved treatment approaches. However, the volume, velocity, and variety of neuroscience data generated from sophisticated recording instruments and acquisition methods have exacerbated the limited scalability of existing neuroinformatics tools. This makes it difficult for neuroscience researchers to effectively leverage the growing multi-modal neuroscience data to advance research in serious neurological disorders, such as epilepsy. We describe the development of the Cloudwave data flow that uses new data partitioning techniques to store and analyze electrophysiological signal in distributed computing infrastructure. The Cloudwave data flow uses MapReduce parallel programming algorithm to implement an integrated signal data processing pipeline that scales with large volume of data generated at high velocity. Using an epilepsy domain ontology together with an epilepsy focused extensible data representation format called Cloudwave Signal Format (CSF), the data flow addresses the challenge of data heterogeneity and is interoperable with existing neuroinformatics data representation formats, such as HDF5. The scalability of the Cloudwave data flow is evaluated using a 30-node cluster installed with the open source Hadoop software stack. The results demonstrate that the Cloudwave data flow can process increasing volume of signal data by leveraging Hadoop Data Nodes to reduce the total data processing time. The Cloudwave data flow is a template for developing highly scalable neuroscience data processing pipelines using MapReduce algorithms to support a variety of user applications.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A scalable neuroinformatics data flow for electrophysiological signals using MapReduce

Jayapandian

Wei

Ramesh

et al. 2015

Front. Neuroinform.

View full text Add to dashboard Cite

show abstract

“…• INCF Dataspace (INC Working Group, 2013) enables interested research groups to connect to a distributed data file system based on iRods 2 . • G-Node 3 provides tools for data access, data management and data sharing, including a data sharing platform (Sobolev et al, 2014) based on common data models for electrophysiological data and metadata (Garcia et al, 2014;Grewe et al, 2011). To structure metadata, G-Node has developed odML, an XML schema for the creation of complex metadata structure in computer-readable format (Grewe et al, 2011).…”

Section: State Of the Artmentioning

confidence: 99%

Framework for Collection of Electrophysiology Data

Ježek

Mouček

Krauz

et al. 2015

Proceedings of the International Conference on Health Informatics

Self Cite

View full text Add to dashboard Cite

Experiments in electrophysiology produce a lot of unstructured metadata collected in electrophysiology databases. The data are usually accessed through a web interface implemented on the top of data model respecting given data format. A lot of experiments are conducted outside the laboratory where access to these databases is not always available. The usage of mobile devices such as tablets or smart phones seems to be a practical solution, but users would welcome the same structured user interface such as they know from a common computer. When user interfaces of electrophysiology databases are tailored to a unique data structure, they cannot be easily reused on a mobile device. As a solution, a mapping of a general data structure to a graphical template is proposed. This mapping is implemented in a framework that generates a template representing the database structure. The parsing process is driven by supplemented annotations added to the code. Next, an Android tool visualizing a graphical layout generated from the template is developed. A use case study is presented on a database of EEG/ERP experiments.

show abstract

“…Beyond neuroimaging, Sobolev et al ( 2014 ) present a data management platform for neurophysiological data, and Mouček et al ( 2014 ), and Tripathy et al ( 2014 ) describe techniques and methodologies for collecting and managing electrophysiological data.…”

mentioning

confidence: 99%