Jørgen Gustava Brandt scite author profile

Jørgen Gustava Brandt

5Publications

29Citation Statements Received

70Citation Statements Given

How they've been cited

How they cite others

120

Affiliations

Humboldt-Universität zu Berlin, Federal Institute for Risk Assessment

Publications

Order By: Most citations

BiobankCloud: A Platform for the Secure Storage, Sharing, and Processing of Large Biomedical Data Sets

Bessani

Brandt

Bux

et al. 2016

View full text Add to dashboard Cite

Biobanks store and catalog human biological material that is increasingly being digitized using next-generation sequencing (NGS). There is, however, a computational bottleneck, as existing software systems are not scalable and secure enough to store and process the incoming wave of genomic data from NGS machines. In the BiobankCloud project, we are building a Hadoop-based platform for the secure storage, sharing, and parallel processing of genomic data. We extended Hadoop to include support for multi-tenant studies, reduced storage requirements with erasure coding, and added support for extensible and consistent metadata. On top of Hadoop, we built a scalable scientific workflow engine featuring a proper workflow definition language focusing on simple integration and chaining of existing tools, adaptive scheduling on Apache Yarn, and support for iterative dataflows. Our platform also supports the secure sharing of data across different, distributed Hadoop clusters. The software is easily installed and comes with a user-friendly web interface for running, managing, and accessing data sets behind a secure 2-factor authentication. Initial tests have shown that the engine scales well to dozens of nodes. The entire system is open-source and includes pre-defined workflows for popular tasks in biomedical data analysis, such as variant identification, differential transcriptome analysis using RNA-Seq, and analysis of miRNA-Seq and ChIP-Seq data.

show abstract

SoFIA: a data integration framework for annotating high-throughput datasets

Childs

Mamlouk

Brandt

et al. 2016

View full text Add to dashboard Cite

show abstract

Computation semantics of the functional scientific workflow language Cuneiform

Brandt¹,

Reisig²,

Leser³

2017

J. Funct. Prog.

View full text Add to dashboard Cite

Cuneiform is a minimal functional programming language for large-scale scientific data analysis. Implementing a strict black-box view on external operators and data, it allows the direct embedding of code in a variety of external languages like Python or R, provides data-parallel higher order operators for processing large partitioned data sets, allows conditionals and general recursion, and has a naturally parallelizable evaluation strategy suitable for multi-core servers and distributed execution environments like Hadoop, HTCondor, or distributed Erlang. Cuneiform has been applied in several data-intensive research areas including remote sensing, machine learning, and bioinformatics, all of which critically depend on the flexible assembly of pre-existing tools and libraries written in different languages into complex pipelines. This paper introduces the computation semantics for Cuneiform. It presents Cuneiform's abstract syntax, a simple type system, and the semantics of evaluation. Providing an unambiguous specification of the behavior of Cuneiform eases the implementation of interpreters which we showcase by providing a concise reference implementation in Erlang. The similarity of Cuneiform's syntax to the simply typed lambda calculus puts Cuneiform in perspective and allows a straightforward discussion of its design in the context of functional programming. Moreover, the simple type system allows the deduction of the language's safety up to black-box operators. Last, the formulation of the semantics also permits the verification of compilers to and from other workflow languages.

show abstract

An Open-Source Community Resource for Creating, Collecting, Sharing and Applying Predictive Microbial Models (PMM-Lab)

Weiser

Filter

Falenski

et al. 2012

View full text Add to dashboard Cite

AgED: Extraction and Evaluation of Elliptic Fourier Descriptors from Image Data in Phenotype Assessment Applications

Brandt

Heyl

2013

View full text Add to dashboard Cite

In biological experiments, phenotype evaluation is a common challenge. In a wide variety of applications, the phenotypic features of organisms have to be measured and statistically assessed. This is especially important as differences between wild-type and mutant or treated and untreated organisms are often very subtle. Here, we propose a set of digital image transformations that implement preprocessing, feature extraction and statistical analysis of image data that is typically generated in a biological experiment. Moreover we present AgED-Analysis given Experimental Data, a software toolkit that facilitates the process of phenotypic feature evaluation from digital image data in an automatized fashion. Suitable statistical analysis and visualization is performed and controlled via a Graphical User Interface. Furthermore, the use of open data structures allows for the convenient reuse of the acquired feature data with miscellaneous data-mining software and scientific workflow systems. The functionality of this software tool is demonstrated and validated by repeating a phytohormone response experiment carried out on the fresh water alga Coleochaete scutata. The results showed that the timely and automatic processing of digital image data aides the researcher and rationalizes the formerly lengthy and, at times, error prone data evaluation in spreadsheet documents. Furthermore, the software toolkit AgED establishes a comparable evaluation standard and provides ready-to-publish graphic export facilities.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.