Ingo Müller scite author profile

Serverless computing has recently attracted a lot of attention from research and industry due to its promise of ultimate elasticity and operational simplicity. However, there is no consensus yet on whether or not the approach is suitable for data processing. In this paper, we present Lambada, a serverless distributed data processing framework designed to explore how to perform data analytics on serverless computing. In our analysis, supported with extensive experiments, we show in which scenarios serverless makes sense from an economic and performance perspective. We address several important technical questions that need to be solved to support data analytics and present examples from several domains where serverless offers a cost and performance advantage over existing solutions. CCS CONCEPTS• Information systems → Parallel and distributed DBMSs; Online analytical processing engines; Database query processing.

show abstract

Distributed join algorithms on thousands of cores

Barthels

Müller

Schneider

et al. 2017

Proc. VLDB Endow.

View full text Add to dashboard Cite

Traditional database operators such as joins are relevant not only in the context of database engines but also as a building block in many computational and machine learning algorithms. With the advent of big data, there is an increasing demand for efficient join algorithms that can scale with the input data size and the available hardware resources.In this paper, we explore the implementation of distributed join algorithms in systems with several thousand cores connected by a low-latency network as used in high performance computing systems or data centers. We compare radix hash join to sort-merge join algorithms and discuss their implementation at this scale. In the paper, we explain how to use MPI to implement joins, show the impact and advantages of RDMA, discuss the importance of network scheduling, and study the relative performance of sorting vs. hashing. The experimental results show that the algorithms we present scale well with the number of cores, reaching a throughput of 48.7 billion input tuples per second on 4,096 cores.

show abstract

Cache-Efficient Aggregation

Müller

Sanders

Lacurie

et al. 2015

View full text Add to dashboard Cite

Communication efficient algorithms for fundamental big data problems

Sanders

Schlag

Müller

2013

View full text Add to dashboard Cite

Towards Agent-Based Coalition Formation for Service Composition

Müller

Kowalczyk

Braun³

2006

View full text Add to dashboard Cite

The topic of agent-based service composition has been experiencing much attention recently. Researchers are applying agent technology with the aim to improve adaptiveness and flexibility of prevailing static Web service composition solutions. One major characteristic of multi-agent systems in particular is their ability of emergent behavior that allows gaining complex system behavior from small distributed sets of simple rules. This paper describes a multi-agent-based coalition formation approach for service composition that achieves emergent behavior based on a lightweight interaction protocol and decentralized decision making. The paper also presents evaluation results of first experiments to underline the validity of the approach.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ingo Müller

Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure

Distributed join algorithms on thousands of cores

Cache-Efficient Aggregation

Communication efficient algorithms for fundamental big data problems

Towards Agent-Based Coalition Formation for Service Composition

Contact Info

Product

Resources

About