2015
DOI: 10.1109/tbdata.2015.2472014
|View full text |Cite
|
Sign up to set email alerts
|

Petuum: A New Platform for Distributed Machine Learning on Big Data

Abstract: What is a systematic way to efficiently apply a wide spectrum of advanced ML programs to industrial scale problems, using Big Models (up to 100s of billions of parameters) on Big Data (up to terabytes or petabytes)? Modern parallelization strategies employ fine-grained operations and scheduling beyond the classic bulk-synchronous processing paradigm popularized by MapReduce, or even specialized graph-based execution that relies on graph representations of ML programs. The variety of approaches tends to pull sy… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

2
238
0
1

Year Published

2016
2016
2023
2023

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 381 publications
(241 citation statements)
references
References 21 publications
2
238
0
1
Order By: Relevance
“…The IoTs is applied to assess whether circuits are associated and whether the equipment is running naturally to improve visiting efficien y, which ensures the safety of multimedia files Many researchers have attempted to produce systematic ways of applying a wide spectrum of advanced machine learning (ML) programs to industrial-scale problems. Xing et al proposed a general-purpose framework, Petuum, which systematically addresses data-and model-parallel challenges in large-scale ML by observing that many ML programs are fundamentally optimization-centric and admit error-tolerant, iterative-convergent algorithmic solutions [46]. This presents unique opportunities for an integrative system design, such as bounded-error network synchronization and dynamic scheduling based on an ML program structure.…”
Section: Analysis Methodsmentioning
confidence: 99%
“…The IoTs is applied to assess whether circuits are associated and whether the equipment is running naturally to improve visiting efficien y, which ensures the safety of multimedia files Many researchers have attempted to produce systematic ways of applying a wide spectrum of advanced machine learning (ML) programs to industrial-scale problems. Xing et al proposed a general-purpose framework, Petuum, which systematically addresses data-and model-parallel challenges in large-scale ML by observing that many ML programs are fundamentally optimization-centric and admit error-tolerant, iterative-convergent algorithmic solutions [46]. This presents unique opportunities for an integrative system design, such as bounded-error network synchronization and dynamic scheduling based on an ML program structure.…”
Section: Analysis Methodsmentioning
confidence: 99%
“…1. Specially, to speed up the process of data analysis, the Economic Data P reprocess and Economic F eature Selection are deployed in distributed platform [36].…”
Section: The Framework Of Economic Big Data Analysismentioning
confidence: 99%
“…Big-data offers many opportunities to evaluate data for example to identify preferences from end-users, to target fraud and also to relate other issues derived from a combined and statistical processing of data [3][5] [33]. Industrial big-data refers to large amounts of time series data generated at high-speed by industrial equipment, typically located in worldwide factories [5][6] [7][8] [9].…”
Section: Introductionmentioning
confidence: 99%