2018
DOI: 10.25080/majora-4af1f417-003
|View full text |Cite
|
Sign up to set email alerts
|

Composable Multi-Threading and Multi-Processing for Numeric Libraries

Abstract: Python is popular among scientific communities that value its simplicity and power, especially as it comes along with numeric libraries such as [NumPy], [SciPy], [Dask], and [Numba]. As CPU core counts keep increasing, these modules can make use of many cores via multi-threading for efficient multi-core parallelism. However, threads can interfere with each other leading to overhead and inefficiency if used together in a single application on machines with a large number of cores. This performance loss can be p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
4
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 1 publication
0
4
0
Order By: Relevance
“…38 To tune the hyperparameters of our models, we used the TPE approach implemented in the Optuna framework. 39 We have used Intel Distribution for Python and Python API for Intel Data Analytics Acceleration Library (Intel DAAL)-named PyDAAL 40 -to boost ML and data analytics performance. Using the advantage of optimized Scikit-learn (Scikit-learn with Intel DAAL) that comes with it, we were able to achieve faster training time and accurate results for the prediction problem.…”
Section: Model Buildingmentioning
confidence: 99%
“…38 To tune the hyperparameters of our models, we used the TPE approach implemented in the Optuna framework. 39 We have used Intel Distribution for Python and Python API for Intel Data Analytics Acceleration Library (Intel DAAL)-named PyDAAL 40 -to boost ML and data analytics performance. Using the advantage of optimized Scikit-learn (Scikit-learn with Intel DAAL) that comes with it, we were able to achieve faster training time and accurate results for the prediction problem.…”
Section: Model Buildingmentioning
confidence: 99%
“…We have used Intel Distribution for Python and Python API for Intel Data Analytics Acceleration Library (Intel DAAL) -named PyDAAL 30 -to boost machine-learning and data analytics performance. Using the advantage of optimised scikit-learn (Scikit-learn with Intel DAAL) that comes with it, we were able to achieve faster training time and accurate results for the prediction problem.…”
Section: (D) Splitting the Train And Test Datamentioning
confidence: 99%
“…The application of parallel and multiprocessor algorithms can break down significant numerical problems into smaller subtasks, reducing the total computation time on multiprocessor computers and resulting in better performance [23]. In dealing with this parallel computing problem, the concept of a processing "pool" is used: "tasks" (data) are forwarded in bulk to the pool, and the pool handles the distribution of tasks to a number of available worker processes [27]- [29].…”
Section: Introductionmentioning
confidence: 99%