2017
DOI: 10.1186/s40537-017-0080-9
|View full text |Cite
|
Sign up to set email alerts
|

Host managed contention avoidance storage solutions for Big Data

Abstract: The performance gap between compute and storage is fairly considerable. This results in a mismatch between the application needs from storage and what storage can deliver. The full potential of storage devices cannot be harnessed till all layers of I/O hierarchy function efficiently. Despite advanced optimizations applied across various layers along the odyssey of data access, the I/O stack still remains volatile. The problems associated due to the inefficiencies in data management get amplified in Big Data sh… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
2
2
2

Relationship

1
5

Authors

Journals

citations
Cited by 8 publications
(6 citation statements)
references
References 54 publications
(191 reference statements)
0
6
0
Order By: Relevance
“…Most studies have focused on studying data center operations to consolidate the computing needs and organize and optimize computing for multiple applications. Computing resources are believed to be abundant, but without appropriate attention, they are mostly waiting for data and wasting cycles [3] [8] [9] [29] [14] [30]. Moreover, for lineage based applications, the impact is more severe due to data-dependency between tasks.…”
Section: Related Workmentioning
confidence: 99%
See 3 more Smart Citations
“…Most studies have focused on studying data center operations to consolidate the computing needs and organize and optimize computing for multiple applications. Computing resources are believed to be abundant, but without appropriate attention, they are mostly waiting for data and wasting cycles [3] [8] [9] [29] [14] [30]. Moreover, for lineage based applications, the impact is more severe due to data-dependency between tasks.…”
Section: Related Workmentioning
confidence: 99%
“…Data centers today cater to a wide diaspora of applications which process multiple data sets for multiple jobs in a multi-user environment concurrently. They also deploy storage systems organized in multiple heterogeneous tiers, which is necessary to achieve cost-performance-capacity trade-off [3] [4] [5]. Dedicating physical resources for every application is not economically feasible.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…A genetic algorithm is used to cross mutate these code fragments to generate candidate titles, and test sets are used to detect the correctness of the output titles [8]. Mishra and Somani proposed the PAR method, which assumes that the materials for repairing errors exist in the source code of other projects and generates candidate titles by manually defining repair templates and randomly selecting constrained mutation operation, which overcomes the shortage of generating meaningless candidate titles by random mutation based on genetic algorithm [9,10]. Belyaev and Ray think that the ability of different repair templates is not the same, so high-frequency repair templates are preferred to generate candidate titles, which improve the accuracy of title generation compared with the PAR method [11,12].…”
Section: Introductionmentioning
confidence: 99%