Akshay Jajoo scite author profile

Akshay Jajoo

5Publications

4Citation Statements Received

72Citation Statements Given

How they've been cited

How they cite others

Affiliations

Purdue University West Lafayette, Purdue University System, Nokia (Finland)

Publications

Order By: Most citations

The Case for Task Sampling based Learning for Cluster Job Scheduling

Jajoo

Lin

et al. 2021

Preprint

View full text Add to dashboard Cite

The ability to accurately estimate job runtime properties allows a scheduler to e ectively schedule jobs. State-of-theart online cluster job schedulers use history-based learning, which uses past job execution information to estimate the runtime properties of newly arrived jobs. However, with fast-paced development in cluster technology (in both hardware and software) and changing user inputs, job runtime properties can change over time, which lead to inaccurate predictions.In this paper, we explore the potential and limitation of real-time learning of job runtime properties, by proactively sampling and scheduling a small fraction of the tasks of each job. Such a task-sampling-based approach exploits the similarity among runtime properties of the tasks of the same job and is inherently immune to changing job behavior. Our analytical and experimental analysis of 3 production traces with di erent skew and job distribution shows that learning in space can be substantially more accurate. Our simulation and testbed evaluation on Azure of the two learning approaches anchored in a generic job scheduler using 3 production cluster job traces shows that despite its online overhead, learning in space reduces the average Job Completion Time (JCT) by 1.28×, 1.56×, and 1.32× compared to the prior-art historybased predictor. Finally, we show how the sampling-based learning can be extended to schedule DAG jobs and achieve similar speedups over the prior-art history-based predictor.

show abstract

A Case for Sampling-Based Learning Techniques in Coflow Scheduling

Jajoo

Lin

2022

IEEE/ACM Trans. Networking

View full text Add to dashboard Cite

A Case for Sampling Based Learning Techniques in Coflow Scheduling

Jajoo¹,

Hu²,

Lin³

2021

Preprint

View full text Add to dashboard Cite

Coflow scheduling improves data-intensive application performance by improving their networking performance. State-of-the-art online coflow schedulers in essence approximate the classic Shortest-Job-First (SJF) scheduling by learning the coflow size online. In particular, they use multiple priority queues to simultaneously accomplish two goals: to sieve long coflows from short coflows, and to schedule short coflows with high priorities. Such a mechanism pays high overhead in learning the coflow size: moving a large coflow across the queues delays small and other large coflows, and moving similar-sized coflows across the queues results in inadvertent round-robin scheduling. We propose P , a new online coflow scheduler that exploits the spatial dimension of coflows, i.e., a coflow has many flows, to drastically reduce the overhead of coflow size learning. P pre-schedules sampled flows of each coflow and uses their sizes to estimate the average flow size of the coflow. It then resorts to Shortest Coflow First, where the notion of shortest is determined using the learned coflow sizes and coflow contention. We show that the sampling-based learning is robust to flow size skew and has the added benefit of much improved scalability from reduced coordinator-local agent interactions. Our evaluation using an Azure testbed, a publicly available production cluster trace from Facebook shows that compared to the prior art Aalo, P reduces the coflow completion time (CCT) in average (P90) cases by 1.50× (8.00×) on a 150-node testbed and 2.72× (9.78×) on a 900-node testbed. Evaluation using additional traces further demonstrates P 's robustness to flow size skew.1

show abstract

A study on the Morris Worm

Jajoo¹

2021

Preprint

View full text Add to dashboard Cite

show abstract

Deep Learning-based Channel State Information Prediction with Incomplete History

Tekgul

Chen

Tan

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Akshay Jajoo

The Case for Task Sampling based Learning for Cluster Job Scheduling

A Case for Sampling-Based Learning Techniques in Coflow Scheduling

A Case for Sampling Based Learning Techniques in Coflow Scheduling

A study on the Morris Worm

Deep Learning-based Channel State Information Prediction with Incomplete History

Contact Info

Product

Resources

About