Abstract-Most existing commercial video servers are designed for a single server. Consequently, the capacity of the system in terms of maximum sustainable concurrent sessions is limited by the performance of the video server hardware. This paper proposes and analyzes the performance of a novel parallel video server architecture where video data are striped across an array of autonomous servers. The architecture allows one to build incrementally scalable video servers without video data replication. The proposed concurrent-push scheduling algorithm allows the system to integrate with quality of service guarantees provided by today's switching networks. In this paper, the striping policy, the service model, and the concurrent-push scheduling algorithm are presented. A system model is constructed to quantify three performance metrics, namely, server buffer requirement, client buffer requirement, and system response time. Results show that a simple extension of the server-push service model does not perform well under the parallel video server architecture. To improve system performance, a novel extension of the grouped sweeping scheme called the asynchronous grouped sweeping scheme (AGSS) is introduced. To further increase the scalability of the architecture, a new subschedule striping scheme (SSS) is introduced. With the proposed AGSS and SSS, our parallel video server architecture can be scaled up to more than 10 000 concurrent users.Index Terms-Concurrent push, grouped sweeping scheme (GSS), parallel video server, performance analysis, scheduling algorithm, server push, server striping, video on demand.