Video summarization is a fundamental challenge in the field of computer vision and multimedia processing, aimed at condensing lengthy videos into concise representations without compromising the essential content and context. This project focuses on the integration of object detection techniques into the process of video summarization, harnessing the power of deep learning to automatically identify and extract key objects and events from video sequences. By leveraging state-of-the-art object detection models and innovative summarization algorithms, this project aims to enhance the efficiency and effectiveness of video summarization, enabling users to quickly grasp the content and significance of videos without the need for exhaustive playback. The proposed approach not only streamlines video browsing and content comprehension but also holds potential applications in various domains, including surveillance, video indexing, and content recommendation systems