In an era of increasingly crowd-focused activities, understanding the dynamics of people's flow is of paramount importance. However, data derived solely from detecting and counting individuals can prove inadequate for certain use-cases. Addressing this deficiency, this study introduces a robust method to enrich data extraction, thereby enhancing its value to a wide range of stakeholders. Presented herein is a novel system dubbed YOLO-Gender, an innovative integration of YOLO and CNN, designed to deliver comprehensive people tracking and gender classification. This gender recognition component provides a much-needed edge in crowd management and facilitates efficient planning of gender-specific services. The core foundation of the system is built upon YOLOv8, the apex of the YOLO model series, renowned for its unparalleled accuracy and efficiency. Through the use of transfer learning models pre-trained on ImageNet, gender recognition is achieved, showcasing a marked enhancement over conventional CNN models. Assessments of this system validate its robust performance, underlining its potential for large-scale deployment. This study represents a significant step forward in AI-powered surveillance, offering a solution that effectively enriches and analytically processes extracted data.