YOLO -- You only look 10647 times

Limberg, Christian; Melnik, Andrew; Harter, Augustin; Ritter, Helge

doi:10.48550/arxiv.2201.06159

Cited by 3 publications

(4 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…See more details on our implementation in the Age Prediction notebook [5]. We have achieved good results in age prediction using the You Only Look Once (YOLO) architecture [9,10,11]. Although YOLO is an object detection algorithm, we mainly utilized its class prediction capabilities and abandoned the bounding box prediction functionality.…”

Section: Methodsmentioning

confidence: 99%

Faces: AI Blitz XIII Solutions

Melnik¹,

Akbulut²,

Sheikh³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

Section: Methodsmentioning

confidence: 99%

Faces: AI Blitz XIII Solutions

Melnik¹,

Akbulut²,

Sheikh³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…Expressions and emotions from the target image are kept, while the facial identity is swapped. First, it collects a dataset of faces of two people, A and B, using an object detection method [102]. Secondly, it trains two auto-encoders E A , E B to encode and two decoders DC A , DC B to reconstruct the faces of A and B respectively.…”

Section: Face Replacement and Face Transfermentioning

confidence: 99%

Face Generation and Editing With StyleGAN: A Survey

Melnik,

Miasayedzenkau,

Makaravets

et al. 2024

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Our goal with this survey is to provide an overview of the state of the art deep learning methods for face generation and editing using StyleGAN. The survey covers the evolution of StyleGAN, from PGGAN to StyleGAN3, and explores relevant topics such as suitable metrics for training, different latent representations, GAN inversion to latent spaces of StyleGAN, face image editing, cross-domain face stylization, face restoration, and even Deepfake applications. We aim to provide an entry point into the field for readers that have basic knowledge about the field of deep learning and are looking for an accessible introduction and overview. Awesome list: https://github.com/ndrwmlnk/awesome-face-generation-and-editing

show abstract

“…During training, 200 images were assigned to the test dataset In the training process of the object detection, we need to define 9 anchor frames to accelerate the model training and improve the detection accuracy. The anchor frames are predefined bounding box patterns used by YOLOv5 to delineate regions for object candidates [19]. Each frame (x, y) indicates the width and height of the target size prediction bounding box.…”

Section: Object Detectionmentioning

confidence: 99%

“…These frames were selected by using kmeans to cluster the labels in the training set. Please refer to [19] for more details regarding the selection of anchor frames. In this study, the following anchor frames are used.…”

Section: Object Detectionmentioning

confidence: 99%

A Bus Crowdedness Sensing System Using Deep-Learning Based Object Detection

Huang

Tsuge

CHEN

et al. 2022

IEICE Trans. Inf. & Syst.

View full text Add to dashboard Cite

Crowdedness of buses is playing an increasingly important role in the disease control of COVID-19. The lack of a practical approach to sensing the crowdedness of buses is a major problem. This paper proposes a bus crowdedness sensing system which exploits deep learningbased object detection to count the numbers of passengers getting on and off a bus and thus estimate the crowdedness of buses in real time. In our prototype system, we combine YOLOv5s object detection model with Kalman Filter object tracking algorithm to implement a sensing algorithm running on a Jetson nano-based vehicular device mounted on a bus. By using the driving recorder video data taken from real bus, we experimentally evaluate the performance of the proposed sensing system to verify that our proposed system system improves counting accuracy and achieves real-time processing at the Jetson Nano platform.

show abstract

YOLO -- You only look 10647 times

Cited by 3 publications

References 6 publications

Faces: AI Blitz XIII Solutions

Faces: AI Blitz XIII Solutions

Face Generation and Editing With StyleGAN: A Survey

A Bus Crowdedness Sensing System Using Deep-Learning Based Object Detection

Contact Info

Product

Resources

About