Multi-object tracking (MOT) constructs multiple object trajectories by associating detections between consecutive frames while maintaining object identities. In many autonomous systems equipped with a camera and a radar, an amplitude and visual features can be measured. Therefore, our goal is to solve a MOT problem by associating detections with both features. To achieve it, we propose a unified MOT framework based on object model learning and confidence-based association. For improving discriminability between different objects, we present a method to learn several visual and amplitude object models during online tracking. By applying the learned object models for the affinity evaluation, we improve the confidence-based association further. In addition, we present a practical track management method to initialize and terminate tracks, and eliminate duplicated false tracks. We implement several MOT systems with different object model learning and association methods, and compare our system with them on challenging visual MOT datasets. We further compare our method with the recent deep appearance learning methods. These comparisons verify that our method can achieve the competitive tracking accuracy while maintaining a low MOT complexity.
The recent advance of synthetic image generation and manipulation methods allows us to generate synthetic face images close to real images. On the other hand, the importance of identifying the synthetic face images increases more and more to protect personal privacy from those. Although some deep learning-based image forensic methods have been developed recently, it is still challenging to distinguish synthetic images generated by recent image generation and manipulation methods such as the deep fake, face2face, and face swap. To resolve this challenge, we propose a novel generative adversarial ensemble learning method. We train multiple discriminative and generative networks based on the adversarial learning. Compared to the conventional adversarial learning, our method is however more focused on improving the discrimination ability rather than image generation one. To this end, we improve the discriminabilty by ensembling outputs from different two discriminators. In addition, we train two generators in order to generate general and hard synthetic images. By ensemble learning of all the generators and discriminators, we improve the discriminators by using the generated synthetic face images, and improve the generators by passing the combined feedback of the discriminators. On the FaceForensics benchmark challenge, we thoroughly evaluate our methods by comparing the recent methods. We also provide the ablation study to prove the effectiveness and usefulness of our method. INDEX TERMS Digital image forensics, generative adversarial ensemble learning, deep learning, synthetic image detection, face image.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.