Machine perception is a key challenge towards autonomous systems. Especially in the field of computer vision, numerous novel approaches have been introduced in recent years. This trend is based on the availability of public datasets. Logistics is one domain that could benefit from such innovations. Yet, there are no public datasets available. Accordingly, we create the first public dataset for scene understanding in logistics. The Logistics Objects in COntext (LOCO) dataset contains 39,101 images. In its first release there are 5,593 bounding-box annotated images. In total 151,428 instances of pallets, small load carriers, stillages, forklifts and pallet trucks were annotated. We also present and discuss our data acquisition approach which features enhanced privacy protection for workers. Finally, we provide an in-depth analysis of LOCO, compare it to other datasets (i.e. OpenImages and MS COCO) and show that it has far more annotations per image and also a considerably smaller annotation size. The dataset and future extensions will be available on our website (https://github.com/tum-fml/loco).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.