With the popularization of machine learning (ML) techniques and the increased chipset's performance, the application of ML to pedestrian localization systems has received significant attention in the last years. Several survey papers have attempted to provide a state-of-the-art overview, but they usually limit their scope to a particular type of positioning system or technology. In addition, they are written from the point of view of ML techniques and their practice, not from the point of view of the localization system and the specific problems that ML techniques can help to solve. This article is intended to offer a comprehensive state-of-the-art survey of the ML techniques that have been adopted over the last ten years to improve the performance of pedestrian localization systems, addressing the applicability of ML techniques in this domain, along with the main localization strategies. It concludes by indicating the underlying open issues and challenges associated with the existing systems, and possible future directions in which ML techniques could improve the performance of pedestrian localization systems. Among other open issues, most previous authors have focused their attention on position estimation accuracy, which wastes the potential of ML techniques to improve other performance parameters (e.g., response time, computational complexity, robustness, scalability or energy efficiency). This study shows that there is a strong trend towards the application of supervised learning. Consequently, there are many potential research opportunities in the use of other learning types, such as unsupervised and reinforcement learning, to improve the performance of pedestrian localization systems.