With the ever-increasing volume of scientific literature, there is a strong need to develop methods that allow rigorous information identification. In this contribution, a state-of-the-art natural language processing (NLP) model was used to select perovskite materials for electrocatalytic applications from literature. This was accomplished