“…One type of datasets relies at least partially on legal files being coded by humans (Pollack, 1994;Toshkov, 2010). The other type involves automated data collection using computer-based coding to derive information from various sources related to EU law and law-making, such as Eur-Lex, Pre-Lex, or other institutional sites (Fjelstul, 2019;Häge, 2011;Hurka et al, 2021;König et al, 2006;Ovádek, 2021). The advantages of automated data collection include the large amount of data involved, their potential to provide continuous updates, and the replicability for the data collection process.…”