“…Our focus here is on inferring the data type for each column in a table of data. Numerous studies have attempted to tackle type inference, including wrangling tools (Raman and Hellerstein, 2001;Kandel et al, 2011;Guo et al, 2011;Trifacta, 2018;Fisher and Gruber, 2005;Fisher et al, 2008), software packages (Petricek et al, 2016;Lindenberg, 2017;Stochastic Solutions, 2018;Döhmen et al, 2017a;Wickham et al, 2017), and probabilistic approaches (Valera and Ghahramani, 2017;Vergari et al, 2019;Limaye et al, 2010). However, often they do not work very well in the presence of missing and anomalous data, which are commonly found in raw data sets due to the lack of a well-organized data collection procedure.…”