Mining Massive Fine-Grained Behavior Data to Improve Predictive Analytics

Martens, David; Provost, Foster; Clark, Jessica; Fortuny, Enric Junqué de

doi:10.25300/misq/2016/40.4.04

Cited by 109 publications

(59 citation statements)

References 25 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In terms of lift it appears that adding the implied network score to the direct plus rating model does not lead to higher performance: when using 5 months of data, the lift of the full ensemble model and the direct plus rating model overlap. The results are in line with other studies that use relational learners on fine‐grained data: network data give a boost to the model lift (Martens et al ., ). In practical terms, this means that among the highest (worst) scores of the models that include payment data in a direct network there are more actual defaulters than among the highest scores of the traditional rating model.…”

Section: Resultsmentioning

confidence: 99%

“…Contrary to previous studies (Junqué de Fortuny et al ., 2013), we do not increase the number of clients, but the number of counterparties and known transactions per client. Nonetheless, we find the same conclusions: when working with fine‐grained data, bigger is better (Junqué de Fortuny et al ., 2013; Martens et al ., ).…”

Section: Resultsmentioning

confidence: 99%

“…The use of behavioural data has proven to be successful in other domains such as targeted advertising (Martens et al, 2016), fraud detection (Juqué de Fortung et al, 2014) and customer retention (Verbeke et al, 2014). The nature of these large behavioural data sets requires a modelling approach that is different from the traditional, structured data sets.…”

Section: Credit Scoring and Behavioural Datamentioning

confidence: 99%

“…By creating both networks, we rely on the sociological concept of assortativity which states that people are more likely to form bonds with others who have similar characteristics such as values, beliefs and socio-economic status (McPherson et al, 2001). By creating a direct network, we build on the theory of assortativity and assume that people of similar creditworthiness tend to cluster (Martens et al, 2016).…”

Section: Transforming Transactions Into Predictionsmentioning

confidence: 99%

“…The implied network is named a pseudosocial network: as in a true social network, strongly connected consumers demonstrate a strong similarity, at the very least in the particular merchants with whom they transact. It is a pseudosocial network because, by and large, the linked consumers probably have no true social relationship with one another (Martens et al, 2016).…”

Section: Transforming Transactions Into Predictionsmentioning

confidence: 99%

See 4 more Smart Citations

Retail Credit Scoring using Fine-Grained Payment Data

Tobback

Martens

2019

Journal of the Royal Statistical Society Series A: Statistics in Society

Self Cite

View full text Add to dashboard Cite

Summary Banks are continuously looking for novel ways to leverage their existing data assets. A major source of data that has not yet been used to the full extent is massive fine‐grained payment data on the bank's customers. In the paper, a design is proposed that builds predictive credit scoring models by using the fine‐grained payment data. Using a real life data set of 183 million transactions made by 2.6 million customers, we show that the scalable implementation that is put forward leads to a significant improvement in the receiver operating characteristic area under the curve, with only seconds of computation needed. When investigating the 1% riskiest customers, twice as many defaulters are detected when using the payment data. Such an improvement has a big effect on the overall working of the bank, from applicant scoring to minimum capital requirements.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%