Building an efficient and effective credit scorer for enterprises is an important and urgent demand in the cross-border e-commerce industry. In this paper, we present a framework to build a credit scorer using e-commerce data integrated from various sources. First, an improved dependency graph approach is proposed to recognize distinct records in the dataset. Then, we apply logistic regression using a prejudice remover regularizer to train the model, preceded by predictor preparation through binning and evaluating their information value. Lastly, we build the credit scorer according to the coefficients of the model. We implement our framework on a dataset from the official customs database and a large cross-border e-commerce platform. The empirical results demonstrate that the scorer built by our methodology can be used to effectively evaluate enterprises, while also removing prejudice against small and medium enterprises to a certain extent.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.