Robust cross-lingual knowledge base question answering via knowledge distillation

Shaofei, Wang; Dang, Depeng

doi:10.1108/dta-12-2020-0312

Cited by 1 publication

(1 citation statement)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The idea of distilling the knowledge in a neural network was first put forth by Hinton as a way to improve the performance of a machine learning algorithm by compressing the knowledge in a complex model into a single simple model (Hinton et al , 2015). Though this idea was primarily thought of as a compression mechanism rather than as an enhancement, in recent times, this idea has evolved to fit the needs for a performance enhancement (Wang and Dang, 2021; Hongyuan et al , 2022). In datasets used for speech and object recognition tasks, training must extract structure from these very large, highly redundant datasets, which often leads to a cumbersome model.…”

Section: Related Workmentioning

confidence: 99%

Improving anti-money laundering in bitcoin using evolving graph convolutions and deep neural decision forest

Mohan¹,

P.V.²,

Sankar³

et al. 2022

DTA

View full text Add to dashboard Cite

PurposeMoney laundering is the process of concealing unlawfully obtained funds by presenting them as coming from a legitimate source. Criminals use crypto money laundering to hide the illicit origin of funds using a variety of methods. The most simplified form of bitcoin money laundering leans hard on the fact that transactions made in cryptocurrencies are pseudonymous, but open data gives more power to investigators and enables the crowdsourcing of forensic analysis. With the motive to curb these illegal activities, there exist various rules, policies and technologies collectively known as anti-money laundering (AML) tools. When properly implemented, AML restrictions reduce the negative effects of illegal economic activity while also promoting financial market integrity and stability, but these bear high costs for institutions. The purpose of this work is to motivate the opportunity to reconcile the cause of safety with that of financial inclusion, bearing in mind the limitations of the available data. The authors use the Elliptic dataset; to the best of the authors' knowledge, this is the largest labelled transaction dataset publicly available in any cryptocurrency.Design/methodology/approachAML in bitcoin can be modelled as a node classification task in dynamic networks. In this work, graph convolutional decision forest will be introduced, which combines the potentialities of evolving graph convolutional network and deep neural decision forest (DNDF). This model will be used to classify the unknown transactions in the Elliptic dataset. Additionally, the application of knowledge distillation (KD) over the proposed approach gives finest results compared to all the other experimented techniques.FindingsThe importance of utilising a concatenation between dynamic graph learning and ensemble feature learning is demonstrated in this work. The results show the superiority of the proposed model to classify the illicit transactions in the Elliptic dataset. Experiments also show that the results can be further improved when the system is fine-tuned using a KD framework.Originality/valueExisting works used either ensemble learning or dynamic graph learning to tackle the problem of AML in bitcoin. The proposed model provides a novel view to combine the power of random forest with dynamic graph learning methods. Furthermore, the work also demonstrates the advantage of KD in improving the performance of the whole system.

show abstract

Section: Related Workmentioning

confidence: 99%