【Author】 Lin, Dan; Wu, Jiajing; Huang, Tao; Lin, Kaixin; Zheng, Zibin
【Source】IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
【Abstract】To combat cybercrimes and maintain financial security for the blockchain ecosystem", know your customer" (KYC) is an essential and also challenging process due to the pseudonymity nature of blockchain technology. To unlock the potential of KYC on blockchain-based platforms like Ethereum, account labeling is a powerful means which can de-anonymize addresses by mining public transaction records. Existing studies on account labeling are mainly conducted via machine learning (ML) methods fed with hand-crafted features or graph neural networks based on the modeled transaction network. However, ML approaches based on hand-crafted features ignore the global interaction information between accounts, making it easy for criminals to evade detection. Moreover, the performance of traditional GCN methods when applied to Ethereum transaction network encounters limitations due to label sparsity, network heterophily, and large network size of the transaction network. In this article, we first analyze Ethereum accounts involved in typical businesses, in terms of both account and topological features. Then based on the analytical results, we propose a novel GCN method named know-your-customer graph convolutional network (KYC-GCN) which contains two key designs: 1) multihop aggregators and importance-based sampling are designed to tackle the dilemma between accuracy and efficiency. 2) GCN architecture is improved to explicitly capture local and more global information. Experimental results on a realistic Ethereum dataset show that the proposed KYC-GCN (90.2% accuracy, 86.2% Marco-F1) achieves state-of-the-art classification performance, and results on six benchmarks demonstrate that it yields great performance under homophily and heterophily.
【Keywords】Account labeling; cybersecurity; Ethereum; graph neural network (GNN); regulation technology
【发表时间】2023
【收录时间】2023-12-28
【文献类型】Article; Early Access
【论文大主题】链上数据分析
【论文小主题】交易实体识别
【影响因子】11.471
评论