“…3 sets: data from 2018 and 2019 (221,076 p , 503,945 a ), mathematical analysis (98,702 p , 117,183 a ), image processing (49,098 p , 107,290 a ) AMiner_tiny | [ 30 ] | 188 input p , 10 candidate p for each input |
AMiner_huge | [ 108 ] | 2,092,356 p , 1,712,433 a , 8,024,869 c , 4,258,615 co-autorships |
ACM C-D | [ 115 ] | 43,380 p from AMiner, a , ACM CSS tags |
AAN_modified | [ 5 , 49 ] | 21,455 p from 312 v from NLP, 17,342 a , 113,367 c |
AAN_tiny | [ 106 ] | 2082 p (ids, titles, publication year), 8194 c , avg. 7.87 c per p , a , v |
Sowiport | [ 28 ] | u i data from Mar 2017 to Oct 2018, 0.1% click-through rate |
RARD_tiny | [ 30 ] | 800 input p from Related-Article Recommendation Dataset from Sowiport [ 13 ] |
CiteSeer | [ 46 ] | 1,100 p , 10 sets of relevant p |
CiteSeer_tiny | [ 94 ] | 400 c -pairs, 1,230 c contexts |
CiteSeer_medium | [ 92 ] | 10 p , 226 c -pairs |
…”