“…From the very beginning the main afford in CoAnSys have been put on document analysis algorithms, i.e. author name disambiguation [7,8,9], metadata extraction [10], document similarity and classification calculations [11,12], citation matching [13,14], etc. Some of algorithms can be used in Hadoop environment out-of-box, some need further amendments and some are entirely not applicable [15].…”