“…Request permissions from permissions@acm.org. page detection, identifying replications between documents has attracted remarkable attention from research community, and many approaches were proposed in the last two decades, e.g., by similarity search and join [27,10,3,4,35,33] or document fingerprinting [25,6,8,29,30,18,31]. For the body of work in similarity search and join, documents are regarded as (multi)sets of tokens or strings, and pairs of documents are identified if they satisfy a similarity constraint.…”