“…Unfortunately, previous works on RSS/Atom statistical characteristics [15,27,13] do not provide a precise and updated characterization of feeds' behavior and content which could be effectively used for tuning refreshing policies of RSS aggregators [24,22], benchmarking scalability and performance of RSS continuous monitoring mechanisms [19,9,7,8,25,5] or comparing various techniques for RSS items mining, recommendation, enrichment and archiving [3,26]. In this paper, we present the first thorough analysis of three complementary features of real-scale RSS/Atom feeds, namely, publication activity, items structure and length, as well as, vocabulary of the textual content.…”