Volodymyr Miz scite author profile

Volodymyr Miz

5Publications

36Citation Statements Received

67Citation Statements Given

How they've been cited

How they cite others

Affiliations

Computer Science Laboratory of Lille, École Polytechnique Fédérale de Lausanne

Publications

Order By: Most citations

Anomaly Detection in the Dynamics of Web and Social Networks Using Associative Memory

Miz

Benjamin

Benzi

2019

View full text Add to dashboard Cite

In this work, we propose a new, fast and scalable method for anomaly detection in large time-evolving graphs. It may be a static graph with dynamic node attributes (e.g. time-series), or a graph evolving in time, such as a temporal network. We define an anomaly as a localized increase in temporal activity in a cluster of nodes. The algorithm is unsupervised. It is able to detect and track anomalous activity in a dynamic network despite the noise from multiple interfering sources.We use the Hopfield network model of memory to combine the graph and time information. We show that anomalies can be spotted with a good precision using a memory network. The presented approach is scalable and we provide a distributed implementation of the algorithm.To demonstrate its efficiency, we apply it to two datasets: Enron Email dataset and Wikipedia page views. We show that the anomalous spikes are triggered by the real-world events that impact the network dynamics. Besides, the structure of the clusters and the analysis of the time evolution associated with the detected events reveals interesting facts on how humans interact, exchange and search for information, opening the door to new quantitative studies on collective and social behavior on large and dynamic datasets.

show abstract

What is Trending on Wikipedia? Capturing Trends and Language Biases Across Wikipedia Editions

Miz

Hanna

Aspert

et al. 2020

View full text Add to dashboard Cite

In this work, we propose an automatic evaluation and comparison of the browsing behavior of Wikipedia readers that can be applied to any language editions of Wikipedia. As an example, we focus on English, French, and Russian languages during the last four months of 2018. The proposed method has three steps. Firstly, it extracts the most trending articles over a chosen period of time. Secondly, it performs a semi-supervised topic extraction and thirdly, it compares topics across languages. The automated processing works with the data that combines Wikipedia's graph of hyperlinks, pageview statistics and summaries of the pages. The results show that people share a common interest and curiosity for entertainment, e.g. movies, music, sports independently of their language. Differences appear in topics related to local events or about cultural particularities. Interactive visualizations showing clusters of trending pages in each language edition are available online https://wiki-insights.epfl.ch/wikitrends CCS CONCEPTS • Applied computing → Sociology; • Human-centered computing → Social content sharing; Social navigation; Wikis.

show abstract

A Graph-Structured Dataset for Wikipedia Research

Aspert

Miz

Ricaud

2019

View full text Add to dashboard Cite

Wikipedia is a rich and invaluable source of information. Its central place on the Web makes it a particularly interesting object of study for scientists. Researchers from different domains used various complex datasets related to Wikipedia to study language, social behavior, knowledge organization, and network theory. While being a scientific treasure, the large size of the dataset hinders preprocessing and may be a challenging obstacle for potential new studies. This issue is particularly acute in scientific domains where researchers may not be technically and data processing savvy. On one hand, the size of Wikipedia dumps is large. It makes the parsing and extraction of relevant information cumbersome. On the other hand, the API is straightforward to use but restricted to a relatively small number of requests. The middle ground is at the mesoscopic scale, when researchers need a subset of Wikipedia ranging from thousands to hundreds of thousands of pages but there exists no efficient solution at this scale.In this work, we propose an efficient data structure to make requests and access subnetworks of Wikipedia pages and categories. We provide convenient tools for accessing and filtering viewership statistics or "pagecounts" of Wikipedia web pages. The dataset organization leverages principles of graph databases that allows rapid and intuitive access to subgraphs of Wikipedia articles and categories. The dataset and deployment guidelines are available on the LTS2 website https://lts2.epfl.ch/Datasets/Wikipedia/.

show abstract

Summary of Tutorials at The Web Conference 2021

West

Bhagat

Groth

et al. 2021

View full text Add to dashboard Cite

show abstract

Wikipedia. Events And Collective Memory Detection Dataset

Miz¹,

Benzi²,

Benjamin³

2017

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Volodymyr Miz

Anomaly Detection in the Dynamics of Web and Social Networks Using Associative Memory

What is Trending on Wikipedia? Capturing Trends and Language Biases Across Wikipedia Editions

A Graph-Structured Dataset for Wikipedia Research

Summary of Tutorials at The Web Conference 2021

Wikipedia. Events And Collective Memory Detection Dataset

Contact Info

Product

Resources

About