ChartDetective: Easy and Accurate Interactive Data Extraction from Complex Vector Charts

Masson, Damien; Malacria, Sylvain; Vogel, Daniel; Lank, Edward; Casiez, Géry

doi:10.1145/3544548.3581113

Cited by 9 publications

(7 citation statements)

References 75 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For the remaining 2,642 papers, additional values will be needed. Overall, a large proportion of CHI papers have useful statistical reports, and recall that our conservative analysis likely underestimate actual occurrences and it is possible that values like means can be estimated from figures [58,73] 3.2.2 Number of Decimals. Consistent with APA recommendations [5, sec 6.36], standard deviations, means, F-values, t-scores, and CIs were reported with a median of two decimals.…”

Section: Reportedmentioning

confidence: 95%

“…However, the SplitBoard paper does not report aggregated means with standard deviations. Sam decides to retrieve these values from the line chart of the WPM using an accurate chart data extraction tool [58,73]. For SplitBoard, the t-score was directly obtained enabling the calculation of Cohen's d. Once the missing values are added in the table, the other values are calculated and Sam can review the effect sizes: Cohen's 𝑑 = 0.64 for SwipeBoard, and 𝑑 = 7.27 for SplitBoard.…”

Section: Sam Hovers Over the Missing Values In The Table To Get An In...mentioning

confidence: 99%

See 1 more Smart Citation

Statslator: Interactive Translation of NHST and Estimation Statistics Reporting Styles in Scientific Documents

Masson,

Malacria,

Casiez

et al. 2023

Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Self Cite

View full text Add to dashboard Cite

An independent Student's t-test did not show that the overall effect of gesture compared to tap on WPM was significant (t(36) = 1.26, p = 0.21). It did show that the effect on CER and KSPC was significant (t(36) = 2.12, p = 0.04 and t(36) = 15.77, p < 0.001). Figure 1: Statslator takes existing statistical reports (a) using NHST or estimation; (b) calculates all possible statistical values using accurate conversion equations; (c) shows the report using graphical and interactive figures configurable by readers.

show abstract

Section: Reportedmentioning

confidence: 95%

Section: Sam Hovers Over the Missing Values In The Table To Get An In...mentioning

confidence: 99%

Statslator: Interactive Translation of NHST and Estimation Statistics Reporting Styles in Scientific Documents

Masson,

Malacria,

Casiez

et al. 2023

Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some systems help readers interpret and manipulate the results by enhancing the visualizations already present in the document. For example, redesigning charts to be more useful [128], with added overlays [73], and interactive features [88,95]. Or the text can be leveraged to automatically annotate existing charts [62,76] and interactively connect text and charts [113].…”

Section: Augmenting Existing Documentsmentioning

confidence: 99%

Charagraph: Interactive Generation of Charts for Realtime Annotation of Data-Rich Paragraphs

Masson

Malacria

Casiez

et al. 2023

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems

Self Cite

View full text Add to dashboard Cite

□ □ (p < .0 (p < □ , □ ms) (c) Select data-group (b) Delimit selection (a) Document with Charagraphs (d) Identify (e) Compare Creation Interaction Figure 1: (a) Charagraphs are in-situ visualizations of numeric data included within text that are dynamically generated (b) by delimiting a selection and (c) selecting a data group. Charagraphs support common data exploration tasks through interactive features such as (d) identifying and (e) comparing values.

show abstract

“…ReVision [32], Graphical Overlays [12], REV (Reverse-Engineering Visualizations) [23,24], ChartSense [11], and approaches surveyed with Chart Mining [7] use computer vision techniques to automatically extract marks and infer visual encoding channels and data attributes from bitmap images. However, these techniques suffer from a variety of issues that affect inference precision, such as pixel resolution, shape ambiguity (e.g., blurry edges), and mark occlusion [17]. While our work targets structured SVG images, these techniques might enable DIVI to support a broader set of visualization media types in the future.…”

Section: Deconstructing Visualizationsmentioning

confidence: 99%

“…al [13] contribute a spatial-constraint model to enable interaction with static visualizations, although its deconstruction output is limited to evenly spaced axes (prohibiting use of log scales) and marks, without support for linked interaction and other non-spatial exploratory interactions (e.g., tooltips and selection / brushing). ChartDetective [17] provides semi-automated data extraction methods for vector charts in PDFs, but lacks support for chart metadata (such as axes) needed for interaction and linking. In addition, semi-automated approaches requiring extensive user input are not as scalable.…”

Section: Deconstructing Visualizationsmentioning

confidence: 99%

DIVI: Dynamically Interactive Visualization

Snyder,

Heer

2023

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Fig. 1: Automatic multi-view, cross-tool interactions with DIVI. Dynamically interactive charts of Seattle weather data, from left to right: Matplotlib bar chart (weather vs. sum(precip)), ggplot2 scatter plot (temp_max vs. precip), and Excel scatter plot (date vs. temp_max). DIVI automatically deconstructs charts to identify semantic components and coordinate user input. Here DIVI links temp_max and weather between scatter plots, and also leverages the source dataset to infer bar chart aggregation of sum(precip) over weather. Interactions are linked: the user brushes an area in the ggplot2 scatter plot with low temp_max and high precip, re-aggregating the bar chart and propagating the selection to the Excel scatter plot to indicate seasonal wintry periods.Abstract-Dynamically Interactive Visualization (DIVI) is a novel approach for orchestrating interactions within and across static visualizations. DIVI deconstructs Scalable Vector Graphics charts at runtime to infer content and coordinate user input, decoupling interaction from specification logic. This decoupling allows interactions to extend and compose freely across different tools, chart types, and analysis goals. DIVI exploits positional relations of marks to detect chart components such as axes and legends, reconstruct scales and view encodings, and infer data fields. DIVI then enumerates candidate transformations across inferred data to perform linking between views. To support dynamic interaction without prior specification, we introduce a taxonomy that formalizes the space of standard interactions by chart element, interaction type, and input event. We demonstrate DIVI's usefulness for rapid data exploration and analysis through a usability study with 13 participants and a diverse gallery of dynamically interactive visualizations, including single chart, multi-view, and cross-tool configurations.

show abstract

ChartDetective: Easy and Accurate Interactive Data Extraction from Complex Vector Charts

Abstract: elements partially or completely occluded. Compared to other approaches relying on raster images, our tool successfully recovered all data, even when hidden, with a 78% lower relative error. CCS CONCEPTS• Human-centered computing → Interactive systems and tools.

Cited by 9 publications

References 75 publications

Statslator: Interactive Translation of NHST and Estimation Statistics Reporting Styles in Scientific Documents

Statslator: Interactive Translation of NHST and Estimation Statistics Reporting Styles in Scientific Documents

Charagraph: Interactive Generation of Charts for Realtime Annotation of Data-Rich Paragraphs

DIVI: Dynamically Interactive Visualization

Contact Info

Product

Resources

About