Learning Human Viewpoint Preferences from Sparsely Annotated Models

Hartwig, S.; Schelling, Michael; Onzenoodt, Christian van; Vázquez, Pere‐Pau; Hermosilla, Pedro; Ropinski, Timo

doi:10.1111/cgf.14613

Cited by 5 publications

(3 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Deep-learning approaches [21] select the optimal viewpoint by conducting multi-view renderings through a predefined sequence of viewpoints, then identifying the optimal viewpoint according to a learned sequence of view scores. Although these approaches effectively capture human selection preferences, they face two primary challenges in practical applications: first, they require extensive pre-rendering of views, which consumes significant resources and might overlook the optimal viewpoints; second, obtaining large-scale human-annotated data is costly and labor-intensive.…”

Section: Deep-learning Based Approachesmentioning

confidence: 99%

Fast Best Viewpoint Selection with Geometry-Enhanced Multiple Views and Cross-Modal Distillation

Cao,

Han,

Yang

et al. 2024

Preprint

View full text Add to dashboard Cite

Despite significant progress on the mesh-based Best Viewpoint Selection (BVS) problem using multi-views, the current state-of-the-art BVS method requires 20-30 rendering views and is limited to selecting from a predefined sequence of viewpoint samples, which may miss optimal viewpoints and precludes its use in situations where response time is critical. To address the limitations, we present a new dual-branch fast best viewpoint selection regression model that significantly reduces reliance on a large number of input views, enables continuous perspective prediction, and enhances interactive response speeds. Our approach incorporates a geometry-enhanced multi-view feature extractor paired with a learnable token and utilizes a cross-modal distillation method to enrich the model’s understanding of 3D structures. Specifically, by embedding view features into a dimensionally matched learnable token and processing it through three cascaded self-attention layers, the resulting token effectively encapsulates fused features that are better suited to the viewpoint selection task.In addition, to reduce the number of views required, we incorporate cross-modal distillation into the BVS solution by imposing alignment constraints between 3D geometry descriptors and fused multi-view expressions, effectively avoiding the computational requirements of dozens of rendering views. Our experimental results on public benchmarks show that our method is approximately 35 times faster than the SOTA method when only six views are adopted, while also achieving the best quantitative metrics.

show abstract

Section: Deep-learning Based Approachesmentioning

confidence: 99%

Fast Best Viewpoint Selection with Geometry-Enhanced Multiple Views and Cross-Modal Distillation

Cao,

Han,

Yang

et al. 2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Hartwig et al [ 36 ] introduced a neural view quality measure aligned with human preferences. The study demonstrated that this measure generalized not only to models unseen during training but also to unseen model categories.…”

Section: State Of the Artmentioning

confidence: 99%

Viewpoint Selection for 3D-Games with f-Divergences

Martin,

Sbert,

Chover

2024

Entropy

View full text Add to dashboard Cite

In this paper, we present a novel approach for the optimal camera selection in video games. The new approach explores the use of information theoretic metrics f-divergences, to measure the correlation between the objects as viewed in camera frustum and the ideal or target view. The f-divergences considered are the Kullback–Leibler divergence or relative entropy, the total variation and the χ2 divergence. Shannon entropy is also used for comparison purposes. The visibility is measured using the differential form factors from the camera to objects and is computed by casting rays with importance sampling Monte Carlo. Our method allows a very fast dynamic selection of the best viewpoints, which can take into account changes in the scene, in the ideal or target view, and in the objectives of the game. Our prototype is implemented in Unity engine, and our results show an efficient selection of the camera and an improved visual quality. The most discriminating results are obtained with the use of Kullback–Leibler divergence.

show abstract

“…A reliable method to evaluate the ambiguity of a scatterplot is directly measuring perceptual variability via human experiments [26]. However, this process is costly and not scalable.…”

Section: Introductionmentioning

confidence: 99%

: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Jeon,

Quadri,

Lee

et al. 2024

IEEE Trans. Visual. Comput. Graphics

8
0
2
0

View full text Add to dashboard Cite

Learning Human Viewpoint Preferences from Sparsely Annotated Models

Cited by 5 publications

References 37 publications

Fast Best Viewpoint Selection with Geometry-Enhanced Multiple Views and Cross-Modal Distillation

Fast Best Viewpoint Selection with Geometry-Enhanced Multiple Views and Cross-Modal Distillation

Viewpoint Selection for 3D-Games with f-Divergences

: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering

Contact Info

Product

Resources

About