Proceedings of the 30th ACM International Conference on Multimedia 2022
DOI: 10.1145/3503161.3547755
|View full text |Cite
|
Sign up to set email alerts
|

Learning Modality-Specific and -Agnostic Representations for Asynchronous Multimodal Language Sequences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
3
3

Relationship

0
10

Authors

Journals

citations
Cited by 40 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…Using an explicit representation of meshes, ARA‐PReg [HHS*21] designs a spectral ARAP regularization for graph shape generators, constraining them to produce a series of shapes that move as rigidly as possible [SA07]. Recent advances include GenCorres [YHS*24], which brings the ARAPReg‐like principle to implicit neural fields, enforcing cycle consistency and rigidity constraints, and the work of Tang et al [TMW*22], which further extends the modeling of deformation spaces by allowing generation conditioned on user‐given constraints via dragging handles, exposing more control over deformation. HyperDiffusion [EMS*23] can sample from a 4D shape distribution by training a diffusion network to generate MLP weights of an SDF neural field.…”
Section: State‐of‐the‐art Methodsmentioning
confidence: 99%
“…Using an explicit representation of meshes, ARA‐PReg [HHS*21] designs a spectral ARAP regularization for graph shape generators, constraining them to produce a series of shapes that move as rigidly as possible [SA07]. Recent advances include GenCorres [YHS*24], which brings the ARAPReg‐like principle to implicit neural fields, enforcing cycle consistency and rigidity constraints, and the work of Tang et al [TMW*22], which further extends the modeling of deformation spaces by allowing generation conditioned on user‐given constraints via dragging handles, exposing more control over deformation. HyperDiffusion [EMS*23] can sample from a 4D shape distribution by training a diffusion network to generate MLP weights of an SDF neural field.…”
Section: State‐of‐the‐art Methodsmentioning
confidence: 99%
“…AugmNet , inspired by recent advancements in augmentation literature ( 53 ), is a learning-based approach designed to learn augmentation patterns within the latent space. Unlike traditional methods that apply image augmentations (e.g., rotation, cropping) directly in the pixel space, AugmNet generically implements transformations on the embeddings.…”
Section: Methodsmentioning
confidence: 99%
“…Deep neural networks are gradually developing toward largescale models [1][2][3][4][5][6]. The changes have brought about an impressive technological breakthrough [7][8][9][10][11][12][13][14][15][16], but applying these technologies to mobile devices such as mobile phones, driverless cars, and tiny robots is difficult. Besides, the source data cannot be obtained in many cases due to data security, such as fingerprints, faces, and medical records images.…”
Section: Introductionmentioning
confidence: 99%