2021
DOI: 10.1051/aacus/2021012
|View full text |Cite
|
Sign up to set email alerts
|

Particle-filter tracking of sounds for frequency-independent 3D audio rendering from distributed B-format recordings

Abstract: Six-Degree-of-Freedom (6DoF) audio rendering interactively synthesizes spatial audio signals for a variable listener perspective based on surround recordings taken at multiple perspectives distributed across the listening area in the acoustic scene. Methods that rely on recording-implicit directional information and interpolate the listener perspective without the attempt of localizing and extracting sounds often yield high audio quality, but are limited in spatial definition. Methods that perform sound locali… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
10
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 30 publications
0
10
0
Order By: Relevance
“…The system also separately renders the residual ambient components, which represent the Ambisonic receiver signals after the source objects have been subtracted from them. Therefore, in essence, the proposed system may be viewed as a natural multireceiver extension to the Coding and Multi-Parameterisation of Ambisonic Sound Scenes (COMPASS) single-receiver method [7] and is also similar to the approach proposed recently in [8]. With an emphasis on developing a practical system, the proposed processing approach is implemented as a real-time Virtual Studio Technology (VST) audio plug-in 1 .…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations
“…The system also separately renders the residual ambient components, which represent the Ambisonic receiver signals after the source objects have been subtracted from them. Therefore, in essence, the proposed system may be viewed as a natural multireceiver extension to the Coding and Multi-Parameterisation of Ambisonic Sound Scenes (COMPASS) single-receiver method [7] and is also similar to the approach proposed recently in [8]. With an emphasis on developing a practical system, the proposed processing approach is implemented as a real-time Virtual Studio Technology (VST) audio plug-in 1 .…”
Section: Introductionmentioning
confidence: 99%
“…The closest work to the method proposed in the present article is described in [8], which operated in the timedomain and combined: building 2D planar activity maps based on broadband grid-scanning methods from each receiver, followed by peak-finding to ascertain source position estimates; subsequent particle-filtering based tracking of active sound objects; and then the application of broadband beamforming and spatialization of the objects, mixed with ambient rendering. Building on the work of [8], the proposed system instead operates in the time-frequency domain and lends particular emphasis on real-time operation. It forgoes the use of computationally expensive activitymap-based source position estimation in favor of continuous DoA estimation methods followed by computing the intersecting points between receivers.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…is function can map the data of the test set to one of the given categories, thus realizing the category prediction of unknown data. At present, common classi ers include decision tree, logistic regression, support vector machine (SVM), Naive Bayes, k-nearest neighbor algorithm (KNN), BP neural network, and deep learning [11][12][13].…”
Section: Introductionmentioning
confidence: 99%