Interpolation between spatial room impulse responses (SRIRs) is necessary for dynamic acoustic rendering in which a listener can move with six degrees-of-freedom. The early part of the SRIR consists of sparse direct and reflected sound events, whose arrival time, direction and level vary with receiver position. Interpolation of the spatio-temporal structure necessitates the non-trivial task of mapping corresponding sound events. Instead of finding an exact map, we propose using partial optimal transport to find a coupling between reflections requiring neither estimation of the room geometry nor explicit knowledge of the source-receiver configuration. Each SRIR is first decomposed into a virtual source space. Then, the interpolated impulse response is calculated based on a partial optimal transport coupling obtained with linear programming. We compare the method against two baseline interpolation methods using simulated SRIR data, and show that it best preserves the temporal fine structure of the omnidirectional response.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.