Audio Processing and Digital Acoustics

The audio processing group at LCAV performs research and education on various topics related to capturing, processing, coding, and rendering of acoustic signals with special focus on 3D-audio. We try to develop expertise in every aspect of this broad field, going from foundations of signal processing, through the physics of wave phenomena, all the way to the human auditory perception. The research is carried out in cooperation with partners from art, industry, and science.

Over the years, we have worked on a broad range of topics that includes:

  • Directional sound capture and playback (beamforming)
  • Room equalization and acoustic echo control
  • Room acoustics simulation
  • Virtual acoustics/auralization
  • Automatic multichannel format conversion (upmix)
  • Sound perception and spatial hearing
  • Sound field reproduction
  • Spatial audio coding
  • Spatial sampling and coding of sound fields

You can also consult our archives for a for a more detailed description of past projects.

Currently, LCAV focuses on various aspects of location-aware audio signal processing. We crafted this term to succinctly cover both typical and highly atypical problems where the terms sound and localization happen to coexist: from vanilla sound source localization with microphone arrays, through more unconventional simultaneous localization of sound sources and microphones and mapping of a room (the infamous acoustic SLAM), to finally localizing concurrent sound sources using a single, albeit unconventional microphone.

Recent LCAV publications in this area:

DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging

M. M. J-A. Simeoni; S. Kashani; P. Hurley; M. Vetterli 

2019. Thirty-third Conference on Neural Information Processing Systems (NeurIPS), Vancouver, British Columbia, Canada, December 9-14, 2019.

Structure from sound with incomplete data

M. Krekovic; G. Baechler; I. Dokmanic; M. Vetterli 

2018. 43rd International Conference on Acoustics, Speech and Signal Processing, Calgary, Alberta, Canada, April 15–20, 2018.

Omnidirectional bats, point-to-plane distances, and the price of uniqueness

M. Krekovic; I. Dokmanic; M. Vetterli 

2017. 42nd International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, March 5-9, 2017. p. 3261 – 3265. DOI : 10.1109/ICASSP.2017.7952759.

Acoustic DoA Estimation by One Unsophisticated Sensor

D. El Badawy; I. Dokmanic; M. Vetterli 

2017. 13th International Conference on Latent Variable Analysis and Signal Separation, Grenoble, France, February 21-23, 2017. p. 89 – 98. DOI : 10.1007/978-3-319-53547-0_9.

Hardware And Software For Reproducible Research In Audio Array Signal Processing

E. Bezzam; R. Scheibler; J. Azcarreta; H. Pan; M. Simeoni et al. 

2017. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LANew Orleans, LA, USA, MAR 05-09, 20175-9 March, 2017. p. 6591 – 6592. DOI : 10.1109/ICASSP.2017.8005297.

FRIDA: FRI-Based DOA Estimation For Arbitrary Array Layouts

H. Pan; R. Scheibler; E. F. Bezzam; I. Dokmanic; M. Vetterli 

2017. ICASSP 2017, New Orleans, USA, March 5-9, 2017. p. 3186 – 3190. DOI : 10.1109/ICASSP.2017.7952744.

EchoSLAM: Simultaneous Localization and Mapping with Acoustic Echoes

M. Krekovic; I. Dokmanic; M. Vetterli 

2016. 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20-25 March 2016. p. 11 – 15. DOI : 10.1109/ICASSP.2016.7471627.

From Acoustic Room Reconstruction to SLAM

I. Dokmanic; L. Daudet; M. Vetterli 

2016. 41st International Conference on Acoustics, Speech, and Signal Processing, Shanghai, China, March 20-25, 2016. p. 6345 – 6349. DOI : 10.1109/ICASSP.2016.7472898.

Look, no beacons! Optimal all-in-one EchoSLAM

M. Krekovic; I. Dokmanic; M. Vetterli 

2016. 50th Asilomar Conference on Signals, Systems, and Computers, Asilomar, Pacific Grove, CA, November 6-9, 2016.

Raking echoes in the time domain

R. Scheibler; I. Dokmanic; M. Vetterli 

2015. ICASSP 2015 – 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, Queensland, Australia, 19-24 April 2015. p. 554 – 558. DOI : 10.1109/ICASSP.2015.7178030.

Raking the Cocktail Party

I. Dokmanic; R. Scheibler; M. Vetterli 

IEEE Journal of Selected Topics in Signal Processing. 2015. Vol. 9, num. 5, p. 825 – 836. DOI : 10.1109/JSTSP.2015.2415761.

How to Localize Ten Microphones in One Fingersnap

I. Dokmanic; L. Daudet; M. Vetterli 

2014. 22nd European Signal Processing Conference, Lisbon, Portugal, September 1-5, 2014. p. 2275 – 2279.

Digital acoustics: processing wave fields in space and time using DSP tools

F. Pinto; M. Kolundzija; M. Vetterli 

APSIPA Transactions on Signal and Information Processing. 2014. Vol. 3, num. e18, p. 1 – 21. DOI : 10.1017/ATSIP.2014.13.

Acoustic Echoes Reveal Room Shape

I. Dokmanic; R. Parhizkar; A. Walther; Y. M. Lu; M. Vetterli 

Proceedings of the National Academy of Sciences. 2013. Vol. 110, num. 30, p. 12186 – 12191. DOI : 10.1073/pnas.1221464110.

Multi-channel low-frequency room equalization using perceptually motivated constrained optimization

M. Kolundzija; C. Faller; M. Vetterli 

2012. IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto, Japan, March 25-30, 2012. p. 533 – 536. DOI : 10.1109/ICASSP.2012.6287934.

Reproducing Sound Fields Using MIMO Acoustic Channel Inversion

M. Kolundzija; C. Faller; M. Vetterli 

Journal of the Audio Engineering Society. 2011. Vol. 59, num. 10, p. 721 – 734.

Spatiotemporal Gradient Analysis of Differential Microphone Arrays

M. Kolundzija; C. Faller; M. Vetterli 

Journal of the Audio Engineering Society. 2011. Vol. 52, num. 1/2, p. 20 – 28.

Design of a Compact Cylindrical Loudspeaker Array for Spatial Sound Reproduction

M. Kolundzija; C. Faller; M. Vetterli 

2011. AES 130th Convention, May 13-16, 2011.

LCAV-APDA