Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery

As 360° cameras and virtual reality headsets become more popular, panorama images have become increasingly ubiquitous. While sounds are essential in delivering immersive and interactive user experiences, most panorama images, however, do not come with native audio. In this paper, we propose an automatic algorithm to augment static panorama images through realistic audio assignment. We accomplish this goal through object detection, scene classification, object depth estimation, and audio source placement. We built an audio file database composed of over 500 audio files to facilitate this process. We designed and conducted a user study to verify the efficacy of various components in our pipeline. We run our method on a large variety of panorama images of indoor and outdoor scenes. By analyzing the statistics, we learned the relative importance of these components, which can be used in prioritizing for power-sensitive time-critical tasks like mobile augmented reality (AR) applications.

Learn More

Publications

Audible Panorama: Automatic Spatial Audio Generation for Panorama Imagery

ACM Conference on Human Factors and Computing Systems (SIGCHI)

Publication date: May 3, 2019

Haikun Huang, Michael S. Solah, Dingzeyu Li, Lap-Fai Yu

Research Areas: AI & Machine Learning AR, VR & 360 Photography Audio Computer Vision, Imaging & Video Graphics (2D & 3D) Human Computer Interaction