COMPRESSION TRANSPARENT LOW-LEVEL DESCRIPTION OF AUDIO SIGNALS (WedPmOR5)

Author(s) :

Jason Lukasiak	(University of Wollongong, Australia)
Chris McElroy	(University of Wollongong, Australia)
Eva Cheng	(University of Wollongong, Australia)

Abstract :

A new low level audio descriptor that represents the psycho-acoustic noise floor shape of an audio frame is proposed. Results presented indicate that the proposed descriptor is far more resilient to compression noise than any of the MPEG-7 low level audio descriptors. In fact, across a wide range of files, on average the proposed scheme fails to uniquely identify only five frames in every ten thousand. In addition, the proposed descriptor maintains a high resilience to compression noise even when decimated to use only one quarter of the values per frame to represent the noise floor. This characteristic indicates the proposed descriptor presents a truly scalable mechanism for transparently describing the characteristics of an audio frame.

Menu