Cookies
O website necessita de alguns cookies e outros recursos semelhantes para funcionar. Caso o permita, o INESC TEC irá utilizar cookies para recolher dados sobre as suas visitas, contribuindo, assim, para estatísticas agregadas que permitem melhorar o nosso serviço. Ver mais
Aceitar Rejeitar
  • Menu
Publicações

Publicações por CTM

2005

A Fractal Self-Similarity Model for the spectral representation of audio signals

Autores
Sinha, D; Ferreira, AJS; Sen, D;

Publicação
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
In the application of conventional audio compression algorithms to low bit rate audio coding one is faced with the unsatisfactory tradeoff between coarser quantization and audio bandwidth reduction. Frequency Extension has therefore emerged as an important tool for the satisfactory performance of low bit rate audio codecs. In this paper we describe one of a newer class of Frequency Extension techniques which are applied directly to the high frequency resolution representation of the signal (e.g., MDCT). This particular technique is based on a Fractal Self-Similarity Model (FSSM) for the short-term frequency representation of the signal. The FSSM model, which may include multiple dilation and translation terms, has been found to be effective for a wide variety of speech and music signals and provides a compact description for long term correlation that may exist in frequency domain. The high frequency resolution of MDCT aids in accurate parameter estimation for the model, which in turn has shown promise as a Frequency Extension tool that offers a detailed and natural sounding quality at low bit rates. Structure of the FSSM model, issues related to parameter estimation, and its application to audio coding for bit rates of 8-48 kbps is discussed. Audio demos are available at http://www.atc-labs.com/fssm.

2005

An accurate method of detection and cancellation of multiple acoustic feedbacks

Autores
Rocha, AF; Ferreira, AJS;

Publicação
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
This paper presents a new method to the adaptive cancellation of acoustic feedbacks. The method uses high resolution frequency analysis and high-Q notch filters so as to accurately detect feedbacks and cancel them without disturbing noticeably the main audio spectrum. The method will be described, its implementation on a TMS320C6711 DSP platform for real time operation will be explained, and results for the adaptive cancellation of two simultaneous acoustic feedbacks will be presented.

2005

Accurate spectral replacement

Autores
Ferreira, AJS; Sinha, D;

Publicação
Audio Engineering Society - 118th Convention Spring Preprints 2005

Abstract
Recent advances in perceptual audio coding are strongly based on the concept of bandwidth extension. Most techniques implementing bandwidth extension require an analysis/synthesis filter bank in addition to that used by the associated perceptual audio coder, which increases the overall system complexity and coding delay, and makes difficult the correct alignment between the operation of the audio coder and the operation of the bandwidth extension technique. We present a new Accurate Spectral Replacement (ASR) technique that is based on a suitable decomposition of the MDCT filter bank, and that implements synthesis of sinusoidal components with an accuracy much higher than the natural frequency resolution of the filter bank. The ASR technique is described, its performance is assessed with both synthetic and natural audio signals, and its main areas of application are addressed. Audio demos are available at http://www.atc-labs.com/asr/.

2005

A new low-delay codec for two-way high-quality audio communication

Autores
Ferreira, AJS; Sinlia, D;

Publicação
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
High-quality audio bit-rate reduction systems are widely used in many application areas involving audio broadcast, streaming and download services. With the advent of 3G mobile and wireless communication networks, there is a clear opportunity for new multimedia services, notably those relying on two-way high- quality audio communication. In t his paper we describe a new source/perceptual audio coder that features low-delay, intrinsic error robustness and high subjective audio quality at competitive compression ratios. The structure of the audio coder is described and an emphasis is given on its innovative approaches to semantic signal segmentation and decomposition, independent coding of sinusoidal and noise components, and bandwidth extension using Accurate Spectral Replacement. A few test results are presented that illustrate the operation and performance of the new coder.

2005

A new broadcast quality low bit rate audio coding scheme utilizing novel bandwidth extension tools

Autores
Sinha, D; Ferreira, AJS;

Publicação
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
In this paper we describe the components of a novel audio coding algorithm capable of delivering high-fidelity CDlike stereo audio at the bit rates of 40-48 kbps and natural sounding FM grade mono at the bit rates of 18-22 kbps. Bandwidth Extension has emerged as an important tool for the satisfactory performance of low bit rate audio codecs. Recently we proposed two new bandwidth extension algorithms, Fractal Self-Similarity Model (FSSM) and Accurate Spectral Replacement (ASR), which belong to a new class of Bandwidth Extension techniques which are applied directly to the high resolution frequency representation of the signal (e.g., MDCT or ODFT). The proposed coding scheme uses FSSM and ASR in an adaptive and complementary framework. Another important component of the proposed codec is a wideband psychoacoustic model that makes an explicit use of the Comodulation Release of Masking (CMR) phenomenon. It also includes a novel parametric stereo coding technique. The proposed audio coding scheme is geared towards broadcast applications where codec latency and encoder complexity is generally not an overriding concern. In this paper we present algorithmic details of the new codec, audio demonstrations, and, comparison to other audio coding schemes. Further information and audio demonstrations are available at http://www.atc-labs.com/teslapro.

2005

A new class of smooth power complementary windows and their application to audio signal processing

Autores
Sinha, D; Ferreira, AJS;

Publicação
Audio Engineering Society - 119th Convention Fall Preprints 2005

Abstract
In this paper we describe a new family of smooth power complementary windows which exhibit a very high level of localization in both time and frequency domain. This window family is parameterized by a "smoothness quotient". As the smoothness quotient increases the window becomes increasingly localized in time (most of the energy gets concentrated in the center half of the window) and frequency (far field rejection becomes increasing stronger to the order of 150 dB or higher). A closed form solution for such window function exists and the associated design procedure is described. The new class of windows is quite attractive for a number of applications as switching functions, equalization functions, or as windows for overlap-add and modulated filter banks. An extension to the family of smooth windows which exhibits improved near-field response in the frequency domain is also discussed. More information is available at http://www.atc-labs.com/technology/misc/windows.

  • 293
  • 325