Home Technology Hardware Aplication Services About us ÖÐÎÄ
Technology
Classic Vocoder
Neural Vocoder
Echo Cancellation
Noise Suppression
Neural Vocoder Position : Home  Neural Vocoder 
 
Neural vocoders adopt an end-to-end neural audio codec framework, with the core architecture featuring a three-stage Encoder¨CQuantizer¨CDecoder structure.
At the encoding end, the algorithm is typically based on a one-dimensional convolutional network, mapping the time-domain waveform into a low-dimensional continuous latent representation. This is followed by multi-stage discretization using Residual Vector Quantization (RVQ), compressing the continuous representation into finite codebook indices to achieve a controllable bitrate (e.g., 0.6¨C24 kbps). Finally, a symmetrically structured decoder reconstructs the time-domain waveform. The training phase employs an end-to-end optimization strategy, with the loss function usually comprising multi-scale STFT loss + perceptual adversarial loss, where the discriminator is used to enhance subjective audio quality.
This approach¡ªtime-domain convolutional autoencoding + residual vector quantization compression + perceptual adversarial training optimization¡ªis well-suited for designing low-bitrate neural vocoders. Through end-to-end modeling, combined with joint constraints in both the frequency and time domains, it achieves significantly better perceptual quality compared to traditional parametric vocoders.
 
 
Copyright© Nanjing Wutong Microelectronics Technology Co.,Ltd. All rights reserved.    Record No. ËÕICP±¸18028114ºÅ
   info@indusic.com 025-84813173 025-84812173
Building 2, Huaye Park, Nanjing Qilin Qidi Science and Technology City