Meta's AI Audio Codec Achieves 10x Better Compression Than MP3
Meta has developed an AI-powered audio compression method called 'EnCodec' that claims to achieve 10x better compression than MP3 while maintaining high audio quality.
Person using audio production software
EnCodec uses a three-part system to compress audio:
- An encoder that converts uncompressed audio into a lower frame rate representation
- A quantizer that compresses the signal while preserving essential information
- A decoder that reconstructs the audio in real-time using neural networks
Audio compression comparison graph
The system employs discriminators in a cat-and-mouse game to ensure the reconstructed audio remains perceptually similar to the original. It's the first neural network-based compression system capable of handling 48 kHz stereo audio, slightly higher than CD quality (44.1 kHz).
Primary applications include:
- Improving voice call quality over poor network connections
- Enabling high-fidelity audio in metaverse experiences
- Delivering quality audio with minimal bandwidth requirements
While still in research phase, EnCodec represents a significant advancement in audio compression technology that could revolutionize digital audio transmission and storage.