Meta's New AI-Powered Audio Codec Claims 10x Better Compression Than MP3

Meta's New AI-Powered Audio Codec Claims 10x Better Compression Than MP3

By Marcus Stevenson

November 20, 2024 at 04:51 AM

Meta has developed an AI-powered audio compression method called 'EnCodec' that achieves 10x better compression than traditional MP3 format while maintaining high audio quality.

Person using audio production software

Person using audio production software

EnCodec uses a three-part system:

  • An encoder that converts uncompressed audio into a lower frame rate representation
  • A quantizer that compresses the signal while preserving essential information
  • A decoder that reconstructs the audio in real-time using neural networks

Audio compression comparison graph

Audio compression comparison graph

The system employs discriminators in a cat-and-mouse game to maintain perceptual quality. While one component tries to spot differences between original and compressed audio, another works to make the compressed version indistinguishable from the original.

Key features:

  • First neural network compression system for 48 kHz stereo audio
  • Exceeds CD-quality sampling rate (44.1 kHz)
  • Designed for voice calls over poor connections
  • Potential applications in metaverse experiences
  • Works without requiring increased bandwidth

While still in research phase, EnCodec represents a significant advancement in audio compression technology, potentially enabling high-quality audio delivery regardless of network conditions.

Businessman checking phone with charts

Businessman checking phone with charts

Fatboy Slim DJing with outstretched arm

Fatboy Slim DJing with outstretched arm

Related Articles

Previous Articles