MioCodec-25Hz-24kHz: A High-Efficiency Neural Audio Codec for Modern Spoken Language Modeling
The rapid advancement of speech AI and spoken language models has created an urgent need for efficient neural audio codecs. As models grow larger and multilingual datasets expand into tens of thousands of hours, storage efficiency, token compactness, and reconstruction quality become critical bottlenecks. Traditional codecs often focus on perceptual audio quality alone, without considering … Read more