Interfaze, a YC startup, has open-sourced diffusion-gemma-asr-small, the first multilingual diffusion ASR model. It uses...

Interfaze, a YC startup, has open-sourced diffusion-gemma-asr-small, the first multilingual diffusion ASR model. It uses a diffusion decoder rather than autoregressive generation, with a single 42M-parameter adapter handling six languages. The model achieves 6.6% WER on LibriSpeech, beating other diffusion approaches. https://www.marktechpost.com/2026/07/02/interfaze-ships-diffusion-gemma-asr-small-an-open-source-diffusion-asr-model-transcribing-six-languages-via-diffusiongemmas-parallel-denoising-decoder/ #AIagent #AI #GenAI #Research

Read Original

Related