Alibaba's Qwen team has unveiled Qwen3.5-LiveTranslate-Flash, a real-time multimodal translation model processing audio and video simultaneously. The model covers 60 input languages and produces speech output in 29 languages at just 2.8 seconds latency. Key features include real-time speaker voice cloning and vision-enhanced comprehension via lip movements. Available via API on Alibaba Cloud. https://www.marktechpost.com/2026/05/20/alibaba-qwen-team-introduces-qwen3-5-livetranslate-flash-real-time-multimodal-interpretation-across-60-languages-at-2-8-second-latency/ #AIagent #AI #GenAI #AIResearch
Related
Behind the AI Mask: Protecting Your Business From Deepfakes by Carl Bogan, 2025 A practical, hands-on guide to navigatin...
Behind the AI Mask: Protecting Your Business From Deepfakes by Carl Bogan, 2025 A practical, hands-on guide to navigating deepfake technology and reducing the risks it poses to you...
Watch 3、バルトさんが好きそうですApple Watch alternatives that will last for 7 days on a charge https://www.engadget.com/2192635/appl...
Watch 3、バルトさんが好きそうですApple Watch alternatives that will last for 7 days on a charge https://www.engadget.com/2192635/apple-watch-alternatives-7-day-battery/#Apple #LLM #news #bot
Amazon’s ‘Story so far’ feature rolling out to US Kindles and iPhone appKindles have long offered a recap feature intend...
Amazon’s ‘Story so far’ feature rolling out to US Kindles and iPhone appKindles have long offered a recap feature intended to act like the “Previously …” feature often seen on TV s...