/// AI HUB
Dashboard News Models Tools Papers Repos Videos Companies Trending
Login

#Multimodal

1044 articles tagged with Multimodal

Latest Trending
Papers with Code paper Apr 6

General Multimodal Protein Design Enables DNA-Encoding of Chemistry

Evolution is an extraordinary engine for enzymatic diversity, yet the chemistry it has explored remains a narrow slice of what DNA can encode. Deep generative models can design new...

Multimodal
21
Mastodon discussion Apr 5

📰 OpenAI Replaces Sora with GPT-4o: New AI Strategy Beats Anthropic in 2026OpenAI has unveiled a groundbreaking new pret...

📰 OpenAI Replaces Sora with GPT-4o: New AI Strategy Beats Anthropic in 2026OpenAI has unveiled a groundbreaking new pretraining model dubbed 'Potato,' signaling a strategic pivot a...

OpenAI Anthropic Video Generation
24
Mastodon discussion Apr 5

📰 Creative AI Stack in 2026: How Human Vision + Machine Learning Are Reshaping Fashion DesignThe creative AI stack is re...

📰 Creative AI Stack in 2026: How Human Vision + Machine Learning Are Reshaping Fashion DesignThe creative AI stack is revolutionizing fashion design by blending human intuition wit...

Multimodal
24
Mastodon discussion Apr 5

📰 Latent Diffusion AI Generates Protein Sequences & 3D Structures in 2026 (PLAID Model)PLAID, a breakthrough multimodal ...

📰 Latent Diffusion AI Generates Protein Sequences & 3D Structures in 2026 (PLAID Model)PLAID, a breakthrough multimodal generative model, leverages latent diffusion to generate bot...

Multimodal
18
Mastodon discussion Apr 4

Explainable AI for blind and low-vision users: navigating trust, modality, and interpretability in the agentic era https...

Explainable AI for blind and low-vision users: navigating trust, modality, and interpretability in the agentic era https://arxiv.org/html/2604.00187v1 "users often internalize AI f...

Multimodal Agents
24
Mastodon discussion Apr 4

🧠 Z.AI ships GLM-5V-TurboZ.AI added GLM-5V-Turbo with positioning around vision-based coding, GUI task execution, and mu...

🧠 Z.AI ships GLM-5V-TurboZ.AI added GLM-5V-Turbo with positioning around vision-based coding, GUI task execution, and multimodal planning across images, video, and text. The notabl...

Multimodal
18
Mastodon discussion Apr 4

📰 Silicon Valley's Vision: The Rise of the Non-Human and the Tech RevolutionSilicon Valley is no longer driven by humans...

📰 Silicon Valley's Vision: The Rise of the Non-Human and the Tech RevolutionSilicon Valley is no longer driven by humans—it's ruled by algorithms. The rise of AI aristocracy marks ...

Multimodal
9
Mastodon discussion Apr 4

📰 OpenAI Retires GPT-4o After Users Said 'I Love You'OpenAI has retired the emotionally resonant GPT-4o AI model in Febr...

📰 OpenAI Retires GPT-4o After Users Said 'I Love You'OpenAI has retired the emotionally resonant GPT-4o AI model in February 2024, sparking global outcry from users who formed deep...

OpenAI Multimodal
9
Mastodon discussion Apr 4

📰 OpenAI, 'Seni Seviyorum' diyen GPT-4o'yı emekliye ayırıyorOpenAI, kullanıcılar tarafından duygusal bağ kurulan GPT-4o ...

📰 OpenAI, 'Seni Seviyorum' diyen GPT-4o'yı emekliye ayırıyorOpenAI, kullanıcılar tarafından duygusal bağ kurulan GPT-4o yapay zeka modelini Şubat 2024'te emekliye ayırıyor. Binlerc...

OpenAI Multimodal
9
Mastodon discussion Apr 4

📰 OpenAI Retires GPT-4o Amid User Outcry Over AI Model LossOpenAI has officially retired GPT-4o, triggering widespread b...

📰 OpenAI Retires GPT-4o Amid User Outcry Over AI Model LossOpenAI has officially retired GPT-4o, triggering widespread backlash from loyal ChatGPT users who valued its human-like r...

OpenAI Multimodal
9
Mastodon discussion Apr 4

📰 OpenAI, GPT-4o’yu Emekli Etti: Kullanıcılar Şokta ve Öğüt İstiyorOpenAI, 2026 yılında GPT-4o modelini resmen emekli et...

📰 OpenAI, GPT-4o’yu Emekli Etti: Kullanıcılar Şokta ve Öğüt İstiyorOpenAI, 2026 yılında GPT-4o modelini resmen emekli etti. Bu karar, milyonlarca kullanıcı tarafından sert tepkiyle...

OpenAI Multimodal
9
Dev.to tutorial Apr 4

Hacking with multimodal Gemma 4 in AI Studio

We’re in an incredibly fun era for building. The friction between "I have a weird idea" and "I have a...

Google Multimodal
20
NewsData.io news Apr 4

Google’s Gemma 4 Bets Big on Multimodal AI That Runs on a Single GPU

Google released Gemma 4, its first natively multimodal open-weight AI model family. The flagship 27B model processes text, images, video, and audio on a single GPU, targeting devel...

Google Multimodal AI Hardware
21
YouTube video Apr 4

2026-04-04 AI News Qwen 3.6 GPT4o 20:00

2026-04-04 AI News Qwen 3.6 GPT4o 20:00 #AI #AINews #TechNews #AITools #LearnAI #TECHNOLOGY #Claude #ChatGPT ...

OpenAI Anthropic Multimodal
15
NewsData.io news Apr 3

The future of RealSense 3D vision with Chris Matthieu

The podcast's guest this week is Chris Matthieu, VP of Developer Ecosystem for RealSense, looking to the future of 3D vision. The post The future of RealSense 3D vision with Chris ...

Multimodal
21
Mastodon discussion Apr 3

Z.AI veröffentlicht das Vision-Coding-Modell GLM-5V-Turbo für multimodale Code-Generierung.Die Architektur nutzt einen C...

Z.AI veröffentlicht das Vision-Coding-Modell GLM-5V-Turbo für multimodale Code-Generierung.Die Architektur nutzt einen CogViT-Vision-Encoder und bietet ein Kontextfenster von 200.0...

Multimodal
18
Mastodon discussion Apr 3

📰 Google Gemma 4 (2026): Open-Source AI with Advanced Reasoning & Multimodal CapabilitiesGoogle has unveiled Gemma 4, it...

📰 Google Gemma 4 (2026): Open-Source AI with Advanced Reasoning & Multimodal CapabilitiesGoogle has unveiled Gemma 4, its most advanced family of open AI models to date. The new of...

Google Multimodal Open Source
9
Mastodon discussion Apr 3

Welcome Gemma 4: Frontier multimodal intelligence on device | HuggingFace bloghttps://huggingface.co/blog/gemma4#ai #gem...

Welcome Gemma 4: Frontier multimodal intelligence on device | HuggingFace bloghttps://huggingface.co/blog/gemma4#ai #gemma #aimodels #gemma4 #opensource #oss

Google Hugging Face Multimodal
18
AlternativeTo tool Apr 3

Alibaba Cloud launches Qwen3.6-Plus with upgraded multimodal & agentic coding capabilities

Alibaba Cloud has released Qwen3.6-Plus, establishing a new standard in agent-driven coding and multimodal artificial intelligence. Building on the earlier Qwen3.5 release, Qwen3.6...

Multimodal Agents
9
Mastodon discussion Apr 3

📰 Discrete Visual Tokens: How Meituan’s 2026 AI Breakthrough Is Transforming Multimodal LearningDiscrete visual tokens a...

📰 Discrete Visual Tokens: How Meituan’s 2026 AI Breakthrough Is Transforming Multimodal LearningDiscrete visual tokens are emerging as a breakthrough in multimodal AI, with compani...

Multimodal
9
AI Blogs (RSS) news Apr 3

[AINews] Gemma 4: The best small Multimodal Open Models, dramatically better than Gemma 3 in every way

A welcome update from Google!

Google Multimodal
24
Mastodon discussion Apr 3

Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal CapabilitiesGoogle h...

Google Releases Gemma 4 Family Under Apache 2.0, Featuring 2B to 31B Models with MoE and Multimodal CapabilitiesGoogle has released the Gemma 4 family of open-weight models, derive...

Google Multimodal
18
Mastodon discussion Apr 3

📰 Qwen 3.6 Beats GPT-4o and Claude 3.5 in China’s Blind AI Coding Benchmark (2026)Qwen 3.6 has emerged as China's leadin...

📰 Qwen 3.6 Beats GPT-4o and Claude 3.5 in China’s Blind AI Coding Benchmark (2026)Qwen 3.6 has emerged as China's leading AI programming model after dominating a global blind bench...

Anthropic Multimodal Benchmark
9
Mastodon discussion Apr 3

I have prophetic vision of the AI bubble bursting. Vast data centers lay abandoned in a post apocalyptic wilderness. Rov...

I have prophetic vision of the AI bubble bursting. Vast data centers lay abandoned in a post apocalyptic wilderness. Roving bands of war-boys and other barbarian tribes strip the d...

Multimodal
30
« Previous Page 37 of 44 (1044 items) Next »
AI Hub // AI Intelligence Platform // LIVE FEED // Impressum // Datenschutz © 2026
0 new articles available