Mastodon discussion Discussions Apr 15 4 views

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU loa...

by Rost Glukhov

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU load, and tokens per second.#Self-Hosting #LLM #AI #Hardware #NVidiahttps://www.glukhov.org/llm-performance/benchmarks/best-llm-on-16gb-vram-gpu/

Read Original

AI Hardware

Metadata

Reblogs Count: 1
Favourites Count: 2
Account: ros@techhub.social

Mastodon discussion 36m ago

I've been embracing Claude Code a little more each week. It's really helpful for just going over really mundane cleanup ...

I've been embracing Claude Code a little more each week. It's really helpful for just going over really mundane cleanup tasks in a project when I open something ancient. I don't vi...

Mastodon discussion 37m ago

Googleが生成AI検索の最適化ガイドを公開。AEO/GEOは「SEOと同じ」と明言 | TECH NOISY https://www.yayafa.com/2803426/ #AgenticAi #AI #ArtificialGener...

Googleが生成AI検索の最適化ガイドを公開。AEO/GEOは「SEOと同じ」と明言 | TECH NOISY https://www.yayafa.com/2803426/ #AgenticAi #AI #ArtificialGeneralIntelligence #ArtificialIntelligence #DeepMind #Gemini #Go...

Mastodon discussion 38m ago

ICYMI: Conde Nast CEO: human journalism will win in the age of AI slop: Conde Nast CEO Roger Lynch explains why Vogue an...

ICYMI: Conde Nast CEO: human journalism will win in the age of AI slop: Conde Nast CEO Roger Lynch explains why Vogue and The New Yorker thrive as AI floods the web with low-qualit...

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU loa...

Metadata

Related

I've been embracing Claude Code a little more each week. It's really helpful for just going over really mundane cleanup ...

Googleが生成AI検索の最適化ガイドを公開。AEO/GEOは「SEOと同じ」と明言 | TECH NOISY https://www.yayafa.com/2803426/ #AgenticAi #AI #ArtificialGener...

ICYMI: Conde Nast CEO: human journalism will win in the age of AI slop: Conde Nast CEO Roger Lynch explains why Vogue an...