Mastodon discussion Discussions Apr 22 4 views

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU loa...

by Rost Glukhov

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU load, and tokens per second.#Self-Hosting #LLM #AI #Hardware #NVidia #llama.cpphttps://www.glukhov.org/llm-performance/benchmarks/best-llm-on-16gb-vram-gpu/

Read Original

AI Hardware

Metadata

Reblogs Count: 1
Account: ros@techhub.social

Mastodon discussion 1m ago

I tried Siri AI, and so far it actually worksSiri, are you there? Parents want one thing, and one thing only, out of AI:...

I tried Siri AI, and so far it actually worksSiri, are you there? Parents want one thing, and one thing only, out of AI: to add a list of soccer games or "spirit week" theme days f...

Mastodon discussion 2m ago

OpenAI、IPOを非公開で申請　「リークを予想し自ら発表」 – CNET Japan https://www.yayafa.com/2819047/ #AgenticAi #AI #ArtificialGeneralIntelligen...

OpenAI、IPOを非公開で申請　「リークを予想し自ら発表」 – CNET Japan https://www.yayafa.com/2819047/ #AgenticAi #AI #ArtificialGeneralIntelligence #ArtificialIntelligence #ipo #OpenAI #エージェント型AI #人工知能 #汎用...

Mastodon discussion 3m ago

“The lawyers on both sides of a federal court case in #Mississippi were caught using artificial intelligence, a situatio...

“The lawyers on both sides of a federal court case in #Mississippi were caught using artificial intelligence, a situation where, effectively, generative #AI tools were used to argu...

Compare llama.cpp speeds on a 16 GB GPU for dense and MoE models at 19K, 32K, and 64K context. Tables list VRAM, GPU loa...

Metadata

Related

I tried Siri AI, and so far it actually worksSiri, are you there? Parents want one thing, and one thing only, out of AI:...

OpenAI、IPOを非公開で申請 「リークを予想し自ら発表」 – CNET Japan https://www.yayafa.com/2819047/ #AgenticAi #AI #ArtificialGeneralIntelligen...

“The lawyers on both sides of a federal court case in #Mississippi were caught using artificial intelligence, a situatio...

OpenAI、IPOを非公開で申請　「リークを予想し自ら発表」 – CNET Japan https://www.yayafa.com/2819047/ #AgenticAi #AI #ArtificialGeneralIntelligen...