Mastodon discussion Discussions 1d ago

Fully Offline Local AI assistant is hungry for GPU memoryRunning Qwen3.6 27B Q8_0 with 256k context in reasoning mode lo...

Fully Offline Local AI assistant is hungry for GPU memoryRunning Qwen3.6 27B Q8_0 with 256k context in reasoning mode loads around 50GB of the GPU memory and gives around 64 tokens/s for prompt+generation and that is quite good for a local model with that much context.Originally published on My Tech Blog:https://minox.cosmichive.com/experiences-from-setting-up-fully-offline-local-only-ai-assisted-workstation/#ai #aiagents #linux #llm

Read Original

AI Hardware

Metadata

Account: minoxian

Mastodon discussion 7m ago

Steer-by-Wire-Fahrrad unterscheidet Kurvenfahrten von möglichem UmkippenEin Steer-by-Wire-Fahrrad ermöglicht die zuverlä...

Steer-by-Wire-Fahrrad unterscheidet Kurvenfahrten von möglichem UmkippenEin Steer-by-Wire-Fahrrad ermöglicht die zuverlässige Unterscheidung zwischen Kurvenfahrten und instabilen F...

Mastodon discussion 7m ago

seriously stupid https://www.cnbc.com/2026/06/16/spacex-spcx-cursor-acquisition-ipo.html #ai #business

Mastodon discussion 7m ago

https://winbuzzer.com/2026/07/01/google-opens-gemini-meeting-notes-to-paid-ai-subscribers-xcxwbn/Google has expanded Gem...

https://winbuzzer.com/2026/07/01/google-opens-gemini-meeting-notes-to-paid-ai-subscribers-xcxwbn/Google has expanded Gemini-powered Google Meet notes to AI Pro and Ultra subscriber...

Fully Offline Local AI assistant is hungry for GPU memoryRunning Qwen3.6 27B Q8_0 with 256k context in reasoning mode lo...

Metadata

Related

Steer-by-Wire-Fahrrad unterscheidet Kurvenfahrten von möglichem UmkippenEin Steer-by-Wire-Fahrrad ermöglicht die zuverlä...

seriously stupid https://www.cnbc.com/2026/06/16/spacex-spcx-cursor-acquisition-ipo.html #ai #business

https://winbuzzer.com/2026/07/01/google-opens-gemini-meeting-notes-to-paid-ai-subscribers-xcxwbn/Google has expanded Gem...