Running Qwen3.6-27B on a 16GB M1 MacBook Pro: A Practical Engineer’s Guide

A realistic guide to testing Qwen3.6-27B locally on a 16GB Apple Silicon Mac using MLX, quantization, short prompts, constrained KV cache, and conservative sampling.

Read Original

Related

Dev.to tutorial 21m ago

Chunking for RAG: stop tuning the wrong knob

A practical chunking playbook for RAG: why semantic splitters disappoint, what chunk size + overlap actually buy you, and a small eval harness in Python.