videlalvaro/emilio: LLM inference engine where every multiply is exp(ln(a) + ln(b)). Runs Qwen2.5-0.5B at ~30 tok/s on Apple GPU — entirely through a single algebraic primitive.

LLM inference engine where every multiply is exp(ln(a) + ln(b)). Runs Qwen2.5-0.5B at ~30 tok/s on Apple GPU — entirely through a single algebraic primitive.

Read Original

Related