GitHub Trending repo Repositories Apr 15 4 views

videlalvaro/emilio: LLM inference engine where every multiply is exp(ln(a) + ln(b)). Runs Qwen2.5-0.5B at ~30 tok/s on Apple GPU — entirely through a single algebraic primitive.

by videlalvaro

LLM inference engine where every multiply is exp(ln(a) + ln(b)). Runs Qwen2.5-0.5B at ~30 tok/s on Apple GPU — entirely through a single algebraic primitive.

Read Original

AI Hardware LLM

Metadata

Stars: 5
Forks: 1
Language: Rust
Watchers: 5
License: MIT

GitHub Trending repo 5h ago

AbhishekK130804/Claude-Mythos-AI-Anthropic-App: Claude pro free Mythos design Opus Cowork Sonnet AI Anthropic App: download free PC android apk iOS, Anthropic Claude API key setup, Claude roleplay mythos client, SillyTavern Claude prompt formatting, custom system prompt jailbreak, Mythos AI creative writing app, Claude 3.5 Sonnet Opus API cost, open source LLM frontend, Claude reverse proxy

Claude pro free Mythos design Opus Cowork Sonnet AI Anthropic App: download free PC android apk iOS, Anthropic Claude API key setup, Claude roleplay mythos client, SillyTavern Clau...

GitHub Trending repo 16h ago

Doorman11991/smallcode: AI coding agent optimized for small LLMs. 87% benchmark with 4B-active model.

AI coding agent optimized for small LLMs. 87% benchmark with 4B-active model.

GitHub Trending repo 1d ago

videlalvaro/emilio: LLM inference engine where every multiply is exp(ln(a) + ln(b)). Runs Qwen2.5-0.5B at ~30 tok/s on Apple GPU — entirely through a single algebraic primitive.

Metadata

Related

Doorman11991/smallcode: AI coding agent optimized for small LLMs. 87% benchmark with 4B-active model.

vorhersager/deep-learning-jax: No description