Parallax: Parameterized Local Linear Attention for Language Modeling
Large Language Models (LLMs) have become the central paradigm in artificial intelligence, yet the core computational primitive of attention has remained structurally unchanged. Loc...