Part 2 of a 4-part series. How LoRA works (the low-rank trick), a working PEFT config, and three real GPU walls I hit — the FP16 unscale error, an OOM, and a 2-GPU speed mystery.
LoRA: I Trained
Part 2 of a 4-part series. How LoRA works (the low-rank trick), a working PEFT config, and three real GPU walls I hit — the FP16 unscale error, an OOM, and a 2-GPU speed mystery.
Most phishing detection APIs check URL reputation databases. The problem? Brand new phishing sites...
Notes following a discussion on how memory works in language models - and how it could be improved:...
My AI conversations were scattered across three apps that couldn't remember each other. So I built a...