Dev.to tutorial Tutorials Jun 21

LoRA: I Trained

by Suman Nath

Part 2 of a 4-part series. How LoRA works (the low-rank trick), a working PEFT config, and three real GPU walls I hit — the FP16 unscale error, an OOM, and a 2-GPU speed mystery.

Read Original

Fine-Tuning

Metadata

Devto Id: 3955632
Reading Time Minutes: 3

Dev.to tutorial 36m ago

I Built a Free API That Detects Phishing Sites Using AI Vision - And It Catches Prompt Injection Too

Most phishing detection APIs check URL reputation databases. The problem? Brand new phishing sites...

Dev.to tutorial 45m ago

Notes: Memory, Context, and Large Language Models (LLMs)

Notes following a discussion on how memory works in language models - and how it could be improved:...

Dev.to tutorial 1h ago

I gave a brand-new AI the memory of all my old chats, here's the free tool I built to do it.

My AI conversations were scattered across three apps that couldn't remember each other. So I built a...

LoRA: I Trained

Metadata

Related

I Built a Free API That Detects Phishing Sites Using AI Vision - And It Catches Prompt Injection Too

Notes: Memory, Context, and Large Language Models (LLMs)

I gave a brand-new AI the memory of all my old chats, here's the free tool I built to do it.