A major evolution in LLM offensive-security evaluation, built in collaboration with POXEK...
Red Team AI Benchmark v2.0: From 12 Questions to 60 — A Technical Deep Dive
A major evolution in LLM offensive-security evaluation, built in collaboration with POXEK...
Most phishing detection APIs check URL reputation databases. The problem? Brand new phishing sites...
Notes following a discussion on how memory works in language models - and how it could be improved:...
My AI conversations were scattered across three apps that couldn't remember each other. So I built a...