Stop Wasting LLM Budgets: High-Performance Semantic Caching with Spring AI and pgvector
Stop Wasting LLM Budgets: High-Performance Semantic Caching with Spring AI and...
Stop Wasting LLM Budgets: High-Performance Semantic Caching with Spring AI and...
In 2024, Google paid $2.7 billion to bring Noam Shazeer back. Shazeer invented the transformer...
This is a submission for the June Solstice Game Jam What I Built Turing's Mirror is a...
The Problem I was researching international relocation and found the existing tools...
TL;DR Most AI learning prompts help you recognize ideas. This one trains recall — via a...
When people talk about AI agents, the conversation usually revolves around reasoning, planning,...
Jen Easterly correctly identified that cybersecurity is an aftermarket for software quality failures. Then she celebrated an AI that makes the aftermarket faster. The cure she name...
When a stronger model ships, there are two questions every skill author should want answered, and...
In most TypeScript agent frameworks you wire the task graph by hand. open-multi-agent's runTeam() hands a goal to a coordinator that builds the DAG for you — here's the mechanism, ...
Build an AI agent blind spot detector with unresolved-intent clustering, outcome scoring, trace evidence, review queues, and product feedback loops.
Your agent's idle timer talks to the prompt cache, and most loops are written as if it does not. The five-minute nap is the worst number on the board. Here is why, and the timing h...
A few months ago, a client asked me to rank on 150+ local search queries across 5 cities — with a...
PayPal's real innovation in 1999 wasn't moving money. Banks already moved money. What PayPal did was...
What If Language Understanding Starts With a Dictionary Instead of a Model? LLM →...
The Trap Most Carbon Apps Fall Into When you say "build a carbon footprint app," the...
A practical, straight-to-the-point field manual for the role The New Stack calls "AI's hottest job"...
Two years ago, my interaction with AI was mostly limited to asking ChatGPT questions and checking its...
A practical explainer on the agent loop, how it works, why it is different from a chatbot, and where human-in-the-loop controls fit in.
Two moments made me start building Loupe. Neither was some grand realization — they were just things...
Las dos ideas que hicieron el trabajo: Los modos de falla ya son líneas de log. Conviértelos en...
If you've spent any time with ChatGPT, Gemini, or Claude, you already know they're impressive. Ask...
If you have been using Claude Code, Codex CLI, Gemini CLI, or any agent framework that supports...
If you have spent any time in the SQLite-adjacent corner of GitHub lately, you have probably seen the...
📖 Originally published on my blog. Part of a series on building with Claude Code. For a while I'd...