Blogs

AI in product

9 essays — builder-to-builder, no corporate filler.

All essays
Three Questions Before You Greenlight an AI Feature
Most AI features fail in production not because the model was wrong — but because nobody asked the right questions at scoping.
Mar 2025 · 4 min read
Give Your Coding Agent Memory in 5 Minutes
agentmemory is an open-source tool that lets Claude Code, Cursor, Copilot, and other agents remember your project across sessions. Here's how to run it.
May 2026 · 4 min read
The Fallback IS the Feature
What the user sees when the AI is wrong is more important than what they see when it's right. Spec the fallback first.
Apr 2025 · 5 min read
Why Your AI Accuracy Benchmark Is Dishonest
Curated test sets always perform better than production data. If you're not testing on production-shaped inputs, your benchmark is a press release.
Apr 2025 · 4 min read
Production AI Demands a Degradation Strategy
Models drift, providers throttle, prompts break. If your feature has no plan for what to do when the AI is unavailable, you don't have a production feature.
Mar 2025 · 5 min read
When AI Is a Liability, Not a Feature
The cases where adding AI actively makes the product worse. Recognize them before you spec.
Mar 2025 · 5 min read
The Remove-a-Step Rule
Every AI feature should be tested against one question: does this remove a step the user is already taking? If no, kill it.
Mar 2025 · 3 min read
Evals Matter When You Have Stakes
Build evals when wrong outputs cost money, customers, or trust. Skip them when you can fix a bad output in five minutes.
Feb 2025 · 4 min read
The Case Against Vector Databases
Most teams that 'need' a vector database actually need keyword search and a small JSON file.
Jan 2025 · 4 min read