RAG in Production: Beyond the Tutorial (Vector DB Selection, Chunking, Evaluation)
Every RAG tutorial follows the same script. Load documents, split into chunks, embed them, store in a vector database, retrieve the top-k results, pass to an LL...
All posts tagged with #llm
Every RAG tutorial follows the same script. Load documents, split into chunks, embed them, store in a vector database, retrieve the top-k results, pass to an LL...
Every product launch in 2026 seems to include the phrase "AI agent." Customer support chatbot? Agent. Form that auto-fills fields? Agent. A wrapper around a sin...
Last year I shipped an AI agent that cost us $400 in a single afternoon. It got stuck in a loop, calling the same API endpoint over and over, burning tokens on ...
I have been building software for over 18 years. In the last two years, my work has shifted more dramatically than in any previous decade. The shift is not abou...
There is a version of every AI agent that works in a conference room demo. The LLM call succeeds, the tool executes cleanly, the output looks impressive. Then s...