Software
Retrieval Cost Engineering for OpenSearch RAG
A production RAG system feels “fast” when it’s consistent: answers arrive quickly, follow-ups don’t suddenly slow down, and costs don’t jump with traffic. The biggest lever to achieve that isn’t usually the model—it’s retrieval. When retrieval is engineered like a disciplined subsystem (with budgets,