Tag: howto
All the articles with the tag "howto".
-
RAG Over Enterprise Records: The Boring Parts That Matter
Enterprise RAG is trustworthy because of the unglamorous parts: per-user permissions enforced at retrieval, freshness, lineage, and handling records that change. Retrieval is an access-control problem wearing a search costume.
-
Small Fine-Tuned Models Are Beating Frontier on My Workloads
On narrow, high-volume tasks a fine-tuned small model matches frontier quality at a fraction of the cost and latency. Here is the pipeline, the eval bar, and the maintenance bill nobody quotes you.
-
Agentic Workflows Need Guardrails, Not Vibes
How to put real constraints around an agent that touches money or production: bounded tools, approval gates on irreversible actions, dry-run modes, spend limits, and a tool-call audit trail you can actually read.
-
Getting JSON Out of LLMs Without Crying
Function calling and JSON mode get you syntactically valid JSON. They do nothing about a model that fills the right shape with confident nonsense. The validation-and-repair layer you still have to write.