My agent only calls one model. Why do I need an envelope?
Because a loop bug can call that one model ten thousand times in an hour. The model is irrelevant; the loop is the risk.
What is a reasonable starting envelope?
For most internal automation, start at 25 steps, 200,000 tokens, and 5 dollars per run. Tighten or loosen by product after two weeks of data.
Should I let agents call other agents?
Only with a parent envelope that includes the children. Otherwise the recursion compounds and you lose the cap.
How do I price a self hosted retrieval call?
Compute fully loaded cost per call (compute plus storage plus query) and assign it to the call. The number does not need to be perfect; it needs to be present so engineers see the trade off.
Do I need a separate dashboard from AI FinOps?
Yes. The AI FinOps dashboard is per feature. The Agentic dashboard is per agent run id with step level and tool level breakdowns.
What about long lived background agents?
Treat them as a budget per day with a sliding window. If the daily envelope is breached, the agent enters a quiescent mode and pages the owner.
How do I show ROI on agent investment?
Cost per resolved task on the agent versus cost per resolved task before the agent (human or scripted). Both numbers must include all attributable cost.