Guides
Guides for request behavior that cuts across API families, including streaming, tool calling, structured outputs, prompt caching, and fallbacks.
Guides for request behavior that cuts across API families, including streaming, tool calling, structured outputs, prompt caching, and fallbacks.
Choose by behavior
| Need | Page |
|---|---|
| Understand the feature set at a high level | Overview |
| Send partial output to users as it is generated | Streaming |
| Ask models to return predictable JSON-like output | Structured Outputs |
| Let models call functions or tools | Tool Calling |
| Reuse long prompt prefixes when the route supports it | Prompt Caching |
| Keep production workloads usable when a model path fails | Model Fallbacks |
| Configure coding agents or local tools against AnyInt | Agent Tool Integrations |
API family matters
These guides describe patterns, not universal request fields. Always confirm the exact payload shape in API Reference before shipping a production client.