AnyInt Docs
Models

Overview

Choose the right AnyInt route family for image understanding, image generation, video generation, and music workflows.

AnyInt currently exposes multimodal capabilities through several route families instead of one single abstraction layer. The main decision is whether you need image understanding, image generation, video generation, or full music creation workflows.

Current multimodal surfaces

SurfaceBest forRequest shapeSync or task-based
Anthropic-compatible messagesText plus image understandingClaude-style messages[].content[] blocksSync
Gemini-compatible routesText generation, image generation, Gemini-native function flowscontents[].parts[]Sync or streaming
DashScope media routesPrompt-driven image and video generationMedia-specific input and parameters payloadsSync for images, task-based for video
AI MusicSong generation, covers, lyrics, stems, and music video workflowsMusic-specific task payloadsTask-based

How to choose

NeedStart withRead next
Ask questions about one or more reference imagesImage InputAnthropic-compatible API or Gemini-compatible API
Generate an image from text or image contextImage GenerationGemini-compatible API and Media API
Generate or poll video tasksVideo GenerationMedia API
Create songs, covers, lyrics, stems, or music videosMedia & MusicAI Music API
Choose a text or reasoning model before adding mediaModelsModels API

Sync vs async matters

Not every multimodal route is synchronous:

  • image generation can return usable content in one response
  • video and music workflows usually involve tasks, polling, and webhooks

That affects UI design, retry logic, and how you store outputs.

Response patternDesign implication
Synchronous responseThe client can use the response immediately after request completion
Streaming responseThe UI must treat chunks as partial state until the stream closes
Task creation responseStore the task ID and show pending or processing state
Callback or polling resultMark the asset ready only after a successful terminal status and output URL

Input and output rules of thumb

RuleWhy it matters
Keep route families separateOpenAI, Anthropic, Gemini, media, and music routes do not share one universal body shape
Use public URLs or supported inline formats when sending mediaThe model provider or gateway must be able to access the asset
Do not assume a model ID implies every modalityThe route contract determines whether image, video, or music behavior is supported
Treat generated assets as customer dataStore output URLs, prompts, and metadata according to your privacy and retention policy
Test with small payloads firstIt is easier to isolate auth, model access, and request-shape issues before using large files

On this page