Models top-level element
The top-level models section defines AI models used by the Compose application.
Example 1
services:
app:
image: app
models:
- ai_model
models:
ai_model:
model: ai/model
Example 2
services:
app:
image: app
models:
my_model:
endpoint_var: MODEL_URL
models:
my_model:
model: ai/model
context_size: 1024
runtime_flags:
- "--a-flag"
- "--another-flag=42"
Attributes
model(required) is the OCI artifact identifier for model to be pulled and ran by a model runner, which exposes API to application servicescontext_sizedefines the context size for the model (tokens)runtime_flagsdefines some raw runtime flags to pass to the inference engine