Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
21 changes: 13 additions & 8 deletions content/manuals/compose/bridge/customize.md
Original file line number Diff line number Diff line change
Expand Up @@ -56,8 +56,8 @@
# access a nested attribute using dot notation
{{ if eq $service.deploy.mode "global" }}
kind: DaemonSet
{{ end }}
{{ end }}
{{ end }}
{{ end }}
```

You can check the [Compose Specification JSON schema](https://github.com/compose-spec/compose-go/blob/main/schema/compose-spec.json) for a full overview of the Compose model. This schema outlines all possible configurations and their data types in the Compose model.
Expand Down Expand Up @@ -125,13 +125,18 @@

The default transformation also includes templates for applications that use LLMs:

- `model-runner-deployment.tmpl`
- `model-runner-service.tmpl`
- `model-runner-pvc.tmpl`
- `/overlays/model-runner/kustomization.yaml`
- `/overlays/desktop/deployment.tmpl`
- `model-runner-deployment.tmpl`: Generates the Kubernetes Deployment for Docker Model Runner. Customize it to change replica counts, image tags, resource requests and limits, GPU scheduling settings, tolerations, or additional environment variables.
- `model-runner-service.tmpl`: Builds the Service that exposes Docker Model Runner. Update it to switch between ClusterIP, NodePort, or LoadBalancer types, adjust ports, or add annotations for ingress and service meshes.
- `model-runner-pvc.tmpl`: Defines the persistent volume claim used to store downloaded models. Edit it to set storage size, storage class, access modes, or volume annotations required by your storage provider.
- `/overlays/model-runner/kustomization.yaml`: Kustomize overlay applied when you deploy Model Runner to a standalone Kubernetes cluster. Extend it to add patches for labels and annotations, attach NetworkPolicies, or include extra manifests such as PodDisruptionBudgets.

Check failure on line 131 in content/manuals/compose/bridge/customize.md

View workflow job for this annotation

GitHub Actions / validate (vale)

[vale] reported by reviewdog 🐶 [Vale.Spelling] Did you really mean 'Kustomize'? Raw Output: {"message": "[Vale.Spelling] Did you really mean 'Kustomize'?", "location": {"path": "content/manuals/compose/bridge/customize.md", "range": {"start": {"line": 131, "column": 48}}}, "severity": "ERROR"}
- `/overlays/desktop/deployment.tmpl`: Desktop-specific deployment template that keeps the in-cluster Model Runner scaled down and points workloads to the host endpoint. Adjust it if you change the Desktop endpoint or want to deploy Model Runner on Desktop instead of relying on the host service.

Common customization scenarios:

These templates can be extended or replaced to change how Docker Model Runner is deployed or configured.
- Enable GPU support by adding vendor-specific resource requests, limits, and node selectors in `model-runner-deployment.tmpl`.
- Increase or tune storage for model artifacts by editing `model-runner-pvc.tmpl` to set the desired size, storage class, or access mode.
- Expose Model Runner outside the cluster by switching the service type in `model-runner-service.tmpl` or adding ingress annotations in the model-runner overlay.
- Align cluster policies by adding labels, annotations, PodSecurity settings, or NetworkPolicies through `/overlays/model-runner/kustomization.yaml`.

For more details, see [Use Model Runner](use-model-runner.md).

Expand Down
Loading