Model
Model
is a custom resource definition (CRD) that represents a Ollama server instance in the cluster.
When created, the creation of Model
triggers the creation of Deployment
, Service
.
If it is created for the first time, an additional StatefulSet and PersistentVolumeClaim will be created to store and persist the downloaded Ollama image.
Full CRD Reference
yaml
apiVersion: ollama.ayaka.io/v1
kind: Model
metadata:
name: phi
spec:
# Scale the model to 2 replicas
replicas: 2
# Use the model image `phi`
image: phi
imagePullPolicy: IfNotPresent
resources:
limits:
cpu: 4
memory: 8Gi
nvidia.com/gpu: 1 # If you got GPUs
requests:
cpu: 4
memory: 8Gi
nvidia.com/gpu: 1 # If you got GPUs
storageClassName: local-path
# If you have your own PersistentVolumeClaim created
persistentVolumeClaim: your-pvc
# If you need to specify the access mode for the PersistentVolume
persistentVolume:
accessMode: ReadWriteOnce
Contributors
Neko Ayaka