Skip to content

Commit 1e74ebd

Browse files
Merge pull request #40 from kerthcet/cleanup/update-readme
Update README.md about preheating models
2 parents 13a41b4 + 204d177 commit 1e74ebd

File tree

2 files changed

+27
-6
lines changed

2 files changed

+27
-6
lines changed

README.md

Lines changed: 25 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -47,9 +47,9 @@ _Name Story: the inspiration of the name `Manta` is coming from Dota2, called [M
4747

4848
Read the [Installation](./docs//installation.md) for guidance.
4949

50-
### Preheat Models
50+
### Preheat Model
5151

52-
A sample to preload the `Qwen/Qwen2.5-0.5B-Instruct` model:
52+
A sample to preload the `Qwen/Qwen2.5-0.5B-Instruct` model. Once preheated, no longer to fetch the models from cold start, but from the cache instead.
5353

5454
```yaml
5555
apiVersion: manta.io/v1alpha1
@@ -74,10 +74,31 @@ spec:
7474
name: Huggingface
7575
repoID: Qwen/Qwen2.5-0.5B-Instruct
7676
nodeSelector:
77-
zone: zone-a
77+
foo: bar
7878
```
7979

80-
### Delete Models
80+
### Use Model
81+
82+
Once you have a Torrent, you can access the model simply from host path of `/mnt/models/. What you need to do is just set the Pod label like:
83+
84+
```yaml
85+
metadata:
86+
labels:
87+
manta.io/torrent-name: "torrent-sample"
88+
```
89+
90+
Note: you can make the Torrent `Standby` by setting the preheat to false (true by default), then preheating will process in runtime, which obviously wll slow down the model loading.
91+
92+
```yaml
93+
apiVersion: manta.io/v1alpha1
94+
kind: Torrent
95+
metadata:
96+
name: torrent-sample
97+
spec:
98+
preheat: false
99+
```
100+
101+
### Delete Model
81102

82103
If you want to remove the model weights once `Torrent` is deleted, set the `ReclaimPolicy=Delete`, default to `Retain`:
83104

docs/installation.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
### Install
1010

1111
```cmd
12-
VERSION=v0.0.3
12+
VERSION=v0.0.4
1313
kubectl apply --server-side -f https://github.com/inftyai/manta/releases/download/$VERSION/manifests.yaml
1414
```
1515

@@ -32,7 +32,7 @@ After installation, you will see outputs like:
3232
### Uninstall
3333
3434
```cmd
35-
VERSION=v0.0.3
35+
VERSION=v0.0.4
3636
kubectl delete -f https://github.com/inftyai/manta/releases/download/$VERSION/manifests.yaml
3737
```
3838

0 commit comments

Comments
 (0)