We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 44a2212 commit 3f43b6eCopy full SHA for 3f43b6e
examples/quantize/README.md
@@ -34,7 +34,7 @@ Run the quantized model:
34
35
```bash
36
# start inference on a gguf model
37
-./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -n 128
+./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -cnv -p "You are a helpful assistant"
38
```
39
40
When running the larger models, make sure you have enough disk space to store all the intermediate files.
0 commit comments