Fix inference example lacks required parameters (ggml-org#9035)

Aisuko · arthw · commit 3f43b6ec5e3d · 2024-11-15T11:36:38.000+08:00
Signed-off-by: Aisuko &lt;urakiny@gmail.com&gt;
diff --git a/examples/quantize/README.md b/examples/quantize/README.md
@@ -34,7 +34,7 @@ Run the quantized model:
 
 ```bash
 # start inference on a gguf model
-./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -n 128
+./llama-cli -m ./models/mymodel/ggml-model-Q4_K_M.gguf -cnv -p "You are a helpful assistant"
 ```
 
 When running the larger models, make sure you have enough disk space to store all the intermediate files.