Skip to content

Commit 21afd8c

Browse files
authored
fix(inference): improve custom models requirements (#5079)
1 parent 0543961 commit 21afd8c

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

pages/managed-inference/reference-content/supported-models.mdx

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,6 +52,8 @@ Your model repository must include:
5252
* An `architectures` array (see [supported architectures](#supported-models-architecture) for the exact list of supported values).
5353
* `max_position_embeddings`
5454
* Model weights in the [`.safetensors`](https:/huggingface.co/docs/safetensors/index) format
55+
* A `tokenizer.json` file
56+
* If your are fine-tuning an existing model, we recommend you use the same `tokenizer.json` file from the base model.
5557
* A chat template included in either:
5658
* `tokenizer_config.json` as a `chat_template` field, or
5759
* `chat_template.json` as a `chat_template` field
@@ -232,4 +234,4 @@ Custom models must conform to one of the architectures listed below. Click to ex
232234
* `EAGLEModel`
233235
* `MedusaModel`
234236
* `MLPSpeculatorPreTrainedModel`
235-
</Concept>
237+
</Concept>

0 commit comments

Comments
 (0)