Skip to content

Commit 5c3cc91

Browse files
fpagnybene2k1
andauthored
feat(infr): add list of known working custom models (#5081)
* feat(infr): add list of known working custom models * Update pages/managed-inference/reference-content/supported-models.mdx --------- Co-authored-by: Benedikt Rollik <brollik@scaleway.com>
1 parent 21afd8c commit 5c3cc91

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

pages/managed-inference/reference-content/supported-models.mdx

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,7 @@ You can find a complete list of all models available in Scaleway's catalog on th
3434
We recommend starting with a variation of a supported model from the Scaleway catalog.
3535
For example, you can deploy a [quantized (4-bit) version of Llama 3.3](https:/huggingface.co/unsloth/Llama-3.3-70B-Instruct-bnb-4bit).
3636
If deploying a fine-tuned version of Llama 3.3, make sure your file structure matches the example linked above.
37+
Examples whose compatibility has been tested are available in [tested models](#known-compatible-models).
3738
</Message>
3839

3940
To deploy a custom model via Hugging Face, ensure the following:
@@ -235,3 +236,16 @@ Custom models must conform to one of the architectures listed below. Click to ex
235236
* `MedusaModel`
236237
* `MLPSpeculatorPreTrainedModel`
237238
</Concept>
239+
240+
## Known compatible models
241+
242+
Several models have already been verified to work on Managed Inference custom models. This list is not exhaustive and is updated gradually. Click to expand the full list.
243+
244+
<Concept>
245+
## Models verified for compatibility
246+
The following model compatibility has been verified:
247+
* `ibm-granite/granite-vision-3.2-2b`
248+
* `ibm-granite/granite-3.3-2b-instruct`
249+
* `microsoft/phi-4`
250+
* `Qwen/Qwen3-32B`
251+
</Concept>

0 commit comments

Comments
 (0)