HunyuanVideo w. BitsAndBytes (local): Expected all tensors to be on the same device

### Describe the bug

Errors in the HunyuanVideo examples here:
[hunyuan_video](https://huggingface.co/docs/diffusers/main/en/api/pipelines/hunyuan_video)

### Reproduction

Run this code from the link:

```
import torch
from diffusers import BitsAndBytesConfig as DiffusersBitsAndBytesConfig, HunyuanVideoTransformer3DModel, HunyuanVideoPipeline
from diffusers.utils import export_to_video

quant_config = DiffusersBitsAndBytesConfig(load_in_8bit=True)
transformer_8bit = HunyuanVideoTransformer3DModel.from_pretrained(
    "tencent/HunyuanVideo",
    subfolder="transformer",
    quantization_config=quant_config,
    torch_dtype=torch.float16,
)

pipeline = HunyuanVideoPipeline.from_pretrained(
    "tencent/HunyuanVideo",
    transformer=transformer_8bit,
    torch_dtype=torch.float16,
    device_map="balanced",
)

prompt = "A cat walks on the grass, realistic style."
video = pipeline(prompt=prompt, num_frames=61, num_inference_steps=30).frames[0]
export_to_video(video, "cat.mp4", fps=15)
```
Gives this error:
`HTTPError: 404 Client Error: Not Found for url: https://huggingface.co/tencent/HunyuanVideo/resolve/main/transformer/config.json`

Changing the path to: `hunyuanvideo-community/HunyuanVideo`

Gives this error:
`RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument index in method wrapper_CUDA__index_select)`

And the other example crashes on a RTX 4090 due to OOM. 

(I wanted to check if [FastHunyuan-diffusers](https://huggingface.co/FastVideo/FastHunyuan-diffusers) would be more vram friendly, but I couldn't due to those errors)

### Logs

```shell
Logs inserted above.
```


### System Info

Win 11

### Who can help?

@DN6 @a-r-r-o-w

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

HunyuanVideo w. BitsAndBytes (local): Expected all tensors to be on the same device #10500

Describe the bug

Reproduction

Logs

System Info

Who can help?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

HunyuanVideo w. BitsAndBytes (local): Expected all tensors to be on the same device #10500

Description

Describe the bug

Reproduction

Logs

System Info

Who can help?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions