Merging tensors of larger models

> Currently, only LLaMA-7B is supported since I haven't figured out how to merge the tensors of the bigger models. However, in theory, you should be able to run 65B on a 64GB MacBook

It shouldn't be hard to merge tensors with my https://github.com/kir-gadjello/zipslicer library, but it's pure Python! If you want to keep the project pure C++ you might want to write a standalone gist script that uses zipslicer to unpack weight shards into binary files.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Merging tensors of larger models #1

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Merging tensors of larger models #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions