Skip to content

Commit 71619f2

Browse files
committed
kv-cache : add TODO for doing split_equal when split_simple fails
ggml-ci
1 parent 3cf5186 commit 71619f2

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

src/llama-kv-cache.cpp

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1811,6 +1811,8 @@ llama_memory_state_ptr llama_kv_cache_unified_iswa::init_batch(const llama_batch
18111811

18121812
auto sbatch = llama_sbatch(batch, hparams.n_embd, true, logits_all);
18131813

1814+
// TODO: if we fail with split_simple, we should attempt split_equal
1815+
18141816
std::vector<llama_ubatch> ubatches;
18151817

18161818
while (sbatch.n_tokens > 0) {

0 commit comments

Comments
 (0)