Skip to content

Commit cbd950b

Browse files
committed
iq3_s: make it work on metal for QK_K = 64
Pleasent surprise: the coding was super-block size independent, so all it took was to delete some QK_K == 256 guards.
1 parent e6e61e3 commit cbd950b

File tree

1 file changed

+0
-7
lines changed

1 file changed

+0
-7
lines changed

ggml-metal.metal

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -4489,7 +4489,6 @@ void kernel_mul_mv_iq3_s_f32_impl(
44894489
threadgroup_barrier(mem_flags::mem_threadgroup);
44904490
}
44914491

4492-
#if QK_K == 256
44934492
const int ix = tiisg;
44944493

44954494
device const float * y4 = y + 32 * ix;
@@ -4535,12 +4534,6 @@ void kernel_mul_mv_iq3_s_f32_impl(
45354534

45364535
y4 += 32 * 32;
45374536
}
4538-
#else
4539-
(void) x;
4540-
(void) y;
4541-
(void) yl;
4542-
(void) nb32;
4543-
#endif
45444537

45454538
for (int row = 0; row < N_DST; ++row) {
45464539
all_sum = simd_sum(sumf[row]);

0 commit comments

Comments
 (0)