Closed
Description
Description
When working on #1416 I found that numba has terrible performance on matrix-vector multiplication:
@ricardoV94 thinks this is because numba probably only uses gemm for everything, and never uses gemv. Since we have a GEMV Op already, it will be very easy to follow the pattern I used in #1416 to make a dispatch for GEMV.
We might also have to add the GEMV rewrite to the numba mode as well -- I know they're disabled for jax, for example. Haven't checked for numba, but something to be aware of.