Skip to content

Commit bbe1019

Browse files
Change to pass sorting direction as call argument, not template parameter
The intent is to reduce the build time, build memory footprint, and binary size of the sorting_impl module. With this change it stands at 46MB, before it was 72MB.
1 parent 3c05c1b commit bbe1019

File tree

4 files changed

+295
-162
lines changed

4 files changed

+295
-162
lines changed

dpctl/tensor/libtensor/include/kernels/sorting/merge_sort.hpp

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -807,8 +807,7 @@ sycl::event stable_argsort_axis1_contig_impl(
807807
const IndexComp<IndexTy, argTy, ValueComp> index_comp{arg_tp, ValueComp{}};
808808

809809
static constexpr size_t determine_automatically = 0;
810-
size_t sorted_block_size =
811-
(sort_nelems >= 512) ? 512 : determine_automatically;
810+
size_t sorted_block_size = determine_automatically;
812811

813812
const size_t total_nelems = iter_nelems * sort_nelems;
814813

0 commit comments

Comments
 (0)