Skip to content

Commit 9085854

Browse files
Merge pull request #1850 from IntelPython/add-as-contig-specialization
Dedicated code to copy array to C-contig/F-contig destinations
2 parents 23c9e14 + d088227 commit 9085854

File tree

9 files changed

+1570
-5
lines changed

9 files changed

+1570
-5
lines changed

CHANGELOG.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,8 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
1010

1111
### Change
1212

13-
* Improved efficiency of copy-and-cast operations from `numpy.ndarray` to `tensor.usm_ndarray` for contiguous inputs [gh-1829](https://github.com/IntelPython/dpctl/pull/1829)
13+
* Improved performance of copy-and-cast operations from `numpy.ndarray` to `tensor.usm_ndarray` for contiguous inputs [gh-1829](https://github.com/IntelPython/dpctl/pull/1829)
14+
* Improved performance of copying operation to C-/F-contig array, with optimization for batch of square matrices [gh-1850](https://github.com/IntelPython/dpctl/pull/1850)
1415

1516
### Fixed
1617

dpctl/tensor/CMakeLists.txt

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -123,6 +123,7 @@ set(_tensor_impl_sources
123123
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/tensor_ctors.cpp
124124
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/accumulators.cpp
125125
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/copy_and_cast_usm_to_usm.cpp
126+
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/copy_as_contig.cpp
126127
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/copy_numpy_ndarray_into_usm_ndarray.cpp
127128
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/copy_for_reshape.cpp
128129
${CMAKE_CURRENT_SOURCE_DIR}/libtensor/source/copy_for_roll.cpp

0 commit comments

Comments
 (0)