publish: Avoid redundant buffer copies #5903

Turbo87 · 2023-01-10T17:35:52Z

Previously, we were buffering the incoming hyper::Body into a bytes::Bytes instance, which we then wrapped in a Cursor, which we then used as a Read implementation to fill a Vec<u8>. This was somewhat wasteful because it meant that we were buffering the tarball payload twice: once in the Bytes and once more in the Vec<u8>.

This PR changes the implementation to take more advantage of the Arc-like behavior of bytes::Bytes. We implement a split_body() function, which turns one Bytes instance into two Bytes, one for the JSON metadata, and one for the tarball payload. This will just shuffle around a few pointers underneath, without performing any additional memory allocation.

In this PR we also slightly refactor the S3 uploader implementation to not require a redundant seek operation, which makes the above implementation a bit easier.

Turbo87 added 2 commits January 10, 2023 18:28

uploaders: Accept any content that can by turned into a reqwest::Body

d0aaac7

publish: Avoid redundant buffer copies

d9748c8

Turbo87 added C-internal 🔧 Category: Nonessential work that would make the codebase more consistent or clear A-backend ⚙️ labels Jan 10, 2023

Turbo87 merged commit b76f361 into rust-lang:master Jan 10, 2023

Turbo87 deleted the publish-buffers branch January 10, 2023 17:43

bors mentioned this pull request Jan 10, 2023

Port to diesel 2.0.3 #4892

Merged

Turbo87 mentioned this pull request Jan 10, 2023

ConduitRequest: Remove Cursor wrapper #5906

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

publish: Avoid redundant buffer copies #5903

publish: Avoid redundant buffer copies #5903

Uh oh!

Turbo87 commented Jan 10, 2023

Uh oh!

Uh oh!

publish: Avoid redundant buffer copies #5903

publish: Avoid redundant buffer copies #5903

Uh oh!

Conversation

Turbo87 commented Jan 10, 2023

Uh oh!

Uh oh!