Return pts of first frame in audio API #552

NicolasHug · 2025-03-12T18:07:17Z

I thought we didn't need this, but we actually need it in order to implement the public sample-based API (which is out of scope for this PR).

NicolasHug · 2025-03-12T18:08:58Z

src/torchcodec/decoders/_core/VideoDecoder.cpp

@@ -838,7 +838,7 @@ VideoDecoder::FrameBatchOutput VideoDecoder::getFramesPlayedInRange(
  return frameBatchOutput;
 }

-torch::Tensor VideoDecoder::getFramesPlayedInRangeAudio(
+VideoDecoder::AudioFramesOutput VideoDecoder::getFramesPlayedInRangeAudio(


I decided to create a new AudioFramesOutput struct instead of relying on the existing FrameOutput, which contains unnecessary fields like streamIndex and durationSeconds. For now, those aren't needed for audio. No super strong opinion though.

I think that's the right call, since the tensors are going to be very different.

NicolasHug · 2025-03-12T18:10:17Z

src/torchcodec/decoders/_core/VideoDecoder.cpp

-      tensors.push_back(frameOutput.data);
+      firstFramePtsSeconds =
+          std::min(firstFramePtsSeconds, frameOutput.ptsSeconds);
+      frames.push_back(frameOutput.data);


It's now a bit ugly that we are manipulating both a FrameOutput and an AudioFramesOutput here in this function.

This is mostly because convertAVFrameToFrameOutput returns a FrameOutput, but maybe we could bypass it and directly call convertAudioAVFrameToFrameOutputOnCPU. I'll write a TODO to investigate this.

Oh, yeah, that's cumbersome. If we're in an audio-only call, we shouldn't have to deal with video structures.

…audio

Return pts of first frame in audio API

c881dcb

NicolasHug requested a review from scotts March 12, 2025 18:07

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Mar 12, 2025

NicolasHug commented Mar 12, 2025

View reviewed changes

scotts approved these changes Mar 12, 2025

View reviewed changes

NicolasHug added 2 commits March 12, 2025 19:25

Merge branch 'main' of github.com:pytorch/torchcodec into return_pts_…

e26012c

…audio

Add TODO

987b4fe

NicolasHug merged commit d75fc58 into pytorch:main Mar 12, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Return pts of first frame in audio API #552

Return pts of first frame in audio API #552

Uh oh!

NicolasHug commented Mar 12, 2025

Uh oh!

NicolasHug Mar 12, 2025

Uh oh!

scotts Mar 12, 2025

Uh oh!

NicolasHug Mar 12, 2025

Uh oh!

scotts Mar 12, 2025

Uh oh!

Uh oh!

Uh oh!

Return pts of first frame in audio API #552

Return pts of first frame in audio API #552

Uh oh!

Conversation

NicolasHug commented Mar 12, 2025

Uh oh!

NicolasHug Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

scotts Mar 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!