Rough implementation of getting key frame indices #484

scotts · 2025-01-27T21:09:54Z

Only works in exact mode. Note that we had to start explicitly tracking frameIndex in our FrameInfo struct. That allows us to easily know the overall index for a key frame.

NicolasHug

Some non-blocking suggestions, approving to unblock

NicolasHug · 2025-01-28T09:43:49Z

src/torchcodec/decoders/_core/VideoDecoderOps.cpp

@@ -48,6 +48,7 @@ TORCH_LIBRARY(torchcodec_ns, m) {
      "get_frames_by_pts_in_range(Tensor(a!) decoder, *, int stream_index, float start_seconds, float stop_seconds) -> (Tensor, Tensor, Tensor)");
  m.def(
      "get_frames_by_pts(Tensor(a!) decoder, *, int stream_index, float[] timestamps) -> (Tensor, Tensor, Tensor)");
+  m.def("get_key_frame_indices(Tensor(a!) decoder, int stream_index) -> int[]");


I think we can be conservative and expose it privately in the Python core API, for now? I.e. as _get_key_frame_indices instead of get_key_frame_indices

src/torchcodec/decoders/_core/video_decoder_ops.py

scotts · 2025-01-28T14:38:06Z

test/decoders/test_video_decoder.py

+        key_frame_indices = decoder._get_key_frame_indices()
+        size = key_frame_indices.size()
+        assert size[0] > 0
+        assert len(size) == 1


@NicolasHug, is there a more PyTorch-y way to assert this?

I can't find of an obvious one. In this specific case, maybe we can just hard-code assert key_frame_indices.shape == (42,), but of course that's only working for this video.

On another note, I wonder if we should somewhat check the correctness of the returned values (still non-blocking)?

Yes, we should check the actual return value. I can probably hardcode what we expect in the test based off of what ffprobe returns.

NicolasHug · 2025-01-28T14:43:20Z

src/torchcodec/decoders/_core/video_decoder_ops.py

+@register_fake("torchcodec_ns::_get_key_frame_indices")
+def get_key_frame_indices_abstract(
+    decoder: torch.Tensor, *, stream_index: int
+) -> List[int]:


Nit: the type annotation says list but I think you changed it to tensor?

…ices

scotts · 2025-01-28T21:08:24Z

test/decoders/test_video_decoder.py

+        key_frame_indices = decoder._get_key_frame_indices()
+
+        # The key frame indices were generated from the following command:
+        #   $ ffprobe -v error -hide_banner -select_streams v:1 -show_frames -of csv test/resources/nasa_13013.mp4 | grep -n ",I," | cut -d ':' -f 1 > key_frames.txt


I left the line-noise of a shell command on a single line so that it's easier to copy-paste. It does make it harder to read, but making it easy to read means breaking it up over several lines, and then when you go to copy-paste, there's the # comment markers in there. Happy to change it to a better way.

NicolasHug · 2025-01-29T09:13:11Z

test/decoders/test_video_decoder.py

+        #   4. Using cut to extract just the count for the frame.
+        # Finally, because the above produces a count, which is index + 1, we subtract
+        # one from all values manually to arrive at the values below.


I suppose you meant the same thing, but my understanding is that -n will output the line number (not the count), and the cut part will select that line number (which is 1-based)?

Yeah, I meant the same thing. I can try to clarify.

scotts added 2 commits January 27, 2025 13:02

Rough implementation of getting key frame indices

0219f92

Formatting

1bcfaa5

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 27, 2025

scotts marked this pull request as ready for review January 28, 2025 02:06

scotts requested a review from NicolasHug January 28, 2025 02:06

NicolasHug approved these changes Jan 28, 2025

View reviewed changes

scotts added 2 commits January 28, 2025 06:09

Make core op private

736401c

Change return type to a 1D tensor of int64

121a9fd

scotts commented Jan 28, 2025

View reviewed changes

NicolasHug reviewed Jan 28, 2025

View reviewed changes

scotts added 3 commits January 28, 2025 12:56

Create key frame index manually

ef644f9

Merge branch 'main' of github.com:pytorch/torchcodec into get_key_ind…

1670dc6

…ices

Fix return type

09ada7c

scotts commented Jan 28, 2025

View reviewed changes

NicolasHug approved these changes Jan 29, 2025

View reviewed changes

scotts added 2 commits January 29, 2025 10:59

More defensive init and more testing

d80e795

Formatting

1c06f1e

scotts merged commit 298c9c1 into pytorch:main Jan 29, 2025
44 checks passed

scotts deleted the get_key_indices branch January 29, 2025 19:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Rough implementation of getting key frame indices #484

Rough implementation of getting key frame indices #484

Uh oh!

scotts commented Jan 27, 2025 •

edited

Loading

Uh oh!

NicolasHug left a comment

Uh oh!

NicolasHug Jan 28, 2025

Uh oh!

Uh oh!

scotts Jan 28, 2025

Uh oh!

NicolasHug Jan 28, 2025

Uh oh!

scotts Jan 28, 2025

Uh oh!

NicolasHug Jan 28, 2025

Uh oh!

scotts Jan 28, 2025

Uh oh!

NicolasHug Jan 29, 2025

Uh oh!

scotts Jan 29, 2025

Uh oh!

Uh oh!

Uh oh!

Rough implementation of getting key frame indices #484

Rough implementation of getting key frame indices #484

Uh oh!

Conversation

scotts commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

scotts commented Jan 27, 2025 •

edited

Loading