Core APIs do not demux, and the `stream_index` parameter has (almost) no effect

Alternative title: The C++ and core ops work fine as long as we add only one stream. They break if we add more than one stream.



Example 1:

```py
from torchcodec.decoders import _core as core

# This video has stream 0 with dimensions torch.Size([3, 180, 320]) and stream 3 with dimensions torch.Size([3, 270, 480])
decoder = core.create_from_file("test/resources/nasa_13013.mp4")
core.add_video_stream(decoder, stream_index=0)
core.add_video_stream(decoder, stream_index=3)

for frame_index in range(100):
    frame, _, _ = core.get_frame_at_index(decoder, stream_index=0, frame_index=frame_index)
    print(frame.shape)  # torch.Size([3, 270, 480]). This is stream 3, not stream 0.
```

Example 2:

```py
from torchcodec.decoders import _core as core

decoder = core.create_from_file("test/resources/nasa_13013.mp4")
core.add_video_stream(decoder, stream_index=0)

frame, _, _ = core.get_frame_at_index(decoder, stream_index=3, frame_index=5)  # This should error but doesn't
print(frame.shape)  # torch.Size([3, 180, 320]). This is Stream 0, not stream 3.
```

------

None of the core APIs or C++ APIs actually do demuxing. I.e. the `stream_index` parameter is **never** used to filter and select frames. The only way it is used is to seek.

This may be more clear by looking at the call-stack of our decoding entry-points.

![Image](https://github.com/user-attachments/assets/fb19ede5-4356-4a7f-a727-d5dbbfdc6057)

All but one rely on `getFrameAtIndexInternal`, which will use the `streamIndex` to set the cursor:

https://github.com/pytorch/torchcodec/blob/288bb838f95b1b6113b0f2785165c6ed8df5652b/src/torchcodec/decoders/_core/VideoDecoder.cpp#L1254-L1255

but then immediately return the frame that is returned by `getNextFrameNoDemuxInternal()`, which doesn't demux anything.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Core APIs do not demux, and the `stream_index` parameter has (almost) no effect #476

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	setCursorPtsInSeconds(ptsToSeconds(pts, streamInfo.timeBase));
	return getNextFrameNoDemuxInternal(preAllocatedOutputTensor);

Core APIs do not demux, and the stream_index parameter has (almost) no effect #476

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Core APIs do not demux, and the `stream_index` parameter has (almost) no effect #476