Audio decoding TODOs

Basic support was added in https://github.com/pytorch/torchcodec/pull/538, we need the following before we make this public:

- [x] enable backwards seeks https://github.com/pytorch/torchcodec/pull/550
- [x] enable audio formats other than fltp https://github.com/pytorch/torchcodec/pull/556
  - [ ] maybe we don't always need to convert to fltp, especially flt (non-planar) formats, we can probably just output tensors with a different layout or so something smarter that avoids a copy. 
- [x] enable a user-defined `sample_rate` https://github.com/pytorch/torchcodec/pull/551
- [x] expose a public method in `AudioDecoder` https://github.com/pytorch/torchcodec/pull/555
- [ ] maybe, maybe not: something like the normalize parameter of the torchaudio reader, which allows users to specify whether they want a float tensor in [-1, 1], or a tensor with the same dtype as the audio format. We'll figure that out later. We'll probably always return float tensors by default anyway.
- [ ] perf: try to pre-allocate the output tensor and save copies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Audio decoding TODOs #549

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Audio decoding TODOs #549

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions