Skip to content

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Performance] Hold a single copy of low/high in bounded specs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2977 opened May 29, 2025 by vmoens Loading…
[Formatting] Rust linter CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2976 opened May 29, 2025 by vmoens Loading…
[Versioning] v0.9 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. versioning Versioning change (version number etc)
#2975 opened May 29, 2025 by vmoens Loading…
[Formatting] headers and future imports checks CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2973 opened May 23, 2025 by vmoens Loading…
10 tasks
[Feature] Adds per-head entropy coefficients to PPOLoss CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2972 opened May 22, 2025 by felixsittenauer Loading…
4 of 9 tasks
[Algorithm] GRPO scripts CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR
#2970 opened May 22, 2025 by vmoens Loading…
[Refactor] Pass all keys at reset (prototype) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2956 opened May 15, 2025 by vmoens Loading…
10 tasks
[Feature] empty_lazy for lazy tensor storages CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2955 opened May 14, 2025 by vmoens Loading…
[BugFix] Base transform applies _call on reset bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2913 opened Apr 22, 2025 by louisfaury Loading…
2 of 9 tasks
[Bugfix] Fix VecNorm eps usage CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2866 opened Mar 22, 2025 by lin-erica Loading…
2 of 10 tasks
v0 param server (using collectives not object store) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2865 opened Mar 21, 2025 by mikaylagawarecki Draft
[Test] Add PEnv tests for devices CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2843 opened Mar 10, 2025 by vmoens Loading…
[DEBUG] ppo compile CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2814 opened Feb 27, 2025 by IvanKobzarev Loading…
10 tasks
[Feature,Deprecation] Split KLRewardTransform in more modules CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2813 opened Feb 27, 2025 by vmoens Loading…
[Feature,Example] Add MCTS algorithm and example CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Examples
#2796 opened Feb 19, 2025 by kurtamohler Loading…
[DRAFT] ppo chess with llm and ConditionalPolicySwitch to sunfish bot CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2763 opened Feb 5, 2025 by mikaylagawarecki Draft
[Example] Self-play chess PPO example CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Examples
#2709 opened Jan 21, 2025 by vmoens Loading…
[WIP] Compute lp during loss execution CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2688 opened Jan 10, 2025 by vmoens Loading…
[Tutorial] MCTS CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2673 opened Dec 19, 2024 by vmoens Loading…
First draft for modular Hindsight Experience Replay Transform CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2667 opened Dec 19, 2024 by dtsaras Draft
3 of 10 tasks
[Tutorial] Beam search with GPT models CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. tutorials
#2623 opened Dec 2, 2024 by vmoens Loading…
[Feature] PPOTrainer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2550 opened Nov 11, 2024 by vmoens Loading…
[Feature] habitat env from config bug Something isn't working CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2539 opened Nov 6, 2024 by vmoens Loading…
10 tasks
[Examples] boiler plate code for multi-turn reward for RLHF CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#2467 opened Oct 5, 2024 by rghosh08 Loading…
3 of 10 tasks
[Algorithm] Update scripts with compile CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2449 opened Sep 23, 2024 by vmoens Loading…
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.