Fixed reinforcement learning to run with any screen size; added diagram #389

mike9ant · 2018-12-17T23:46:03Z

Fixed reinforcement learning to run with different screen sizes (it looks like the resolution coming in was incorrect, which caused the wrong screen subregion to be accessed and training fo fail).

Also, added some improved comments in the tutorial and a diagram of data-plow to make things more understandable.

Don't merge until it passes tests (we see tutorial updated).

…ams.

soumith · 2018-12-18T03:58:46Z

intermediate_source/reinforcement_q_learning.py

@@ -23,7 +23,10 @@
 As the agent observes the current state of the environment and chooses
 an action, the environment *transitions* to a new state, and also
 returns a reward that indicates the consequences of the action. In this
-task, the environment terminates if the pole falls over too far.
+task, rewards are +1 for every incremental timestep and the environment
+terminates if the pole falls over too far or the crat mover more then 2.4


crat mover -> cart moves

soumith · 2018-12-18T04:00:10Z

intermediate_source/reinforcement_q_learning.py

-policy_net = DQN().to(device)
-target_net = DQN().to(device)
+# Get screen size so that we can initialize layers correctly based on shape
+# returned from AI gym. Typical dimentions at this pont are close to 3x40x90


dimentions -> dimensions

soumith · 2018-12-18T04:00:38Z

intermediate_source/reinforcement_q_learning.py

+# which is the result of a clamped and down-scaled buffer in get_screen()
+init_screen = get_screen()
+_, _, screen_height, screen_width = init_screen.shape
+#screen_height = init_screen.shape[2]


remove commented code?

soumith · 2018-12-19T23:07:56Z

the two failures are because of the mnist deadlock appearing again cc: @yf225

Fixed reinforcement learning to run with any screen size; added diagram

Fixed reinforcement learning to run with any screen size; added diagr…

5f1dcfe

…ams.

soumith reviewed Dec 18, 2018

View reviewed changes

Fixed comment typos and training values.

b7569a3

soumith merged commit 0fa8074 into pytorch:master Dec 19, 2018

holly1238 mentioned this pull request Jun 16, 2021

DQN Tutorial get_screen() constants #219

Closed

rodrigo-techera pushed a commit to Experience-Monks/tutorials that referenced this pull request Nov 29, 2021

Merge pull request pytorch#389 from mike7ant/master

f04ee2e

Fixed reinforcement learning to run with any screen size; added diagram

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed reinforcement learning to run with any screen size; added diagram #389

Fixed reinforcement learning to run with any screen size; added diagram #389

Uh oh!

mike9ant commented Dec 17, 2018

Uh oh!

soumith Dec 18, 2018

Uh oh!

soumith Dec 18, 2018

Uh oh!

soumith Dec 18, 2018

Uh oh!

soumith commented Dec 19, 2018

Uh oh!

Uh oh!

Fixed reinforcement learning to run with any screen size; added diagram #389

Fixed reinforcement learning to run with any screen size; added diagram #389

Uh oh!

Conversation

mike9ant commented Dec 17, 2018

Uh oh!

soumith Dec 18, 2018

Choose a reason for hiding this comment

Uh oh!

soumith Dec 18, 2018

Choose a reason for hiding this comment

Uh oh!

soumith Dec 18, 2018

Choose a reason for hiding this comment

Uh oh!

soumith commented Dec 19, 2018

Uh oh!

Uh oh!