We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent f361861 commit d23a4d2Copy full SHA for d23a4d2
intermediate_source/reinforcement_q_learning.py
@@ -150,7 +150,7 @@ def __len__(self):
150
# rewards from the uncertain far future less important for our agent
151
# than the ones in the near future that it can be fairly confident
152
# about. It also encourages agents to collect reward closer in time
153
-# than equivalent rewards temporally future away.
+# than equivalent rewards that are temporally far away in the future.
154
#
155
# The main idea behind Q-learning is that if we had a function
156
# :math:`Q^*: State \times Action \rightarrow \mathbb{R}`, that could tell
0 commit comments