Merge pull request #290 from 8bitmp3/patch-1

Update (small) the reinforcement learning chapter
2021-03-02 12:11:47 +13:00 · 2021-03-02 12:11:47 +13:00 · 5c8843a53b
parent 3d418c0308 80f6cb27c0
commit 5c8843a53b
1 changed files with 1 additions and 1 deletions
--- a/18_reinforcement_learning.ipynb
+++ b/18_reinforcement_learning.ipynb
@ -565,7 +565,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Let's create a neural network that will take observations as inputs, and output the action to take for each observation. To choose an action, the network will estimate a probability for each action, then we will select an action randomly according to the estimated probabilities. In the case of the Cart-Pole environment, there are just two possible actions (left or right), so we only need one output neuron: it will output the probability `p` of the action 0 (left), and of course the probability of action 1 (right) will be `1 - p`."
+    "Let's create a neural network that will take observations as inputs, and output the probabilities of actions to take for each observation. To choose an action, the network will estimate a probability for each action, then we will select an action randomly according to the estimated probabilities. In the case of the Cart-Pole environment, there are just two possible actions (left or right), so we only need one output neuron: it will output the probability `p` of the action 0 (left), and of course the probability of action 1 (right) will be `1 - p`."
   ]
  },
  {