Commit Graph

16 Commits (80eec21242bc15ec5781480d5eace6fe0ea653b7)

Author SHA1 Message Date
Aurélien Geron 5c8843a53b Merge pull request #290 from 8bitmp3/patch-1
Update (small) the reinforcement learning chapter
2021-03-02 12:11:47 +13:00
B D 64f0e05a94 Minor change on greedy policy variable usage
Chap 18, why not using directly the 'n_outputs' variable defined earlier, instead of hardcoded '2'
2021-02-28 12:02:23 +01:00
Aurélien Geron 749817ccfa Update libraries to latest version, including TensorFlow 2.4.1 and Scikit-Learn 0.24.1 2021-02-18 11:59:02 +13:00
Aurélien Geron 8ebdcffc6b Work around TF Agents issue: env.step(1) => env.step(np.array(1)) 2020-11-23 16:52:37 +13:00
8bitmp3 80f6cb27c0 Update (small) the reinforcement learning chapter 2020-10-17 15:04:51 +01:00
Aurélien Geron 7b3d280a86 Fix error in commented out code, fixes #89 2020-03-31 21:39:51 +13:00
Aurélien Geron cd4e2e1313 Add comment about the reshape operation from the training_step function 2020-03-12 22:51:36 +13:00
Aurélien Geron 49715d4b74 Fix bug in training_step: target_Q_values must be a column vector 2020-03-12 22:47:22 +13:00
Aurélien Geron d6fbc91cf2 Upgrade packages, and add environment-windows.yml 2019-12-14 18:58:01 +08:00
Aurélien Geron 88dccccd5f Make notebooks 14 to 19 runnable in Colab without changes 2019-11-06 21:06:55 +08:00
Aurélien Geron d8971f1767 Fix os.join() => os.path.join() 2019-10-12 18:05:41 +09:30
Aurélien Geron 4c3b7b9b06 Save agent's breakout performance to an animated gif 2019-05-28 09:30:16 +08:00
Aurélien Geron 3ef350ab4c Fix figure name and clarify a couple code examples 2019-05-27 20:35:00 +08:00
Aurélien Geron c5f4b41cf5 Fix breakout plot 2019-05-26 23:56:49 +08:00
Aurélien Geron 2edbb6e9d4 Add Reinforcement Learning notebook 2019-05-26 23:30:39 +08:00
Aurélien Geron 73a36f335f Add warning about TF issue regarding DenseFeatures and the Functional API, fixes #6 2019-05-15 20:23:24 +08:00