Commit Graph

45 Commits (c2d5a87137fd0927fbeaf343907a2ebfb5dcd846)

Author SHA1 Message Date
Aurélien Geron 4f5aa43f92 Remove a warning when computing left_proba 2024-01-19 19:01:59 +13:00
Aurélien Geron b2a2533543 Replace OpenAI Gym with Gymnasium and fix box-2d installation bug 2024-01-18 22:05:41 +13:00
Aurélien Geron 4ca56568ca Fix render code for LunarLander-v2 2022-09-25 22:05:11 +13:00
Aurélien Geron 2228feb218 Update gym library, and remove unneeded libraries for headless servers 2022-09-24 12:23:42 +12:00
Aurélien Geron 0573deb5d3 Use object instead of deprecated np.object 2022-09-22 23:29:25 +12:00
Aurélien Geron 79ce441212 Update lib versions and add pydot, fixes #29 2022-09-22 19:14:01 +12:00
Aurélien Geron 58e5232e29 Remove sklearn version check when sklearn is not needed 2022-04-17 12:17:13 +12:00
Aurélien Geron 5578c334d7 Big update to chapter 18 for the 3rd edition 2022-04-05 21:47:12 +12:00
Aurélien Geron dcb43f2881 Add GPU accelerator to notebooks 14 to 19 2022-03-23 11:02:35 +13:00
Aurélien Geron a2f8391951 Upgrade to TensorFlow 2.8 2022-03-01 11:41:27 +13:00
Aurélien Geron 85171acd17 Require Python 3.7+ 2022-02-19 23:03:20 +13:00
Aurélien Geron bbc1113951 Work in progress for chapters 16 to 19 2022-02-19 22:09:28 +13:00
Aurélien Geron a2c05861f3 Move notebooks again as the SVM chapter is added back 2021-11-23 13:41:17 +13:00
Aurélien Geron ce4fccf74c Large change: Chapter 5 on SVMs moves online 2021-10-15 21:59:45 +13:00
Aurélien Geron fa1ae51184 Large change: replace os.path with pathlib, move to Python 3.7 2021-10-15 21:46:27 +13:00
Aurélien Geron fdb5d1695e Replace pip install mcpi with %pip which always installs in the right environment 2021-10-07 18:10:34 +13:00
Aurélien Geron 7fc088e23d Use atari_py version with ROMs 2021-09-21 16:41:01 +12:00
Aurélien Geron 108fe1fa53 Replace lr with learning_rate in Keras optimizers, fixes #456 2021-08-31 20:54:35 +12:00
Aurélien Geron 1b96533668 Replace 'Open in Colab' button 2021-05-26 07:40:58 +12:00
Peretz Cohen 51b01b6db4 Update 18_reinforcement_learning.ipynb
fix link
2021-05-25 11:19:02 -07:00
Peretz Cohen 4b02bac9da Merge branch 'ageron:master' into add-kaggle-badge 2021-05-24 20:24:08 -07:00
Peretz Cohen 86d50828e2 Update 18_reinforcement_learning.ipynb
add Open in Kaggle badge
2021-05-24 20:17:10 -07:00
Aurélien Geron c3020f1da5 Support running on Kaggle 2021-05-25 12:07:29 +12:00
Aurélien Geron 90e53af92c Install gym[box2d] on Colab for LunarLander-v2 policy gradients exercise solution 2021-03-20 10:46:02 +13:00
Aurélien Geron 9af016e341 Remove redundant heading for LunarLander-v2 policy gradients exercise solution 2021-03-20 10:40:02 +13:00
Aurélien Geron cfd0837f5c Add LunarLander-v2 Policy Gradients exercise solution 2021-03-20 10:04:52 +13:00
Aurélien Geron e9b5dce122 Fix auto-fire, add exercises, explain Space Invaders delta 2021-03-18 22:16:38 +13:00
Aurélien Geron c98ee19363 Fix AtariPreprocessingWithAutoFire typo 2021-03-10 10:45:24 +13:00
Aurélien Geron dd94101c5d Speed up training: I tuned learning rate for DQN variants, and added auto-FIRE for Blockout. Fixes #117 2021-03-09 22:21:08 +13:00
Aurélien Geron 5c8843a53b Merge pull request #290 from 8bitmp3/patch-1
Update (small) the reinforcement learning chapter
2021-03-02 12:11:47 +13:00
B D 64f0e05a94 Minor change on greedy policy variable usage
Chap 18, why not using directly the 'n_outputs' variable defined earlier, instead of hardcoded '2'
2021-02-28 12:02:23 +01:00
Aurélien Geron 749817ccfa Update libraries to latest version, including TensorFlow 2.4.1 and Scikit-Learn 0.24.1 2021-02-18 11:59:02 +13:00
Aurélien Geron 8ebdcffc6b Work around TF Agents issue: env.step(1) => env.step(np.array(1)) 2020-11-23 16:52:37 +13:00
8bitmp3 80f6cb27c0 Update (small) the reinforcement learning chapter 2020-10-17 15:04:51 +01:00
Aurélien Geron 7b3d280a86 Fix error in commented out code, fixes #89 2020-03-31 21:39:51 +13:00
Aurélien Geron cd4e2e1313 Add comment about the reshape operation from the training_step function 2020-03-12 22:51:36 +13:00
Aurélien Geron 49715d4b74 Fix bug in training_step: target_Q_values must be a column vector 2020-03-12 22:47:22 +13:00
Aurélien Geron d6fbc91cf2 Upgrade packages, and add environment-windows.yml 2019-12-14 18:58:01 +08:00
Aurélien Geron 88dccccd5f Make notebooks 14 to 19 runnable in Colab without changes 2019-11-06 21:06:55 +08:00
Aurélien Geron d8971f1767 Fix os.join() => os.path.join() 2019-10-12 18:05:41 +09:30
Aurélien Geron 4c3b7b9b06 Save agent's breakout performance to an animated gif 2019-05-28 09:30:16 +08:00
Aurélien Geron 3ef350ab4c Fix figure name and clarify a couple code examples 2019-05-27 20:35:00 +08:00
Aurélien Geron c5f4b41cf5 Fix breakout plot 2019-05-26 23:56:49 +08:00
Aurélien Geron 2edbb6e9d4 Add Reinforcement Learning notebook 2019-05-26 23:30:39 +08:00
Aurélien Geron 73a36f335f Add warning about TF issue regarding DenseFeatures and the Functional API, fixes #6 2019-05-15 20:23:24 +08:00