handson-ml

Commit Graph

Author	SHA1	Message	Date
Aurélien Geron	4f5aa43f92	Remove a warning when computing left_proba	2024-01-19 19:01:59 +13:00
Aurélien Geron	b2a2533543	Replace OpenAI Gym with Gymnasium and fix box-2d installation bug	2024-01-18 22:05:41 +13:00
Aurélien Geron	4ca56568ca	Fix render code for LunarLander-v2	2022-09-25 22:05:11 +13:00
Aurélien Geron	2228feb218	Update gym library, and remove unneeded libraries for headless servers	2022-09-24 12:23:42 +12:00
Aurélien Geron	0573deb5d3	Use object instead of deprecated np.object	2022-09-22 23:29:25 +12:00
Aurélien Geron	79ce441212	Update lib versions and add pydot, fixes #29	2022-09-22 19:14:01 +12:00
Aurélien Geron	58e5232e29	Remove sklearn version check when sklearn is not needed	2022-04-17 12:17:13 +12:00
Aurélien Geron	5578c334d7	Big update to chapter 18 for the 3rd edition	2022-04-05 21:47:12 +12:00
Aurélien Geron	dcb43f2881	Add GPU accelerator to notebooks 14 to 19	2022-03-23 11:02:35 +13:00
Aurélien Geron	a2f8391951	Upgrade to TensorFlow 2.8	2022-03-01 11:41:27 +13:00
Aurélien Geron	85171acd17	Require Python 3.7+	2022-02-19 23:03:20 +13:00
Aurélien Geron	bbc1113951	Work in progress for chapters 16 to 19	2022-02-19 22:09:28 +13:00
Aurélien Geron	a2c05861f3	Move notebooks again as the SVM chapter is added back	2021-11-23 13:41:17 +13:00
Aurélien Geron	ce4fccf74c	Large change: Chapter 5 on SVMs moves online	2021-10-15 21:59:45 +13:00
Aurélien Geron	fa1ae51184	Large change: replace os.path with pathlib, move to Python 3.7	2021-10-15 21:46:27 +13:00
Aurélien Geron	fdb5d1695e	Replace pip install mcpi with %pip which always installs in the right environment	2021-10-07 18:10:34 +13:00
Aurélien Geron	7fc088e23d	Use atari_py version with ROMs	2021-09-21 16:41:01 +12:00
Aurélien Geron	108fe1fa53	Replace lr with learning_rate in Keras optimizers, fixes #456	2021-08-31 20:54:35 +12:00
Aurélien Geron	1b96533668	Replace 'Open in Colab' button	2021-05-26 07:40:58 +12:00
Peretz Cohen	51b01b6db4	Update 18_reinforcement_learning.ipynb fix link	2021-05-25 11:19:02 -07:00
Peretz Cohen	4b02bac9da	Merge branch 'ageron:master' into add-kaggle-badge	2021-05-24 20:24:08 -07:00
Peretz Cohen	86d50828e2	Update 18_reinforcement_learning.ipynb add Open in Kaggle badge	2021-05-24 20:17:10 -07:00
Aurélien Geron	c3020f1da5	Support running on Kaggle	2021-05-25 12:07:29 +12:00
Aurélien Geron	90e53af92c	Install gym[box2d] on Colab for LunarLander-v2 policy gradients exercise solution	2021-03-20 10:46:02 +13:00
Aurélien Geron	9af016e341	Remove redundant heading for LunarLander-v2 policy gradients exercise solution	2021-03-20 10:40:02 +13:00
Aurélien Geron	cfd0837f5c	Add LunarLander-v2 Policy Gradients exercise solution	2021-03-20 10:04:52 +13:00
Aurélien Geron	e9b5dce122	Fix auto-fire, add exercises, explain Space Invaders delta	2021-03-18 22:16:38 +13:00
Aurélien Geron	c98ee19363	Fix AtariPreprocessingWithAutoFire typo	2021-03-10 10:45:24 +13:00
Aurélien Geron	dd94101c5d	Speed up training: I tuned learning rate for DQN variants, and added auto-FIRE for Blockout. Fixes #117	2021-03-09 22:21:08 +13:00
Aurélien Geron	5c8843a53b	Merge pull request #290 from 8bitmp3/patch-1 Update (small) the reinforcement learning chapter	2021-03-02 12:11:47 +13:00
B D	64f0e05a94	Minor change on greedy policy variable usage Chap 18, why not using directly the 'n_outputs' variable defined earlier, instead of hardcoded '2'	2021-02-28 12:02:23 +01:00
Aurélien Geron	749817ccfa	Update libraries to latest version, including TensorFlow 2.4.1 and Scikit-Learn 0.24.1	2021-02-18 11:59:02 +13:00
Aurélien Geron	8ebdcffc6b	Work around TF Agents issue: env.step(1) => env.step(np.array(1))	2020-11-23 16:52:37 +13:00
8bitmp3	80f6cb27c0	Update (small) the reinforcement learning chapter	2020-10-17 15:04:51 +01:00
Aurélien Geron	7b3d280a86	Fix error in commented out code, fixes #89	2020-03-31 21:39:51 +13:00
Aurélien Geron	cd4e2e1313	Add comment about the reshape operation from the training_step function	2020-03-12 22:51:36 +13:00
Aurélien Geron	49715d4b74	Fix bug in training_step: target_Q_values must be a column vector	2020-03-12 22:47:22 +13:00
Aurélien Geron	d6fbc91cf2	Upgrade packages, and add environment-windows.yml	2019-12-14 18:58:01 +08:00
Aurélien Geron	88dccccd5f	Make notebooks 14 to 19 runnable in Colab without changes	2019-11-06 21:06:55 +08:00
Aurélien Geron	d8971f1767	Fix os.join() => os.path.join()	2019-10-12 18:05:41 +09:30
Aurélien Geron	4c3b7b9b06	Save agent's breakout performance to an animated gif	2019-05-28 09:30:16 +08:00
Aurélien Geron	3ef350ab4c	Fix figure name and clarify a couple code examples	2019-05-27 20:35:00 +08:00
Aurélien Geron	c5f4b41cf5	Fix breakout plot	2019-05-26 23:56:49 +08:00
Aurélien Geron	2edbb6e9d4	Add Reinforcement Learning notebook	2019-05-26 23:30:39 +08:00
Aurélien Geron	73a36f335f	Add warning about TF issue regarding DenseFeatures and the Functional API, fixes #6	2019-05-15 20:23:24 +08:00

45 Commits (c2d5a87137fd0927fbeaf343907a2ebfb5dcd846)