DLA Exam's Project by Federico Lancini, Alessandro Maggioni, and Luca Tramonti.

The aim of this project is to create and compare three different models to solve the problem of the double inverted pendulum (in the model of the environment that is made by OpenAI in the Gym library).

The authors created this code with the idea of the following execution order:

environment_inspection.ipynb shows the main features of the environment and presents a short description.
(If you do not re-train the models and are using the models that are already available in the model folder, you can skip this point) DDPG_training.ipynb, DQL_training.ipynb, and PPO_training.ipynb respectively train the different models and test their performance over time.
In testing.ipynb, the authors test the models and compare them.
In DDPG_testing.ipynb, we test the agent with different checkpoints and different malfunction_probability values. The malfunction_probability represents the probability of the environment having a malfunction in the sense that the force that is really applied to the cart is less than the force that the agent applies.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
models		models
.gitignore		.gitignore
DDPG_testing.ipynb		DDPG_testing.ipynb
DDPG_training.ipynb		DDPG_training.ipynb
DQL_training.ipynb		DQL_training.ipynb
PPO_training.ipynb		PPO_training.ipynb
README.md		README.md
ddpg.py		ddpg.py
dql.py		dql.py
environment_inspection.ipynb		environment_inspection.ipynb
instructions.txt		instructions.txt
requirements.txt		requirements.txt
testing.ipynb		testing.ipynb
trials.ipynb		trials.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DLA Exam's Project by Federico Lancini, Alessandro Maggioni, and Luca Tramonti.

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DLA Exam's Project by Federico Lancini, Alessandro Maggioni, and Luca Tramonti.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages