Reinforcement learning 13 Try Mountain_car with ChainerRL.

It is assumed that you have achieved up to Reinforcement Learning 12. I will do it on Ubuntu 18.04. I tried to replace CartPole-v0 with MountainCar-v0 with the CartPole I made earlier. It seems that the difficulty level is increasing.

I replaced it as it was, but something was different. .. .. .. I set gamma to 0.99.

Looking around the site, there is a lot of learning. Is it the trick to do more? I set it as follows.

chainerrl.experiments.train_agent_with_evaluation(
    agent, env,
    steps=1000000,           # Train the agent for 2000 steps
    eval_n_steps=None,       # 10 episodes are sampled for each evaluation
    eval_n_episodes=1,       # 10 episodes are sampled for each evaluation
    eval_max_episode_len=200,  # Maximum length of each episodes
    eval_interval=100,   # Evaluate the agent after every 1000 steps
    outdir='result')      # Save everything to 'result' directory
print('Finished.')

I set epsilon = 0.003.

It took some time to study, but I can climb it.

Click here for the 2000 learning curve. mountaincar_20191118124725.png

Here is the curve of learning 10000 times. mountaincar_20191119123112.png

If it is 10,000 times, it will take 85 minutes. I wish I had a computer that I wasn't using. What should I do with a computer that I usually use on mobile?

I'm thinking of starting GPU with the goal of about 30th time. I'm investigating the preparation, but Chainer has an extremely small amount of programs. About 8Mb on the HDD. The tensorflow is large and over 300Mb. I want to use Radeon as GPU, but I wonder if chainer works.

Recommended Posts

Reinforcement learning 13 Try Mountain_car with ChainerRL.
Reinforcement learning 11 Try OpenAI acrobot with ChainerRL.
Reinforcement learning 19 Colaboratory + Mountain_car + ChainerRL
Reinforcement learning 18 Colaboratory + Acrobat + ChainerRL
Try deep learning with TensorFlow
Play with reinforcement learning with MuZero
Reinforcement learning 17 Colaboratory + CartPole + ChainerRL
Reinforcement learning 28 colaboratory + OpenAI + chainerRL
Reinforcement learning 2 Installation of chainerrl
Reinforcement learning starting with Python
Reinforcement learning 20 Colaboratory + Pendulum + ChainerRL
Try running CNN with ChainerRL
Try Deep Learning with FPGA
Reinforcement learning 5 Try programming CartPole?
Reinforcement learning 9 ChainerRL magic remodeling
Try machine learning with Kaggle
Reinforcement learning 21 Colaboratory + Pendulum + ChainerRL + A2C
Try Deep Learning with FPGA-Select Cucumbers
Try deep learning with TensorFlow Part 2
Reinforcement learning 22 Colaboratory + CartPole + ChainerRL + A3C
Explore the maze with reinforcement learning
Try machine learning with scikit-learn SVM
Reinforcement learning 8 Try using Chainer UI
Reinforcement learning 24 Colaboratory + CartPole + ChainerRL + ACER
Try Common Representation Learning with chainer
I tried deep reinforcement learning (Double DQN) for tic-tac-toe with ChainerRL
[Reinforcement learning] DQN with your own library
Try Bitcoin Price Forecasting with Deep Learning
Try with Chainer Deep Q Learning --Launch
Try deep learning of genomics with Kipoi
Reinforcement learning 14 Pendulum was done at ChainerRL.
[Python] Easy Reinforcement Learning (DQN) with Keras-RL
Try OpenAI's standard reinforcement learning algorithm PPO
[Introduction] Reinforcement learning
Future reinforcement learning_2
Future reinforcement learning_1
Reinforcement learning 10 Try using a trained neural network.
See the behavior of drunkenness with reinforcement learning
[Reinforcement learning] Experience Replay is easy with cpprb!
Reinforcement learning 12 ChainerRL quick start guide windows version
Reinforcement learning 27 colaboratory 90-minute rule measures chainerRL (+ chokozainerRL)
Challenge block breaking with Actor-Critic model reinforcement learning
[Mac] I tried reinforcement learning with OpenAI Baselines
Try scraping with Python.
Learning Python with ChemTHEATER 03
"Object-oriented" learning with python
Reinforcement learning 1 Python installation
Learning Python with ChemTHEATER 05-1
Reinforcement learning 3 OpenAI installation
I want to climb a mountain with reinforcement learning
Reinforcement learning for tic-tac-toe
Try to predict forex (FX) with non-deep machine learning
Learning Python with ChemTHEATER 02
Reinforcement learning 37 Make an automatic start with Atari's wrapper
[Reinforcement learning] Bandit task
Learning Python with ChemTHEATER 01
Try SNN with BindsNET
Python + Unity Reinforcement Learning (Learning)
Try regression with TensorFlow
Reinforcement learning 1 introductory edition
Now, let's try face recognition with Chainer (learning phase)