strategy

Greedy Strategy

Greedy algorithm. A method of simply selecting the hand with the highest output value of the neural network. logits are the values before passing through the activation function of the neural network output stage.

def greedy(logits): #Returns the index of the element with the maximum value among the elements of the list specified in the argument
                    #In a neural network, logits are the values before passing through the activation function.
    return np.argmax(logits)

Softmax strategy

It seems that the probability changes depending on the coefficient of temperature.

def boltzmann(logits, temperature):
    logits /= temperature # a /=b is a= a /Meaning of b
    logits -= logits.max() # a -=b is a= a -The meaning of b. It will be a negative value. The maximum value is 0.
    probabilities = np.exp(logits) # x =<0 exp function
    probabilities /= probabilities.sum()
    return np.random.choice(len(logits), p=probabilities) # choice(i, p=b)Is 0 to i-Randomly returns numbers up to 1 with a probability of b

Flow diagram

As a simple example, the processing up to the exp output when there are five outputs (output 1 is -0.2, output 2 is 0.3, output 3 is 0.5, output 4 is 0, and output 5 is -0.6) is shown. The temperature is 1.

When the temperature is set, the smaller the temperature, the closer the magnitude of each output becomes. In other words, the smaller the temperature, the more even the probability of the move.

In Chapter 8, the process of giving randomness is done at the end. The higher the probability of a hand while having randomness, the easier it is to be selected. This process is not done in Chapter 12. I don't understand it well, but is it suitable for usage?

    return np.random.choice(len(logits), p=probabilities) # choice(i, p=b)Is 0 to i-Randomly returns numbers up to 1 with a probability of b

Recommended Posts

Deep Learning with Shogi AI on Mac and Google Colab Chapter 11

Deep Learning with Shogi AI on Mac and Google Colab Chapter 8

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 3

Deep Learning with Shogi AI on Mac and Google Colab Chapter 7

Deep Learning with Shogi AI on Mac and Google Colab Chapter 10 6-9

Deep Learning with Shogi AI on Mac and Google Colab Chapter 10

Deep Learning with Shogi AI on Mac and Google Colab Chapter 7 5-7

Deep Learning with Shogi AI on Mac and Google Colab Chapter 9

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 3

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 1-2

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 3

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 3 ~ 5

Deep Learning with Shogi AI on Mac and Google Colab Chapter 7 9

Deep Learning with Shogi AI on Mac and Google Colab Chapter 8 5-9

Deep Learning with Shogi AI on Mac and Google Colab Chapter 8 1-4

Deep Learning with Shogi AI on Mac and Google Colab Chapter 12 3

Deep Learning with Shogi AI on Mac and Google Colab Chapter 7 8

Deep Learning with Shogi AI on Mac and Google Colab Chapter 7 1-4

Deep Learning with Shogi AI on Mac and Google Colab

Deep Learning with Shogi AI on Mac and Google Colab Chapters 1-6

Learn with Shogi AI Deep Learning on Mac and Google Colab Use Google Colab

Deep Learning on Mac and Google Colab Words Learned with Shogi AI

Machine learning with Pytorch on Google Colab

About learning with google colab

Steps to quickly create a deep learning environment on Mac with TensorFlow and OpenCV

Play with Turtle on Google Colab

Use MeCab and neologd with Google Colab

"Learning word2vec" and "Visualization with Tensorboard" on Colaboratory

Deep Learning from scratch The theory and implementation of deep learning learned with Python Chapter 3

Install selenium on Mac and try it with python

Deep learning image analysis starting with Kaggle and Keras

[AI] Deep Metric Learning

Extract music features with Deep Learning and predict tags

"Deep Learning from scratch" Self-study memo (No. 14) Run the program in Chapter 4 on Google Colaboratory

[Google Colab] How to interrupt learning and then resume it

Recognize your boss and hide the screen with Deep Learning

An error that stumbled upon learning YOLO on Google Colab

Machine learning environment settings based on Python 3 on Mac (coexistence with Python 2)

HIKAKIN and Max Murai with live game video and deep learning

Easy deep learning web app with NNC and Python + Flask

Try deep learning with TensorFlow

Deep Kernel Learning with Pyro

Plotly Dash on Google Colab

Try Deep Learning with FPGA

Catalina on Mac and pyenv

Generate Pokemon with Deep Learning

Create AtCoder Contest appointments on Google Calendar with Python and GAS

Build a Python environment on your Mac with Anaconda and PyCharm

Error and solution when installing python3 with homebrew on mac (catalina 10.15)

How to run Jupyter and Spark on Mac with minimal settings

The strongest way to use MeCab and CaboCha with Google Colab

[Reading Notes] Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow Chapter 1

Install lp_solve on Mac OS X and call it with python.

Deep Learning / Deep Learning from Zero 2 Chapter 4 Memo

Try Deep Learning with FPGA-Select Cucumbers

Deep Learning / Deep Learning from Zero Chapter 3 Memo

tensor flow with anaconda on mac

MQTT on Raspberry Pi and Mac

Make ASCII art with deep learning

Deep Learning / Deep Learning from Zero 2 Chapter 5 Memo