About this post

I implemented the learning rule of Perceptron, which is one of the methods for determining the discrimination boundary for linearly separable data groups, in Python without using a library. Since I am a beginner in both Python and machine learning, please point out the bad points.

For "Widrow-Hoff learning rules" that are compared alongside "Perceptron learning rules", see "Implementing Widrow-Hoff learning rules in Python" [http: / /qiita.com/s-kiriki/items/6a90beede4c139558bcc).

Perceptron's theory of learning rules

An overview of Perceptron's learning rules and mathematical formulas are summarized in the slides below (starting in the middle of the slide).

https://speakerdeck.com/kirikisinya/xin-zhe-renaiprmlmian-qiang-hui-at-ban-zang-men-number-2

Implementation

In the case of one dimension

Find the separation boundary line of linearly separable training data that exists on one dimension as shown in the figure below and belongs to one of the two classes.

スクリーンショット 2014-12-04 16.11.00.png

As a point of implementation,

The initial weight vector is w = (0.2,0.3) and the learning coefficient is ρ = 0.5.
The convergence test of the separation boundary was not performed, and the weight vector correction (learning) was repeated a sufficient number of times (100 times) (I think it's not really good, but I thought it would be good to let the machine do a lot of work. .)

The actual code looks like this:

# coding: UTF-8
#Implementation example of one-dimensional perceptron learning rules
import numpy as np
import matplotlib.pyplot as plt

def train(wvec, xvec, is_c1):
    low = 0.5#Learning coefficient
    if (np.dot(wvec,xvec) > 0) != is_c1:
        if is_c1:
            wvec_new = wvec + low*xvec
        else:
            wvec_new = wvec - low*xvec
        return wvec_new
    else:
        return wvec

if __name__ == '__main__':
    
    data = np.array([[1.0, 1],[0.5, 1],[-0.2, 2],[-1.3, 2]])#Data group
    
    features = data[:,0].reshape(data[:,0].size,1)#Feature vector
    labels = data[:,1]#Class (this time c1=1,c2=2）
    wvec = np.array([0.2, 0.3])#Initial weight vector
    is_c1s = (labels == 1)#Array of boolean whether c1
    
    xvecs = np.c_[np.ones(features.size), features]#xvec[0] = 1
    
    loop = 100
    for j in range(loop):
        for xvec, is_c1 in zip(xvecs, is_c1s):
            wvec = train(wvec, xvec, is_c1)
    
    print wvec
    print -(wvec[0]/wvec[1])
    
    #Graph depiction
    plt.axhline(y=0, c='gray')
    plt.scatter(features[is_c1s], np.zeros(features[is_c1s].size), c='red', marker="o")
    plt.scatter(features[~is_c1s], np.zeros(features[~is_c1s].size), c='yellow', marker="o")
    #Separation border
    plt.axvline(x=-(wvec[0]/wvec[1]), c='green')    
    plt.show()

The weight vector after training is w = (-0.3, 0.75). Substituting this into the formula wx = 0, the discriminant function becomes x = 0.4, and it can be seen that the linear separation is successfully performed by learning as shown in the figure below.

In the case of 2D

As shown in the figure below (image), find the separation boundary line of linearly separable training data that exists in two dimensions and belongs to one of the two classes.

スクリーンショット 2014-12-04 16.48.41.png