Topics

  • The set of hypotheses we hope to be , or the function that can predict the labels of a data point
  • is the activation function

Margin

  • The margin of a hyperplane (when varying ) is the minimum distance between the given hyperplane and a sample
  • If there’s no hyperplane, it’s

  • The margin of a dataset is the margin of the hyperplane with the greatest margin

Training

  • PerceptronTrain(data = N samples/instances = , maxIter)
    • for i = 1 through to maxIter
      • Potentially shuffle data here
      • for (x, y) in data
        • if (i.e. the data is misclassified) then
    • return
  • AveragedPerceptronTrain(data = N samples/instances = , maxIter)
    • for i = 1 through to maxIter11
      • Potentially shuffle here
      • for (x, y) in data
        • if (i.e. the data is misclassified) then
            • This effectively moves the boundary plane defined by toward classifying more correct examples
    • return