Knowledge

❯

❯

Perceptron

Feb 09, 20252 min read

Computers

Topics

Mistake Bound Theorem
Voted Perceptron

$H = {h ∣ h : X \to Y, h (x) = sign (a)}$

The set of hypotheses $h (x)$ we hope to be $\overset{y}{^}$ , or the function that can predict the labels of a data point
$a$ is the activation function

Margin

Margin (D, w) = {min_{(x, y) \in D} \frac{y _{n} w ^{T} x _{n}}{∥ w ∥} - \infty for separating hyperplane w else

The margin of a hyperplane (when varying $w$ ) is the minimum distance between the given hyperplane and a sample
If there’s no hyperplane, it’s $- \infty$

$Margin (D) = max_{w} Margin (D, w)$

The margin of a dataset is the margin of the hyperplane with the greatest margin

Training

PerceptronTrain(data = N samples/instances = ${(x_{1}, y_{1}), \dots, (x_{n}, y_{n})}$ , maxIter)
- for i = 1 through to maxIter
  - Potentially shuffle data here
  - for (x, y) in data
    - $a := w^{^{⊤}} x$
    - if $a y \leq 0$ (i.e. the data is misclassified) then
      - $w = w + y x$
- return $w$
AveragedPerceptronTrain(data = N samples/instances = ${(x_{1}, y_{1}), \dots, (x_{n}, y_{n})}$ , maxIter)
- for i = 1 through to maxIter11
  - Potentially shuffle here
  - for (x, y) in data
    - $a := w^{^{⊤}} x$
    - if $a y \leq 0$ (i.e. the data is misclassified) then
      - $w = w + y x$
        
        This effectively moves the boundary plane defined by $w$ toward classifying more correct examples
    - $μ = μ + w$
- return $μ$

$O (d n)$

Graph View

Topics
$\displaystyle H=\left\{ h|h:\mathbb{X}\rightarrow \mathbb{Y},h(\mathbf{x})=\text{sign}(a) \right\}$
Margin
$\displaystyle \text{Margin}(\mathcal{D})=\text{max}_{w}\text{Margin}(\mathcal{D},w)$
Training
$\displaystyle O(dn)$

Backlinks

Deep Learning
Supervised Learning
Voted Perceptron

Created with Quartz v4.5.2 © 2026

Personal Site
GitHub