Projects data onto the dimension that maximizes variance of the data along that dimension, which so happens to be along the eigenvector with the largest eigenvalue when looking at the sample covariance matrix of the data

$Z = XP$

$Z \in R^{n \times k}$ the lower-dimensional representation of the data, where $n$ is the number of datapoints and $k$ is the number of principal components
$X \in R^{n \times d}$ is the data, where $d$ is the dimensionality of the data
$P \in R^{d \times k}$ is the transformation matrix

$Fraction of retained variance = \frac{\sum _{i = 1}^{k} λ _{i}}{\sum _{i = 1}^{d} λ _{i}}$

If you choose the first $k$ highest eigenvalues, this is proportion of variance you retain after PCA

$max_{p} p^{^{⊤}} C_{X} p, ∥ p ∥_{2} = 1$

The goal of PCA is to maximize the variance represented by $p^{^{⊤}} C_{x} p$ by varying $p$ with the constraint that $p$ is normal
$C_{X}$ is the covariance matrix

Knowledge

Explorer

PCA

$Z = XP$

$Fraction of retained variance = \frac{\sum _{i = 1}^{k} λ _{i}}{\sum _{i = 1}^{d} λ _{i}}$

$max_{p} p^{^{⊤}} C_{X} p, ∥ p ∥_{2} = 1$

Training

Graph View

Table of Contents

Backlinks

Knowledge

Explorer

PCA

Z=XP

Fraction of retained variance=∑i=1d​λi​∑i=1k​λi​​

maxp​p⊤CX​p,∥p∥2​=1

Training

Graph View

Table of Contents

Backlinks

$Z = XP$

$Fraction of retained variance = \frac{\sum _{i = 1}^{k} λ _{i}}{\sum _{i = 1}^{d} λ _{i}}$

$max_{p} p^{^{⊤}} C_{X} p, ∥ p ∥_{2} = 1$