Topics

  • The hats are considered estimators, or values we have to calculate to predict
  • and are regression coefficients
  • is random noise

  • Gradient of loss function

  • Coefficient of regression that minimizes
  • Same as

  • Coefficient of regression that minimizes

  • If is not invertible, this happens both/either because
    • , or there are fewer data points than features
    • is not linearly independent, in which case

  • is the variance in
  • is the average value of our sample

  • Used for when the noise is unknown