What exactly is logistic regression. For binary classification, logistic regression computes the posterior probability of class C1 as a logistic sigmoid acting on a linear function of the feature vector x: p(C1|x)=y(x)=σ(wTx).

σ (.) is the logistic sigmoid function. σ(a)=1/(1+exp(-a)) + d(σ)/d(a)=σ(1-σ).

Now the goal is to maximize the conditional likelihood p(t/w).

  • we take the log to work with the equation and minimize the negative term.
  • To find the minimum of the error, we use simple gradient descent.