What exactly is logistic regression. For binary classification, logistic regression computes the posterior probability of class C1 as a logistic sigmoid acting on a linear function of the feature vector x: p(C1|x)=y(x)=σ(wTx)
.
σ (.) is the logistic sigmoid function. σ(a)=1/(1+exp(-a))
+ d(σ)/d(a)=σ(1-σ)
.
Now the goal is to maximize the conditional likelihood p(t/w).
- we take the log to work with the equation and minimize the negative term.
- To find the minimum of the error, we use simple gradient descent.