Source: View original notebook on GitHub
Category: Machine Learning / Learn ML
Logistic :
- used for 2 class problems
- Assumption : all samples(rows) are considered independent of each other
- dataset can be numerical, talking about X part of dataset not Y (Y is is always categorical)
Naive :
- can be used for multiclass problems as well
- Assumption : all features(columns) are considered independent of each other
- Of course these assumptions of independence are rarely true, which may explain why some have referred to the model as the "Idiot Bayes" model, but in practice Naive Bayes models have performed surprisingly well, even on complex tasks where it is clear that the strong independence assumptions are false.
- All the dataset columns must be categorical or say nominal(number associated with each class groups) -> multinomial and bernoulli naive bayes , and can be numerical as well (Gaussian Naive Bayes)

