CS231N Lec. 2 | Image Classification pipline

11 Oct 2020 in Studies on Lecture-Review

please find lecture reference from here¹

CS231N Lec. 2 | Image Classification pipline

KNN classifier

K-nearest neighbor classifier

this algorithm Big-O is ,
Train : O(1) just copying all data Predict : O(n) checking all data

meaning, train fast but predict slow. In contrast, users expect fast prediction while slow train is acceptable. means KNN is not suitable

Then, how calculate distance ?

L1 ( Manhattan ) distance dependency on “coordinate system”

L2 ( Euclidean ) distance

split data

which one is better? it’s hyperparameter, so it’s better to try both.

Hyperparameter

Choices about the algorithm that we set rather than learn.
Quite problem dependent. should try all possible cases.
e.g.) in case of KNN, value of k and distance.

How set Hyperparameter ?

split data
When you setting Hyperparameter, split data into train, validation and test
choose hyperparameters on validation and evaluate on test.
cross validation
split data into folds, try each fold as validation and average the result.