Why is accuracy used to train perceptron?

why is the instructor using accuracy and not loss function in training the perceptron model? Also, why is the standard normalization not performed on the data to scale the feature values to range between 0 and 1?

It is fairly a base model, and considering the fact of simplicity course was designed such way.