Why Training Neural Networks is Hard
- Andromeda AI
- Sep 4, 2021
- 1 min read
Hitting a neural network involves using a training dataset to update the model weights to create a good mapping of inputs to outputs.
This training process is solved using an optimization algorithm that searches through a space of possible values for the neural network model weights for a set of weights that results in good performance on the training dataset.
After reading this blog, you will know:
Training a neural network involves using an optimization algorithm to find a set of weights to best map inputs to outputs.
The problem is hard, not least because the error surface is non-convex and contains local minima, flat spots, and is highly multidimensional.
The stochastic gradient descent algorithm is the best general algorithm to address this challenging problem.
Comments