Why Training Neural Networks is Hard

Andromeda AI
Sep 4, 2021
1 min read

Hitting a neural network involves using a training dataset to update the model weights to create a good mapping of inputs to outputs.

This training process is solved using an optimization algorithm that searches through a space of possible values for the neural network model weights for a set of weights that results in good performance on the training dataset.

After reading this blog, you will know:

Training a neural network involves using an optimization algorithm to find a set of weights to best map inputs to outputs.
The problem is hard, not least because the error surface is non-convex and contains local minima, flat spots, and is highly multidimensional.
The stochastic gradient descent algorithm is the best general algorithm to address this challenging problem.

Why Training Neural Networks is Hard

Recent Posts

Comments

Subscribe Form