A brief introduction to deep neural networks.

The main framework for neural networks

In very simple terms, let’s say we are trying to plot 1 red point and 1 blue point inside of a 2D graph. The one thing the ANN will try to do is find the line that separates them the best. The way the algorithm does this is by determining how far the line is from the point, and the line either moves farther away or closer to it, depending on whether it’s classified properly or not. Behind the scenes, the weights and biases for all the nodes are changing because of this.

The blue point in this picture is not classified properly, so it wants the line to move towards it to classify it. This is essentially what a neural network does.
A more complex dataset. Not something you would see often in real life but the same techniques can be applied for classifying the ANN here as any other.

Activation Functions

Now, let’s take a deeper look into activation functions. The activation function is used to put the range of the output between a certain range for all of the node outputs. They are the ones that convert the output from one layer into the input for the next layer. There are many activation functions, so let’s take a closer look at a few notable ones.

The Sigmoid Function

The Hyperbolic Tangent Function

Rectified Linear Unit Activation Function (ReLU)

Softmax Function

A picture of the softmax function returning probabilities to the
  • Weights, biases and nodes
  • Multi-layer neural networks
  • Activation functions
