Artificial neural networks
Artificial networks are some of the hottest applications in artificial intelligence, especially machine learning. The main aim of artificial neural networks is building models that can learn like a human mind; in other words, we try to mimic the human mind. That is why, in order to learn how to build neural network systems, we need to have a clear understanding of how a human mind actually works. The human mind is an amazing entity. The mind is composed and wired by neurons. Neurons are responsible for transferring and processing information.
We all know that the human mind can perform a lot of tasks, like hearing, seeing, tasting, and many other complicated tasks. So logically, one might think that the mind is composed of many different areas, with each area responsible for a specific task, thanks to a specific algorithm. But this is totally wrong. According to research, all of the different parts of the human mind function thanks to one algorithm, not different algorithms. This hypothesis is called the one algorithm hypothesis.
Now we know that the mind works by using one algorithm. But what is this algorithm? How is it used? How is information processed with it?
To answer the preceding questions, we need to look at the logical representation of a neuron. The artificial representation of a human neuron is called a perceptron. A perceptron is represented by the following graph:
There are many Activation Functions used. You can view them as logical gates:
- Step function: A predefined threshold value.
- Sigmoid function:
- Tanh function:
- ReLu function:
Many fully connected perceptrons comprise what we call a Multi-Layer Perceptron (MLP) network. A typical neural network contains the following:
- An input layer
- Hidden layers
- Output layers
We will discuss the term deep learning once we have more than three hidden layers. There are many types of deep learning networks used in the world:
- Convolutional neural networks (CNNs)
- Recursive neural networks (RNNs)
- Long short-term memory (LSTM)
- Shallow neural networks
- Autoencoders (AEs)
- Restricted Boltzmann machines
Don't worry; we will discuss the preceding algorithms in detail in future chapters.
To build deep learning models, we follow five steps, suggested by Dr. Jason Brownlee. The five steps are as follows:
- Network definition
- Network compiling
- Network fitting
- Network evaluation
- Prediction