lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

aiFundamentalsDeepLearningChapter5Title

lesson10Title

lesson11Title

lesson12Title

lesson13Title

lesson14Title

lesson15Title

lesson16Title

lesson17Title

aiFundamentalsDeepLearningChapter1Title

aiFundamentalsDeepLearningChapter2Title

aiFundamentalsDeepLearningChapter3Title

aiFundamentalsDeepLearningChapter4Title

# Simplified Recurrent Neural Network Structure, GRU

`GRU (Gated Recurrent Unit)` is a structure created to solve the limitations of RNNs. It offers similar functionality to `LSTM` but as a *simplified recurrent neural network*.

GRU addresses the long-term dependency problem by retaining essential information and discarding irrelevant data.

<br />

## Why was GRU developed?

LSTM has the advantage of retaining long-term information, but its complex structure can lead to slower learning speeds.

GRU was developed to preserve the performance of LSTM while offering a simpler, faster architecture.

Like LSTM, GRU uses gates but has **fewer gates with simpler calculations**.

This makes it easy to implement and also quicker to train.

<br />

## Key Structure of GRU

GRU is composed of the following two gates:

- *Update Gate*: Decides how much of the past information to retain. It regulates the amount of information to remember.

- *Reset Gate*: Determines how much of the past information should be ignored. It controls how much of the previous state to combine with the current input.

These two gates work together to maintain important information and discard unnecessary information. Consequently, GRU can effectively process information in a sequence over time.

<br />

## How does GRU operate?

GRU functions through the following process:

1. It calculates both the update gate and reset gate based on the current input and previous state.

2. The reset gate determines the extent to which past information is reflected.

3. The update gate decides how much the new state should be reflected.

4. Finally, it calculates the new state and passes it to the next time step.

In this way, GRU can sequentially process information like an RNN, while effectively remembering even older information with fewer calculations.

<br />

In the next lesson, we will explore the `Transformer` structure, which is often compared with recurrent neural network-based models.

GRUs offer similar performance to LSTMs but with a simpler architecture, which allows for faster training. This is because GRUs use fewer gates than LSTMs.

Simplified Recurrent Neural Network Structure, GRU

Why was GRU developed?

Key Structure of GRU

How does GRU operate?

What is the main advantage of GRUs compared to LSTMs?