lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

lesson11Title

lesson12Title

lesson13Title

aiFundamentalsInActionChapter1Title

aiFundamentalsInActionChapter2Title

aiFundamentalsInActionChapter3Title

aiFundamentalsInActionChapter4Title

# What is GPT?

`GPT` stands for `G`enerative `P`re-trained `T`ransformer, a type of generative AI model that is pre-trained on large datasets based on the transformer model introduced by a team at Google in 2017.

> `Transformer`: An efficient AI architecture that can simultaneously understand the relationships between words in a sentence

<br />

Nowadays, GPT is used for almost every text-centric task, including document summarization, translation, writing, and coding.

In this lesson, we'll explore what GPT means and how it came to be so prominent.

<br />

## What does GPT stand for?

`GPT` is an acronym for the following three words:

- *`G`enerative*: The AI model can generate text.

- *`P`re-trained*: It is trained in advance using a large amount of data.

- *`T`ransformer*: It utilizes the transformer architecture.

In essence, GPT represents a `"pre-trained text generation model based on the transformer architecture."`

Simply put, it’s an AI model that has learned from countless documents to naturally generate new sentences.

<br />

## What makes GPT special?

GPT popularized the use of the transformer model, which can grasp the context of entire sentences at once.

Previous RNN-based models took a long time to generate text by processing sentences sequentially, but GPT is capable of understanding and generating sentences quickly and accurately by comparing all words simultaneously.

Additionally, GPT supports `fine-tuning`, enabling retraining for specific tasks like summarization and translation using a large pre-trained model as the base.

<br />

In the next lesson, we'll delve deeper into the background and history of GPT's development.