aiFundamentalsMachineLearningChapter4Desc

lesson1Title

lesson2Title

lesson3Title

lesson4Title

lesson5Title

lesson6Title

lesson7Title

lesson8Title

lesson9Title

lesson10Title

aiFundamentalsMachineLearningChapter4Title

aiFundamentalsMachineLearningChapter1Desc

lesson11Title

lesson12Title

lesson13Title

lesson14Title

lesson15Title

lesson16Title

lesson17Title

lesson18Title

lesson19Title

lesson20Title

lesson21Title

lesson22Title

lesson23Title

aiFundamentalsMachineLearningChapter1Title

aiFundamentalsMachineLearningChapter2Desc

aiFundamentalsMachineLearningChapter2Title

aiFundamentalsMachineLearningChapter3Desc

lesson24Title

aiFundamentalsMachineLearningChapter3Title

# Predicting Data with a Line - Linear Regression 

`Linear Regression` is a method used to create a line (or a plane in higher dimensions) from data to learn patterns. By inputting new data, it can predict corresponding numerical values.

The simplest form, `Simple Linear Regression`, can be expressed with the following formula:

$$
Y = W X + B
$$

Each variable represents the following:

- $X$: Input data (e.g., study hours)

- $Y$: The value to be predicted (e.g., test score)

- $W$: Slope (Weight), determines how the result changes as the input value increases

- $B$: Intercept (Bias), the point where the graph meets the Y-axis

Using this equation, you can predict the $Y$ value given a specific $X$ value.

For instance, if a student scored 40 points after 1 hour of study and 60 points after 2 hours, with a base score of 10, the `W` and `B` values are calculated as follows:

```plaintext title="Example Data"
B = 10
W = (60 - 40) / (2 - 1) = 20
```

Based on this information, the linear regression model is learned as `Y = 20X + 10`.

For 3 hours of study, the test score is calculated as follows:

```plaintext title="Example Data"
Y = 20 * 3 + 10 = 70
```

According to the regression model, this student is predicted to score 70 points with 3 hours of study.

While this is a simplified example with only 2 points, real models are trained using much larger datasets.

 

## How Linear Regression Works 

Linear regression learns by finding the optimal line in the given data. 

To achieve this, it needs to find the `optimal W and B values` that minimize the loss function. 

The most commonly used loss function is `MSE (Mean Squared Error)`, which outlines the average squared difference between the model's predictions and actual values. Smaller values indicate better learning by the model.

Machine learning models use an algorithm known as `Gradient Descent` to reduce loss and find the best W and B.

 

## Limitations of Linear Regression 

Although linear regression is simple and easy to interpret, it has some key limitations.

 

### 1. Can Only Learn Linear Relationships

If the data does not follow a linear relationship, the predictive performance of a linear regression model can diminish.

For example, in cases with U-shaped or S-shaped data patterns, linear regression is not suitable.

 

### 2. Sensitive to Outliers

If there are extreme data points (outliers), the model can be significantly influenced.

 

### 3. Limited Predictive Power with Insufficient Features

In many real-world situations, it's difficult to determine outcomes (Y) using just one variable (X).

In such cases, `Multiple Linear Regression` can be used to incorporate multiple input variables.

 

Linear regression is one of the fundamental algorithms in machine learning that analyzes data in a linear form to predict numerical values.

In the next lesson, we will explore `Logistic Regression`.

Linear regression finds the best-fit line for the given data to learn the pattern and predict new data. It does this by minimizing the loss function to create the optimal model.

### Linear regression is a method of finding the line that best explains the pattern in the data.