학습 자료

Introduction to scipy.stats


The scipy.stats module is one of the most powerful parts of SciPy.
It provides tools for statistical analysis, including probability distributions, statistical tests, and summary statistics.

In this lesson, we’ll:

  • Explore summary statistics
  • Perform a basic statistical test
  • Work with probability distributions

Setting Up

Importing the required modules:

Import NumPy and SciPy Stats
import numpy as np from scipy import stats

Example 1: Summary Statistics

Summary Statistics
data = [5, 7, 8, 7, 2, 17, 2, 9, 4, 11] mean = np.mean(data) median = np.median(data) mode = stats.mode(data, keepdims=True) print("Mean:", mean) print("Median:", median) print("Mode:", mode.mode[0], "Frequency:", mode.count[0])

Example 2: Hypothesis Testing

One-Sample t-Test
# Test if the mean of data is significantly different from 5 t_stat, p_value = stats.ttest_1samp(data, 5) print("t-statistic:", t_stat) print("p-value:", p_value)

If the p-value is less than 0.05, we reject the null hypothesis.


Example 3: Probability Distributions

Normal Distribution PDF
x = np.linspace(-3, 3, 100) pdf = stats.norm.pdf(x, loc=0, scale=1) print("First 5 PDF values:", pdf[:5])

Probability density functions (PDF) are useful for understanding the likelihood of different outcomes.


Key Takeaways

  • scipy.stats is your go-to module for statistical work in Python.

  • It provides:

    • Summary statistics
    • Hypothesis tests
    • Probability distributions

In the next lessons, we’ll dig deeper into descriptive and inferential statistics with SciPy.

Quiz
0 / 1

What is the purpose of the scipy.stats module?

The scipy.stats module provides tools for analysis.
statistical
mathematical
graphical
numerical

학습 자료

AI 튜터

디자인

업로드

수업 노트

즐겨찾기

도움말