Skip to main content

Inferential Statistics in Practice: From Probability to ANOVA


🔍 Project Overview 

This project demonstrates the application of inferential statistics to solve multiple real-world problems across sports analytics, manufacturing quality control, marketing operations and healthcare.

The objective was to move beyond descriptive statistics and apply probability theory, hypothesis testing, and ANOVA techniques to draw meaningful conclusions and support data-driven decision-making.

Download Complete Report from Git

Open on Git


🎯 Key Objectives

  • Apply probability concepts to real datasets

  • Use normal distribution and Z-tests for quality analysis

  • Perform hypothesis testing (Z-test, T-test)

  • Analyze multi-factor effects using One-Way & Two-Way ANOVA

  • Translate statistical results into business insights and recommendations


🧠 Problem 1: Sports Injury Probability Analysis

Business Question

Can player position help explain the likelihood of foot injuries in a football team?

Approach

  • Used conditional probability and joint probability

  • Analyzed injury distribution across playing positions

Key Insight

  • Overall injury probability: 61%

  • Strikers had the highest injury likelihood among injured players

  • Player position plays a significant role in injury risk

Impact

Helps coaching and medical staff focus preventive care strategies on high-risk positions.


🏭 Problem 2: Manufacturing Quality Control (Normal Distribution)

Business Question

What proportion of cement gunny bags fail strength requirements?

Approach

  • Assumed normal distribution

  • Used Z-score-based probability estimation

  • Visualized probability regions for decision clarity

Key Insights

  • ~11% of bags fall below minimum strength threshold

  • Over 82% meet acceptable strength criteria

  • Identified risk zones contributing to material loss

Impact

Supports supply chain quality checks and reduces wastage risk.


🧪 Problem 3: Stone Hardness Testing (Hypothesis Testing)

Business Question

Are unpolished stones suitable for high-quality printing?

Statistical Techniques Used

  • Z-test (large sample, known population mean)

  • Independent two-sample T-test

  • Outlier treatment and distribution analysis

Key Findings

  • Mean hardness of unpolished stones is significantly below required threshold

  • Polished stones show higher and more consistent hardness

Recommendation

Zingaro is justified in rejecting unpolished stones for printing applications.


🦷 Problem 4: Dental Implant Hardness Analysis (ANOVA)

Business Question

How do dentist, method, and alloy influence implant hardness?

Techniques Used

  • One-Way ANOVA

  • Two-Way ANOVA with interaction effects

  • Shapiro-Wilk Test (normality)

  • Levene Test (variance equality)

  • Tukey post-hoc analysis

Key Insights

  • Dentist alone does not significantly impact hardness

  • Implant method significantly affects hardness

  • Strong interaction exists between dentist and method

  • Optimal methods vary by alloy type

Business Impact

  • Standardizes implant procedures

  • Improves treatment outcomes

  • Reduces variability in medical results


🛠 Skills Demonstrated

Statistical & Analytical Skills

  • Probability theory

  • Hypothesis testing

  • Z-test, T-test

  • One-Way & Two-Way ANOVA

  • Post-hoc analysis

Tools & Techniques

  • Python

  • Pandas, NumPy

  • SciPy, StatsModels

  • Data visualization

  • Statistical interpretation


📈 Overall Impact

This project showcases the ability to:

  • Choose the right statistical test for each problem

  • Validate assumptions before modeling

  • Interpret statistical output in business terms

  • Support decisions with data-backed evidence


🏁 Conclusion

Inferential statistics is a critical foundation for data science and analytics.
This project demonstrates how statistical methods can directly support sports strategy, manufacturing quality, marketing optimization, and healthcare decision-making.













Comments

Popular posts from this blog

Raghvendra Singh Portfolio

  I’m Raghvendra Singh Business Analytics & Data Science Professional I help businesses make data-driven decisions using analytics, dashboards and data science techniques across Ecommerce, Retail, Finance and Marketing . I specialize in converting raw data into clear insights, measurable impact and actionable recommendations for business leaders and teams. Profile Links Github LinkedIn Portfolio  Below are selected projects showcasing my work in analytics, data science and business problem-solving . 1. Digital Marketing Ads Clustering for Ads24x7 2. Inferential statistics: Probability to ANOVA 📬 Let’s Connect 📧 Email: raghavsingh0027 @gmail.com 🔗 LinkedIn: https://www.linkedin.com/in/raghvendra0027

25 Game-Changing Use Cases of Data Science in Marketing

In today’s competitive and fast-paced marketing landscape, businesses are constantly seeking ways to optimize their strategies, engage with customers more effectively, and increase ROI. Enter data science , which has proven to be a powerful tool in transforming marketing practices. By leveraging data, machine learning, and artificial intelligence (AI), marketers can extract valuable insights, predict trends, and enhance decision-making. This article will explore 25 use cases of data science in marketing and illustrate how it can help companies unlock new opportunities and drive better outcomes. 1. Customer Segmentation What it is: Data science enables marketers to categorize customers based on shared traits, behaviors, or preferences, which allows for more targeted and personalized campaigns. Example: By analyzing purchasing history and browsing patterns, data science tools can create customer segments, enabling businesses to deliver tailored marketing messages for each group....

Machine Learning vs Deep Learning : Understand the difference!

In the world of artificial intelligence (AI), terms like "Machine Learning" (ML) and "Deep Learning" (DL) are frequently used, often interchangeably. However, while both fall under the umbrella of AI, they are distinct in their methodologies, applications, and capabilities. In this post, we'll explore the key differences between machine learning and deep learning, helping you understand when and why each is used. What is Machine Learning? Machine Learning is a subset of AI focused on developing algorithms that allow computers to learn from and make predictions based on data. The core idea behind machine learning is that the system can automatically learn and improve from experience without being explicitly programmed for each task. There are three main types of machine learning: Supervised Learning : The model is trained on labeled data, which means the input data has corresponding output labels. The algorithm's goal is to learn a mapping from inputs ...