Skip to main content

Inferential Statistics in Practice: From Probability to ANOVA


🔍 Project Overview 

This project demonstrates the application of inferential statistics to solve multiple real-world problems across sports analytics, manufacturing quality control, marketing operations and healthcare.

The objective was to move beyond descriptive statistics and apply probability theory, hypothesis testing, and ANOVA techniques to draw meaningful conclusions and support data-driven decision-making.

Download Complete Report from Git

Open on Git


🎯 Key Objectives

  • Apply probability concepts to real datasets

  • Use normal distribution and Z-tests for quality analysis

  • Perform hypothesis testing (Z-test, T-test)

  • Analyze multi-factor effects using One-Way & Two-Way ANOVA

  • Translate statistical results into business insights and recommendations


🧠 Problem 1: Sports Injury Probability Analysis

Business Question

Can player position help explain the likelihood of foot injuries in a football team?

Approach

  • Used conditional probability and joint probability

  • Analyzed injury distribution across playing positions

Key Insight

  • Overall injury probability: 61%

  • Strikers had the highest injury likelihood among injured players

  • Player position plays a significant role in injury risk

Impact

Helps coaching and medical staff focus preventive care strategies on high-risk positions.


🏭 Problem 2: Manufacturing Quality Control (Normal Distribution)

Business Question

What proportion of cement gunny bags fail strength requirements?

Approach

  • Assumed normal distribution

  • Used Z-score-based probability estimation

  • Visualized probability regions for decision clarity

Key Insights

  • ~11% of bags fall below minimum strength threshold

  • Over 82% meet acceptable strength criteria

  • Identified risk zones contributing to material loss

Impact

Supports supply chain quality checks and reduces wastage risk.


🧪 Problem 3: Stone Hardness Testing (Hypothesis Testing)

Business Question

Are unpolished stones suitable for high-quality printing?

Statistical Techniques Used

  • Z-test (large sample, known population mean)

  • Independent two-sample T-test

  • Outlier treatment and distribution analysis

Key Findings

  • Mean hardness of unpolished stones is significantly below required threshold

  • Polished stones show higher and more consistent hardness

Recommendation

Zingaro is justified in rejecting unpolished stones for printing applications.


🦷 Problem 4: Dental Implant Hardness Analysis (ANOVA)

Business Question

How do dentist, method, and alloy influence implant hardness?

Techniques Used

  • One-Way ANOVA

  • Two-Way ANOVA with interaction effects

  • Shapiro-Wilk Test (normality)

  • Levene Test (variance equality)

  • Tukey post-hoc analysis

Key Insights

  • Dentist alone does not significantly impact hardness

  • Implant method significantly affects hardness

  • Strong interaction exists between dentist and method

  • Optimal methods vary by alloy type

Business Impact

  • Standardizes implant procedures

  • Improves treatment outcomes

  • Reduces variability in medical results


🛠 Skills Demonstrated

Statistical & Analytical Skills

  • Probability theory

  • Hypothesis testing

  • Z-test, T-test

  • One-Way & Two-Way ANOVA

  • Post-hoc analysis

Tools & Techniques

  • Python

  • Pandas, NumPy

  • SciPy, StatsModels

  • Data visualization

  • Statistical interpretation


📈 Overall Impact

This project showcases the ability to:

  • Choose the right statistical test for each problem

  • Validate assumptions before modeling

  • Interpret statistical output in business terms

  • Support decisions with data-backed evidence


🏁 Conclusion

Inferential statistics is a critical foundation for data science and analytics.
This project demonstrates how statistical methods can directly support sports strategy, manufacturing quality, marketing optimization, and healthcare decision-making.













Comments

Popular posts from this blog

Text Analytics on U.S. Presidential Inaugural Speeches

Project Overview In this project, I performed text analytics and natural language processing (NLP) on three historic U.S. Presidential inaugural speeches to understand their linguistic structure, vocabulary usage, and dominant themes . Speeches Analyzed Franklin D. Roosevelt – 1941 John F. Kennedy – 1961 Richard Nixon – 1973 The goal was not political analysis, but language analysis using Python and NLP libraries. Git Link Problem Definition The objectives of this analysis were: Compute text statistics for each speech: Number of characters Number of words Number of sentences Average word length Perform text preprocessing : Lowercasing Removing punctuation, numbers, and special characters Stopword removal Stemming Identify the most frequently used words across all three speeches Visualize dominant themes using a Word Cloud Data Source The speeches were sourced from the NLTK Inaugural Corpus , which contains official U.S. presidential inaugural addresses dating back to 1789. from nlt...

Raghvendra Singh Portfolio

  I’m Raghvendra Singh Business Analytics & Data Science Professional I help businesses make data-driven decisions using analytics, dashboards and data science techniques across Ecommerce, Retail, Finance and Marketing . I specialize in converting raw data into clear insights, measurable impact and actionable recommendations for business leaders and teams. Profile Links Github LinkedIn Portfolio  Below are selected projects showcasing my work in analytics, data science and business problem-solving . 1. Digital Marketing Ads Clustering for Ads24x7 2. Inferential statistics: Probability to ANOVA 3. Power BI Sales & Invetory forecasting using SARIMA, SQL, Python 4. Power BI/ Looker/ Tableu- Neerus Dashboards - Myntra payments dashboard 5. Text Analytics using NLP on political speeches analysis 6.  Election Data Classification: End to end analysis 7.  📬 Let’s Connect 📧 Email: raghavsingh0027 @gmail.com 🔗 LinkedIn: https://www.linkedin.com/in/raghvendra0...

Introducing The Cat Poet: Your Personal AI Cat Wordsmith by AI Councel Lab

Poetry is the rhythmical creation of beauty in words.     – Edgar Allan Poe Now, imagine that beauty, powered by AI. Welcome to AI Councel Lab , your go-to space for cutting-edge AI tools that blend creativity and intelligence. Today, we're thrilled to introduce a truly unique creation: The  Cat Poet — a next-generation poetic companion that turns your ideas into art. ✨ What Is The AI   Cat Poet ? Try Cat Poet App Now → The Cat Poet is an AI-powered poetry generator designed to take a keyword or phrase of your choice and craft beautiful poems in a wide range of poetic styles — from minimalist Haikus to heartfelt Elegies , powerful Odes , and over 30 diverse poetic forms . Whether you're a writer, student, creative thinker, or someone just looking for a moment of lyrical joy, The Cat Poet is here to inspire you. 🧠 How It Works Simply enter a word, feeling, or concept — and let the AI weave its magic. Behind the scenes, a fine-tuned language model selects from a c...