🔍 Project Overview
This project demonstrates the application of inferential statistics to solve multiple real-world problems across sports analytics, manufacturing quality control, marketing operations and healthcare.
The objective was to move beyond descriptive statistics and apply probability theory, hypothesis testing, and ANOVA techniques to draw meaningful conclusions and support data-driven decision-making.
Download Complete Report from Git
🎯 Key Objectives
-
Apply probability concepts to real datasets
-
Use normal distribution and Z-tests for quality analysis
-
Perform hypothesis testing (Z-test, T-test)
-
Analyze multi-factor effects using One-Way & Two-Way ANOVA
-
Translate statistical results into business insights and recommendations
🧠 Problem 1: Sports Injury Probability Analysis
Business Question
Can player position help explain the likelihood of foot injuries in a football team?
Approach
-
Used conditional probability and joint probability
-
Analyzed injury distribution across playing positions
Key Insight
-
Overall injury probability: 61%
-
Strikers had the highest injury likelihood among injured players
-
Player position plays a significant role in injury risk
Impact
Helps coaching and medical staff focus preventive care strategies on high-risk positions.
🏭 Problem 2: Manufacturing Quality Control (Normal Distribution)
Business Question
What proportion of cement gunny bags fail strength requirements?
Approach
-
Assumed normal distribution
-
Used Z-score-based probability estimation
-
Visualized probability regions for decision clarity
Key Insights
-
~11% of bags fall below minimum strength threshold
-
Over 82% meet acceptable strength criteria
-
Identified risk zones contributing to material loss
Impact
Supports supply chain quality checks and reduces wastage risk.
🧪 Problem 3: Stone Hardness Testing (Hypothesis Testing)
Business Question
Are unpolished stones suitable for high-quality printing?
Statistical Techniques Used
-
Z-test (large sample, known population mean)
-
Independent two-sample T-test
-
Outlier treatment and distribution analysis
Key Findings
-
Mean hardness of unpolished stones is significantly below required threshold
-
Polished stones show higher and more consistent hardness
Recommendation
Zingaro is justified in rejecting unpolished stones for printing applications.
🦷 Problem 4: Dental Implant Hardness Analysis (ANOVA)
Business Question
How do dentist, method, and alloy influence implant hardness?
Techniques Used
-
One-Way ANOVA
-
Two-Way ANOVA with interaction effects
-
Shapiro-Wilk Test (normality)
-
Levene Test (variance equality)
-
Tukey post-hoc analysis
Key Insights
-
Dentist alone does not significantly impact hardness
-
Implant method significantly affects hardness
-
Strong interaction exists between dentist and method
-
Optimal methods vary by alloy type
Business Impact
-
Standardizes implant procedures
-
Improves treatment outcomes
-
Reduces variability in medical results
🛠 Skills Demonstrated
Statistical & Analytical Skills
-
Probability theory
-
Hypothesis testing
-
Z-test, T-test
-
One-Way & Two-Way ANOVA
-
Post-hoc analysis
Tools & Techniques
-
Python
-
Pandas, NumPy
-
SciPy, StatsModels
-
Data visualization
-
Statistical interpretation
📈 Overall Impact
This project showcases the ability to:
-
Choose the right statistical test for each problem
-
Validate assumptions before modeling
-
Interpret statistical output in business terms
-
Support decisions with data-backed evidence
🏁 Conclusion
Inferential statistics is a critical foundation for data science and analytics.
This project demonstrates how statistical methods can directly support sports strategy, manufacturing quality, marketing optimization, and healthcare decision-making.
Comments
Post a Comment