Predictive Modeling with Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Predictive Modeling with Python
This course is part of Applied Data Analytics Specialization
Instructor: Edureka
Included with
Learn more
Ask Coursera
Recommended experience
Recommended experience
What you'll learn
Manage and preprocess data for statistical analysis and modeling.
Conduct hypothesis testing using advanced statistical techniques.
Build exploratory data analysis (EDA) models to uncover insights.
Build and evaluate models to solve real-world data challenges.
Skills you'll gain
Details to know
23 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 6 modules in this course
This Predictive Modeling with Python course provides a practical introduction to statistical analysis and machine learning with Python. You will learn essential machine learning concepts, methods, and algorithms with a strong emphasis on applying them to solve real-world business and data problems.
By the end of the course, you will: - Understand different data types used in statistical analysis. - Learn techniques to manage inconsistent data effectively. - Perform hypothesis testing using parametric and non-parametric tests. - Develop exploratory data analysis (EDA) models using statistical and machine learning methods. - Enhance machine learning models through evaluation and optimization techniques. This course is designed for individuals with a foundational knowledge of Python programming and basic statistical concepts. This course is ideal for aspiring data analysts, data scientists, business executives, machine learning engineers, and anyone passionate about data-driven decision-making Throughout the program, you will gain hands-on experience in statistical and predictive modeling and apply your skills to real-world scenarios. Enroll in "Predictive Modeling with Python" today and take your expertise to the next level!
In the first module of this course, learners will explore various data types and utilize different measures of central tendency and measures of dispersion to address data inconsistencies.
What's included
12 videos3 readings4 assignments2 discussion prompts
12 videosβ’Total 40 minutes
- Course Introductionβ’3 minutes
- Data Typesβ’3 minutes
- Categorical Dataβ’2 minutes
- Nominal Dataβ’2 minutes
- Demonstration of Data Types - Dataset Descriptionβ’4 minutes
- Demonstration of Typesβ’5 minutes
- What is Statistics?β’4 minutes
- Measures of Central Tendencyβ’3 minutes
- Demonstration of Central Tendencyβ’4 minutes
- Measures of Dispersionβ’3 minutes
- Demonstration: Measures of Dispersionβ’4 minutes
- Summary of Data and Informationβ’2 minutes
3 readingsβ’Total 30 minutes
- Welcome to Predictive Modeling with Pythonβ’10 minutes
- Applications of Central Tendency in Statistics β’10 minutes
- Libraries for Statistical Analysisβ’10 minutes
4 assignmentsβ’Total 29 minutes
- Practice Quiz : Data Types in Statisticsβ’3 minutes
- Practice Quiz : Statistical Analysisβ’3 minutes
- Practice Quiz : Statistical Dispersionβ’3 minutes
- Knowledge Check : Data and Informationβ’20 minutes
2 discussion promptsβ’Total 20 minutes
- Introduce Yourselfβ’10 minutes
- Why are measures of dispersion important in understanding datasets?β’10 minutes
In this module, learners will learn to manage data using probability distribution functions. Learners will start by applying the Bernoulli distribution to model categorical data, explore the Poisson distribution for forecasting, and utilize the Exponential and Normal distributions for regression modeling.
What's included
17 videos3 readings5 assignments
17 videosβ’Total 69 minutes
- Probability Density and Mass Functionβ’4 minutes
- Cumulative Distribution Functionβ’4 minutes
- Discrete Probabilityβ’2 minutes
- Negative Bernoulli Distributionβ’2 minutes
- Demonstration of Negative Bernoulli Distributionβ’7 minutes
- Geometric Distributionβ’2 minutes
- Demonstration of Geometric Distributionβ’7 minutes
- Poisson Distributionβ’5 minutes
- Example of Poisson Distributionβ’2 minutes
- Demonstration of Poisson Distributionβ’4 minutes
- Continuous Probability Distributionβ’5 minutes
- Uniform Distributionβ’4 minutes
- Exponential Distributionβ’3 minutes
- Demonstration of Exponential Distributionβ’3 minutes
- Normal Distributionβ’6 minutes
- Demonstration of Normal Distributionβ’6 minutes
- Summary of Probability Distribution Functionsβ’2 minutes
3 readingsβ’Total 30 minutes
- Example of PDF and PMFβ’10 minutes
- Importance of Negative Bernoulli and Geometric Distributionsβ’10 minutes
- Continuous Probability Distribution and Uniform Distribution: Mathematical Exampleβ’10 minutes
5 assignmentsβ’Total 32 minutes
- Practice Quiz : General Probability Distributionβ’3 minutes
- Practice Quiz : Negative Bernoulli and Geometric Distributionβ’3 minutes
- Practice Quiz : Poisson and Uniform Distributionβ’3 minutes
- Practice Quiz : Exponential and Normal Distributionβ’3 minutes
- Knowledge Check : Probability Distribution Functionβ’20 minutes
In the third module of this course, Learners will learn to apply the Central Limit Theorem in scenarios where data may be improperly distributed. Identify and analyze sample data, using both parametric and non-parametric methods to handle various test cases for hypothesis testing and decision-making.
What's included
30 videos3 readings5 assignments1 discussion prompt
30 videosβ’Total 138 minutes
- Central Limit Theoremβ’5 minutes
- Demonstration of Central Limit Theoremβ’6 minutes
- Demontration: Conclusion of Central Limit Theoremβ’3 minutes
- Population and Sample Spaceβ’3 minutes
- Parameter and Statisticsβ’5 minutes
- Forms of Inferential Statisticsβ’1 minute
- Point and Interval Estimationβ’4 minutes
- Maximum Likelihoodβ’4 minutes
- Demonstration Exploring Data β’5 minutes
- Demonstration: Drawing Sample Data β’4 minutes
- Hypothesis Testingβ’5 minutes
- Hypothesis Testing Exampleβ’5 minutes
- Statistical Test Implementationβ’6 minutes
- One Tailed and Two Tailed Testβ’5 minutes
- Z - Test and T - Testβ’6 minutes
- Power Analysisβ’6 minutes
- Demonstration of Confidence Interval and Margin of Error β’3 minutes
- Demonstration of Hypothesis Testingβ’4 minutes
- Demonstrating Power Analysisβ’8 minutes
- Chi - Square Testβ’4 minutes
- Pearson and Spearman Correlationβ’6 minutes
- Chi square Test Demonstrationβ’6 minutes
- Pearson Correlation Demonstrationβ’6 minutes
- Spearman Correlation Demonstrationβ’7 minutes
- ANOVAβ’6 minutes
- Example for One Way ANOVA - Part 1β’2 minutes
- Example for Two Way ANOVA - Part 2β’2 minutes
- Demonstration for One way ANOVAβ’5 minutes
- Demonstration for Two way ANOVAβ’3 minutes
- Summary for Inferential Statisticsβ’2 minutes
3 readingsβ’Total 30 minutes
- Central Limit Theorem (CLT): Mathematical Exampleβ’10 minutes
- Statistical Inference Real World Applicationsβ’10 minutes
- Shapiro-Wilk Test β’10 minutes
5 assignmentsβ’Total 32 minutes
- Practice Quiz : Introduction to Central Limit Theoremβ’3 minutes
- Practice Quiz : Statistical Inference Methodsβ’3 minutes
- Practice Quiz : Statistical Hypothesis and Significance Testingβ’3 minutes
- Practice Quiz : Parametric and Non Parametric Testsβ’3 minutes
- Knowledge Check : Inferential Statisticsβ’20 minutes
1 discussion promptβ’Total 10 minutes
- Have any of the you used the above methods in their professional careersβ’10 minutes
In the fourth module, learners will explore implementing Exploratory Data Analysis (EDA) on large, complex datasets by conducting both univariate and multivariate analysis. They will also learn how to clean and process data, as well as perform feature engineering to prepare the data for analysis.
What's included
29 videos3 readings4 assignments1 discussion prompt
29 videosβ’Total 138 minutes
- What is EDA?β’5 minutes
- Univariate Analysis: Data and Outliersβ’4 minutes
- Univariate Analysis: Kurtosis and Chart Typesβ’4 minutes
- Multivariate Analysisβ’4 minutes
- Multivariate Analysis: Covariance, Correlation, and Associationβ’4 minutes
- Multivariate Analysis: Correlation Matrixβ’4 minutes
- Multivariate Analysis: Scatter Plots and HeatMapsβ’6 minutes
- Identifying and Handling Missing Dataβ’7 minutes
- Sampling Methodsβ’4 minutes
- Mean Median Mode Imputationβ’5 minutes
- Data Normalization and Standardization β’3 minutes
- Methods to Transform Dataβ’6 minutes
- Univariate, Bivariate and Multivariate Imputationβ’4 minutes
- Demonstration I: Understanding the Dataβ’4 minutes
- Demonstration II: Visualizing and Handling Missing Dataβ’6 minutes
- Demonstration III: Scaling and Imputation of Dataβ’4 minutes
- Demonstration IV: Train Test Splitβ’7 minutes
- Demonstration V: Stratified K-Fold Cross-Validationβ’4 minutes
- Demonstration VI: Sampling and Evaluationβ’4 minutes
- Introduction to Feature Engineering β’5 minutes
- Feature Transformationβ’6 minutes
- Encoding: One Hot Encodingβ’6 minutes
- Encoding: Label Encodingβ’2 minutes
- Autofeat Libraryβ’3 minutes
- Demonstration I: Setting up the Scenarioβ’4 minutes
- Demonstration II: Data Transformationβ’7 minutes
- Demonstration III: Encodingβ’8 minutes
- Demonstration IV: Autofeatβ’7 minutes
- Summary for Introduction to EDAβ’2 minutes
3 readingsβ’Total 30 minutes
- Understanding Exploratory Data Analysis (EDA)β’10 minutes
- Best Practices in Data Pre-processing β’10 minutes
- Overview of Autofeat libraryβ’10 minutes
4 assignmentsβ’Total 29 minutes
- Practice Quiz : Understanding EDAβ’3 minutes
- Practice Quiz : Data Cleaning and Pre-processingβ’3 minutes
- Practice Quiz : Feature Engineering and Data Transformationβ’3 minutes
- Knowledge Check : Introduction to (Exploratory Data Analysis) EDAβ’20 minutes
1 discussion promptβ’Total 10 minutes
- Explain the importance of Data Pre-processingβ’10 minutes
In this module, learners will learn how to use machine learning models to extract insights from data. They will apply regression and classification algorithms and then optimize the results produced by these models.
What's included
42 videos2 readings4 assignments1 discussion prompt
42 videosβ’Total 197 minutes
- Introduction to Linear Regressionβ’6 minutes
- Assumptions in Linear Regressionβ’4 minutes
- Working of Linear Regressionβ’4 minutes
- Cost function in Linear Regressionβ’2 minutes
- Gradient Descent in Linear Regressionβ’3 minutes
- Demonstration of Linear Regression: Building Modelβ’3 minutes
- Demonstration of Linear Regression: Testing the Modelβ’6 minutes
- Logistic Regressionβ’7 minutes
- Cost function in Logistic Regressionβ’4 minutes
- Gradient Descent in Logistic Regressionβ’4 minutes
- Importance of Sigmoid Functionβ’1 minute
- Demonstration: Logistic Regression - Data Processingβ’6 minutes
- Demonstration: Logistic Regression - Model Executionβ’7 minutes
- Classification in Machine Learningβ’5 minutes
- Decision Tree Part 1: What is Decision Tree?β’5 minutes
- Decision Tree Part 2: What is Random Forest?β’5 minutes
- Basic Terminologies of Decision Treeβ’4 minutes
- Working of Decision Treeβ’6 minutes
- Building a Decision Treeβ’6 minutes
- Advantages and Disadvantages of Decision Treeβ’6 minutes
- Demonstration Part 1: Explaining the Scenarioβ’5 minutes
- Demonstration Part 2: Exploring the Dataβ’2 minutes
- Demonstration Part 3: Profiling Reportβ’7 minutes
- Demonstration Part 4: Attrition and Univariate Graphβ’7 minutes
- Demonstration Part 5: Data Pre - processingβ’4 minutes
- Demonstration Part 6: Building Decision Treeβ’4 minutes
- Demonstration Part 7:Tree Classifierβ’5 minutes
- Demonstration Part 8: Pros and Consβ’5 minutes
- Random Forest Example Part 1: Ensemble Learning and Bagging β’4 minutes
- Random Forest Example Part 2: Working of Random Forestβ’4 minutes
- Performance Metrics for Regression - MAE and MAPE β’4 minutes
- Performance Metrics for Regression - MSE, RMSE, RMSLE and R-squareβ’4 minutes
- Confusion Matrixβ’5 minutes
- ROC and AUCβ’5 minutes
- Hyperparameter Tuning and Optimizationβ’5 minutes
- Model Selectionβ’7 minutes
- Model Evaluation β’6 minutes
- Bias Variance Trade-off β’4 minutes
- Cross Validationβ’4 minutes
- Demonstration I: Grid Search - Analyze the Dataβ’4 minutes
- Demonstration II: Grid Search - Building Modelβ’4 minutes
- Summary of Predictive Modelsβ’2 minutes
2 readingsβ’Total 20 minutes
- Regularization in Regressionβ’10 minutes
- Optuna: A Powerful Tool for Hyperparameter Optimizationβ’10 minutes
4 assignmentsβ’Total 29 minutes
- Practice Quiz : Regressionβ’3 minutes
- Practice Quiz : Classification: Decision Tree and Random Forestβ’3 minutes
- Practice Quiz : Model Evaluation and Optimizationβ’3 minutes
- Knowledge Check : Predictive Modeling and Analysisβ’20 minutes
1 discussion promptβ’Total 10 minutes
- Which among the following evaluation metrics are easy to use and fairly accurate?β’10 minutes
This module is designed to assess an individual on the various concepts and teachings covered in this course. Evaluate your knowledge with a comprehensive graded quiz on Probability, Statistical Modeling, and Machine Learning.
What's included
1 video1 reading1 assignment1 discussion prompt
1 videoβ’Total 2 minutes
- Course Summary of Predictive Modeling with Pythonβ’2 minutes
1 readingβ’Total 30 minutes
- Black Friday - Sales Analysisβ’30 minutes
1 assignmentβ’Total 20 minutes
- Knowledge Check : Predictive Modeling with Pythonβ’20 minutes
1 discussion promptβ’Total 10 minutes
- Describe Your Learning Journeyβ’10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Probability and Statistics
- Status: Preview
Course
- Status: Free TrialU
University of Pennsylvania
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Frequently asked questions
This course is designed with emphasizes on predictive modeling and statistical analysis, providing learners with the skills and methods to examine data, discern trends, and make well-informed forecasts about future results.
Predictive Modeling with Python is tailored for professionals and enthusiasts seeking to deepen their expertise in predictive modeling and statistical analysis, including data analysts, aspiring data scientists, business leaders, and individuals dedicated to data-driven decision-making.
The course spans approximately 6 weeks, allowing flexibility based on the learner's pace, with an estimated weekly commitment of 2-3 hours for lectures, practical projects, and assessments.
More questions
Financial aid available,
