VOOZH about

URL: https://www.coursera.org/learn/predictive-modeling-with-python

⇱ Predictive Modeling with Python | Coursera


Predictive Modeling with Python

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Predictive Modeling with Python

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

2 weeks to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Manage and preprocess data for statistical analysis and modeling.

  • Conduct hypothesis testing using advanced statistical techniques.

  • Build exploratory data analysis (EDA) models to uncover insights.

  • Build and evaluate models to solve real-world data challenges.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

23 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Applied Data Analytics Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

This Predictive Modeling with Python course provides a practical introduction to statistical analysis and machine learning with Python. You will learn essential machine learning concepts, methods, and algorithms with a strong emphasis on applying them to solve real-world business and data problems.

By the end of the course, you will: - Understand different data types used in statistical analysis. - Learn techniques to manage inconsistent data effectively. - Perform hypothesis testing using parametric and non-parametric tests. - Develop exploratory data analysis (EDA) models using statistical and machine learning methods. - Enhance machine learning models through evaluation and optimization techniques. This course is designed for individuals with a foundational knowledge of Python programming and basic statistical concepts. This course is ideal for aspiring data analysts, data scientists, business executives, machine learning engineers, and anyone passionate about data-driven decision-making Throughout the program, you will gain hands-on experience in statistical and predictive modeling and apply your skills to real-world scenarios. Enroll in "Predictive Modeling with Python" today and take your expertise to the next level!

In the first module of this course, learners will explore various data types and utilize different measures of central tendency and measures of dispersion to address data inconsistencies.

What's included

12 videos3 readings4 assignments2 discussion prompts

12 videosβ€’Total 40 minutes
  • Course Introductionβ€’3 minutes
  • Data Typesβ€’3 minutes
  • Categorical Dataβ€’2 minutes
  • Nominal Dataβ€’2 minutes
  • Demonstration of Data Types - Dataset Descriptionβ€’4 minutes
  • Demonstration of Typesβ€’5 minutes
  • What is Statistics?β€’4 minutes
  • Measures of Central Tendencyβ€’3 minutes
  • Demonstration of Central Tendencyβ€’4 minutes
  • Measures of Dispersionβ€’3 minutes
  • Demonstration: Measures of Dispersionβ€’4 minutes
  • Summary of Data and Informationβ€’2 minutes
3 readingsβ€’Total 30 minutes
  • Welcome to Predictive Modeling with Pythonβ€’10 minutes
  • Applications of Central Tendency in Statistics β€’10 minutes
  • Libraries for Statistical Analysisβ€’10 minutes
4 assignmentsβ€’Total 29 minutes
  • Practice Quiz : Data Types in Statisticsβ€’3 minutes
  • Practice Quiz : Statistical Analysisβ€’3 minutes
  • Practice Quiz : Statistical Dispersionβ€’3 minutes
  • Knowledge Check : Data and Informationβ€’20 minutes
2 discussion promptsβ€’Total 20 minutes
  • Introduce Yourselfβ€’10 minutes
  • Why are measures of dispersion important in understanding datasets?β€’10 minutes

In this module, learners will learn to manage data using probability distribution functions. Learners will start by applying the Bernoulli distribution to model categorical data, explore the Poisson distribution for forecasting, and utilize the Exponential and Normal distributions for regression modeling.

What's included

17 videos3 readings5 assignments

17 videosβ€’Total 69 minutes
  • Probability Density and Mass Functionβ€’4 minutes
  • Cumulative Distribution Functionβ€’4 minutes
  • Discrete Probabilityβ€’2 minutes
  • Negative Bernoulli Distributionβ€’2 minutes
  • Demonstration of Negative Bernoulli Distributionβ€’7 minutes
  • Geometric Distributionβ€’2 minutes
  • Demonstration of Geometric Distributionβ€’7 minutes
  • Poisson Distributionβ€’5 minutes
  • Example of Poisson Distributionβ€’2 minutes
  • Demonstration of Poisson Distributionβ€’4 minutes
  • Continuous Probability Distributionβ€’5 minutes
  • Uniform Distributionβ€’4 minutes
  • Exponential Distributionβ€’3 minutes
  • Demonstration of Exponential Distributionβ€’3 minutes
  • Normal Distributionβ€’6 minutes
  • Demonstration of Normal Distributionβ€’6 minutes
  • Summary of Probability Distribution Functionsβ€’2 minutes
3 readingsβ€’Total 30 minutes
  • Example of PDF and PMFβ€’10 minutes
  • Importance of Negative Bernoulli and Geometric Distributionsβ€’10 minutes
  • Continuous Probability Distribution and Uniform Distribution: Mathematical Exampleβ€’10 minutes
5 assignmentsβ€’Total 32 minutes
  • Practice Quiz : General Probability Distributionβ€’3 minutes
  • Practice Quiz : Negative Bernoulli and Geometric Distributionβ€’3 minutes
  • Practice Quiz : Poisson and Uniform Distributionβ€’3 minutes
  • Practice Quiz : Exponential and Normal Distributionβ€’3 minutes
  • Knowledge Check : Probability Distribution Functionβ€’20 minutes

In the third module of this course, Learners will learn to apply the Central Limit Theorem in scenarios where data may be improperly distributed. Identify and analyze sample data, using both parametric and non-parametric methods to handle various test cases for hypothesis testing and decision-making.

What's included

30 videos3 readings5 assignments1 discussion prompt

30 videosβ€’Total 138 minutes
  • Central Limit Theoremβ€’5 minutes
  • Demonstration of Central Limit Theoremβ€’6 minutes
  • Demontration: Conclusion of Central Limit Theoremβ€’3 minutes
  • Population and Sample Spaceβ€’3 minutes
  • Parameter and Statisticsβ€’5 minutes
  • Forms of Inferential Statisticsβ€’1 minute
  • Point and Interval Estimationβ€’4 minutes
  • Maximum Likelihoodβ€’4 minutes
  • Demonstration Exploring Data β€’5 minutes
  • Demonstration: Drawing Sample Data β€’4 minutes
  • Hypothesis Testingβ€’5 minutes
  • Hypothesis Testing Exampleβ€’5 minutes
  • Statistical Test Implementationβ€’6 minutes
  • One Tailed and Two Tailed Testβ€’5 minutes
  • Z - Test and T - Testβ€’6 minutes
  • Power Analysisβ€’6 minutes
  • Demonstration of Confidence Interval and Margin of Error β€’3 minutes
  • Demonstration of Hypothesis Testingβ€’4 minutes
  • Demonstrating Power Analysisβ€’8 minutes
  • Chi - Square Testβ€’4 minutes
  • Pearson and Spearman Correlationβ€’6 minutes
  • Chi square Test Demonstrationβ€’6 minutes
  • Pearson Correlation Demonstrationβ€’6 minutes
  • Spearman Correlation Demonstrationβ€’7 minutes
  • ANOVAβ€’6 minutes
  • Example for One Way ANOVA - Part 1β€’2 minutes
  • Example for Two Way ANOVA - Part 2β€’2 minutes
  • Demonstration for One way ANOVAβ€’5 minutes
  • Demonstration for Two way ANOVAβ€’3 minutes
  • Summary for Inferential Statisticsβ€’2 minutes
3 readingsβ€’Total 30 minutes
  • Central Limit Theorem (CLT): Mathematical Exampleβ€’10 minutes
  • Statistical Inference Real World Applicationsβ€’10 minutes
  • Shapiro-Wilk Test β€’10 minutes
5 assignmentsβ€’Total 32 minutes
  • Practice Quiz : Introduction to Central Limit Theoremβ€’3 minutes
  • Practice Quiz : Statistical Inference Methodsβ€’3 minutes
  • Practice Quiz : Statistical Hypothesis and Significance Testingβ€’3 minutes
  • Practice Quiz : Parametric and Non Parametric Testsβ€’3 minutes
  • Knowledge Check : Inferential Statisticsβ€’20 minutes
1 discussion promptβ€’Total 10 minutes
  • Have any of the you used the above methods in their professional careersβ€’10 minutes

In the fourth module, learners will explore implementing Exploratory Data Analysis (EDA) on large, complex datasets by conducting both univariate and multivariate analysis. They will also learn how to clean and process data, as well as perform feature engineering to prepare the data for analysis.

What's included

29 videos3 readings4 assignments1 discussion prompt

29 videosβ€’Total 138 minutes
  • What is EDA?β€’5 minutes
  • Univariate Analysis: Data and Outliersβ€’4 minutes
  • Univariate Analysis: Kurtosis and Chart Typesβ€’4 minutes
  • Multivariate Analysisβ€’4 minutes
  • Multivariate Analysis: Covariance, Correlation, and Associationβ€’4 minutes
  • Multivariate Analysis: Correlation Matrixβ€’4 minutes
  • Multivariate Analysis: Scatter Plots and HeatMapsβ€’6 minutes
  • Identifying and Handling Missing Dataβ€’7 minutes
  • Sampling Methodsβ€’4 minutes
  • Mean Median Mode Imputationβ€’5 minutes
  • Data Normalization and Standardization β€’3 minutes
  • Methods to Transform Dataβ€’6 minutes
  • Univariate, Bivariate and Multivariate Imputationβ€’4 minutes
  • Demonstration I: Understanding the Dataβ€’4 minutes
  • Demonstration II: Visualizing and Handling Missing Dataβ€’6 minutes
  • Demonstration III: Scaling and Imputation of Dataβ€’4 minutes
  • Demonstration IV: Train Test Splitβ€’7 minutes
  • Demonstration V: Stratified K-Fold Cross-Validationβ€’4 minutes
  • Demonstration VI: Sampling and Evaluationβ€’4 minutes
  • Introduction to Feature Engineering β€’5 minutes
  • Feature Transformationβ€’6 minutes
  • Encoding: One Hot Encodingβ€’6 minutes
  • Encoding: Label Encodingβ€’2 minutes
  • Autofeat Libraryβ€’3 minutes
  • Demonstration I: Setting up the Scenarioβ€’4 minutes
  • Demonstration II: Data Transformationβ€’7 minutes
  • Demonstration III: Encodingβ€’8 minutes
  • Demonstration IV: Autofeatβ€’7 minutes
  • Summary for Introduction to EDAβ€’2 minutes
3 readingsβ€’Total 30 minutes
  • Understanding Exploratory Data Analysis (EDA)β€’10 minutes
  • Best Practices in Data Pre-processing β€’10 minutes
  • Overview of Autofeat libraryβ€’10 minutes
4 assignmentsβ€’Total 29 minutes
  • Practice Quiz : Understanding EDAβ€’3 minutes
  • Practice Quiz : Data Cleaning and Pre-processingβ€’3 minutes
  • Practice Quiz : Feature Engineering and Data Transformationβ€’3 minutes
  • Knowledge Check : Introduction to (Exploratory Data Analysis) EDAβ€’20 minutes
1 discussion promptβ€’Total 10 minutes
  • Explain the importance of Data Pre-processingβ€’10 minutes

In this module, learners will learn how to use machine learning models to extract insights from data. They will apply regression and classification algorithms and then optimize the results produced by these models.

What's included

42 videos2 readings4 assignments1 discussion prompt

42 videosβ€’Total 197 minutes
  • Introduction to Linear Regressionβ€’6 minutes
  • Assumptions in Linear Regressionβ€’4 minutes
  • Working of Linear Regressionβ€’4 minutes
  • Cost function in Linear Regressionβ€’2 minutes
  • Gradient Descent in Linear Regressionβ€’3 minutes
  • Demonstration of Linear Regression: Building Modelβ€’3 minutes
  • Demonstration of Linear Regression: Testing the Modelβ€’6 minutes
  • Logistic Regressionβ€’7 minutes
  • Cost function in Logistic Regressionβ€’4 minutes
  • Gradient Descent in Logistic Regressionβ€’4 minutes
  • Importance of Sigmoid Functionβ€’1 minute
  • Demonstration: Logistic Regression - Data Processingβ€’6 minutes
  • Demonstration: Logistic Regression - Model Executionβ€’7 minutes
  • Classification in Machine Learningβ€’5 minutes
  • Decision Tree Part 1: What is Decision Tree?β€’5 minutes
  • Decision Tree Part 2: What is Random Forest?β€’5 minutes
  • Basic Terminologies of Decision Treeβ€’4 minutes
  • Working of Decision Treeβ€’6 minutes
  • Building a Decision Treeβ€’6 minutes
  • Advantages and Disadvantages of Decision Treeβ€’6 minutes
  • Demonstration Part 1: Explaining the Scenarioβ€’5 minutes
  • Demonstration Part 2: Exploring the Dataβ€’2 minutes
  • Demonstration Part 3: Profiling Reportβ€’7 minutes
  • Demonstration Part 4: Attrition and Univariate Graphβ€’7 minutes
  • Demonstration Part 5: Data Pre - processingβ€’4 minutes
  • Demonstration Part 6: Building Decision Treeβ€’4 minutes
  • Demonstration Part 7:Tree Classifierβ€’5 minutes
  • Demonstration Part 8: Pros and Consβ€’5 minutes
  • Random Forest Example Part 1: Ensemble Learning and Bagging β€’4 minutes
  • Random Forest Example Part 2: Working of Random Forestβ€’4 minutes
  • Performance Metrics for Regression - MAE and MAPE β€’4 minutes
  • Performance Metrics for Regression - MSE, RMSE, RMSLE and R-squareβ€’4 minutes
  • Confusion Matrixβ€’5 minutes
  • ROC and AUCβ€’5 minutes
  • Hyperparameter Tuning and Optimizationβ€’5 minutes
  • Model Selectionβ€’7 minutes
  • Model Evaluation β€’6 minutes
  • Bias Variance Trade-off β€’4 minutes
  • Cross Validationβ€’4 minutes
  • Demonstration I: Grid Search - Analyze the Dataβ€’4 minutes
  • Demonstration II: Grid Search - Building Modelβ€’4 minutes
  • Summary of Predictive Modelsβ€’2 minutes
2 readingsβ€’Total 20 minutes
  • Regularization in Regressionβ€’10 minutes
  • Optuna: A Powerful Tool for Hyperparameter Optimizationβ€’10 minutes
4 assignmentsβ€’Total 29 minutes
  • Practice Quiz : Regressionβ€’3 minutes
  • Practice Quiz : Classification: Decision Tree and Random Forestβ€’3 minutes
  • Practice Quiz : Model Evaluation and Optimizationβ€’3 minutes
  • Knowledge Check : Predictive Modeling and Analysisβ€’20 minutes
1 discussion promptβ€’Total 10 minutes
  • Which among the following evaluation metrics are easy to use and fairly accurate?β€’10 minutes

This module is designed to assess an individual on the various concepts and teachings covered in this course. Evaluate your knowledge with a comprehensive graded quiz on Probability, Statistical Modeling, and Machine Learning.

What's included

1 video1 reading1 assignment1 discussion prompt

1 videoβ€’Total 2 minutes
  • Course Summary of Predictive Modeling with Pythonβ€’2 minutes
1 readingβ€’Total 30 minutes
  • Black Friday - Sales Analysisβ€’30 minutes
1 assignmentβ€’Total 20 minutes
  • Knowledge Check : Predictive Modeling with Pythonβ€’20 minutes
1 discussion promptβ€’Total 10 minutes
  • Describe Your Learning Journeyβ€’10 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Edureka
203 Coursesβ€’185,724 learners

Explore more from Probability and Statistics

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

This course is designed with emphasizes on predictive modeling and statistical analysis, providing learners with the skills and methods to examine data, discern trends, and make well-informed forecasts about future results.

Predictive Modeling with Python is tailored for professionals and enthusiasts seeking to deepen their expertise in predictive modeling and statistical analysis, including data analysts, aspiring data scientists, business leaders, and individuals dedicated to data-driven decision-making.

The course spans approximately 6 weeks, allowing flexibility based on the learner's pace, with an estimated weekly commitment of 2-3 hours for lectures, practical projects, and assessments.

The course primarily utilizes Google Colab for coding exercises. However, learners have the option to use integrated development environments (IDEs) such as Jupyter Notebook, PyCharm, Spyder, or VS Code for more advanced coding projects, if preferred.

This course requires a foundational understanding of mathematics, particularly algebra and basic statistics. Familiarity with data analysis concepts, including data interpretation and manipulation, is also recommended. Proficiency in programming languages, especially Python, is essential for successful completion of the course.

Yes, predictive modeling is a core skill for data science, business analytics, and AI-driven decision-making roles.

Predictive modeling skills prepare you for careers as Data Scientists, Machine Learning Engineers, Business Analysts, and Risk Modelers.

Yes, you’ll receive a Coursera certificate that you can showcase to employers and add to your LinkedIn profile.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,