VOOZH about

URL: https://www.coursera.org/learn/data-prep-for-machine-learning-python

⇱ Data Prep for Machine Learning in Python | Coursera


Data Prep for Machine Learning in Python

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Gain insight into a topic and learn the fundamentals.
Advanced level
Designed for those already in the industry
6 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
Advanced level
Designed for those already in the industry
6 hours to complete
Flexible schedule
Learn at your own pace

Build your subject-matter expertise

This course is part of the Practical Data Science for Data Analysts Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 10 modules in this course

Machine learning models rely on good data to produce meaningful insights. For that reason, data prep is one of the most critical skills for machine learning.

In this course, you’ll learn how to import and clean data before populating missing values using imputation. You’ll learn how to visualize histograms, scatter charts, and box plots to identify trends of interest before using the analysis to select the most important features. Feature engineering techniques such as one hot encoding, binning and scaling will help us transform the structure of our data to produce higher quality machine learning insights. This data prep course in Python includes more interactive exercises and challenges than previous BIDA courses have. You will also have the opportunity to test your skills on a comprehensive guided Python case study before completing the final exam. Upon completing this course, you will be able to: β€’ Import and clean your data in Python β€’ Apply imputation to estimate missing values in the dataset β€’ Conduct exploratory data analysis (EDA) to find initial patterns to guide our analysis β€’ Select features to focus on the most important variables β€’ Apply feature engineering to make datasets machine learning-friendly β€’ Select appropriate feature engineering techniques based on the model type Whether you are a business leader or an aspiring analyst exploring data science, this Data Prep for Machine Learning in Python course will serve as your comprehensive introduction to this fascinating subject. You’ll learn all the key terminology to allow you to talk data science with your teams, begin implementing analysis, and understand how data science can help your business.

In this course, we’ll learn how to import and clean data before populating missing values using imputation. We’ll learn how to visualize histograms, scatter charts, and box plots to identify trends of interest before using the analysis to select the most important features. Feature engineering techniques such as one hot encoding, binning and scaling will help us transform the structure of our data to produce higher quality machine learning insights.

What's included

3 videos1 reading

3 videosβ€’Total 6 minutes
  • Course Introductionβ€’1 minute
  • Pre-requisite Knowledgeβ€’2 minutes
  • A Quick Guide to Course Structure, Notebooks, and Exercisesβ€’3 minutes
1 readingβ€’Total 10 minutes
  • Downloadable Filesβ€’10 minutes

What's included

18 videos

18 videosβ€’Total 46 minutes
  • Introduction - Importing & Cleaning Dataβ€’1 minute
  • Importing Data - CSV, Excel and SQLβ€’5 minutes
  • Selecting Columnsβ€’3 minutes
  • Filtering Rowsβ€’3 minutes
  • Exercise - Import & Filter Dataβ€’1 minute
  • Exercise Review - Import & Filter Dataβ€’2 minutes
  • Data Types Theoryβ€’3 minutes
  • Basic Data Validationβ€’4 minutes
  • Comparing to a Trusted Datasourceβ€’4 minutes
  • Exercise - Data Validationβ€’1 minute
  • Exercise Review - Data Validationβ€’1 minute
  • Imputation Theoryβ€’2 minutes
  • Cleaning Dataβ€’4 minutes
  • Data Type Errorsβ€’2 minutes
  • Imputation with Zerosβ€’2 minutes
  • Basic Imputation of Valuesβ€’4 minutes
  • Exercise - Cleaning & Imputationβ€’1 minute
  • Exercise Review - Cleaning & Imputationβ€’3 minutes

What's included

11 videos

11 videosβ€’Total 28 minutes
  • Introduction - Exploratory Data Analysisβ€’2 minutes
  • Descriptive Stats for Numeric Featuresβ€’2 minutes
  • Basic Plots for Numeric Features + Combining Axis & Functionsβ€’4 minutes
  • Basic Plots for Categorical Featuresβ€’3 minutes
  • Exercise - Visuals for Numeric & Categoric Featuresβ€’2 minutes
  • Exercise Review - Visuals for Numeric & Categoric Featuresβ€’3 minutes
  • Continuous vs Continuous Variable Analysis 1β€’3 minutes
  • Continuous vs Continuous Variable Analysis Part 2β€’3 minutes
  • Categorical vs Continuous Variable Analysisβ€’3 minutes
  • Exercise - Creating and Analyzing Multivariate Plotsβ€’3 minutes
  • Exercise Review - Creating and Analyzing Multivariate Plotsβ€’3 minutes

What's included

2 videos

2 videosβ€’Total 5 minutes
  • Training Vs Testingβ€’1 minute
  • Train-Test Split in SKLearnβ€’5 minutes

What's included

1 assignment

1 assignmentβ€’Total 45 minutes
  • Week 1 Challengeβ€’45 minutes

What's included

16 videos

16 videosβ€’Total 46 minutes
  • Introduction - Feature Engineeringβ€’2 minutes
  • Training Vs Testing Theoryβ€’4 minutes
  • Encoding Theory (inc One Hot Encoding)β€’3 minutes
  • Identifying Categorical Columns & Valuesβ€’2 minutes
  • One Hot Encoding in Pandasβ€’3 minutes
  • One Hot Encoding in SKLearnβ€’6 minutes
  • Exercise - One Hot Encodingβ€’1 minute
  • Exercise Review - One Hot Encodingβ€’2 minutes
  • Exercise Review On Hot Encoding Pt 2β€’3 minutes
  • GetDummies vs OneHotEncoderβ€’4 minutes
  • Transforming Distributions Theoryβ€’3 minutes
  • Identifying Skew in Pythonβ€’3 minutes
  • Transforming Features in Pythonβ€’5 minutes
  • Taking Logs Scenariosβ€’2 minutes
  • Exercise - Transformationsβ€’1 minute
  • Exercise Review - Transformationsβ€’2 minutes

What's included

24 videos

24 videosβ€’Total 60 minutes
  • Outliers Theoryβ€’1 minute
  • Removing Outliersβ€’4 minutes
  • Modifying Outliersβ€’2 minutes
  • Exercise - Outliersβ€’0 minutes
  • Exercise Review - Outliersβ€’1 minute
  • Binning Theoryβ€’2 minutes
  • Categorical Binningβ€’4 minutes
  • Binning by Width & Frequencyβ€’5 minutes
  • Manual Binningβ€’3 minutes
  • Final Thoughts on Binningβ€’2 minutes
  • Smoothingβ€’1 minute
  • Smoothing in Practiceβ€’2 minutes
  • Exercise - Binningβ€’1 minute
  • Exercise Review - Binningβ€’2 minutes
  • Advanced Thoughts on Binningβ€’2 minutes
  • Why Feature Scaling Mattersβ€’2 minutes
  • Scaling Features Theoryβ€’5 minutes
  • Min Max Scalingβ€’4 minutes
  • Scaling Testing Dataβ€’3 minutes
  • Final Thoughts on Scalingβ€’1 minute
  • Standard Scalerβ€’3 minutes
  • Exercise - Scalingβ€’1 minute
  • Exercise Review - Scalingβ€’4 minutes
  • Making Feature Engineering Decisionsβ€’3 minutes

What's included

9 videos

9 videosβ€’Total 21 minutes
  • Introduction - Feature Selectionβ€’1 minute
  • Manual Feature Selectionβ€’4 minutes
  • Feature Selection with Continuous Targetβ€’2 minutes
  • Correlation Coefficients - Continuous Var + Continuous Featureβ€’3 minutes
  • ANOVA - Continuous Target + Categorical Featureβ€’5 minutes
  • Feature Selection with Categorical Target Variableβ€’1 minute
  • Box Plots - Categorical Var + Continous Featureβ€’3 minutes
  • Chi-square - Categorical Var + Categorical Featureβ€’2 minutes
  • Summary of Feature Selection Techniquesβ€’0 minutes

What's included

1 video

1 videoβ€’Total 1 minute
  • Conclusionβ€’1 minute

What's included

1 assignment

1 assignmentβ€’Total 80 minutes
  • Week 2 Challengeβ€’80 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Corporate Finance Institute
47 Coursesβ€’146,269 learners

Explore more from Finance

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,