Data Prep for Machine Learning in Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Data Prep for Machine Learning in Python
This course is part of Practical Data Science for Data Analysts Specialization
Included with
Learn more
Ask Coursera
Skills you'll gain
Tools you'll learn
Details to know
2 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 10 modules in this course
Machine learning models rely on good data to produce meaningful insights. For that reason, data prep is one of the most critical skills for machine learning.
In this course, youβll learn how to import and clean data before populating missing values using imputation. Youβll learn how to visualize histograms, scatter charts, and box plots to identify trends of interest before using the analysis to select the most important features. Feature engineering techniques such as one hot encoding, binning and scaling will help us transform the structure of our data to produce higher quality machine learning insights. This data prep course in Python includes more interactive exercises and challenges than previous BIDA courses have. You will also have the opportunity to test your skills on a comprehensive guided Python case study before completing the final exam. Upon completing this course, you will be able to: β’ Import and clean your data in Python β’ Apply imputation to estimate missing values in the dataset β’ Conduct exploratory data analysis (EDA) to find initial patterns to guide our analysis β’ Select features to focus on the most important variables β’ Apply feature engineering to make datasets machine learning-friendly β’ Select appropriate feature engineering techniques based on the model type Whether you are a business leader or an aspiring analyst exploring data science, this Data Prep for Machine Learning in Python course will serve as your comprehensive introduction to this fascinating subject. Youβll learn all the key terminology to allow you to talk data science with your teams, begin implementing analysis, and understand how data science can help your business.
In this course, weβll learn how to import and clean data before populating missing values using imputation. Weβll learn how to visualize histograms, scatter charts, and box plots to identify trends of interest before using the analysis to select the most important features. Feature engineering techniques such as one hot encoding, binning and scaling will help us transform the structure of our data to produce higher quality machine learning insights.
What's included
3 videos1 reading
3 videosβ’Total 6 minutes
- Course Introductionβ’1 minute
- Pre-requisite Knowledgeβ’2 minutes
- A Quick Guide to Course Structure, Notebooks, and Exercisesβ’3 minutes
1 readingβ’Total 10 minutes
- Downloadable Filesβ’10 minutes
What's included
18 videos
18 videosβ’Total 46 minutes
- Introduction - Importing & Cleaning Dataβ’1 minute
- Importing Data - CSV, Excel and SQLβ’5 minutes
- Selecting Columnsβ’3 minutes
- Filtering Rowsβ’3 minutes
- Exercise - Import & Filter Dataβ’1 minute
- Exercise Review - Import & Filter Dataβ’2 minutes
- Data Types Theoryβ’3 minutes
- Basic Data Validationβ’4 minutes
- Comparing to a Trusted Datasourceβ’4 minutes
- Exercise - Data Validationβ’1 minute
- Exercise Review - Data Validationβ’1 minute
- Imputation Theoryβ’2 minutes
- Cleaning Dataβ’4 minutes
- Data Type Errorsβ’2 minutes
- Imputation with Zerosβ’2 minutes
- Basic Imputation of Valuesβ’4 minutes
- Exercise - Cleaning & Imputationβ’1 minute
- Exercise Review - Cleaning & Imputationβ’3 minutes
What's included
11 videos
11 videosβ’Total 28 minutes
- Introduction - Exploratory Data Analysisβ’2 minutes
- Descriptive Stats for Numeric Featuresβ’2 minutes
- Basic Plots for Numeric Features + Combining Axis & Functionsβ’4 minutes
- Basic Plots for Categorical Featuresβ’3 minutes
- Exercise - Visuals for Numeric & Categoric Featuresβ’2 minutes
- Exercise Review - Visuals for Numeric & Categoric Featuresβ’3 minutes
- Continuous vs Continuous Variable Analysis 1β’3 minutes
- Continuous vs Continuous Variable Analysis Part 2β’3 minutes
- Categorical vs Continuous Variable Analysisβ’3 minutes
- Exercise - Creating and Analyzing Multivariate Plotsβ’3 minutes
- Exercise Review - Creating and Analyzing Multivariate Plotsβ’3 minutes
What's included
2 videos
2 videosβ’Total 5 minutes
- Training Vs Testingβ’1 minute
- Train-Test Split in SKLearnβ’5 minutes
What's included
1 assignment
1 assignmentβ’Total 45 minutes
- Week 1 Challengeβ’45 minutes
What's included
16 videos
16 videosβ’Total 46 minutes
- Introduction - Feature Engineeringβ’2 minutes
- Training Vs Testing Theoryβ’4 minutes
- Encoding Theory (inc One Hot Encoding)β’3 minutes
- Identifying Categorical Columns & Valuesβ’2 minutes
- One Hot Encoding in Pandasβ’3 minutes
- One Hot Encoding in SKLearnβ’6 minutes
- Exercise - One Hot Encodingβ’1 minute
- Exercise Review - One Hot Encodingβ’2 minutes
- Exercise Review On Hot Encoding Pt 2β’3 minutes
- GetDummies vs OneHotEncoderβ’4 minutes
- Transforming Distributions Theoryβ’3 minutes
- Identifying Skew in Pythonβ’3 minutes
- Transforming Features in Pythonβ’5 minutes
- Taking Logs Scenariosβ’2 minutes
- Exercise - Transformationsβ’1 minute
- Exercise Review - Transformationsβ’2 minutes
What's included
24 videos
24 videosβ’Total 60 minutes
- Outliers Theoryβ’1 minute
- Removing Outliersβ’4 minutes
- Modifying Outliersβ’2 minutes
- Exercise - Outliersβ’0 minutes
- Exercise Review - Outliersβ’1 minute
- Binning Theoryβ’2 minutes
- Categorical Binningβ’4 minutes
- Binning by Width & Frequencyβ’5 minutes
- Manual Binningβ’3 minutes
- Final Thoughts on Binningβ’2 minutes
- Smoothingβ’1 minute
- Smoothing in Practiceβ’2 minutes
- Exercise - Binningβ’1 minute
- Exercise Review - Binningβ’2 minutes
- Advanced Thoughts on Binningβ’2 minutes
- Why Feature Scaling Mattersβ’2 minutes
- Scaling Features Theoryβ’5 minutes
- Min Max Scalingβ’4 minutes
- Scaling Testing Dataβ’3 minutes
- Final Thoughts on Scalingβ’1 minute
- Standard Scalerβ’3 minutes
- Exercise - Scalingβ’1 minute
- Exercise Review - Scalingβ’4 minutes
- Making Feature Engineering Decisionsβ’3 minutes
What's included
9 videos
9 videosβ’Total 21 minutes
- Introduction - Feature Selectionβ’1 minute
- Manual Feature Selectionβ’4 minutes
- Feature Selection with Continuous Targetβ’2 minutes
- Correlation Coefficients - Continuous Var + Continuous Featureβ’3 minutes
- ANOVA - Continuous Target + Categorical Featureβ’5 minutes
- Feature Selection with Categorical Target Variableβ’1 minute
- Box Plots - Categorical Var + Continous Featureβ’3 minutes
- Chi-square - Categorical Var + Categorical Featureβ’2 minutes
- Summary of Feature Selection Techniquesβ’0 minutes
What's included
1 video
1 videoβ’Total 1 minute
- Conclusionβ’1 minute
What's included
1 assignment
1 assignmentβ’Total 80 minutes
- Week 2 Challengeβ’80 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Finance
- Status: Free TrialI
Illinois Tech
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
- Status: Free Trial
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
