AI Workflow: Feature Engineering and Bias Detection
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
AI Workflow: Feature Engineering and Bias Detection
This course is part of IBM AI Enterprise Workflow Specialization
Instructors: Mark J Grover
9,194 already enrolled
Included with
Learn more
Ask Coursera
82 reviews
82 reviews
Skills you'll gain
- Machine Learning
- Anomaly Detection
- Text Mining
- Responsible AI
- Natural Language Processing
- Unsupervised Learning
- Exploratory Data Analysis
- Data Science
- Data Preprocessing
- Machine Learning Algorithms
- Data Ethics
- Feature Engineering
- Quality Assurance
- Design Thinking
- Data Transformation
- Dimensionality Reduction
- Data Pipelines
Details to know
10 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 2 modules in this course
This is the third course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones.
Course 3 introduces you to the next stage of the workflow for our hypothetical media company. In this stage of work you will learn best practices for feature engineering, handling class imbalances and detecting bias in the data. Class imbalances can seriously affect the validity of your machine learning models, and the mitigation of bias in data is essential to reducing the risk associated with biased models. These topics will be followed by sections on best practices for dimension reduction, outlier detection, and unsupervised learning techniques for finding patterns in your data. The case studies will focus on topic modeling and data visualization. By the end of this course you will be able to: 1. Employ the tools that help address class and class imbalance issues 2. Explain the ethical considerations regarding bias in data 3. Employ ai Fairness 360 open source libraries to detect bias in models 4. Employ dimension reduction techniques for both EDA and transformations stages 5. Describe topic modeling techniques in natural language processing 6. Use topic modeling and visualization to explore text data 7. Employ outlier handling best practices in high dimension data 8. Employ outlier detection algorithms as a quality assurance tool and a modeling tool 9. Employ unsupervised learning techniques using pipelines as part of the AI workflow 10. Employ basic clustering algorithms Who should take this course? This course targets existing data science practitioners that have expertise building machine learning models, who want to deepen their skills on building and deploying AI in large enterprises. If you are an aspiring Data Scientist, this course is NOT for you as you need real world expertise to benefit from the content of these courses. What skills should you have? It is assumed that you have completed Courses 1 and 2 of the IBM AI Enterprise Workflow specialization and you have a solid understanding of the following topics prior to starting this course: Fundamental understanding of Linear Algebra; Understand sampling, probability theory, and probability distributions; Knowledge of descriptive and inferential statistical concepts; General understanding of machine learning techniques and best practices; Practiced understanding of Python and the packages commonly used in data science: NumPy, Pandas, matplotlib, scikit-learn; Familiarity with IBM Watson Studio; Familiarity with the design thinking process.
This module will introduce you to skills required for effective feature engineering in today's business enterprises. The skills are presented as a series of best practices representing years of practical experience.
What's included
6 videos14 readings5 assignments1 ungraded lab
6 videosβ’Total 31 minutes
- Data Transformations Overviewβ’3 minutes
- Introduction to Class Imbalanceβ’2 minutes
- Class Imbalance Deep Diveβ’9 minutes
- Introduction to Dimensionality Reductionβ’2 minutes
- Dimension Reductionβ’13 minutes
- Case Study Intro / Feature Engineeringβ’2 minutes
14 readingsβ’Total 162 minutes
- Data Transformation: Through the eyes of our Working Exampleβ’3 minutes
- Transforms with scikit-learnβ’3 minutes
- Pipelinesβ’3 minutes
- Class imbalance: Through the Eyes of our Working Exampleβ’3 minutes
- Class Imbalanceβ’5 minutes
- Sampling Techniquesβ’2 minutes
- Models that Naturally Handle Imbalanceβ’2 minutes
- Data Biasβ’2 minutes
- Dimensionality Reduction: Through the Eyes of Our Working Exampleβ’3 minutes
- Why is Dimensionality Reduction Important?β’3 minutes
- Dimensionality Reduction and Topic modelsβ’5 minutes
- Topic modeling: Through the Eyes of our Working Exampleβ’3 minutes
- Getting Started with the Topic Modeling Case Study (hands-on)β’120 minutes
- Data Transforms and Feature Engineering: Summary/Reviewβ’5 minutes
5 assignmentsβ’Total 103 minutes
- Data Transforms and Feature Engineering: End of Module Quizβ’10 minutes
- Getting Started: Check for Understandingβ’30 minutes
- Class Imbalance, Data Bias: Check for Understandingβ’30 minutes
- Dimensionality Reduction: Check for Understandingβ’3 minutes
- CASE STUDY - Topic Modeling: Check for Understandingβ’30 minutes
1 ungraded labβ’Total 60 minutes
- Case Study Answer Key Notebookβ’60 minutes
This module will continue the discussion of skill related to feature engineering for practicing data scientists, with a focus on outliers and the use of unsupervised learning techniques for finding patterns.
What's included
5 videos11 readings5 assignments1 ungraded lab
5 videosβ’Total 16 minutes
- Exploring IBM's AI Fairness 360 Toolkitβ’2 minutes
- Introduction to Outliersβ’3 minutes
- Outlier Detectionβ’3 minutes
- Introduction to Unsupervised learningβ’2 minutes
- Unsupervised Learningβ’6 minutes
11 readingsβ’Total 172 minutes
- ai360: Through the Eyes of our Working Exampleβ’3 minutes
- Introduction to 360 (hands-on)β’15 minutes
- Outlier Detection: Through the Eyes of our Working Exampleβ’3 minutes
- Outliersβ’3 minutes
- Unsupervised learning: Through the Eyes of our Working Exampleβ’3 minutes
- An Overview of Unsupervised Learningβ’2 minutes
- Clusteringβ’3 minutes
- Clustering Evaluationβ’3 minutes
- Clustering: Through the Eyes of our Working Exampleβ’3 minutes
- Getting Started with the Clustering Case Study (hands-on)β’130 minutes
- Pattern Recognition and Data Mining Best Practices: Summary/Reviewβ’4 minutes
5 assignmentsβ’Total 132 minutes
- Pattern Recognition and Data Mining Best Practices: End of Module Quizβ’12 minutes
- ai360 Tutorial: Check for Understandingβ’30 minutes
- Outlier Detection: Check for Understandingβ’30 minutes
- Unsupervised Learning: Check for Understandingβ’30 minutes
- CASE STUDY - Clustering: Check for Understandingβ’30 minutes
1 ungraded labβ’Total 60 minutes
- Case Study Answer Key Notebookβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Offered by
Explore more from Machine Learning
Why people choose Coursera for their career
Learner reviews
- 5 stars
62.19%
- 4 stars
24.39%
- 3 stars
8.53%
- 2 stars
2.43%
- 1 star
2.43%
Showing 3 of 82
Reviewed on Jul 5, 2020
Dear Team,Namaste !! Well...All Instructer Very Help Full ...Quick Reply for any Queries ...Concept Clearance.Thanks & RegardsNeela Mistry
Reviewed on May 3, 2020
It's quite good but the content could be more in-depth as an 'advance' course.
Frequently asked questions
This course assumes that you are already familiar with basic data science concepts including probability and statistics, linear algebra, machine learning, and the use of Python and Jupyter. It is assumed you have completed the first two courses of the specialization: AI Workflow: Business Priorities and Data Ingestion, AI Workflow: Data Analysis and Hypothesis Testing.
No. The certification exam is administered by Pearson VUE and must be taken at one of their testing facilities. You may visit their site at https://home.pearsonvue.com/ for more information.
Please visit the Pearson VUE web site at https://home.pearsonvue.com/ for the latest information on taking the AI Enterprise Workflow certification test.
More questions
Financial aid available,
