VOOZH about

URL: https://www.coursera.org/learn/python-data-science

⇱ Python for Data Science | Coursera


Python for Data Science

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Python for Data Science

9,771 already enrolled

Included with

Ask Coursera

Gain insight into a topic and learn the fundamentals.
4.1

148 reviews

Beginner level

Recommended experience

Flexible schedule
4 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course

Gain insight into a topic and learn the fundamentals.
4.1

148 reviews

Beginner level

Recommended experience

Flexible schedule
4 weeks at 10 hours a week
Learn at your own pace
94%
Most learners liked this course

What you'll learn

  • Build pandas pipelines to clean, transform, and aggregate real‑world datasets.

  • Perform EDA and compute descriptive statistics to summarize data quality and behavior.

  • Apply hypothesis tests (t‑test/chi‑square) and interpret results for business decisions.

  • Create publication‑quality charts (bar/line/box/heatmaps) with matplotlib & seaborn.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

17 assignments

Taught in English

Build your Data Analysis expertise

This course is part of the Fractal Data Science Professional Certificate
When you enroll in this course, you'll also be enrolled in this Professional Certificate.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate from Fractal Analytics

There are 5 modules in this course

Master Python for data science with hands‑on projects. Learn pandas, statistics, and visualization to solve real‑world business problems. Build job‑ready skills in data wrangling, exploratory data analysis (EDA), and charting with matplotlib/seaborn—no prior experience required. This beginner‑friendly course guides you through cleaning messy data, applying descriptive and inferential statistics, and preparing datasets for machine learning. You’ll design analyses that answer business questions, communicate insights with compelling visuals, and complete challenging assessments aligned to workplace scenarios.

By the end, you’ll confidently manipulate data in pandas, automate workflows, and build dashboards that stakeholders understand. Start your data‑driven journey and turn raw data into decisions.

In the first module of the Python for Data Science course, learners will be introduced to the fundamental concepts of Python programming. The module begins with the basics of Python, covering essential topics like introduction to Python.Next, the module delves into working with Jupyter notebooks, a popular interactive environment for data analysis and visualization. Learners will learn how to set up Jupyter notebooks, create, run, and manage code cells, and integrate text and visualizations using Markdown. Additionally, the module will showcase real-life applications of Python in solving data-related problems. Learners will explore various data science projects and case studies where Python plays a crucial role, such as data cleaning, data manipulation, statistical analysis, and machine learning.By the end of this module, learners will have a good understanding of Python, be proficient in using Jupyter notebooks for data analysis, and comprehend how Python is used to address real-world data science challenges.

What's included

12 videos6 readings2 assignments

12 videosTotal 60 minutes
  • Welcome to python for data science6 minutes
  • Expert Talk - A data scientist's experience with Python4 minutes
  • What is python?4 minutes
  • Working with Jupyter notebooks8 minutes
  • Introduction to the problem4 minutes
  • Solution approach - Preparing tables and charts4 minutes
  • Solution approach - Gaining Insights4 minutes
  • Solution Approach - Airline traffic analysis5 minutes
  • Solution summary4 minutes
  • Expert Talk - Why Python is the language of choice for data science professionals9 minutes
  • Introduction to the Problem4 minutes
  • Exploring the Problem5 minutes
6 readingsTotal 60 minutes
  • Course syllabus10 minutes
  • Installation guide 10 minutes
  • Working effectively with Jupyter notebooks10 minutes
  • Important note!10 minutes
  • The Global Problem Statement10 minutes
  • Tell us what you think!10 minutes
2 assignmentsTotal 60 minutes
  • Python fundamentals30 minutes
  • Data Analysis 30 minutes

By the end of this module, learners will acquire essential skills in working with various types of data. They will have a solid grasp of Python programming fundamentals, including data structures and libraries. They will be proficient in loading, cleaning, and transforming data, and will possess the ability to perform exploratory data analysis, employing data visualization techniques. They will also gain insights into basic statistical concepts, such as probability, distributions, and hypothesis testing.

What's included

32 videos4 readings6 assignments2 programming assignments5 ungraded labs

32 videosTotal 174 minutes
  • Introduction1 minute
  • Diving into CSV Data7 minutes
  • Data inspection5 minutes
  • Finding missing data in the POS data7 minutes
  • Deleting missing data and saving the cleaned data set7 minutes
  • Lab data and problem3 minutes
  • A note on assessments1 minute
  • Basic data structures - lists and dictionaries14 minutes
  • Basic data structures - series3 minutes
  • Creating a data frame using lists, dictionaries and series4 minutes
  • Slicing with precision6 minutes
  • Changing the indices and saving the new DataFrame4 minutes
  • Navigating data insights6 minutes
  • Selecting data that match certain criteria4 minutes
  • Selecting data that match multiple criteria4 minutes
  • Expert Talk - Understanding your data6 minutes
  • What are the unique products in the POS data set?6 minutes
  • Finding specific values in the data7 minutes
  • How much did we sell per category? 6 minutes
  • Finding totals and averages by brand and by category6 minutes
  • Grouping by multiple attributes6 minutes
  • Displaying aggregated data in a pivot table8 minutes
  • Expert talk - How insights and data analysis guide each other5 minutes
  • Working with dates7 minutes
  • How much did we sell each month?6 minutes
  • What is the monthly average of sales?5 minutes
  • Were there specific dates when sales were high?9 minutes
  • What if we have more than one dataset?6 minutes
  • Merging some simple data sets5 minutes
  • Merging POS data with the online data6 minutes
  • Walkthrough - How to approach a graded assignment4 minutes
  • Summary1 minute
4 readingsTotal 60 minutes
  • Data cleaning with python10 minutes
  • Resources - Datasets and Jupyter notebooks 10 minutes
  • Python statistics fundamentals 10 minutes
  • Working with dates 30 minutes
6 assignmentsTotal 180 minutes
  • DataFame essentials30 minutes
  • DataFrame operations30 minutes
  • Data selection & filtering30 minutes
  • Data manipulation & aggregation30 minutes
  • Date time operations30 minutes
  • Merging & joining dataframes30 minutes
2 programming assignmentsTotal 300 minutes
  • Graded Assignment120 minutes
  • New Programming Assignment180 minutes
5 ungraded labsTotal 150 minutes
  • Data cleaning & manipulation 30 minutes
  • Data slicing & manipulations30 minutes
  • Data aggregations30 minutes
  • Practice Programming Assignment30 minutes
  • Merging the data30 minutes

By the end of this module, learners will gain a comprehensive understanding of statistical concepts, data exploration techniques, and visualization methods. Learners will develop the skills to identify patterns, outliers, and relationships in data, making informed decisions and formulating hypotheses. Ultimately, they will emerge with the ability to transform raw data into meaningful insights, effectively communicate their findings through data storytelling, and apply EDA across diverse real-world applications.

What's included

34 videos1 reading5 assignments1 programming assignment4 ungraded labs

34 videosTotal 206 minutes
  • Introduction0 minutes
  • Expert Talk - Why EDA is a superpower6 minutes
  • Finding the average of the data7 minutes
  • Understanding the spread of the data9 minutes
  • Quantiles - how to understand and visualize them7 minutes
  • Exploring variability in the POS data7 minutes
  • What shape is my data? 7 minutes
  • Understanding the distributions of features in the POS data7 minutes
  • Understanding Data Distributions4 minutes
  • Some other common shapes of data - Part I11 minutes
  • Some other common shapes of data - Part I6 minutes
  • Some other common shapes of data - Part II8 minutes
  • Some other common shapes of data - Part III8 minutes
  • What chance of revenue falls in a given range3 minutes
  • How are the features related to each other? - Part I6 minutes
  • How are the features related to each other? - Part I5 minutes
  • How are the features related to each other? - Part II5 minutes
  • How are the features related to each other? - Part II5 minutes
  • Visualizing categorical features7 minutes
  • Visualizing proportions8 minutes
  • Expert Talk - Power of visualization & its importance in storytelling 7 minutes
  • Using boxplots to compare revenues across segments in the POS data8 minutes
  • Making better visuals - Part III9 minutes
  • Communicating insights better by creating multiple subplots within the same plot3 minutes
  • Comparing the distribution of revenue for each sector by overlaying their KDE plots 8 minutes
  • Sampling our data - Part I 5 minutes
  • Sampling our data - Part II5 minutes
  • Introduction to hypothesis testing - Part I6 minutes
  • Introduction to hypothesis testing - Part II4 minutes
  • Hypothesis testing using Z - Test - Part I6 minutes
  • Hypothesis testing using Z - Test - Part II6 minutes
  • Hypothesis testing using t - Test7 minutes
  • Hypothesis testing using Chi-square test7 minutes
  • Summary1 minute
1 readingTotal 10 minutes
  • Resources - Datasets and Jupyter notebooks10 minutes
5 assignmentsTotal 150 minutes
  • Statistics fundamentals30 minutes
  • Data distributions30 minutes
  • Understanding relationships between features30 minutes
  • Practice Quiz30 minutes
  • Practice quiz30 minutes
1 programming assignmentTotal 120 minutes
  • Graded Assignment120 minutes
4 ungraded labsTotal 120 minutes
  • Understanding data distributions 30 minutes
  • Practice Programming Assignment30 minutes
  • Practice Programming Assignment30 minutes
  • Practice Programming Assignment30 minutes

By the end of this module, learners will acquire the essential skills to effectively transform raw and often messy data into a structured and suitable format for advanced analysis. They will master the techniques for handling missing values, identifying and dealing with outliers, encoding categorical variables, scaling and normalizing numerical features, and handling textual or unstructured data. Learners will also be proficient in detecting and addressing data inconsistencies, such as duplicates and errors. Learners will be able to treat data to make it suitable for further analysis. Upon completion of this module, Upon completion

What's included

25 videos2 readings3 assignments1 programming assignment3 ungraded labs

25 videosTotal 135 minutes
  • Introduction4 minutes
  • Expert Talk - Handling missing data7 minutes
  • What to do with missing values?5 minutes
  • Missing values in the POS data3 minutes
  • Missing values within a hierarchy8 minutes
  • Missing values within a hierarchy (contd.)6 minutes
  • What if parts of the hierarchy are also missing?3 minutes
  • Finishing up missing value treatment in the POS data5 minutes
  • Missing values - another simpler example9 minutes
  • Working with categoric features5 minutes
  • Transforming features - binning and discretization8 minutes
  • Transforming features - binning and discretization (contd.)6 minutes
  • Encoding categoric features - one-hot and label encoding9 minutes
  • Encoding features in the POS data6 minutes
  • Finishing up the encoding and saving the encoded data4 minutes
  • What is data normalization and why do we need it?5 minutes
  • Data normalization using min-max scaling6 minutes
  • Data normalization using z-score scaling4 minutes
  • Other types of data transformation5 minutes
  • Applying log transformation to the online data5 minutes
  • Finding outlying data6 minutes
  • Removing outliers by dropping them5 minutes
  • How to deal with outliers - imputation7 minutes
  • How to deal with outliers - capping4 minutes
  • Summary2 minutes
2 readingsTotal 40 minutes
  • Resources - Datasets and Jupyter notebooks10 minutes
  • Data pre-processing 30 minutes
3 assignmentsTotal 90 minutes
  • Missing values30 minutes
  • Dealing with categorical data30 minutes
  • Data normalization30 minutes
1 programming assignmentTotal 120 minutes
  • Graded Assignment120 minutes
3 ungraded labsTotal 90 minutes
  • Handling missing values30 minutes
  • Handling categorical features30 minutes
  • Data normalization & treating outliers30 minutes

By the end of this module, learners will develop a profound understanding of how to craft and enhance features to optimize the performance of machine learning models. They will be adept at identifying relevant variables, creating new features through techniques such as one-hot encoding, binning, and polynomial expansion, and extracting valuable information from existing data, like dates or text, using methods like feature extraction and text vectorization. Learners will also grasp the concept of feature scaling and normalization to ensure the consistency and comparability of feature ranges. With these skills, they will possess the ability to shape data effectively, amplifying its predictive power and contributing to the construction of robust, high-performing machine learning pipelines.

What's included

11 videos2 readings1 assignment1 programming assignment1 ungraded lab

11 videosTotal 53 minutes
  • Introduction1 minute
  • Reducing the dimensionality of data sets6 minutes
  • Exploring the features of the obesity data set7 minutes
  • What is Principal Component Analysis(PCA)?8 minutes
  • Applying PCA to the obesity data5 minutes
  • Creating a transformed version of the data through feature engineering9 minutes
  • Expert Talk - Gen AI in Python5 minutes
  • Introduction to Gen AI in Python for Data science4 minutes
  • Some quick data analysis using PandasAI4 minutes
  • Some quick data visualization using PandasAI3 minutes
  • Summary1 minute
2 readingsTotal 40 minutes
  • Complete guide to Feature Engineering30 minutes
  • Resources - Datasets and Jupyter notebooks10 minutes
1 assignmentTotal 30 minutes
  • Feature engineering & PCA30 minutes
1 programming assignmentTotal 120 minutes
  • Graded Assignment120 minutes
1 ungraded labTotal 30 minutes
  • Dimensionality reduction, PCA30 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Instructor ratings
4.4 (28 ratings)
Fractal Analytics
23 Courses136,400 learners

Explore more from Data Analysis

Why people choose Coursera for their career

👁 Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
👁 Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
👁 Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
👁 Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    64.42%

  • 4 stars

    14.76%

  • 3 stars

    2.68%

  • 2 stars

    2.01%

  • 1 star

    16.10%

Showing 3 of 148

KS
·

Reviewed on Feb 18, 2024

Good course. Need more in-depth details with case studies.

VY
·

Reviewed on Nov 14, 2023

All expert did a comprehending way of giving their knowledge for learning, a great work.

DK
·

Reviewed on Nov 28, 2023

Its a great course if you want to learn how to apply concepts in solving real business problems

Frequently asked questions

A practical, beginner‑friendly introduction to Python for data science focused on data wrangling, statistics, and visualization—skills employers value and use daily.

Beginners and professionals transitioning into data analysis or business analytics who want hands‑on, job‑ready skills.

Clean and analyze datasets with pandas, run statistical tests, build insightful visualizations, and prepare data for ML—then present findings that drive decisions.

Typically 4–6 weeks with 4–6 hours/week (pace‑yourself; includes labs, quizzes, and a capstone assessment).

No coding experience required. Basic comfort with math and spreadsheets is helpful.

Python fundamentals, pandas, data cleaning, EDA, visualization with matplotlib/seaborn, applied statistics, feature engineering, and a capstone project.

A blend of interactive mentoring dialogues, self‑assessments, practice quizzes, guided labs, and a real‑world capstone.

It’s skill‑first and project‑driven, emphasizing business problem solving, clear communication of insights, and ML‑readiness—not just coding in isolation.

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Financial aid available,