Data Analysis and Visualization with Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Data Analysis and Visualization with Python
This course is part of multiple programs.
Instructor: Microsoft
17,930 already enrolled
Included with
Ask Coursera
114 reviews
Recommended experience
114 reviews
Recommended experience
Skills you'll gain
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 5 modules in this course
Description: This course delves into the world of data analysis with Python. You'll learn how to use libraries like pandas and Matplotlib to manipulate, analyze, and visualize data, extracting valuable insights and communicating findings effectively.
Benefits: Become proficient in data analysis techniques, enabling you to extract meaningful insights from data and present them in compelling visualizations. By the end of this course, you'll be able to: • Perform data cleaning, transformation, and manipulation using pandas. • Create various types of visualizations using Matplotlib. • Understand the fundamentals of generative AI and its applications in data analysis. • Implement basic machine learning models for data analysis. Tools/Software: Python, Jupyter Notebook, pandas, Matplotlib, Scikit-learn This course is for entry-Level professionals looking to build a foundational understanding and experience with Python, while seeking employment as a Python developer. No prior work experience or degree is required.
This module provides a foundational understanding of data analysis and its role in various industries. Learners will explore the data analysis process, key concepts, and ethical considerations. They will also be introduced to essential Python libraries and tools like Jupyter Notebook, equipping them with the necessary skills to begin their data analysis journey. By the end of this module, learners will be able to define data analysis, differentiate it from data science, explain the data analysis process, identify key data analysis concepts, and set up their data analysis toolkit.
What's included
10 videos7 readings5 assignments
10 videos•Total 49 minutes
- What is data analysis?•6 minutes
- The data analysis process•3 minutes
- Data ethics and privacy: Navigating the responsible use of data•6 minutes
- Data governance•6 minutes
- Setting up your environment for data analysis•2 minutes
- Jupyter notebook tips and tricks•6 minutes
- Demo: Jupyter notebook shortcuts and productivity tips•6 minutes
- Use cases for Python libraries•6 minutes
- Understanding datasets•2 minutes
- Finding and accessing real-world datasets•5 minutes
7 readings•Total 70 minutes
- Data analysis and visualization with Python syllabus•10 minutes
- Foundations of data analysis•10 minutes
- The difference between data analysis and data science•10 minutes
- Key concepts in data analysis•10 minutes
- Essential Python libraries for data analysis•10 minutes
- Common dataset types and sources•10 minutes
- Data cleaning 101•10 minutes
5 assignments•Total 90 minutes
- Unveiling data analysis•15 minutes
- Activity: A simple analysis in Jupyter Notebook•15 minutes
- Setting up your data analysis toolkit•15 minutes
- Diving into datasets•15 minutes
- Introduction to data analysis•30 minutes
This module focuses on equipping learners with practical data processing and manipulation skills. Learners will be introduced to pandas, a powerful Python library, as a core tool for data manipulation. Learners will become proficient in using pandas dataFrames, mastering essential operations such as indexing, slicing, and filtering data. They will gain a thorough understanding of various indexing techniques (loc, iloc, boolean indexing) and their appropriate applications. The module emphasizes the importance of data cleaning for accurate analysis and guides learners through various techniques to identify and handle missing values and outliers. It also covers different data types in Python, enabling learners to make informed choices for their analysis. Learners will practice loading, inspecting, and transforming datasets using pandas functions, applying these skills to real-world scenarios. By the end of this module, learners will confidently leverage pandas to clean, transform, and prepare data for subsequent analysis and visualization, ensuring data integrity and reliability in their data analysis projects.
What's included
13 videos5 readings5 assignments
13 videos•Total 60 minutes
- Manipulating data with pandas•2 minutes
- pandas Dataframes: The basics•6 minutes
- Demo: Loading and inspecting datasets in pandas•5 minutes
- Exploring data transformations through pandas•5 minutes
- Demo: Transforming data with pandas•5 minutes
- Exploratory data analysis (EDA)•2 minutes
- The importance of data cleaning•5 minutes
- Identifying and handling missing data•5 minutes
- Handling duplicate values in datasets•6 minutes
- Detecting and removing outliers from datasets•6 minutes
- Data types in Python: Choosing the right fit•2 minutes
- Demo: pandas for exploration and cleaning•5 minutes
- Taming messy data with pandas•5 minutes
5 readings•Total 50 minutes
- pandas indexing explained•10 minutes
- pandas cheat sheet•10 minutes
- Essential tactics for data manipulation•10 minutes
- Common causes of missing data•10 minutes
- pandas for essential analysis tasks•10 minutes
5 assignments•Total 95 minutes
- Activity: Loading and inspecting datasets in pandas•20 minutes
- pandas: Your data manipulation powerhouse•15 minutes
- The hero of data analysis: Data cleaning•15 minutes
- Using pandas for cleaning and exploration•15 minutes
- Data processing and manipulation•30 minutes
This module focuses on the essential skill of data visualization. Learners examine a variety of visualization types, such as line charts, bar charts, and scatter plots, learning how to choose the most effective ones for different data and analysis goals. The module provides a comparison of popular visualization libraries, including Matplotlib, Plotly, and Bokeh, highlighting the unique strengths of each to help learners select the right tool. Learners gain practical experience creating visualizations with Matplotlib, mastering the basics of plot customization for clear and informative communication. The module also introduces advanced techniques with Plotly and Bokeh, enabling learners to design interactive and highly customized visualizations. It emphasizes the importance of communicating data insights effectively, teaching learners how to construct narratives with data. Learners are introduced to best practices for data visualization design, ensuring their visuals are clear, informative, and engaging. By the end of this module, learners will be able to transform data into impactful visuals that support effective communication and informed decision-making.
What's included
10 videos8 readings5 assignments
10 videos•Total 49 minutes
- Charting your data visually•2 minutes
- Common visualizations tools•6 minutes
- Introduction to Matplotlib•7 minutes
- Explore visualization libraries•3 minutes
- Interactive plots with Plotly•5 minutes
- Customizing visualizations with Bokeh•5 minutes
- Use data for storytelling•3 minutes
- The art of data storytelling•6 minutes
- Presenting data insights•6 minutes
- Avoid conclusion bias in data analysis•6 minutes
8 readings•Total 80 minutes
- What is data visualization?•10 minutes
- Anatomy of a Matplotlib Plot•10 minutes
- Matplotlib gallery•10 minutes
- Choosing the right visualization library•10 minutes
- Plotly interactive dashboards•10 minutes
- Application of data storytelling across industries•10 minutes
- Data visualization best practices•10 minutes
- Cognitive load theory and data visualization•10 minutes
5 assignments•Total 95 minutes
- Introduction to visualization•15 minutes
- Creating visualizations•15 minutes
- Interpreting and presenting data insights•15 minutes
- Activity: Visualizing trends•20 minutes
- Data visualization•30 minutes
This module provides learners with a foundational understanding of generative AI, its applications, and ethical implications, along with practical techniques for leveraging it in data analysis and visualization. Learners will explore the core concepts of generative AI, including transformer models, large language models (LLMs), and natural language processing (NLP). They will delve into the distinctions between generative AI and other AI types, examining real-world applications across various sectors. The module also emphasizes the ethical considerations surrounding generative AI, covering topics such as ownership, authenticity, and responsible use of AI-generated content. Additionally, learners will gain hands-on experience with techniques for generating synthetic data using generative adversarial networks (GANs) and other models, and explore data augmentation methods for enhancing the size and diversity of datasets, ultimately improving the performance of machine learning models.
What's included
8 videos6 readings4 assignments
8 videos•Total 38 minutes
- What is generative AI?•6 minutes
- Real-world applications of generative AI•7 minutes
- The ethics of AI-Generated content•2 minutes
- Filling the gaps in your data•2 minutes
- Using Generative Adversarial Networks (GANs)•6 minutes
- Data augmentation: Supercharging your dataset•2 minutes
- Why augmentation matters•5 minutes
- Text augmentation techniques•7 minutes
6 readings•Total 60 minutes
- Generative AI vs. other AI•10 minutes
- Ethical guidelines for generative AI•10 minutes
- Introduction to synthetic data•10 minutes
- Synthetic data generation techniques•10 minutes
- Image augmentation techniques•10 minutes
- Best practices for data augmentation•10 minutes
4 assignments•Total 75 minutes
- Basics of generative AI•15 minutes
- Generating synthetic data with GenAI•15 minutes
- Data augmentation•15 minutes
- Introduction to generative AI•30 minutes
This module provides a foundational understanding of machine learning, its applications, and how to build basic models. Learners will explore core concepts like supervised and unsupervised learning, delve into model evaluation techniques using metrics like precision, recall, and F1-score, and gain hands-on experience building linear and logistic regression models with Scikit-learn. Additionally, the module covers the use of synthetic data in machine learning, including ethical considerations and practical applications.
What's included
14 videos9 readings6 assignments2 programming assignments
14 videos•Total 71 minutes
- What is machine learning?•6 minutes
- How machine learning works•2 minutes
- Machine learning in the real world•6 minutes
- Why model evaluation matters•6 minutes
- Regression metrics•6 minutes
- Demo: Using metrics for classification•5 minutes
- Regression metrics for machine learning•6 minutes
- From data to predictions: The magic of machine learning•2 minutes
- What is a neural network in machine learning?•6 minutes
- Demo: Linear regression with Scikit-Learn•5 minutes
- Classification with logistic regression•6 minutes
- Synthetic data in ML: Case studies•6 minutes
- Demo: Training and testing with synthetic data•6 minutes
- Synthetic data: Balancing innovation and responsibility•2 minutes
9 readings•Total 90 minutes
- Key terminology in machine learning•10 minutes
- Unraveling the confusion matrix•10 minutes
- Best practices for analyzing and presenting data sets•10 minutes
- Beyond the numbers: Interpreting evaluation metrics in context•10 minutes
- Machine learning basics•10 minutes
- Scikit-Learn documentation•10 minutes
- Your first machine learning model: A guide•10 minutes
- Ethical considerations of synthetic data•10 minutes
- Data and visualization with Python: Pulling it all together•10 minutes
6 assignments•Total 135 minutes
- Machine learning 101•15 minutes
- Evaluating model performance•15 minutes
- Building your first machine learning models•15 minutes
- Leveraging synthetic data in machine learning•30 minutes
- Activity: Synthetic data generation•30 minutes
- Introduction to machine learning•30 minutes
2 programming assignments•Total 100 minutes
- Activity: Your first machine learning model implementation•50 minutes
- Activity: Analyzing and predicting customer churn•50 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Explore more from Design and Product
- Status: Free Trial
Course
- Status: Free TrialA
Arizona State University
Course
- Status: Free TrialG
Google
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
62.93%
- 4 stars
23.27%
- 3 stars
6.03%
- 2 stars
2.58%
- 1 star
5.17%
Showing 3 of 114
Reviewed on Sep 24, 2025
This course was challenging. The content didn't flow or connect for me as the previous courses. Maybe dealing with life events, I lost some of my focus.
Reviewed on May 14, 2026
Was a great course, a lot of information and knowledge, but with some imbalance related to practice.
Reviewed on Nov 18, 2025
4.5 stars. it gets better and hands on towards the end
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
More questions
Financial aid available,
