Introduction to Data Science in Python
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Introduction to Data Science in Python
This course is part of Applied Data Science with Python Specialization
Instructor: Christopher Brooks
861,586 already enrolled
Included with
Learn more
Ask Coursera
27,289 reviews
27,289 reviews
What you'll learn
Understand techniques such as lambdas and manipulating csv files
Describe common Python functionality and features used for data science
Query DataFrame structures for cleaning and processing
Explain distributions, sampling, and t-tests
Skills you'll gain
- Data Wrangling
- Data Analysis
- Data Transformation
- Data Manipulation
- Data-Driven Decision-Making
- Statistical Methods
- Programming Principles
- Data Science
- Data Import/Export
- Data Preprocessing
- Statistical Analysis
- Text Mining
- Pivot Tables And Charts
- Data Cleansing
- Scripting Languages
- Probability & Statistics
- Data Processing
Tools you'll learn
Details to know
9 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses.
This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Network Analysis in Python.
In this module, you'll get an introduction to the field of data science, review common Python functionality and features that data scientists use, and be introduced to the Coursera Jupyter Notebook for the lectures. All of the course information on grading, prerequisites, and expectations are on the course syllabus, and you can find more information about the Jupyter Notebooks on our Course Resources page.
What's included
20 videos4 readings3 assignments1 programming assignment2 ungraded labs1 plugin
20 videosβ’Total 112 minutes
- Introduction to Specializationβ’4 minutes
- Introduction to the Courseβ’5 minutes
- The Coursera Jupyter Notebook Systemβ’8 minutes
- Python Functionsβ’8 minutes
- Python Types and Sequencesβ’9 minutes
- Python More on Stringsβ’3 minutes
- Python Demonstration: Reading and Writing CSV filesβ’4 minutes
- Python Dates and Timesβ’2 minutes
- Advanced Python Objects, map()β’6 minutes
- Advanced Python Lambda and List Comprehensionsβ’3 minutes
- Creating Arrayβ’4 minutes
- Manipulating Arrayβ’6 minutes
- From Arrays to Images and Backβ’6 minutes
- Indexing, Slicing, and Iteratingβ’6 minutes
- Trying NumPy with Datasetsβ’10 minutes
- Regex Matching and Anchorsβ’6 minutes
- Patterns and Character Classesβ’3 minutes
- Quantifiersβ’7 minutes
- Groupsβ’5 minutes
- Advanced Assertions and Applicationsβ’7 minutes
4 readingsβ’Total 200 minutes
- Syllabusβ’10 minutes
- Pre-Course Surveyβ’10 minutes
- Regular Expression Operations Documentationβ’60 minutes
- Module 1 Textbook Reading Assignmentβ’120 minutes
3 assignmentsβ’Total 50 minutes
- Practice Quiz: Python Programmingβ’10 minutes
- Practice Quiz: Numerical Python Library (NumPy)β’10 minutes
- Quiz 1β’30 minutes
1 programming assignmentβ’Total 180 minutes
- Assignment 1β’180 minutes
2 ungraded labsβ’Total 30 minutes
- Your Personal Jupyter Notebook Workspaceβ’15 minutes
- Module 1 Jupyter Notebooksβ’15 minutes
1 pluginβ’Total 60 minutes
- Regex Practice Sessionβ’60 minutes
In this module of the course, you'll learn the fundamentals of one of the most important toolkits Python has for data cleaning and processing -- pandas. You'll learn how to read in data into DataFrame structures, how to query these structures, and the details about such structures are indexed.
What's included
13 videos1 reading3 assignments1 programming assignment1 ungraded lab
13 videosβ’Total 89 minutes
- Introduction to Pandasβ’3 minutes
- The Series Data Structureβ’6 minutes
- Creating Pandas' Seriesβ’4 minutes
- Querying a Seriesβ’4 minutes
- Vectorized Operationsβ’8 minutes
- Appending Seriesβ’4 minutes
- DataFrame Data Structureβ’13 minutes
- DataFrame Indexing and Loadingβ’9 minutes
- Querying a DataFrameβ’10 minutes
- Indexing Dataframesβ’8 minutes
- Missing Values, Part 1β’4 minutes
- Missing Values, Part 2β’7 minutes
- Example: Manipulating DataFrameβ’9 minutes
1 readingβ’Total 60 minutes
- Module 2 Textbook Reading Assignmentβ’60 minutes
3 assignmentsβ’Total 50 minutes
- Practice Quiz: Pandas and Series Dataβ’10 minutes
- Practice Quiz: DataFrameβ’10 minutes
- Quiz 2 β’30 minutes
1 programming assignmentβ’Total 180 minutes
- Assignment 2β’180 minutes
1 ungraded labβ’Total 15 minutes
- Module 2 Jupyter Notebooksβ’15 minutes
In this module, you'll deepen your understanding of the python pandas library by learning how to merge DataFrames, generate summary tables, group data into logical pieces, and manipulate dates. We'll also refresh your understanding of scales of data, and discuss issues with creating metrics for analysis. The week ends with a more significant programming assignment.
What's included
12 videos1 reading2 assignments1 programming assignment1 ungraded lab
12 videosβ’Total 84 minutes
- Merging Dataframesβ’7 minutes
- Handling Conflicts between Dataframesβ’4 minutes
- Concatenating DataFramesβ’5 minutes
- Pandas Idioms, Part 1β’7 minutes
- Pandas Idioms, Part 2β’9 minutes
- Group by, Part 1β’9 minutes
- Group by, Part 2β’5 minutes
- Group by, Part 3β’7 minutes
- Scalesβ’10 minutes
- Pivot Tableβ’10 minutes
- Date/Time Functionalityβ’8 minutes
- Working with Dates in a Dataframeβ’5 minutes
1 readingβ’Total 120 minutes
- Module 3 Textbook Reading Assignmentβ’120 minutes
2 assignmentsβ’Total 40 minutes
- Practice Quiz: More Data Processing with Pandasβ’10 minutes
- Quiz 3 β’30 minutes
1 programming assignmentβ’Total 180 minutes
- Assignment 3β’180 minutes
1 ungraded labβ’Total 15 minutes
- Module 3 Jupyter Notebooksβ’15 minutes
In this final module of the course, you'll be introduced to a variety of statistical techniques such a distributions, sampling and t-tests. The week ends with two discussions of science and the rise of the fourth paradigm -- data driven discovery.
What's included
2 videos6 readings1 assignment1 programming assignment1 ungraded lab
2 videosβ’Total 20 minutes
- Basic Statistical Testingβ’14 minutes
- Other Forms of Structured Dataβ’6 minutes
6 readingsβ’Total 115 minutes
- Science Isn't Broken: p-hackingβ’45 minutes
- Goodhart's Lawβ’30 minutes
- The 5 Graph Algorithms that you should knowβ’10 minutes
- Post-Course Surveyβ’10 minutes
- Keep Learning with Michigan Online!β’10 minutes
- Progress Note from Christopher Brooksβ’10 minutes
1 assignmentβ’Total 30 minutes
- Final Quiz β’30 minutes
1 programming assignmentβ’Total 180 minutes
- Assignment 4β’180 minutes
1 ungraded labβ’Total 15 minutes
- Module 4 Jupyter Notebooksβ’15 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Offered by
Explore more from Data Analysis
- Status: Free TrialG
Google
Course
- Status: Free Trial
Course
- Status: Free Trial
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
66.35%
- 4 stars
24.23%
- 3 stars
5.36%
- 2 stars
1.92%
- 1 star
2.12%
Showing 3 of 27289
Reviewed on Jul 26, 2020
Quizzes were very challenging and interesting. I learned alot. The main drawback in this course is that the materials are insufficient to answer the assignments.And some questions were not so clear.
Reviewed on Aug 24, 2017
The course is good but the oral explanations are at times very tiresome. A more constructive approach in which the explanations are followed by step-by-step examples whould be far better.Best regards
Reviewed on Jul 17, 2022
An excellent course offered by the university of michigan which provides the basic knowledge required for starting career in data science and the concepts explianing by the proffesors were profound.
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
