Python and Pandas for Data Engineering
Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
Python and Pandas for Data Engineering
This course is part of Python, Bash and SQL Essentials for Data Engineering Specialization
32,069 already enrolled
Included with
Learn more
Ask Coursera
277 reviews
Recommended experience
277 reviews
Recommended experience
What you'll learn
Setup a provisioned Python project environment
Use Pandas libraries to read and write data into data structures and files
Employ Vim and Visual Studio Code to write Python code
Skills you'll gain
Tools you'll learn
Details to know
25 assignments
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
In this first course of the Python, Bash and SQL Essentials for Data Engineering Specialization, you will learn how to set up a version-controlled Python working environment which can utilize third party libraries. You will learn to use Python and the powerful Pandas library for data analysis and manipulation. Additionally, you will also be introduced to Vim and Visual Studio Code, two popular tools for writing software. This course is valuable for beginning and intermediate students in order to begin transforming and manipulating data as a data engineer.
In this module, you will learn how to set up an isolated Python environment with third party libraries and apply it by setting up a virtual environment including Pandas and Jupyter.
What's included
15 videos13 readings6 assignments1 discussion prompt6 ungraded labs
15 videosβ’Total 72 minutes
- Overview of Python, Bash and SQL Essentials for Data Engineeringβ’7 minutes
- Meet your Course Instructor: Kennedy Behrmanβ’1 minute
- Overview of Key Conceptsβ’6 minutes
- Introduction to Setting Up Your Python Environmentβ’0 minutes
- Installing Packages with pip in Pythonβ’6 minutes
- Saving Requirements File in Pythonβ’3 minutes
- Creating and Using a Python Virtual Environmentβ’6 minutes
- Expression Statements in Pythonβ’3 minutes
- Assignment Statements in Pythonβ’5 minutes
- Import Statements in Pythonβ’4 minutes
- Other Simple Statements in Pythonβ’5 minutes
- Compound Statements in Pythonβ’5 minutes
- If Statements in Pythonβ’7 minutes
- While Loops in Pythonβ’5 minutes
- Functions in Pythonβ’8 minutes
13 readingsβ’Total 130 minutes
- Key Termsβ’10 minutes
- Report a problem with the courseβ’10 minutes
- Key Termsβ’10 minutes
- Meet your Supporting Instructors: Alfredo Deza and Noah Giftβ’10 minutes
- Course Structure and Discussion Etiquetteβ’10 minutes
- Getting Started and Best Practicesβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Evaluating to True or Falseβ’10 minutes
- Lesson Reflectionβ’10 minutes
6 assignmentsβ’Total 330 minutes
- Python Statementsβ’30 minutes
- Quiz-Setting Up Your Python Environmentβ’180 minutes
- Assignment Statementsβ’30 minutes
- Import Statementsβ’30 minutes
- If Statementsβ’30 minutes
- While Loopsβ’30 minutes
1 discussion promptβ’Total 10 minutes
- Meet and Greet (optional)β’10 minutes
6 ungraded labsβ’Total 360 minutes
- Install a Package with the pip Commandβ’60 minutes
- Export a Requirements Fileβ’60 minutes
- Create a Virtual Environmentβ’60 minutes
- Practicing with Expression Statementsβ’60 minutes
- Decorator Functionsβ’60 minutes
- Setting up a Python Environmentβ’60 minutes
In this module, you will learn how to create and use Python Sequences, Dictionaries, Sets, List Comprehensions, and Generators. Additionally, you will learn how to apply these by manipulating client data in a Jupyter notebook.
What's included
12 videos6 readings8 assignments5 ungraded labs
12 videosβ’Total 65 minutes
- Introduction to Python Essentialsβ’1 minute
- Sequences in Pythonβ’8 minutes
- Lists and Tuples in Pythonβ’6 minutes
- Strings in Pythonβ’11 minutes
- Creating Range Objects in Pythonβ’3 minutes
- Creating Dictionaries in Pythonβ’5 minutes
- Accessing Dictionary Data in Pythonβ’4 minutes
- Dictionary Views in Pythonβ’3 minutes
- Sets and Set Operations in Pythonβ’7 minutes
- List Comprehensions in Pythonβ’7 minutes
- Generator Expressions in Pythonβ’4 minutes
- Generator Functions in Pythonβ’7 minutes
6 readingsβ’Total 60 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
8 assignmentsβ’Total 240 minutes
- Essential Python Conceptsβ’30 minutes
- Sequence Operationsβ’30 minutes
- Lists and Tuplesβ’30 minutes
- Range Objectsβ’30 minutes
- Accessing Data in Dictionariesβ’30 minutes
- Sets and Set Operationsβ’30 minutes
- List Comprehensionsβ’30 minutes
- Generator Expressionsβ’30 minutes
5 ungraded labsβ’Total 300 minutes
- Practicing with Strings in Pythonβ’60 minutes
- Creating Dictionaries in Pythonβ’60 minutes
- Dictionary Views in Pythonβ’60 minutes
- Comprehensions and Generators in Pythonβ’60 minutes
- Practicing Essential Pythonβ’60 minutes
In this module, you will learn how to load data into a Pandas DataFrame and write statements to select columns and rows from a DataFrame. Additionally, you will apply comparison and boolean operators as a method of selecting data.
What's included
10 videos7 readings4 assignments8 ungraded labs
10 videosβ’Total 63 minutes
- Introduction to Data in Python: Pandas and Alternativesβ’1 minute
- Creating Pandas DataFrames in Pythonβ’4 minutes
- Investigating Data in a Pandas DataFrameβ’7 minutes
- Selecting Data in a Pandas DataFrameβ’7 minutes
- Manipulating Pandas DataFramesβ’5 minutes
- Updating Pandas DataFrame Dataβ’5 minutes
- Applying Functions in a Pandas DataFrameβ’6 minutes
- Creating NumPy Arrays in Pythonβ’16 minutes
- Spark and PySpark DataFrames in Pythonβ’6 minutes
- Creating Dask DataFrames in Pythonβ’7 minutes
7 readingsβ’Total 70 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Polarsβ’10 minutes
- Lesson Reflectionβ’10 minutes
4 assignmentsβ’Total 120 minutes
- Pandas and Alternativesβ’30 minutes
- NumPyβ’30 minutes
- PySparkβ’30 minutes
- Daskβ’30 minutes
8 ungraded labsβ’Total 480 minutes
- Creating DataFramesβ’60 minutes
- Looking at Data in DataFramesβ’60 minutes
- Selecting Data in a Pandas DataFrameβ’60 minutes
- Manipulating DataFramesβ’60 minutes
- Updating Data in a DataFrameβ’60 minutes
- Applying Functions in a Pandas DataFrameβ’60 minutes
- Manipulate DataFrames with Polars to gain insightsβ’60 minutes
- Pandas and Alternativesβ’60 minutes
This week, you will learn the basics of some popular development environments and apply it by writing code in Vim and Visual Studio Code. Additionally, you will learn how to check your code into a Git repository.
What's included
12 videos8 readings7 assignments8 ungraded labs
12 videosβ’Total 48 minutes
- Introduction to Python Development Environmentsβ’0 minutes
- Introduction to Vim Normal Modeβ’7 minutes
- Switching from Normal to Insert and Visual Modes in Vimβ’5 minutes
- Working with the Vim Command Lineβ’7 minutes
- Vim Configurationβ’4 minutes
- Introduction to Visual Studio Codeβ’2 minutes
- Setting Up Visual Studio Codeβ’3 minutes
- Debugging Visual Studio Codeβ’3 minutes
- What is Version Control?β’3 minutes
- Introduction to Git and Git Conceptsβ’8 minutes
- Version Control with GitHubβ’6 minutes
- Summary of Python and Pandas for Data Engineeringβ’0 minutes
8 readingsβ’Total 80 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Key Termsβ’10 minutes
- Lesson Reflectionβ’10 minutes
- Next Stepsβ’10 minutes
- Share your learning experienceβ’10 minutes
7 assignmentsβ’Total 225 minutes
- Cumulative Python and Pandas for Data Engineering Quizβ’45 minutes
- Insert and Visual Modesβ’30 minutes
- Vim Command Line Modeβ’30 minutes
- Features of Visual Studio Codeβ’30 minutes
- Version Controlβ’30 minutes
- Git Commandsβ’30 minutes
- Hosted Gitβ’30 minutes
8 ungraded labsβ’Total 480 minutes
- Basic Vim Commandsβ’60 minutes
- Explore Visual Studio Codeβ’60 minutes
- Visual Studio Code Debuggerβ’60 minutes
- Setup and Provision a Python Projectβ’60 minutes
- Pandas Final Challenge: Life Expectancy and Happinessβ’60 minutes
- Final Jupyter Sandboxβ’60 minutes
- Final VS Code Sandboxβ’60 minutes
- Final Sandbox Linux Desktopβ’60 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructors
Offered by
Explore more from Data Analysis
- Status: Free TrialD
Duke University
Course
- Status: Free TrialD
Duke University
Specialization
- Status: Free TrialD
Duke University
Specialization
- Status: Free TrialD
Duke University
Course
Why people choose Coursera for their career
Learner reviews
- 5 stars
74.72%
- 4 stars
17.68%
- 3 stars
3.61%
- 2 stars
1.80%
- 1 star
2.16%
Showing 3 of 277
Reviewed on Mar 9, 2023
The lessons were laid out well and were easy to follow. I enjoyed how the labs reinforced the material that was covered.
Reviewed on Jun 29, 2022
Good for quick basics of working with bash, github, python, virtual environments and such
Reviewed on Jul 4, 2022
It's all overview tool that need to use for work in data engineer
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you canβt afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, youβll find a link to apply on the description page.
More questions
Financial aid available,
