Data Structures in Python
Data Structures in Python
This course is part of Google Data Analysis with Python Specialization
4,134 already enrolled
Included with
Ask Coursera
14 reviews
14 reviews
What you'll learn
Use core NumPy and pandas data structures to store and organize data
Define Python tools such as libraries, packages, modules, and global variables
Describe the main features and methods of built-in Python data structures such as lists, tuples, dictionaries, and sets
Skills you'll gain
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 5 modules in this course
In this course, you’ll explore data structures in Python, which are methods of storing and organizing data in a computer. You’ll focus on data structures that are among the most useful for data professionals: lists, tuples, dictionaries, sets, and arrays. You’ll also discover how to categorize data using data loading, cleaning, and binning. Lastly, you’ll learn about two of the most widely used and important Python tools for advanced data analysis: NumPy and pandas.
By the end of this course, you will be able to: • Explain how to manipulate dataframes using techniques such as selecting and indexing, boolean masking, grouping and aggregating, and merging and joining • Describe the main features and methods of core pandas data structures such as dataframes • Describe the main features and methods of core NumPy data structures such as arrays and series • Define Python tools such as libraries, packages, modules, and global variables • Describe the main features and methods of built-in Python data structures such as lists, tuples, dictionaries, and sets
In this module, you will explore data structures in Python, which are methods of storing and organizing data in a computer. You’ll focus on lists and tuples, data structures that are among the most useful for data professionals.
What's included
5 videos3 readings1 assignment3 ungraded labs
5 videos•Total 20 minutes
- Introduction to data structures in Python•1 minute
- Introduction to lists•5 minutes
- Modify the contents of a list•4 minutes
- Introduction to tuples•4 minutes
- More with loops, lists, and tuples•6 minutes
3 readings•Total 20 minutes
- Reference guide: Lists•8 minutes
- Compare lists, strings, and tuples•8 minutes
- zip(), enumerate(), and list comprehension•4 minutes
1 assignment•Total 8 minutes
- Test your knowledge: Lists and tuples•8 minutes
3 ungraded labs•Total 50 minutes
- Annotated follow-along guide: Data structures in Python•20 minutes
- Activity: Lists & tuples •20 minutes
- Exemplar: Lists & tuples •10 minutes
In this module, you will focus on dictionaries and sets, some more data structures that are among the most useful for data professionals.
What's included
3 videos2 readings1 assignment2 ungraded labs
3 videos•Total 15 minutes
- Introduction to dictionaries•5 minutes
- Dictionary methods•5 minutes
- Introduction to sets•6 minutes
2 readings•Total 8 minutes
- Reference guide: Dictionaries•4 minutes
- Reference guide: Sets•4 minutes
1 assignment•Total 6 minutes
- Test your knowledge: Dictionaries and sets•6 minutes
2 ungraded labs•Total 30 minutes
- Activity: Dictionaries & sets•20 minutes
- Exemplar: Dictionaries & sets•10 minutes
In this module, you will focus on arrays. You’ll learn about one of the most widely used and important Python tools for advanced data analysis: NumPy.
What's included
3 videos3 readings1 assignment2 ungraded labs
3 videos•Total 15 minutes
- The power of packages•4 minutes
- Introduction to NumPy•4 minutes
- Basic array operations•6 minutes
3 readings•Total 12 minutes
- Understand Python libraries, packages, and modules•4 minutes
- Python’s new versions and features•4 minutes
- Reference guide: Arrays•4 minutes
1 assignment•Total 6 minutes
- Test your knowledge: Arrays and vectors with NumPy•6 minutes
2 ungraded labs•Total 30 minutes
- Activity: Arrays and vectors with NumPy•20 minutes
- Exemplar: Arrays and vectors with NumPy•10 minutes
In this module, you will learn about one of the most widely used and important Python tools for advanced data analysis: pandas. You’ll also discover how to categorize data using data loading, cleaning, and binning.
What's included
5 videos3 readings1 assignment2 ungraded labs
5 videos•Total 35 minutes
- Introduction to pandas•5 minutes
- pandas basics•10 minutes
- Boolean masking•6 minutes
- Grouping and aggregation•6 minutes
- Merging and joining data•9 minutes
3 readings•Total 12 minutes
- The fundamentals of pandas•4 minutes
- Boolean masking in pandas •4 minutes
- More on grouping and aggregation•4 minutes
1 assignment•Total 8 minutes
- Test your knowledge: Dataframes with pandas•8 minutes
2 ungraded labs•Total 30 minutes
- Activity: Dataframes with pandas•20 minutes
- Exemplar: Dataframes with pandas•10 minutes
Review everything you’ve learned and take the final assessment.
What's included
1 reading1 assignment
1 reading•Total 5 minutes
- Wrap-up•5 minutes
1 assignment•Total 55 minutes
- Course 4 challenge: Data structures in Python•55 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Instructor
Explore more from Data Analysis
- Status: Free TrialU
University of Michigan
Course
- Status: Free TrialU
University of Colorado Boulder
Course
- C
Coursera
Guided Project
Why people choose Coursera for their career
Advance your career with an online degree
Earn a degree from world-class universities - 100% online
Frequently asked questions
Organizations of all types and sizes have business processes that generate massive volumes of data. Every moment, all sorts of information gets created by computers, the internet, phones, texts, streaming video, photographs, sensors, and much more. In the global digital landscape, data is increasingly imprecise, chaotic, and unstructured. As the speed and variety of data increases exponentially, organizations are struggling to keep pace.
Data science is part of a field of study that uses raw data to create new ways of modeling and understanding the unknown. To gain insights, businesses rely on data professionals to acquire, organize, and interpret data, which helps inform internal projects and processes. Data scientists rely on a combination of critical skills, including statistics, scientific methods, data analysis, and artificial intelligence.
A data professional is a term used to describe any individual who works with data and/or has data skills. At a minimum, a data professional is capable of exploring, cleaning, selecting, analyzing, and visualizing data. They may also be comfortable with writing code and have some familiarity with the techniques used by statisticians and machine learning engineers, including building models, developing algorithmic thinking, and building machine learning models.
Data professionals are responsible for collecting, analyzing, and interpreting large amounts of data within a variety of different organizations. The role of a data professional is defined differently across companies. Generally speaking, data professionals possess technical and strategic capabilities that require more advanced analytical skills such as data manipulation, experimental design, predictive modeling, and machine learning. They perform a variety of tasks related to gathering, structuring, interpreting, monitoring, and reporting data in accessible formats, enabling stakeholders to understand and use data effectively. Ultimately, the work of data professionals helps organizations make informed, ethical decisions.
Large volumes of data — and the technology needed to manage and analyze it — are becoming increasingly accessible. Because of this, there has been a surge in career opportunities for people who can tell stories using data, such as senior data analysts and data scientists. These professionals collect, analyze, and interpret large amounts of data within a variety of different organizations. Their responsibilities require advanced analytical skills such as data manipulation, experimental design, predictive modeling, and machine learning.
More questions
Financial aid available,
