VOOZH about

URL: https://www.coursera.org/learn/open-source-tools-for-data-science

⇱ Tools for Data Science | Coursera


Tools for Data Science

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Tools for Data Science

This course is part of multiple programs.

598,906 already enrolled

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
4.5

30,352 reviews

Beginner level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
89%
Most learners liked this course

Gain insight into a topic and learn the fundamentals.
4.5

30,352 reviews

Beginner level

Recommended experience

Flexible schedule
2 weeks at 10 hours a week
Learn at your own pace
89%
Most learners liked this course

What you'll learn

  • Describe the Data Scientist’s tool kit which includes: Libraries & Packages, Data sets, Machine learning models, and Big Data tools 

  • Utilize languages commonly used by data scientists like Python, R, and SQL 

  • Demonstrate working knowledge of tools such as Jupyter notebooks and RStudio and utilize their various features  

  • Create and manage source code for data science using Git repositories and GitHub. 

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

13 assignments

Taught in English

Build your subject-matter expertise

This course is available as part of
When you enroll in this course, you'll also be asked to select a specific program.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 6 modules in this course

In order to be successful in Data Science, you need to be skilled with using tools that Data Science professionals employ as part of their jobs. This course teaches you about the popular tools in Data Science and how to use them.

You will become familiar with the Data Scientist’s tool kit which includes: Libraries & Packages, Data Sets, Machine Learning Models, Kernels, as well as the various Open source, commercial, Big Data and Cloud-based tools. Work with Jupyter Notebooks, JupyterLab, RStudio IDE, Git, GitHub, and Watson Studio. You will understand what each tool is used for, what programming languages they can execute, their features and limitations. This course gives plenty of hands-on experience in order to develop skills for working with these Data Science Tools. With the tools hosted in the cloud on Skills Network Labs, you will be able to test each tool and follow instructions to run simple code in Python, R, or Scala. Towards the end the course, you will create a final project with a Jupyter Notebook. You will demonstrate your proficiency preparing a notebook, writing Markdown, and sharing your work with your peers.

In this module, you will learn about the different types and categories of tools that data scientists use and popular examples of each. You will also become familiar with Open Source, Cloud-based, and Commercial options for data science tools.

What's included

6 videos4 readings3 assignments1 plugin

6 videosβ€’Total 39 minutes
  • Course Introductionβ€’3 minutes
  • Categories of Data Science Toolsβ€’8 minutes
  • Open Source Tools for Data Science - Part 1β€’8 minutes
  • Open Source Tools for Data Science - Part 2β€’5 minutes
  • Commercial Tools for Data Scienceβ€’7 minutes
  • Cloud Based Tools for Data Scienceβ€’8 minutes
4 readingsβ€’Total 45 minutes
  • Learning goals for the courseβ€’10 minutes
  • Model Developmentβ€’10 minutes
  • Summary: Open Source Tools for Data Scienceβ€’20 minutes
  • Module 1 Summaryβ€’5 minutes
3 assignmentsβ€’Total 54 minutes
  • Graded Quiz - Data Science Tools β€’30 minutes
  • Practice Quiz: Introduction to Data Science Toolsβ€’12 minutes
  • Practice Quiz: Commercial and Cloud-Based Data Science Toolsβ€’12 minutes
1 pluginβ€’Total 15 minutes
  • Open Source Tool Boardβ€’15 minutes

For users who are just starting on their data science journey, the range of programming languages can be overwhelming. So, which language should you learn first? This module will bring awareness about the criteria that would determine which language you should learn. You will learn the benefits of Python, R, SQL, and other common languages such as Java, Scala, C++, JavaScript, and Julia. You will explore how you can use these languages in Data Science. You will also look at some sites to locate more information about the languages.

What's included

5 videos1 reading2 assignments

5 videosβ€’Total 21 minutes
  • Languages of Data Scienceβ€’3 minutes
  • Introduction to Pythonβ€’4 minutes
  • Introduction to R Languageβ€’4 minutes
  • Introduction to SQLβ€’4 minutes
  • Other Languages for Data Scienceβ€’6 minutes
1 readingβ€’Total 2 minutes
  • Module 2 Summaryβ€’2 minutes
2 assignmentsβ€’Total 40 minutes
  • Graded Quiz - Languagesβ€’30 minutes
  • Practice Quiz - Languages β€’10 minutes

In this module, you will learn about the various libraries in data science. In addition, you will understand an API in relation to REST request and response. Further, in the module, you will explore open data sets on the Data Asset eXchange. Finally, you will learn how to use a machine learning model to solve a problem and navigate the Model Asset eXchange.

What's included

5 videos3 readings2 assignments

5 videosβ€’Total 27 minutes
  • Libraries for Data Scienceβ€’5 minutes
  • Application Programming Interfaces (APIs)β€’5 minutes
  • Data Sets - Powering Data Scienceβ€’6 minutes
  • Machine Learning Models – Learning from Models to Make Predictionsβ€’7 minutes
  • The Model Asset eXchangeβ€’4 minutes
3 readingsβ€’Total 23 minutes
  • Additional Sources of Datasetsβ€’5 minutes
  • Hands on Lab: Getting Started with Open Source Datasets and Deep Learning Modelsβ€’15 minutes
  • Module 3 Summaryβ€’3 minutes
2 assignmentsβ€’Total 42 minutes
  • Graded Quiz - Libraries, APIs, Data Sets, Models β€’30 minutes
  • Practice Quiz - Libraries, APIs, Data Sets, Models β€’12 minutes

With the advancement of digital data, Jupyter Notebook allows a Data Scientist to record their data experiments and results that others can reuse. This module introduces the Jupyter Notebook and Jupyter Lab. You will learn how to work with different kernels in a Notebook session and about the basic Jupyter architecture. In addition, you will identify the tools in an Anaconda Jupyter environment. Finally, the module gives an overview of cloud based Jupyter environments and their data science features.

What's included

6 videos3 readings2 assignments3 app items

6 videosβ€’Total 21 minutes
  • Introduction to Jupyter Notebooksβ€’3 minutes
  • Getting Started with Jupyterβ€’4 minutes
  • Jupyter Kernelsβ€’2 minutes
  • Jupyter Architectureβ€’2 minutes
  • Additional Anaconda Jupyter Environmentsβ€’6 minutes
  • Additional Cloud Based Jupyter Environmentsβ€’4 minutes
3 readingsβ€’Total 22 minutes
  • (Optional): Hands-on Lab: Download & Install Anaconda on Windowsβ€’15 minutes
  • Jupyter Notebooks on the Internetβ€’5 minutes
  • Module 4 Summaryβ€’2 minutes
2 assignmentsβ€’Total 40 minutes
  • Graded Quiz - Jupyter Notebooks and JupyterLab β€’30 minutes
  • Practice Quiz - Jupyter Notebooks and Jupyter Labβ€’10 minutes
3 app itemsβ€’Total 40 minutes
  • Hands-on Lab: Getting Started with Jupyter Notebooksβ€’10 minutes
  • Hands-on Lab: Using Markdown in Jupyter Notebooksβ€’15 minutes
  • Hands-on Lab: Working with Files in Jupyter Notebooksβ€’15 minutes

R is a statistical programming language and is a powerful tool for data processing and manipulation. This module will start with an introduction to R and RStudio. You will learn about the different R visualization packages and how to create visual charts using the plot function. In addition, Distributed Version Control Systems (DVCS) have become critical tools in software development and key enablers for social and collaborative coding. While there are many distributed versioning systems, Git is amongst the most popular ones. Further in the module, you will develop the essential conceptual and hands-on skills to work with Git and GitHub. You will start with an overview of Git and GitHub, followed by creation of a GitHub account and a project repository, adding files to it, and committing your changes using the web interface. Next, you will become familiar with Git workflows involving branches and pull requests (PRs) and merges. You will also complete a project at the end to apply and demonstrate your newly acquired skills.

What's included

7 videos5 readings3 assignments5 app items

7 videosβ€’Total 29 minutes
  • Introduction to R and RStudioβ€’3 minutes
  • Plotting in RStudioβ€’4 minutes
  • Overview of Git/GitHubβ€’4 minutes
  • Introduction to GitHubβ€’5 minutes
  • GitHub Repositoriesβ€’4 minutes
  • GitHub - Getting Startedβ€’3 minutes
  • GitHub - Working with Branches β€’6 minutes
5 readingsβ€’Total 68 minutes
  • [Optional] Download & Install R and RStudioβ€’15 minutes
  • Hands-on Lab: Getting Started with GitHubβ€’20 minutes
  • Hands-On Lab: Branching and Merging (Web UI)β€’20 minutes
  • Module 5 Summaryβ€’3 minutes
  • Glossaryβ€’10 minutes
3 assignmentsβ€’Total 50 minutes
  • Graded Quiz - RStudio & GitHub β€’30 minutes
  • Practice Quiz - RStudio β€’10 minutes
  • Practice Quiz - GitHubβ€’10 minutes
5 app itemsβ€’Total 220 minutes
  • R Basics with RStudioβ€’15 minutes
  • Getting started with RStudio and Installing packagesβ€’60 minutes
  • Creating Data Visualizations using ggplotβ€’60 minutes
  • Plotting with RStudioβ€’60 minutes
  • [Optional] Getting Started with Branches using Git Commandsβ€’25 minutes

In this module, you will work on a final project to demonstrate some of the skills learned in the course. You will also be tested on your knowledge of various components and tools in a Data Scientist's toolkit learned in the previous modules.

What's included

2 readings1 assignment2 app items

2 readingsβ€’Total 22 minutes
  • Final Project Overviewβ€’20 minutes
  • Final Project Submission Guidelines and Deliverablesβ€’2 minutes
1 assignmentβ€’Total 36 minutes
  • Final Exam β€’36 minutes
2 app itemsβ€’Total 120 minutes
  • AI Graded: Final Project - Submission and Evaluationβ€’60 minutes
  • Hands-on Lab: Create your Jupyter Notebookβ€’60 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructors

Instructor ratings
4.5 (5,131 ratings)
IBM
6 Coursesβ€’802,361 learners
IBM
1 Courseβ€’598,906 learners
IBM
10 Coursesβ€’840,013 learners

Explore more from Data Analysis

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

  • 5 stars

    67.99%

  • 4 stars

    21.61%

  • 3 stars

    6.37%

  • 2 stars

    2.06%

  • 1 star

    1.95%

Showing 3 of 30352

SS
Β·

Reviewed on Jul 19, 2020

Good content, excellent delivery. Week1 had too much of information at once. I was left with little motivation after week-1. All the other weeks were good. Exercises were engaging and very useful.

MO
Β·

Reviewed on Apr 17, 2023

the best course for the beginner who is going to start his data science journey. This course tells you all options like tools, libraries, programming languages, etc. Highly recommended for beginners.

GC
Β·

Reviewed on Apr 12, 2020

It serves perfecty its aim that is giving a first glance of the open course tools for data science. Of course each tool is briefly touched and it hands over the student the duty to deepen each tool.

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Financial aid available,