NoSQL Databases: Analyze & Implement Scalable Systems
Ends soon! Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.
NoSQL Databases: Analyze & Implement Scalable Systems
This course is part of Hadoop & Big Data Foundations Mastery Course Specialization
Included with
What you'll learn
Compare NoSQL models and consistency approaches.
Implement workflows with Oozie and streaming via Storm.
Build recommendation and clustering models using Mahout.
Skills you'll gain
Tools you'll learn
Details to know
See how employees at top companies are mastering in-demand skills
Build your subject-matter expertise
- Learn new concepts from industry experts
- Gain a foundational understanding of a subject or tool
- Develop job-relevant skills with hands-on projects
- Earn a shareable career certificate
There are 4 modules in this course
By the end of this course, learners will be able to explain the origins of NoSQL databases, evaluate their features and data models, compare ACID and BASE consistency approaches, apply workflow orchestration with Apache Oozie, and implement real-time stream processing using Apache Storm. They will also design recommendation systems, apply classification techniques, and implement clustering algorithms with Apache Mahout.
This course equips learners with both foundational knowledge and hands-on skills in distributed big data systems. Through a structured progression, learners gain practical experience with tasks, workers, topologies, and coordinators, while also exploring advanced topics such as data versioning, stream reliability, and scalable machine learning models. What makes this course unique is its integration of multiple cutting-edge technologies—NoSQL, Oozie, Storm, and Mahout—into a single, cohesive learning journey. Instead of studying these tools in isolation, learners will analyze how they interact in real-world scenarios to build scalable, fault-tolerant, and intelligent data solutions. Ideal for aspiring data engineers, developers, and analysts, this course provides the skills to design, evaluate, and implement modern big data architectures that drive insights and innovation.
This module introduces learners to the origins, features, and benefits of NoSQL databases. It explores schema flexibility, consistency models, and application development, while also introducing concepts like data versioning and workflow orchestration. Learners build a strong foundation to understand why NoSQL emerged as a solution for big data and distributed systems.
What's included
21 videos5 assignments
21 videos•Total 159 minutes
- A Brief History of NoSQL•9 minutes
- Schema Agnostic•7 minutes
- Nonrelational•6 minutes
- Enterprise NoSQL•9 minutes
- Recent Trends in IT•8 minutes
- NoSQL Benefits and Precautions•10 minutes
- Managing Different Data Types•8 minutes
- Triple and Graph Store•8 minutes
- Hybrid NoSQL Databases•8 minutes
- Applying Consistency Method•7 minutes
- Choosing ACID or BASE•10 minutes
- Developing Application on NoSQL•5 minutes
- Semantics•9 minutes
- Public Cloud•7 minutes
- Managing Availability•6 minutes
- Versioning Data•6 minutes
- Introduction to Apache Oozie•9 minutes
- Discuss Action in Detail•9 minutes
- Discuss Parameters•9 minutes
- Email Action in Oozie•5 minutes
- Hadoop FS Action in Oozie•5 minutes
5 assignments•Total 70 minutes
- Graded - Foundations of NoSQL Databases•30 minutes
- Origins of NoSQL•10 minutes
- Features and Data Types•10 minutes
- Consistency and Application Development•10 minutes
- Data Versioning and Workflow Introduction•10 minutes
This module provides hands-on insights into Apache Oozie for workflow orchestration in big data environments. Learners examine Hive and Pig actions, control nodes, coordinators, and workflow applications. The module also introduces Apache Storm basics, stream processing, and reliability concepts essential for modern big data solutions.
What's included
23 videos5 assignments
23 videos•Total 132 minutes
- Hive Action in Oozie•5 minutes
- Hive Action in Oozie Continue•5 minutes
- Control Node•3 minutes
- Control Node Continue•7 minutes
- Pig Action in Oozie•10 minutes
- Pig Action in Oozie Continues•9 minutes
- Oozie Coordinators•12 minutes
- Oozie Workflow Applications•9 minutes
- Oozie Workflow Applications Continues•11 minutes
- Introduction•2 minutes
- Description of Hadoop•4 minutes
- Storm Introduction•4 minutes
- Apache Storm History•4 minutes
- Features of Apache Storm•3 minutes
- Architecture of Apache Storm•3 minutes
- Architecture Explanation in Detail•7 minutes
- Topology•5 minutes
- Spouts and Bolts•4 minutes
- Stream•3 minutes
- Installation Process•2 minutes
- Stream Grouping•11 minutes
- Stream Grouping Continue•5 minutes
- Reliability•5 minutes
5 assignments•Total 70 minutes
- Graded - Workflow Orchestration with Apache Oozie•30 minutes
- Hive and Control Nodes•10 minutes
- Coordinators and Workflow Applications•10 minutes
- Apache Storm Basics•10 minutes
- Topology, Streams, and Reliability•10 minutes
This module dives deeper into Apache Storm, covering tasks, workers, deployment, and parallelism. It bridges Storm’s real-time processing with Apache Mahout’s machine learning capabilities, focusing on recommendations, classifiers, and practical examples. Learners gain practical skills in deploying, scaling, and integrating real-time ML applications.
What's included
20 videos5 assignments
20 videos•Total 145 minutes
- Tasks•3 minutes
- Workers•3 minutes
- Java Installation and Zookeeper•9 minutes
- Zookeeper installation•9 minutes
- Eclipse Installation•2 minutes
- Command line Client•4 minutes
- Parallelism in Storm Topology•8 minutes
- What is Mahout•7 minutes
- Mahout Architecture•9 minutes
- Subversion Installation•7 minutes
- Item Based Recommendation•7 minutes
- Example- CBayes Classifier•8 minutes
- Command Line Options•11 minutes
- Canopy Clustering•11 minutes
- Basic Recommender•11 minutes
- Practical Examples•8 minutes
- Mahout Seqdumper Command•7 minutes
- Running Code through Eclipse•6 minutes
- Reading from Code•6 minutes
- Introduction to Apache Mahout Deep Dive•9 minutes
5 assignments•Total 70 minutes
- Graded - Real-Time Processing with Apache Storm•30 minutes
- Tasks and Workers •10 minutes
- Deployment and Parallelism•10 minutes
- Recommendations and Classifiers•10 minutes
- Practical Examples and Deep Dive •10 minutes
This module focuses on machine learning algorithms implemented in Apache Mahout. Learners study recommendation systems, clustering, classification, evaluation techniques, and advanced algorithms like KMeans and Logistic Regression. By the end, learners will be able to design and implement scalable ML models on big data platforms.
What's included
12 videos4 assignments
12 videos•Total 106 minutes
- Use Cases•9 minutes
- Recommendation•11 minutes
- Example - Tanimoto Distance•7 minutes
- How to Use Mahout?•10 minutes
- Exercise•7 minutes
- Example - Evaluation•7 minutes
- Deep Dive Canopy Clustering•10 minutes
- Classification•10 minutes
- Vector File•8 minutes
- Naïve Bayes Classifier from Code•11 minutes
- KMeans Clustering•6 minutes
- Logistic Regression•8 minutes
4 assignments•Total 60 minutes
- Graded - Machine Learning with Apache Mahout•30 minutes
- Use Cases and Recommendations •10 minutes
- Evaluation and Clustering•10 minutes
- Clustering Algorithms •10 minutes
Earn a career certificate
Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.
Explore more from Data Analysis
- Status: Free Trial
Specialization
- Status: Free Trial
Course
- Status: Free TrialU
Universidad Nacional Autónoma de México
Course
- Status: PreviewO
O.P. Jindal Global University
Course
Why people choose Coursera for their career
Frequently asked questions
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
More questions
Financial aid available,
