VOOZH about

URL: https://www.coursera.org/learn/big-data-analytics-hive-pig-mapreduce

⇱ Big Data Analytics with Hive, Pig & MapReduce | Coursera


Big Data Analytics with Hive, Pig & MapReduce

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Big Data Analytics with Hive, Pig & MapReduce

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
9 hours to complete
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
9 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Design and optimize Hive databases for large datasets.

  • Process XML data and execute MapReduce and Pig scripts.

  • Apply analytics to real-world telecom and social data.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

15 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Hadoop Big Data Analytics & Projects Mastery Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 4 modules in this course

By the end of this course, learners will be able to design Hive databases, manage complex tables, process XML data with Pig, execute MapReduce jobs, and analyze large-scale social media datasets to extract meaningful insights. The course begins with foundational concepts of Hive, including databases, partitions, and bucketing, then advances into table optimization and constraints for schema design. Learners will gain practical experience in ingesting data with Sqoop, processing it using MapReduce, and applying location- and author-based analytics to real-world datasets. Finally, the course explores Pig scripting for XML processing and Hive complex data types for advanced bookmarking dataset analysis.

This course is unique because it combines two hands-on case studies: one from the telecom industry and another from social media analytics, offering a blend of foundational Hive knowledge and advanced Hadoop ecosystem tools. Designed for professionals, students, and data enthusiasts, the course emphasizes practical application over theory, ensuring learners can confidently apply big data technologies to solve real business problems.

This module introduces Apache Hive and its role in the Hadoop ecosystem. Learners will explore Hive’s basic features, database commands, table operations, and foundational concepts like external tables, partitions, and bucketing. By the end, they will have a strong foundation to query and manage data effectively in Hadoop using Hive.

What's included

10 videos4 assignments

10 videosβ€’Total 65 minutes
  • Introduction of Hiveβ€’8 minutes
  • Simple and Complex Datatype in Hiveβ€’9 minutes
  • Clustersβ€’0 minutes
  • Database Command in Hiveβ€’12 minutes
  • Tables Commands in Hiveβ€’6 minutes
  • Manage Tablesβ€’6 minutes
  • External Tablesβ€’2 minutes
  • Introduction to Partitioningβ€’7 minutes
  • Partition Commandβ€’7 minutes
  • Bucketingβ€’8 minutes
4 assignmentsβ€’Total 60 minutes
  • Foundations of Hive and Big Dataβ€’30 minutes
  • Getting Started with Hiveβ€’10 minutes
  • Hive Database Essentialsβ€’10 minutes
  • Advanced Table Management in Hiveβ€’10 minutes

This module dives deeper into advanced Hive functionality, including table constraints and complex table creation. Learners will understand how to design optimized tables and implement constraints to improve schema structure and maintainability in Hive.

What's included

4 videos3 assignments

4 videosβ€’Total 33 minutes
  • Table Contr Services in Hiveβ€’11 minutes
  • Example of Contr Servicesβ€’7 minutes
  • Example of Contr Services Continuesβ€’5 minutes
  • Creating Contract All Tableβ€’11 minutes
3 assignmentsβ€’Total 50 minutes
  • Optimizing Data with Hiveβ€’30 minutes
  • Hive Constraints in Actionβ€’10 minutes
  • Creating Advanced Tablesβ€’10 minutes

This module focuses on importing social media data into Hadoop, processing it with MapReduce, and analyzing it for insights. Learners will practice using Sqoop for RDBMS to HDFS transfers, run MapReduce programs, and analyze datasets by location, authors, and reader preferences.

What's included

11 videos4 assignments

11 videosβ€’Total 90 minutes
  • Introduction to Social Media Industryβ€’9 minutes
  • Book Marking Websiteβ€’8 minutes
  • Book Marking Website Continuesβ€’5 minutes
  • Understanding Sqoopβ€’7 minutes
  • Get Data from RDMS to HDFSβ€’9 minutes
  • Execute Map Reduce Program in order to Process XML Fileβ€’12 minutes
  • Analyze Book Performance By Reviews Using Codeβ€’7 minutes
  • Analyze Book Performance By Reviews Using Code Continuesβ€’9 minutes
  • Analyse Book By Locationβ€’7 minutes
  • Example of Analyse Book By Locationβ€’7 minutes
  • Analyse Book Reader Against Authorβ€’10 minutes
4 assignmentsβ€’Total 60 minutes
  • Social Media Data Integration and Processingβ€’30 minutes
  • Social Media Landscape and Data Ingestionβ€’10 minutes
  • Processing Data with MapReduceβ€’10 minutes
  • Location and Reader Analysisβ€’10 minutes

This module explores Pig and Hive for advanced social media analytics. Learners will process XML data with Pig, store and explore outputs, and utilize Hive complex data types with MapReduce for deep insights into bookmarking datasets and user interactions.

What's included

12 videos4 assignments

12 videosβ€’Total 112 minutes
  • How to process XML File in PIGβ€’6 minutes
  • How to process XML File in PIG Continuesβ€’8 minutes
  • Analyze Book Performance in XML File in PIGβ€’10 minutes
  • More on Analyze Book Performance in XML File in PIGβ€’10 minutes
  • Pig XML File Output Using Bookβ€’9 minutes
  • Pig XML File Output Using Locationβ€’10 minutes
  • Pig XML File Output Using Location Continuesβ€’9 minutes
  • Understanding Complex Data Set Using Hiveβ€’12 minutes
  • Understanding Complex Data Set Using Hive Continuesβ€’10 minutes
  • Create Array in Map Reduce Using Hiveβ€’10 minutes
  • Book Marking Type Data Set Using Complex Typeβ€’9 minutes
  • Output of Book Marking Type Data Setβ€’10 minutes
4 assignmentsβ€’Total 60 minutes
  • Social Media Insights with Pig and Hiveβ€’30 minutes
  • XML Data Processing with Pigβ€’10 minutes
  • Pig Outputs and Data Explorationβ€’10 minutes
  • Complex Data Structures with Hive and MapReduceβ€’10 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

EDUCBA
1,591 Coursesβ€’326,930 learners

Explore more from Data Analysis

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,