VOOZH about

URL: https://www.coursera.org/learn/apache-hive-design-query-optimize-big-data

⇱ Apache Hive: Design, Query & Optimize Big Data | Coursera


Apache Hive: Design, Query & Optimize Big Data

Keep adding new skills with 10,000+ programs for $239 (usually $399). Save now.

Apache Hive: Design, Query & Optimize Big Data

Included with

β€’

Learn more

Ask Coursera

Gain insight into a topic and learn the fundamentals.
1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

Gain insight into a topic and learn the fundamentals.
1 week to complete
at 10 hours a week
Flexible schedule
Learn at your own pace

What you'll learn

  • Design and manage Hive databases, tables, and partitions.

  • Implement joins, UDFs, and SerDe for data transformation.

  • Optimize queries and tune performance for big data workflows.

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

25 assignments

Taught in English

Build your subject-matter expertise

This course is part of the Hadoop & Big Data Foundations Mastery Course Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 5 modules in this course

Learners will be able to design Hive databases and tables, implement partitions and bucketing, apply joins, configure SerDe, create custom UDFs, and optimize queries for efficient big data processing. By the end of the course, participants will not only understand Hive fundamentals but also apply advanced operations such as indexing, views, Slowly Changing Dimensions (SCDs), XML data handling, variable substitution, and performance tuning.

This course provides a step-by-step pathway from beginner to advanced Hive skills, ensuring a solid foundation in HiveQL while introducing real-world scenarios that mirror enterprise big data challenges. Unlike generic SQL courses, this program is specifically tailored to Hive within the Hadoop ecosystem, highlighting its schema-on-read model, distributed query execution, and integration with Hadoop’s scalability. Learners will gain hands-on practice with query optimization, compression, and Hive architecture, making them confident in handling large-scale datasets. Upon completion, they will be able to analyze, transform, and optimize big data effectively, preparing for careers in data engineering, analytics, and Hadoop ecosystem management.

This module introduces Apache Hive and its core fundamentals, including databases, tables, partitions, and bucketing. Learners will explore how Hive enables SQL-like queries on Hadoop, manage datasets, and apply key commands for efficient data handling.

What's included

13 videos5 assignments

13 videosβ€’Total 85 minutes
  • Introduction to HIVEβ€’11 minutes
  • HIVE Data Baseβ€’10 minutes
  • Load Data Commandβ€’6 minutes
  • How to Replace Columnβ€’4 minutes
  • External Tableβ€’6 minutes
  • HIVE Metastoreβ€’3 minutes
  • What is Hive Partitionβ€’10 minutes
  • Creating Partition Tableβ€’9 minutes
  • Insert Overwrite Tableβ€’4 minutes
  • Dynamic Partition Trueβ€’2 minutes
  • Hive Bucketingβ€’5 minutes
  • Decomposing Data Setsβ€’6 minutes
  • Hive Joinsβ€’9 minutes
5 assignmentsβ€’Total 70 minutes
  • Hive Fundamentalsβ€’30 minutes
  • Getting Started with Hiveβ€’10 minutes
  • Tables and Data Management Basicsβ€’10 minutes
  • Partitions and Bucketingβ€’10 minutes
  • Dataset Operations and Decompositionβ€’10 minutes

This module focuses on Hive joins, serialization and deserialization (SerDe), and user-defined functions (UDFs). Learners will practice how to extend HiveQL functionality and apply advanced data transformation techniques.

What's included

12 videos5 assignments

12 videosβ€’Total 88 minutes
  • Hive Joins Continueβ€’10 minutes
  • Skew Joinβ€’3 minutes
  • What is Serdeβ€’7 minutes
  • Serde in Hiveβ€’9 minutes
  • Hive UDFβ€’10 minutes
  • Hive UDF Continuesβ€’7 minutes
  • More Hive UDFβ€’7 minutes
  • Maxcale Functionβ€’3 minutes
  • Hive Example Use Caseβ€’12 minutes
  • Introduction to Hive Concepts and Hands-on Demonstrationβ€’6 minutes
  • Internal Table and External Tableβ€’6 minutes
  • Inserting Data Into Tablesβ€’7 minutes
5 assignmentsβ€’Total 70 minutes
  • Joins, SerDe, and UDFsβ€’30 minutes
  • Advanced Joinsβ€’10 minutes
  • Serialization and Deserializationβ€’10 minutes
  • Hive Functions and Use Casesβ€’10 minutes
  • Core Hive Demonstrationβ€’10 minutes

This module covers Hive operations, functions, and expressions, along with advanced partitioning strategies. Learners will gain hands-on experience with sorting, joins, alter commands, and table sampling for data optimization.

What's included

12 videos5 assignments

12 videosβ€’Total 81 minutes
  • Date and Mathematical Functionsβ€’9 minutes
  • Conditional Statementsβ€’7 minutes
  • Explode and Lateral Viewβ€’8 minutes
  • Sortingβ€’6 minutes
  • Joinβ€’9 minutes
  • Map Joinβ€’2 minutes
  • Static and Dynamic Partitioningβ€’7 minutes
  • More on Dynamic Partitioningβ€’7 minutes
  • Alter Commandβ€’6 minutes
  • MSCK Commandβ€’9 minutes
  • Bucketingβ€’8 minutes
  • Table Samplingβ€’3 minutes
5 assignmentsβ€’Total 70 minutes
  • Hive Operations and Partitioningβ€’30 minutes
  • Functions and Expressionsβ€’10 minutes
  • Sorting and Joinsβ€’10 minutes
  • Partitioning and Alter Commandsβ€’10 minutes
  • Commands, Bucketing, and Samplingβ€’10 minutes

This module explores Hive views, indexing techniques, and configuration of Hive variables. Learners will learn to create reusable query structures, apply compact and bitmap indexes, and configure variable substitution for query optimization.

What's included

12 videos5 assignments

12 videosβ€’Total 70 minutes
  • Archivingβ€’3 minutes
  • Ranksβ€’9 minutes
  • Creating Viewsβ€’9 minutes
  • Advantages of views and Altering Viewsβ€’7 minutes
  • What is Indexingβ€’6 minutes
  • Compact and Bitmap Index Running Timeβ€’5 minutes
  • Hive Commands in Bash Shellβ€’5 minutes
  • Hive Variables - Hiveconfβ€’4 minutes
  • Hive Variables -Hiveconf in Bash Shellβ€’5 minutes
  • Configuring a Hive Var Variableβ€’9 minutes
  • Variable Substitutionβ€’2 minutes
  • Word Countβ€’6 minutes
5 assignmentsβ€’Total 70 minutes
  • Views, Indexing, and Variablesβ€’30 minutes
  • Archiving and Rankingβ€’10 minutes
  • Views in Hiveβ€’10 minutes
  • Indexing and Commands in Hiveβ€’10 minutes
  • Hive Variables and Substitutionβ€’10 minutes

This module introduces Hive’s internal architecture, execution modes, and advanced features. Learners will explore SCDs, XML data handling, immutable tables, compression techniques, and performance configurations.

What's included

23 videos5 assignments

23 videosβ€’Total 141 minutes
  • Hive Architectureβ€’3 minutes
  • Parallelism in Hiveβ€’6 minutes
  • Table Properties in Hiveβ€’6 minutes
  • Null Format Propertiesβ€’6 minutes
  • Null Format Properties Continuesβ€’4 minutes
  • Purge Commands in Hivesβ€’5 minutes
  • Slowing Changing Dimensionβ€’7 minutes
  • Implement the SCDβ€’9 minutes
  • Example of the SCDβ€’4 minutes
  • How to Load XML Data in Hiveβ€’5 minutes
  • How to Load XML Data in Hive Continueβ€’9 minutes
  • No Drop and Offline in Hiveβ€’8 minutes
  • Immutable Tableβ€’9 minutes
  • How to Create Hive RC Fileβ€’9 minutes
  • Multiple Tablesβ€’6 minutes
  • Merging Hive Created Files and Function rLikeβ€’6 minutes
  • Various Configuration Settings in Hiveβ€’9 minutes
  • Various Configuration Settings in Hive Continuesβ€’3 minutes
  • Compressing Various Files in Hiveβ€’6 minutes
  • Different Modes in Hiveβ€’4 minutes
  • File Compression in Hiveβ€’6 minutes
  • Type of Mode in Hiveβ€’4 minutes
  • Comparison of Internal and External Tableβ€’8 minutes
5 assignmentsβ€’Total 70 minutes
  • Hive Architecture and Advanced Featuresβ€’30 minutes
  • Architecture and Table Propertiesβ€’10 minutes
  • Storage and Data Dimensionsβ€’10 minutes
  • XML and Table Managementβ€’10 minutes
  • Configuration, Compression, and Modesβ€’10 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

EDUCBA
1,591 Coursesβ€’326,930 learners

Explore more from Data Analysis

Why people choose Coursera for their career

πŸ‘ Image

Felipe M.

Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
πŸ‘ Image

Jennifer J.

Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
πŸ‘ Image

Larry W.

Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
πŸ‘ Image

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Financial aid available,