Working of Hive

Uploaded by

Shravya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views3 pages

Working of Hive

Uploaded by

Shravya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Let's break down the process of how a Hive query is executed in simple terms:

1. Execute Query

● What Happens: You submit a query using a Hive interface, like a command line or web
interface.

Example: Imagine you want to find out the average sales of your store from a database. You
write a SQL-like query in Hive, such as:
sql
Copy code
SELECT AVG(sales) FROM store_data;

2. Get Plan

● What Happens: The query is passed to a "driver," which uses a query compiler to first
check if your query is written correctly (syntax check) and then decide on the best way to
get the answer (query plan).
● Example: The compiler checks if you've written "AVG" and "sales" correctly and figures
out which parts of your data need to be read.

3. Get Metadata

● What Happens: The compiler now asks the Metastore (a database storing metadata) for
information about the tables in your query, like their structure.
● Example: It might ask, "What is the structure of the store_data table? Does it have a
sales column?"

4. Send Metadata

● What Happens: The Metastore sends back details like the table's schema, location, and
column types.
● Example: The Metastore might respond, "Yes, the store_data table has a column
called sales and it's stored in this specific format on these servers."

5. Send Plan

● What Happens: After getting the metadata, the compiler finalizes the plan to run your
query and gives it back to the driver.
● Example: The driver now knows how it will execute the query, what data to read, and in
what order.

6. Execute Plan
● What Happens: The driver sends this plan to the execution engine.
● Example: The execution engine starts preparing for the actual work, much like a chef
getting ingredients ready based on a recipe.

7. Execute Job (MapReduce)

● What Happens: The execution engine processes the query using MapReduce jobs
(small pieces of work distributed across different machines). It sends the job to a
JobTracker, which assigns work to TaskTrackers running on different data nodes
(computers in the cluster).
● Example: The task might be, "Find the total sales per day across many servers," and
each server handles a chunk of the data, reporting results back.

7.1 Metadata Ops (During Execution)

● What Happens: While running the query, the execution engine might also ask for more
metadata from the Metastore, if necessary.
● Example: It might need to check details about a table's partitions or where certain data
is stored.

8. Fetch Results

● What Happens: After the MapReduce job finishes, the execution engine collects all the
results from different nodes (servers).
● Example: Each server that processed part of the data sends its partial result back, such
as sales for a particular day.

9. Send Results to Driver

● What Happens: The execution engine sends the final results to the driver.
● Example: The execution engine now knows the total and average sales and passes that
information to the driver.

10. Send Results to Hive Interface

● What Happens: The driver sends the final result back to the interface where you
submitted the query, like the command line or web UI.
● Example: Finally, you see the average sales figure on your screen, say, "The average
sales is $500."

Summary:

In simple terms, Hive takes your SQL-like query, checks if it's written correctly, figures out how
to run it, and distributes the work across many machines in the background. It uses MapReduce
to break down the job, fetches results from different parts of the system, and then sends the
answer back to you.

Working of Hive 2
No ratings yet
Working of Hive 2
7 pages
Chapter - 4 - Data Access - Hive
No ratings yet
Chapter - 4 - Data Access - Hive
35 pages
Hive
No ratings yet
Hive
52 pages
Hive and Hadoop Workflow Explained
No ratings yet
Hive and Hadoop Workflow Explained
3 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
182 pages
Introduction To Hive-5
No ratings yet
Introduction To Hive-5
4 pages
Hadoop Ecosystem: Hive and MapReduce
No ratings yet
Hadoop Ecosystem: Hive and MapReduce
14 pages
Web Based Data Management of Apache Hive
No ratings yet
Web Based Data Management of Apache Hive
22 pages
Big Data: Week - 11
No ratings yet
Big Data: Week - 11
28 pages
BDA Answers
No ratings yet
BDA Answers
10 pages
Unit 5 Lecture No-1 (Hive)
No ratings yet
Unit 5 Lecture No-1 (Hive)
30 pages
Unit 3 Hive Overview and Architecture
No ratings yet
Unit 3 Hive Overview and Architecture
5 pages
Apache Hive: Structure & Data Analysis
No ratings yet
Apache Hive: Structure & Data Analysis
25 pages
What Is Hive
No ratings yet
What Is Hive
4 pages
Week 14 Hive
No ratings yet
Week 14 Hive
6 pages
Apache Hive for Big Data Processing
No ratings yet
Apache Hive for Big Data Processing
19 pages
Unit 3 Hive
No ratings yet
Unit 3 Hive
3 pages
Apache Hive: Data Warehousing on Hadoop
No ratings yet
Apache Hive: Data Warehousing on Hadoop
28 pages
HIVE
No ratings yet
HIVE
33 pages
Bigdata Lecture 5
No ratings yet
Bigdata Lecture 5
19 pages
Architecture and Working of Hive
No ratings yet
Architecture and Working of Hive
7 pages
Course3 Module2 Intro To Hive Slides
No ratings yet
Course3 Module2 Intro To Hive Slides
76 pages
01 Introduction To Hive (1) 2 15
No ratings yet
01 Introduction To Hive (1) 2 15
14 pages
Hive
No ratings yet
Hive
30 pages
Hive
No ratings yet
Hive
49 pages
Hive Database & Analytics Guide
No ratings yet
Hive Database & Analytics Guide
10 pages
Hive Architecture
No ratings yet
Hive Architecture
7 pages
Hive for Big Data Professionals
No ratings yet
Hive for Big Data Professionals
17 pages
Hive
No ratings yet
Hive
12 pages
Chapter 5 Hive
No ratings yet
Chapter 5 Hive
69 pages
Unit 4 Hadoop Ecosystem - HIVE and PIG
No ratings yet
Unit 4 Hadoop Ecosystem - HIVE and PIG
157 pages
7 Hive
No ratings yet
7 Hive
30 pages
BDA Assignment QP-3 IT C With Key Solutions
No ratings yet
BDA Assignment QP-3 IT C With Key Solutions
5 pages
Unit-3 - HDFS, Hive and Hbase
No ratings yet
Unit-3 - HDFS, Hive and Hbase
54 pages
BDA Unit-5
No ratings yet
BDA Unit-5
39 pages
Bda Unit 4 - Mam
No ratings yet
Bda Unit 4 - Mam
57 pages
BDA Unit 4 Notes
No ratings yet
BDA Unit 4 Notes
33 pages
Hive for Data Engineers
No ratings yet
Hive for Data Engineers
13 pages
Hive Architecture and Working
No ratings yet
Hive Architecture and Working
2 pages
Unit 5 Lecture No-1 (Hive)
No ratings yet
Unit 5 Lecture No-1 (Hive)
30 pages
Hive
No ratings yet
Hive
5 pages
HIVE
No ratings yet
HIVE
18 pages
DA Unit-5
No ratings yet
DA Unit-5
78 pages
Bda (M-4)
No ratings yet
Bda (M-4)
8 pages
Bda Report
No ratings yet
Bda Report
16 pages
Unit-5 - Hive
No ratings yet
Unit-5 - Hive
31 pages
Unit-4 Hive
No ratings yet
Unit-4 Hive
10 pages
Module - 4
No ratings yet
Module - 4
58 pages
Hive
No ratings yet
Hive
65 pages
Execution Environments For Distributed Computing: Apache Hive
No ratings yet
Execution Environments For Distributed Computing: Apache Hive
23 pages
Execution Environments For Distributed Computing: Apache Hive
No ratings yet
Execution Environments For Distributed Computing: Apache Hive
23 pages
Introduction to Hive Architecture
No ratings yet
Introduction to Hive Architecture
23 pages
Assignment 4-Gcc: Hive Is Not
No ratings yet
Assignment 4-Gcc: Hive Is Not
3 pages
Big Data Analytics Module-4
No ratings yet
Big Data Analytics Module-4
39 pages
HIVE Lect
No ratings yet
HIVE Lect
91 pages
HIVE NB
No ratings yet
HIVE NB
19 pages
Electric Circuit Diagram Template
No ratings yet
Electric Circuit Diagram Template
1 page
001992113-an-01-en-TRENDGEEK TG 126 BT GAMEPAD
No ratings yet
001992113-an-01-en-TRENDGEEK TG 126 BT GAMEPAD
5 pages
An Introduction To Rapid System Prototyping
No ratings yet
An Introduction To Rapid System Prototyping
5 pages
300+ TOP COMPUTER NETWORKS Multiple Choice Questions and Answers 2023
No ratings yet
300+ TOP COMPUTER NETWORKS Multiple Choice Questions and Answers 2023
23 pages
MYSQL
No ratings yet
MYSQL
6 pages
05b.BDA (18CS72) Module-5 Text Mining
No ratings yet
05b.BDA (18CS72) Module-5 Text Mining
23 pages
Lecture 1
No ratings yet
Lecture 1
12 pages
Data Leakage Detection Strategies
No ratings yet
Data Leakage Detection Strategies
19 pages
Opensmppbox Svn-R59 User'S Guide: Open Source SMPP Proxy
No ratings yet
Opensmppbox Svn-R59 User'S Guide: Open Source SMPP Proxy
19 pages
Training Report 2
No ratings yet
Training Report 2
97 pages
Linux-Networking Cheat Sheet
No ratings yet
Linux-Networking Cheat Sheet
7 pages
Manual: Lenze9300Servo - Lib
No ratings yet
Manual: Lenze9300Servo - Lib
29 pages
AWS IoT for Sustainable Buildings
No ratings yet
AWS IoT for Sustainable Buildings
42 pages
Business Process Modelling "As Is"
100% (1)
Business Process Modelling "As Is"
25 pages
Azure Storage Migration FAQs
No ratings yet
Azure Storage Migration FAQs
8 pages
Buying CC Fullz A How To Guide
100% (1)
Buying CC Fullz A How To Guide
4 pages
User's Manual: Wireless Headphone
No ratings yet
User's Manual: Wireless Headphone
77 pages
Cyclades: Automatique (IRIA), The National Research Laboratory For Computer Science in France, Now Known As INRIA, Which
No ratings yet
Cyclades: Automatique (IRIA), The National Research Laboratory For Computer Science in France, Now Known As INRIA, Which
3 pages
Accenture Coding Handout
100% (3)
Accenture Coding Handout
94 pages
Casestudy
100% (1)
Casestudy
4 pages
MCT Program Guide 2011
No ratings yet
MCT Program Guide 2011
7 pages
Mentor: Bless Herera Agapito Cyber Security: Objectives
No ratings yet
Mentor: Bless Herera Agapito Cyber Security: Objectives
2 pages
Unit 10: Exercise 1: Listen and Write The Technology Words You Hear
No ratings yet
Unit 10: Exercise 1: Listen and Write The Technology Words You Hear
12 pages
Uses of Computers in Research
No ratings yet
Uses of Computers in Research
7 pages
Create A Gantt Chart in Excel - Easy Excel Tutorial
No ratings yet
Create A Gantt Chart in Excel - Easy Excel Tutorial
5 pages
Embedded Deep Learning Accelerators A Survey On Recent Advances
No ratings yet
Embedded Deep Learning Accelerators A Survey On Recent Advances
19 pages
Dot Printer Command Guide
No ratings yet
Dot Printer Command Guide
104 pages
EmployeeTrackingSystem PDFCOFFEE - Com 1658006517766
No ratings yet
EmployeeTrackingSystem PDFCOFFEE - Com 1658006517766
13 pages
Cinemin Swivel Projector Manual
No ratings yet
Cinemin Swivel Projector Manual
8 pages
Bank Management System Project in Java
No ratings yet
Bank Management System Project in Java
35 pages

Working of Hive

Uploaded by

Working of Hive

Uploaded by

Let's break down the process of how a Hive query is executed in simple terms:

7. Execute Job (MapReduce)

7.1 Metadata Ops (During Execution)

9. Send Results to Driver

10. Send Results to Hive Interface

You might also like