Toc 9780134049984

The Hadoop 2 Quick-Start Guide provides a comprehensive overview of the Apache Hadoop 2 ecosystem, covering essential concepts, installation procedures, and tools for big data computing. It includes detailed sections on Hadoop Distributed File System (HDFS), MapReduce framework, and various applications within the Hadoop ecosystem. Additionally, the guide offers practical examples, administration procedures, and troubleshooting tips for users and administrators.

Uploaded by

SS Return of rebel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views10 pages

Toc 9780134049984

Uploaded by

SS Return of rebel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Hadoop 2®

Quick-Start Guide
Hadoop 2 Quick-Start Guide: Learn the
Essentials of Big Data Computing in the
Apache Hadoop 2 Ecosystem

Table of Contents

Cover
Half Title
Title
Copyright
Contents
Foreword
Preface
Acknowledgments
About the Author
1 Background and Concepts
Defining Apache Hadoop
A Brief History of Apache Hadoop
Defining Big Data
Hadoop as a Data Lake
Using Hadoop: Administrator, User, or Both
First There Was MapReduce
Apache Hadoop Design Principles
Apache Hadoop MapReduce Example
MapReduce Advantages
Apache Hadoop V1 MapReduce Operation
Table of Contents

Moving Beyond MapReduce with Hadoop V2

Hadoop V2 YARN Operation Design
The Apache Hadoop Project Ecosystem
Summary and Additional Resources
2 Installation Recipes
Core Hadoop Services
Hadoop Configuration Files
Planning Your Resources
Hardware Choices
Software Choices
Installing on a Desktop or Laptop
Installing Hortonworks HDP 2.2 Sandbox
Installing Hadoop from Apache Sources
Installing Hadoop with Ambari
Performing an Ambari Installation
Undoing the Ambari Install
Installing Hadoop in the Cloud Using Apache Whirr
Step 1: Install Whirr
Step 2: Configure Whirr
Step 3: Launch the Cluster
Step 4: Take Down Your Cluster
Summary and Additional Resources
3 Hadoop Distributed File System Basics
Hadoop Distributed File System Design Features
HDFS Components
HDFS Block Replication
HDFS Safe Mode
Rack Awareness
Table of Contents
NameNode High Availability
HDFS Namespace Federation
HDFS Checkpoints and Backups
HDFS Snapshots
HDFS NFS Gateway
HDFS User Commands
Brief HDFS Command Reference
General HDFS Commands
List Files in HDFS
Make a Directory in HDFS
Copy Files to HDFS
Copy Files from HDFS
Copy Files within HDFS
Delete a File within HDFS
Delete a Directory in HDFS
Get an HDFS Status Report
HDFS Web GUI
Using HDFS in Programs
HDFS Java Application Example
HDFS C Application Example
Summary and Additional Resources
4 Running Example Programs and Benchmarks
Running MapReduce Examples
Listing Available Examples
Running the Pi Example
Using the Web GUI to Monitor Examples
Running Basic Hadoop Benchmarks
Running the Terasort Test
Running the TestDFSIO Benchmark
Table of Contents
Managing Hadoop MapReduce Jobs
Summary and Additional Resources
5 Hadoop MapReduce Framework
The MapReduce Model
MapReduce Parallel Data Flow
Fault Tolerance and Speculative Execution
Speculative Execution
Hadoop MapReduce Hardware
Summary and Additional Resources
6 MapReduce Programming
Compiling and Running the Hadoop WordCount Example
Using the Streaming Interface
Using the Pipes Interface
Compiling and Running the Hadoop Grep Chaining Example
Debugging MapReduce
Listing, Killing, and Job Status
Hadoop Log Management
Summary and Additional Resources
7 Essential Hadoop Tools
Using Apache Pig
Pig Example Walk-Through
Using Apache Hive
Hive Example Walk-Through
A More Advanced Hive Example
Using Apache Sqoop to Acquire Relational Data
Apache Sqoop Import and Export Methods
Apache Sqoop Version Changes
Table of Contents
Sqoop Example Walk-Through
Using Apache Flume to Acquire Data Streams
Flume Example Walk-Through
Manage Hadoop Workflows with Apache Oozie
Oozie Example Walk-Through
Using Apache HBase
HBase Data Model Overview
HBase Example Walk-Through
Summary and Additional Resources
8 Hadoop YARN Applications
YARN Distributed-Shell
Using the YARN Distributed-Shell
A Simple Example
Using More Containers
Distributed-Shell Examples with Shell Arguments
Structure of YARN Applications
YARN Application Frameworks
Distributed-Shell
Hadoop MapReduce
Apache Tez
Apache Giraph
Hoya: HBase on YARN
Dryad on YARN
Apache Spark
Apache Storm
Apache REEF: Retainable Evaluator Execution Framework
Hamster: Hadoop and MPI on the Same Cluster
Apache Flink: Scalable Batch and Stream Data Processing
Table of Contents
Apache Slider: Dynamic Application Management
Summary and Additional Resources
9 Managing Hadoop with Apache Ambari
Quick Tour of Apache Ambari
Dashboard View
Services View
Hosts View
Admin View
Views View
Admin Pull-Down Menu
Managing Hadoop Services
Changing Hadoop Properties
Summary and Additional Resources
10 Basic Hadoop Administration Procedures
Basic Hadoop YARN Administration
Decommissioning YARN Nodes
YARN WebProxy
Using the JobHistoryServer
Managing YARN Jobs
Setting Container Memory
Setting Container Cores
Setting MapReduce Properties
Basic HDFS Administration
The NameNode User Interface
Adding Users to HDFS
Perform an FSCK on HDFS
Balancing HDFS
HDFS Safe Mode
Table of Contents
Decommissioning HDFS Nodes
SecondaryNameNode
HDFS Snapshots
Configuring an NFSv3 Gateway to HDFS
Capacity Scheduler Background
Hadoop Version 2 MapReduce Compatibility
Enabling ApplicationMaster Restarts
Calculating the Capacity of a Node
Running Hadoop Version 1 Applications
Summary and Additional Resources
A: Book Webpage and Code Download
B: Getting Started Flowchart and Troubleshooting Guide
Getting Started Flowchart
General Hadoop Troubleshooting Guide
Rule 1: Dont Panic
Rule 2: Install and Use Ambari
Rule 3: Check the Logs
Rule 4: Simplify the Situation
Rule 5: Ask the Internet
Other Helpful Tips

C: Summary of Apache Hadoop Resources by Topic

General Hadoop Information
Hadoop Installation Recipes
HDFS
Examples
MapReduce
MapReduce Programming
Essential Tools
Table of Contents
YARN Application Frameworks
Ambari Administration
Basic Hadoop Administration
D: Installing the Hue Hadoop GUI
Hue Installation
Steps Performed with Ambari
Install and Configure Hue
Starting Hue
Hue User Interface
E: Installing Apache Spark
Spark Installation on a Cluster
Starting Spark across the Cluster
Installing and Starting Spark on the Pseudo-distributed Single-Node
Installation
Run Spark Examples
Index

Hadoop 2 Quick Start Guide PDF
100% (1)
Hadoop 2 Quick Start Guide PDF
736 pages
Hadoop Mapreduce V2 Cookbook 2Nd Edition Explore The Hadoop Mapreduce V2 Ecosystem To Gain Insights From Very Large Datasets Thilina Gunarathne
No ratings yet
Hadoop Mapreduce V2 Cookbook 2Nd Edition Explore The Hadoop Mapreduce V2 Ecosystem To Gain Insights From Very Large Datasets Thilina Gunarathne
51 pages
Hadoop MapReduce Cookbook 1st Edition Srinath Perera Kindle & PDF Formats
No ratings yet
Hadoop MapReduce Cookbook 1st Edition Srinath Perera Kindle & PDF Formats
80 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
62 pages
HADOOP
No ratings yet
HADOOP
4 pages
DC Hadoop
No ratings yet
DC Hadoop
48 pages
Hadoop Lab Practical Guide
No ratings yet
Hadoop Lab Practical Guide
69 pages
Hadoop MapReduce Cookbook 1st Edition Srinath Perera All Chapters Available
100% (1)
Hadoop MapReduce Cookbook 1st Edition Srinath Perera All Chapters Available
88 pages
Unit-2 (HADOOP)
No ratings yet
Unit-2 (HADOOP)
20 pages
Bda Manual
No ratings yet
Bda Manual
80 pages
W Java132
No ratings yet
W Java132
14 pages
Hortonworks HDP Installing Manually Book
100% (2)
Hortonworks HDP Installing Manually Book
140 pages
wk8 Final
No ratings yet
wk8 Final
39 pages
Hadoop and Mapreduce Cheat Sheet
No ratings yet
Hadoop and Mapreduce Cheat Sheet
1 page
Edition Explore The Hadoop Mapreduce v2 Ecosystem To Gain Insights From Very Large Datasets 5475948
No ratings yet
Edition Explore The Hadoop Mapreduce v2 Ecosystem To Gain Insights From Very Large Datasets 5475948
160 pages
Chapter 2: Running Example Program and Bench Mark: Big Data Analytics (15CS82)
No ratings yet
Chapter 2: Running Example Program and Bench Mark: Big Data Analytics (15CS82)
12 pages
Big Data Lab Guide for AI Students
No ratings yet
Big Data Lab Guide for AI Students
83 pages
Hadoop Operations 1st Edition Eric Sammer Updated 2025
No ratings yet
Hadoop Operations 1st Edition Eric Sammer Updated 2025
91 pages
New Bda Manual
No ratings yet
New Bda Manual
80 pages
Bigdata Lab
No ratings yet
Bigdata Lab
55 pages
Bda 2
No ratings yet
Bda 2
25 pages
Data W - Bigdata8
No ratings yet
Data W - Bigdata8
105 pages
Big Data-Week 3 - 1
No ratings yet
Big Data-Week 3 - 1
22 pages
Hadoop Course Content
No ratings yet
Hadoop Course Content
3 pages
Big Data & Apache Hadoop: Click To Add Text
No ratings yet
Big Data & Apache Hadoop: Click To Add Text
37 pages
CCS334-BDA LAB MANUAL Final
No ratings yet
CCS334-BDA LAB MANUAL Final
46 pages
Module 4 - Hadoop
No ratings yet
Module 4 - Hadoop
5 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
33 pages
V Ai-Ds Ccs334 Bda Labmanual
No ratings yet
V Ai-Ds Ccs334 Bda Labmanual
49 pages
Describe The Functions and Features of HDP
100% (2)
Describe The Functions and Features of HDP
16 pages
Bda Lab Manual
No ratings yet
Bda Lab Manual
42 pages
Cloud PDF
No ratings yet
Cloud PDF
138 pages
Hadoop Setup for Windows Users
No ratings yet
Hadoop Setup for Windows Users
16 pages
Introduction To
No ratings yet
Introduction To
7 pages
Bda Internal 1
No ratings yet
Bda Internal 1
22 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
26 pages
How To Install Hadoop in Windows 10 & 11 - Hadoop Installation
No ratings yet
How To Install Hadoop in Windows 10 & 11 - Hadoop Installation
9 pages
Bda Mod 2 Answers (Except 1st One)
No ratings yet
Bda Mod 2 Answers (Except 1st One)
4 pages
Hadoop Ecosystem
No ratings yet
Hadoop Ecosystem
15 pages
BIG Data - Unit - 2
No ratings yet
BIG Data - Unit - 2
24 pages
Big Data Questions MQC
No ratings yet
Big Data Questions MQC
9 pages
Big Data Open Source Implementation & Administration
No ratings yet
Big Data Open Source Implementation & Administration
16 pages
Big Data Cheat Sheet
No ratings yet
Big Data Cheat Sheet
1 page
Welcome To Apache™ Hadoop®!
No ratings yet
Welcome To Apache™ Hadoop®!
9 pages
BIG Data File
No ratings yet
BIG Data File
28 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
bdcc-2 2
No ratings yet
bdcc-2 2
12 pages
Setup Hadoop On Windows 10 Machines
No ratings yet
Setup Hadoop On Windows 10 Machines
4 pages
Bda Unit-4 Notes
No ratings yet
Bda Unit-4 Notes
15 pages
Big Data Management
No ratings yet
Big Data Management
38 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
7 pages
2-Introduction To Hadoop Eco System
No ratings yet
2-Introduction To Hadoop Eco System
35 pages
Big Data Analytics - Basics of Hadoop
No ratings yet
Big Data Analytics - Basics of Hadoop
15 pages
Module 2 Big Data Analytics
No ratings yet
Module 2 Big Data Analytics
38 pages
Hadoop Basics for Data Engineers
No ratings yet
Hadoop Basics for Data Engineers
44 pages
Hadoop Setup Guide for Windows Users
No ratings yet
Hadoop Setup Guide for Windows Users
29 pages
Ask DATA
No ratings yet
Ask DATA
9 pages
Firewall Questions and Answers
100% (1)
Firewall Questions and Answers
16 pages
TC74
No ratings yet
TC74
16 pages
Datasheet Systems MGMT Panda Security
No ratings yet
Datasheet Systems MGMT Panda Security
2 pages
Summative Test: ICT 9 Computer Systems
No ratings yet
Summative Test: ICT 9 Computer Systems
4 pages
Press Brake Programming Software
No ratings yet
Press Brake Programming Software
2 pages
CSC 4405 Survey of Programming Languages Lecture 5-1
No ratings yet
CSC 4405 Survey of Programming Languages Lecture 5-1
12 pages
Cape VP Link Multi Industry 2022
No ratings yet
Cape VP Link Multi Industry 2022
2 pages
Apex Class Notes
No ratings yet
Apex Class Notes
18 pages
Ls-Dyna R12: For ANSYS 2020R2 Student Package
No ratings yet
Ls-Dyna R12: For ANSYS 2020R2 Student Package
2 pages
Assignment:1: Review Questions
100% (1)
Assignment:1: Review Questions
12 pages
Case Study
No ratings yet
Case Study
14 pages
AS Computer Science Communication and Networking Technologies Notes
No ratings yet
AS Computer Science Communication and Networking Technologies Notes
6 pages
4.5 Static Hashing, Dynamic Hashing
No ratings yet
4.5 Static Hashing, Dynamic Hashing
8 pages
Subham Kumar: Machine Learning Based Contextual Chatbot
No ratings yet
Subham Kumar: Machine Learning Based Contextual Chatbot
2 pages
The Psychology of Evolving Technology: How Social Media, Influencer Culture and New Technologies Are Altering Society 1st Edition Rhoda Okunev
100% (4)
The Psychology of Evolving Technology: How Social Media, Influencer Culture and New Technologies Are Altering Society 1st Edition Rhoda Okunev
57 pages
EWB & Logic Gates Lab Guide
No ratings yet
EWB & Logic Gates Lab Guide
12 pages
FW7540 19.0v1 Connecting To Amazon VPC On Sophos Firewall
No ratings yet
FW7540 19.0v1 Connecting To Amazon VPC On Sophos Firewall
8 pages
Security in Large Networks Using Mediator Protocols (Synopsis)
No ratings yet
Security in Large Networks Using Mediator Protocols (Synopsis)
12 pages
DPChem Operation Instruction V1.0 For LIS Network Interface
No ratings yet
DPChem Operation Instruction V1.0 For LIS Network Interface
29 pages
Dsu Micropr
No ratings yet
Dsu Micropr
11 pages
Vurdering Fra Sakkyndig Utvalg
No ratings yet
Vurdering Fra Sakkyndig Utvalg
11 pages
Memory Types and Features
No ratings yet
Memory Types and Features
9 pages
Evbum2581 D
No ratings yet
Evbum2581 D
13 pages
PHP Practicals in Set C Examples Questions
No ratings yet
PHP Practicals in Set C Examples Questions
3 pages
Lab Manual - 1 - To - 17
No ratings yet
Lab Manual - 1 - To - 17
64 pages
EMC VNX2 Series Support Bulletin
No ratings yet
EMC VNX2 Series Support Bulletin
51 pages
1100 1200 Installation en
No ratings yet
1100 1200 Installation en
36 pages
SPH - Pneumatic Tube Aerocom
No ratings yet
SPH - Pneumatic Tube Aerocom
3 pages
Lecture 1: An Introduction To CUDA: Mike Giles
No ratings yet
Lecture 1: An Introduction To CUDA: Mike Giles
40 pages

Toc 9780134049984

Uploaded by

Toc 9780134049984

Uploaded by

Hadoop 2®

Moving Beyond MapReduce with Hadoop V2

C: Summary of Apache Hadoop Resources by Topic

You might also like