0% found this document useful (0 votes)

19 views6 pages

Installation of Hadoop

This document outlines the steps for installing Hadoop, including prerequisites like Java and SSH, downloading Hadoop, configuring environment variables, and setting up necessary configuration files. It also details how to format the Hadoop file system, start services, and verify the installation through web UIs and a sample job. The instructions are intended for users familiar with command-line operations on a Linux system.

Uploaded by

Kalyan G V

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views6 pages

Installation of Hadoop

Uploaded by

Kalyan G V

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

AMC ENGINEERING COLLEGE

Dept. Of Computer Science and Engineering

Big Data Analytics [21CS71] Assignement-2
Topic: - Installation of Hadoop Kalyan G V (1AM21CS077)
Installation of Hadoop

Steps for Hadoop Installation:

1. Install Java Development Kit (JDK):

Hadoop requires Java to be installed on your system.

• To check if Java is installed:

java -version

• If Java is not installed, install it using:

sudo apt update

sudo apt install openjdk-8-jdk

• Verify the installation:

java -version

2. Install SSH:

Hadoop uses SSH to communicate between its nodes.

• Install SSH if it is not already present:

sudo apt install openssh-server

• Ensure SSH is running:

sudo systemctl start ssh

sudo systemctl enable ssh
3. Download Hadoop:

• Download Hadoop from the official Apache website:

wget https://downloads.apache.org/hadoop/common/hadoop-3.3.1/hadoop-3.3.1.tar.gz

• Extract the downloaded tar file:

tar -xzvf hadoop-3.3.1.tar.gz

• Move the extracted folder to /usr/local/hadoop:

sudo mv hadoop-3.3.1 /usr/local/hadoop

4. Configure Hadoop Environment Variables:

• Open the .bashrc file to add Hadoop-related environment variables:

nano ~/.bashrc

• Add the following lines at the end of the file:

export HADOOP_HOME=/usr/local/hadoop
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
export YARN_HOME=$HADOOP_HOME

• Save and close the file. Then, apply the changes:

source ~/.bashrc

5. Configure Hadoop Files:

Hadoop requires several configuration files to be set up for proper functioning.

• core-site.xml: Navigate to $HADOOP_HOME/etc/hadoop and open core-site.xml:

nano $HADOOP_HOME/etc/hadoop/core-site.xml

Add the following configuration:

<configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

• hdfs-site.xml: Edit hdfs-site.xml:

nano $HADOOP_HOME/etc/hadoop/hdfs-site.xml

Add the following configuration:

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
• mapred-site.xml: Edit mapred-site.xml:

cp $HADOOP_HOME/etc/hadoop/mapred-site.xml.template
$HADOOP_HOME/etc/hadoop/mapred-site.xml
nano $HADOOP_HOME/etc/hadoop/mapred-site.xml

Add the following configuration:

<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

• yarn-site.xml: Edit yarn-site.xml:

nano $HADOOP_HOME/etc/hadoop/yarn-site.xml

Add the following configuration:

<configuration>
<property>
<name>yarn.resourcemanager.address</name>
<value>localhost:8032</value>
</property>
</configuration>
6. Format the Hadoop File System:

• Format the Hadoop Distributed File System (HDFS) for the first time:

hdfs namenode -format

7. Start Hadoop Services:

• Start the HDFS daemons (Namenode, Datanode):

start-dfs.sh

• Start YARN daemons (ResourceManager, NodeManager):

start-yarn.sh
8. Verify Hadoop Installation:

• Check if Hadoop is running by opening the ResourceManager and Namenode web UIs:
o ResourceManager UI: http://localhost:8088
o Namenode UI: http://localhost:9870
o

o
• Check the status of HDFS:

hdfs dfsadmin -report

• You can also run a simple Hadoop job:

yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar

pi 2 5

Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
13 pages
BDA Practical Experiment 1
No ratings yet
BDA Practical Experiment 1
5 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
3 pages
Hadoop Setup Guide for Ubuntu 16.04/18.04
No ratings yet
Hadoop Setup Guide for Ubuntu 16.04/18.04
20 pages
Exp 1 Hadoop Installation Steps
No ratings yet
Exp 1 Hadoop Installation Steps
4 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
Hadoop For Ubuntu 2
No ratings yet
Hadoop For Ubuntu 2
4 pages
Big Data Lab Record
No ratings yet
Big Data Lab Record
30 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
Hadoop 3 Installation
No ratings yet
Hadoop 3 Installation
10 pages
Week 1 Lab
No ratings yet
Week 1 Lab
8 pages
Big Data
No ratings yet
Big Data
5 pages
Hadoop 2.7.1 Setup on CentOS 6.4
No ratings yet
Hadoop 2.7.1 Setup on CentOS 6.4
4 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
3 pages
BDA Practical1 MC18-23
No ratings yet
BDA Practical1 MC18-23
17 pages
Assignment Tanupriya BDDV
No ratings yet
Assignment Tanupriya BDDV
8 pages
Hadoop Installation Commands
No ratings yet
Hadoop Installation Commands
3 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
49 pages
Had Oop Installation
No ratings yet
Had Oop Installation
4 pages
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
No ratings yet
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
11 pages
Experiment-2 BDA Lab
No ratings yet
Experiment-2 BDA Lab
13 pages
Computer Science & Engineering: Department of
No ratings yet
Computer Science & Engineering: Department of
6 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
Ccs334-Bda Lab Manual
No ratings yet
Ccs334-Bda Lab Manual
48 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Hadoop Setup & File Management Guide
No ratings yet
Hadoop Setup & File Management Guide
16 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
80 pages
Ex 1
No ratings yet
Ex 1
5 pages
Hadoop Installation
No ratings yet
Hadoop Installation
5 pages
Bda Lab Manual Print 3.6.24
No ratings yet
Bda Lab Manual Print 3.6.24
45 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
33 pages
Bdamanual
No ratings yet
Bdamanual
8 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
HarshYadav 20CS3032 Assignment1
No ratings yet
HarshYadav 20CS3032 Assignment1
22 pages
Online:: Setting Up The Environment
No ratings yet
Online:: Setting Up The Environment
9 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Original
No ratings yet
Original
17 pages
DOM Manipulation Cheat Sheet
No ratings yet
DOM Manipulation Cheat Sheet
2 pages
TCS iON NQT Corporates Brochure
No ratings yet
TCS iON NQT Corporates Brochure
4 pages
Spark & SparkMLLib
No ratings yet
Spark & SparkMLLib
6 pages
Full Stack Development-Module 1
No ratings yet
Full Stack Development-Module 1
42 pages
IOT
No ratings yet
IOT
5 pages
21CS53 DBMS IA02 QBankAnswers
No ratings yet
21CS53 DBMS IA02 QBankAnswers
21 pages
Build
No ratings yet
Build
10 pages
Digital Forensics Professional: The World's Premier Online Digital Forensics Course
0% (1)
Digital Forensics Professional: The World's Premier Online Digital Forensics Course
12 pages
IT215: Systems Software, Winter 2014-15 Second In-Sem Exam (2 Hours)
No ratings yet
IT215: Systems Software, Winter 2014-15 Second In-Sem Exam (2 Hours)
10 pages
Build Your Own Kernel
No ratings yet
Build Your Own Kernel
4 pages
Forensic Analysis of The Windows Registry
100% (1)
Forensic Analysis of The Windows Registry
13 pages
Python Packages
No ratings yet
Python Packages
22 pages
CS7103 - MultiCore Architecture Ppts Unit-II
No ratings yet
CS7103 - MultiCore Architecture Ppts Unit-II
43 pages
Linux Basics for Beginners Guide
100% (8)
Linux Basics for Beginners Guide
100 pages
Operating System1
100% (1)
Operating System1
223 pages
Citrix Xen Server y Xen Center Software Installation Manual
No ratings yet
Citrix Xen Server y Xen Center Software Installation Manual
17 pages
Cron Scheduling Simple
100% (1)
Cron Scheduling Simple
36 pages
Android Module Metadata Report
No ratings yet
Android Module Metadata Report
28 pages
Bca Notes
50% (2)
Bca Notes
3 pages
SAP System Directories On UNIX
No ratings yet
SAP System Directories On UNIX
10 pages
Getting Started With The AVR Toolchain: Julien TOUS October 7, 2008
No ratings yet
Getting Started With The AVR Toolchain: Julien TOUS October 7, 2008
3 pages
19Nh14 102190051 Lab13 Chương Trình MapReduce Shortest Path Using Parallel Breadth First Search BFS 02
No ratings yet
19Nh14 102190051 Lab13 Chương Trình MapReduce Shortest Path Using Parallel Breadth First Search BFS 02
16 pages
Cisco Iosv - Gns3a
No ratings yet
Cisco Iosv - Gns3a
4 pages
Assignment 05
No ratings yet
Assignment 05
6 pages
References:: Deadlocks
No ratings yet
References:: Deadlocks
21 pages
Remastering UBUNTU
No ratings yet
Remastering UBUNTU
26 pages
Git - The Simple Guide - No Deep Shit!
No ratings yet
Git - The Simple Guide - No Deep Shit!
14 pages
Share Files and Folders Over The Network
No ratings yet
Share Files and Folders Over The Network
8 pages
Linux Shell Script Variable Assignment
100% (1)
Linux Shell Script Variable Assignment
8 pages
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Standalone Manager With Remote Databases-En-Us
No ratings yet
Red Hat Virtualization-4.4-Installing Red Hat Virtualization As A Standalone Manager With Remote Databases-En-Us
87 pages
Presentation On Jumpstart (Remote Un-Attendant Installation)
No ratings yet
Presentation On Jumpstart (Remote Un-Attendant Installation)
13 pages
Introduction To Operating System
No ratings yet
Introduction To Operating System
43 pages
Mod Menu Es
No ratings yet
Mod Menu Es
69 pages
70-410 R2 LM Worksheet Lab 08
No ratings yet
70-410 R2 LM Worksheet Lab 08
5 pages
Configuration Management Using SVN
No ratings yet
Configuration Management Using SVN
21 pages
Performance Measurement Tools and Techniques
No ratings yet
Performance Measurement Tools and Techniques
50 pages

Installation of Hadoop

Uploaded by

Installation of Hadoop

Uploaded by

AMC ENGINEERING COLLEGE

Dept. Of Computer Science and Engineering

Steps for Hadoop Installation:

1. Install Java Development Kit (JDK):

Hadoop requires Java to be installed on your system.

• To check if Java is installed:

• If Java is not installed, install it using:

sudo apt update

• Verify the installation:

Hadoop uses SSH to communicate between its nodes.

• Install SSH if it is not already present:

sudo apt install openssh-server

• Ensure SSH is running:

sudo systemctl start ssh

• Download Hadoop from the official Apache website:

• Extract the downloaded tar file:

tar -xzvf hadoop-3.3.1.tar.gz

• Move the extracted folder to /usr/local/hadoop:

sudo mv hadoop-3.3.1 /usr/local/hadoop

4. Configure Hadoop Environment Variables:

• Open the .bashrc file to add Hadoop-related environment variables:

• Add the following lines at the end of the file:

• Save and close the file. Then, apply the changes:

5. Configure Hadoop Files:

Hadoop requires several configuration files to be set up for proper functioning.

• core-site.xml: Navigate to $HADOOP_HOME/etc/hadoop and open core-site.xml:

Add the following configuration:

• hdfs-site.xml: Edit hdfs-site.xml:

Add the following configuration:

Add the following configuration:

• yarn-site.xml: Edit yarn-site.xml:

Add the following configuration:

hdfs namenode -format

7. Start Hadoop Services:

• Start the HDFS daemons (Namenode, Datanode):

• Start YARN daemons (ResourceManager, NodeManager):

hdfs dfsadmin -report

• You can also run a simple Hadoop job:

yarn jar $HADOOP_HOME/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar

You might also like