Practical 5

This document provides a step-by-step guide to install Hadoop 2.4.1 in pseudo distributed mode. It includes instructions for setting up Hadoop environment variables, configuring necessary files such as core-site.xml, hdfs-site.xml, yarn-site.xml, and mapred-site.xml, and applying changes to the system. The document emphasizes user-defined property values for customization according to the user's Hadoop infrastructure.

Uploaded by

tosam67394

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views3 pages

Practical 5

Uploaded by

tosam67394

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Installing Hadoop in Pseudo Distributed Mode

Follow the steps given below to install Hadoop 2.4.1 in pseudo distributed
mode.

Step 1 − Setting Up Hadoop

You can set Hadoop environment variables by appending the following
commands to ~/.bashrc file.
export HADOOP_HOME=/usr/local/hadoop
export HADOOP_MAPRED_HOME=$HADOOP_HOME
export HADOOP_COMMON_HOME=$HADOOP_HOME

export HADOOP_HDFS_HOME=$HADOOP_HOME
export YARN_HOME=$HADOOP_HOME
export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native
export PATH=$PATH:$HADOOP_HOME/sbin:$HADOOP_HOME/bin
export HADOOP_INSTALL=$HADOOP_HOME

Now apply all the changes into the current running system.

$ source ~/.bashrc

Step 2 − Hadoop Configuration

You can find all the Hadoop configuration files in the location
$HADOOP_HOME/etc/hadoop. It is required to make changes in those
configuration files according to your Hadoop infrastructure.

$ cd $HADOOP_HOME/etc/hadoop
In order to develop Hadoop programs in java, you have to reset the java
environment variables in hadoop-env.sh file by
replacing JAVA_HOME value with the location of java in your system.
export JAVA_HOME=/usr/local/jdk1.7.0_71

The following are the list of files that you have to edit to configure
Hadoop.

core-site.xml
The core-site.xml file contains information such as the port number used
for Hadoop instance, memory allocated for the file system, memory limit
for storing the data, and size of Read/Write buffers.
Open the core-site.xml and add the following properties in between
<configuration>, </configuration> tags.

<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>
hdfs-site.xml
The hdfs-site.xml file contains information such as the value of
replication data, namenode path, and datanode paths of your local file
systems. It means the place where you want to store the Hadoop
infrastructure.

Let us assume the following data.

dfs.replication (data replication value) = 1

(In the below given path /hadoop/ is the user name.

hadoopinfra/hdfs/namenode is the directory created by hdfs file
system.)
namenode path = //home/hadoop/hadoopinfra/hdfs/namenode

(hadoopinfra/hdfs/datanode is the directory created by hdfs file

system.)
datanode path = //home/hadoop/hadoopinfra/hdfs/datanode

Open this file and add the following properties in between the
<configuration> &lt/configuration> tags in this file.

<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>

<property>
<name>dfs.name.dir</name>
<value>file:///home/hadoop/hadoopinfra/hdfs/namenode
</value>
</property>

<property>
<name>dfs.data.dir&lt/name>
<value>file:///home/hadoop/hadoopinfra/hdfs/datanode
</value>
</property>
</configuration>
Note − In the above file, all the property values are user-defined and you
can make changes according to your Hadoop infrastructure.
yarn-site.xml

This file is used to configure yarn into Hadoop. Open the yarn-site.xml file
and add the following properties in between the <configuration>,
</configuration> tags in this file.

<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle&lt/value>
</property>
</configuration>
mapred-site.xml
This file is used to specify which MapReduce framework we are using. By
default, Hadoop contains a template of yarn-site.xml. First of all, it is
required to copy the file from mapred-site.xml.template to mapred-
site.xml file using the following command.
$ cp mapred-site.xml.template mapred-site.xml
Open mapred-site.xml file and add the following properties in between
the <configuration>, </configuration>tags in this file.
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
</configuration>

Install Sqoop
No ratings yet
Install Sqoop
7 pages
Assignment Tanupriya BDDV
No ratings yet
Assignment Tanupriya BDDV
8 pages
Hive INstallation
No ratings yet
Hive INstallation
13 pages
Experiment-2 BDA Lab
No ratings yet
Experiment-2 BDA Lab
13 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Group A 1st
No ratings yet
Group A 1st
4 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Ex 1
No ratings yet
Ex 1
5 pages
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
No ratings yet
Sqoop Tutorial: Sqoop: "SQL To Hadoop and Hadoop To SQL"
11 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Week 1 Lab
No ratings yet
Week 1 Lab
8 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
DataVisuaization Lab
No ratings yet
DataVisuaization Lab
5 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
3 pages
BDA Practical Experiment 1
No ratings yet
BDA Practical Experiment 1
5 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Hadoop For Ubuntu 2
No ratings yet
Hadoop For Ubuntu 2
4 pages
Hadoop Setup Guide for Ubuntu 16.04/18.04
No ratings yet
Hadoop Setup Guide for Ubuntu 16.04/18.04
20 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
Unit 3 PART 2
No ratings yet
Unit 3 PART 2
11 pages
Big Data Lab Record
No ratings yet
Big Data Lab Record
30 pages
Bda Lab Manual Print 3.6.24
No ratings yet
Bda Lab Manual Print 3.6.24
45 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
33 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
6 pages
Computer Science & Engineering: Department of
No ratings yet
Computer Science & Engineering: Department of
6 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
34 pages
Big Data
No ratings yet
Big Data
5 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
7 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
Amc Engineering College: Dept. of Computer Science and Engineering
No ratings yet
Amc Engineering College: Dept. of Computer Science and Engineering
6 pages
A Report On Distributed Computing
No ratings yet
A Report On Distributed Computing
25 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Hadoop Installation
No ratings yet
Hadoop Installation
4 pages
3 Hadoop
No ratings yet
3 Hadoop
40 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
Hadoop Setup & File Management Guide
No ratings yet
Hadoop Setup & File Management Guide
16 pages
Hadoop Setup Guide for Windows Users
No ratings yet
Hadoop Setup Guide for Windows Users
29 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
50 pages
BDA Lab File
No ratings yet
BDA Lab File
4 pages
Hadoopfile PP
No ratings yet
Hadoopfile PP
83 pages
Exp 1-2
No ratings yet
Exp 1-2
9 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
13 pages
Hadoop Setup for CSE Students
No ratings yet
Hadoop Setup for CSE Students
17 pages
Hadoop Installation Step by Step
No ratings yet
Hadoop Installation Step by Step
8 pages
Bda Record
No ratings yet
Bda Record
27 pages
Hive Installation Guide
No ratings yet
Hive Installation Guide
15 pages
Hadoop Cluster Creation
No ratings yet
Hadoop Cluster Creation
8 pages
Hadoop Record 2024-Final
No ratings yet
Hadoop Record 2024-Final
59 pages
CLD 7
No ratings yet
CLD 7
3 pages
Hadoop Setup Guide for Developers
No ratings yet
Hadoop Setup Guide for Developers
3 pages
Big Data Analytics Lab Manual
No ratings yet
Big Data Analytics Lab Manual
80 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
Camera Raw GPU Config
No ratings yet
Camera Raw GPU Config
1 page
Digital Image Processing - Chapter 7
No ratings yet
Digital Image Processing - Chapter 7
63 pages
Agnet Over Satcom
No ratings yet
Agnet Over Satcom
6 pages
Lecture 1-2 - Data and Database Basics
No ratings yet
Lecture 1-2 - Data and Database Basics
36 pages
Tesys T - LTMR08EFM
No ratings yet
Tesys T - LTMR08EFM
4 pages
50 Excel Practical Assignments Unsolved
No ratings yet
50 Excel Practical Assignments Unsolved
71 pages
Ethical Hacking and Network Security PDF
No ratings yet
Ethical Hacking and Network Security PDF
2 pages
Old Paper 2597164
No ratings yet
Old Paper 2597164
1 page
Topic-4 MCQ
No ratings yet
Topic-4 MCQ
22 pages
DHIMATLABToolboxUserGuide.1465646613 Unlocked
No ratings yet
DHIMATLABToolboxUserGuide.1465646613 Unlocked
20 pages
Annex A Mechanics of The Different Contests and Events 2
No ratings yet
Annex A Mechanics of The Different Contests and Events 2
32 pages
Simple Novel Manager (VNGE)
No ratings yet
Simple Novel Manager (VNGE)
2 pages
Wa0007.
No ratings yet
Wa0007.
7 pages
PSY309 Lecture Notes 1
No ratings yet
PSY309 Lecture Notes 1
5 pages
ROI of Infoblox IPAM (IP Address Management) For DNS and DHCP
No ratings yet
ROI of Infoblox IPAM (IP Address Management) For DNS and DHCP
9 pages
Department of Education: Caraga Region Schools Division of Surigao Del Sur Gamut National High School
No ratings yet
Department of Education: Caraga Region Schools Division of Surigao Del Sur Gamut National High School
2 pages
Constello 2K24-March 6
No ratings yet
Constello 2K24-March 6
15 pages
Answer B 1 2
No ratings yet
Answer B 1 2
7 pages
DP 700
100% (6)
DP 700
141 pages
Black Controller User Guide
No ratings yet
Black Controller User Guide
9 pages
Killer Deals
No ratings yet
Killer Deals
33 pages
BSNL's New Complaint Portal Guide
No ratings yet
BSNL's New Complaint Portal Guide
1 page
Requirements For Biometric Card Providers v1.0 FINAL
No ratings yet
Requirements For Biometric Card Providers v1.0 FINAL
15 pages
AB AF001 - X - EASY2 Old - GB
No ratings yet
AB AF001 - X - EASY2 Old - GB
65 pages
Spring Boot With MongoDB
No ratings yet
Spring Boot With MongoDB
16 pages
Master Data - Material Master Data
No ratings yet
Master Data - Material Master Data
82 pages
Carding Beginners Guide
No ratings yet
Carding Beginners Guide
3 pages
Karpatne A. Knowledge Guided Machine Learning... 2022
100% (1)
Karpatne A. Knowledge Guided Machine Learning... 2022
442 pages
Advanced View of Projects Raspberry Pi List - Raspberry PI Projects
No ratings yet
Advanced View of Projects Raspberry Pi List - Raspberry PI Projects
186 pages
Data Security Perspectives Quiz Answers NSE 1 Information Security Awareness Fortinet
100% (1)
Data Security Perspectives Quiz Answers NSE 1 Information Security Awareness Fortinet
3 pages

Practical 5

Uploaded by

Practical 5

Uploaded by

Installing Hadoop in Pseudo Distributed Mode

Step 1 − Setting Up Hadoop

Step 2 − Hadoop Configuration

Let us assume the following data.

dfs.replication (data replication value) = 1

(In the below given path /hadoop/ is the user name.

(hadoopinfra/hdfs/datanode is the directory created by hdfs file

You might also like