0% found this document useful (0 votes)

28 views5 pages

DataVisuaization Lab

Uploaded by

Odrib Deb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views5 pages

DataVisuaization Lab

Uploaded by

Odrib Deb

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Practical: 1

Aim: Configure Hadoop cluster in pseudo distributed mode and run basic Hadoop
commands.

Installation of Hadoop 3.3.2 on Ubuntu 18.04 LTS

1. Installing Java

$ sudo apt update

$ sudo apt install openjdk-8-jdk openjdk-8-jre
$ java -version

Set JAVA_HOME in .bashrc

export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64
export PATH=$PATH:/usr/lib/jvm/java-8-openjdk-amd64/bin

Apply changes of bashrc in ubuntu environment either by rebooting the system or

applying source ~/.bashrc

2. Adding dedicated hadoop user

$ sudo addgroup hadoop

$ sudo adduser --ingroup hadoop hduser

3. Adding hduser in sudoers file

$ sudo visudo

Add following line in the /etc/sudoers.tmp file

hduser ALL=(ALL:ALL) ALL

4. Now switch to hduser

$ su -hduser

5. Setting up SSH

Hadoop services like Resource Manager & Node Manager uses ssh to share the status of
nodes b/w slave to master & master to master.

$ sudo apt-get install openssh-server openssh-client

After installing ssh, generate ssh keys and copy them in ~/.ssh/authorized_keys.
Generate Keys for secure communication:
$ ssh-keygen -t rsa -P “”
$ cat $HOME/.ssh/id_rsa.pub >> $HOME/.ssh/authorized_keys

6. Download Hadoop 3.3.2 tar file, extract it into /usr/local/Hadoop folder.

$ sudo tar xvzf hadoop-3.0.2.tar.gz

$ sudo mv -r hadoop-3.0.2 /usr/local/hadoop

7. Changing ownership to hduser:Hadoop group and full permission to them.

$ sudo chown -R hduser:hadoop /usr/local/hadoop $ sudo chmod -R 777

/usr/local/Hadoop

8. Hadoop Setup

This setup, also called pseudo-distributed mode, allows each Hadoop daemon to run as
a single Java process. A Hadoop environment is configured by editing a set of
configuration files:

bashrc hadoop-env.sh core-site.xml hdfs-site.xml mapred-site-xml yarn-site.xml

8.1 bashrc

$ sudo gedit ~/.bashrc

Add following lines at the end:

#Hadoop Related Options

export HADOOP_HOME=/usr/local/hadoop
export HADOOP_INSTALL=$HADOOP_HOME
export HADOOP_MAPRED_HOME=$HADOOP_HOME
export HADOOP_COMMON_HOME=$HADOOP_HOME
export HADOOP_HDFS_HOME=$HADOOP_HOME
export YARN_HOME=$HADOOP_HOME
export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_HOME/lib/native export
PATH=$PATH:$HADOOP_HOME/sbin:$HADOOP_HOME/bin
export HADOOP_OPTS="-Djava.library.path=$HADOOP_HOME/lib/native"

$ source ~/.bashrc

8.2 hadoop-env.sh

Lets change the working directory to hadoop configurations location $ cd

/usr/local/hadoop/etc/hadoop/

$ sudo gedit hadoop-env.sh

Add this line:
export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64

8.3 yarn-site.xml

$ sudo gedit yarn-site.xml

Add following lines:
<property> <name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>

</property>
<property> <name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>

8.4 hdfs-site.xml

$ sudo gedit hdfs-site.xml

Add following lines: <property> <name>dfs.replication</name> <value>1</value>

</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:/usr/local/hadoop/yarn_data/hdfs/namenode</value> </property>
<property>
<name>dfs.datanode.data.dir</name>

<value>file:/usr/local/hadoop/yarn_data/hdfs/datanode</value> </property>

8.5 core-site.xml

$ sudo gedit core-site.xml

Add following lines:
<property>
<name>hadoop.tmp.dir</name> <value>/home/hduser/hadoop/tmp</value>
</property>

<property> <name>fs.default.name</name> <value>hdfs://localhost:9000</value>

</property>

8.6 mapred-site.xml

$ sudo gedit mapred-site.xml

Add following lines:
<property> <name>mapred.framework.name</name> <value>yarn</value>

</property>
<property> <name>mapreduce.jobhistory.address</name>
<value>localhost:10020</value>
</property>

9. Create temp directory, directory for datanode and namenode

$ sudo mkdir -p /home/hduser/hadoop/tmp

$ sudo chown -R hduser:hadoop /home/hduser/hadoop/tmp

$ sudo chmod -R 777 /home/hduser/hadoop/tmp

$ sudo mkdir -p /usr/local/hadoop/yarn_data/hdfs/namenode

$ sudo mkdir -p /usr/local/hadoop/yarn_data/hdfs/datanode
$ sudo chmod -R 777 /usr/local/hadoop/yarn_data/hdfs/namenode
$ sudo chmod -R 777 /usr/local/hadoop/yarn_data/hdfs/datanode
$ sudo chown -R hduser:hadoop /usr/local/hadoop/yarn_data/hdfs/namenode $ sudo
chown -R hduser:hadoop /usr/local/hadoop/yarn_data/hdfs/datanode

10. Format Hadoop namenode to get the fresh start

$ hdfs namenode -format
Start all hadoop services by executing command one by one. $ start-dfs.sh
$ start-yarn.sh

or
$ start-all.sh

Type this simple command to check if all the daemons are active and running as Java
processes:
$ jps

Following output is expected if all went well:

6960 SecondaryNameNode 7380 NodeManager

6632 NameNode
11066 Jps

7244 ResourceManager 6766 DataNode

Access Hadoop UI from Browser

The default port number 9870 gives you access to the Hadoop NameNode UI:

http://localhost:9870

The NameNode user interface provides a comprehensive overview of the entire cluster.

The default port 9864 is used to access individual DataNodes directly from your
browser:
http://localhost:9864
The YARN Resource Manager is accessible on port 8088: http://localhost:8088

Hadoop Setup Guide for Ubuntu 16.04/18.04
No ratings yet
Hadoop Setup Guide for Ubuntu 16.04/18.04
20 pages
Installing A Single Node Hadoop Cluster
No ratings yet
Installing A Single Node Hadoop Cluster
4 pages
Hadoop Setup Guide for Linux Users
No ratings yet
Hadoop Setup Guide for Linux Users
23 pages
Install Hadoop
No ratings yet
Install Hadoop
8 pages
Bdamanual
No ratings yet
Bdamanual
8 pages
Bda Lab
No ratings yet
Bda Lab
37 pages
Hadoop Installation Steps
100% (1)
Hadoop Installation Steps
6 pages
Single Node Hadoop Cluster
No ratings yet
Single Node Hadoop Cluster
9 pages
BDA Lab Manual UPDATED
No ratings yet
BDA Lab Manual UPDATED
45 pages
$ Sudo Apt-Get Install Oracle-Java8-Installer
No ratings yet
$ Sudo Apt-Get Install Oracle-Java8-Installer
4 pages
Hadoop Installation
No ratings yet
Hadoop Installation
6 pages
Install Single Node Hadoop on Ubuntu
No ratings yet
Install Single Node Hadoop on Ubuntu
13 pages
Installation of Hadoop in Ubuntu
No ratings yet
Installation of Hadoop in Ubuntu
15 pages
Ex 1
No ratings yet
Ex 1
5 pages
Hadoop Configuration
No ratings yet
Hadoop Configuration
12 pages
Week 1 Lab
No ratings yet
Week 1 Lab
8 pages
Big Data Analytics - Lab-Manual
No ratings yet
Big Data Analytics - Lab-Manual
19 pages
Hadoop Installatio1
No ratings yet
Hadoop Installatio1
22 pages
How To Install Hadoop On Ubuntu 18.04 or 20.04
No ratings yet
How To Install Hadoop On Ubuntu 18.04 or 20.04
15 pages
TP2 - 3IM - en
No ratings yet
TP2 - 3IM - en
7 pages
Hadoop Cluster Creation
No ratings yet
Hadoop Cluster Creation
8 pages
HADOOP 1.X Installation Steps On Ubuntu
No ratings yet
HADOOP 1.X Installation Steps On Ubuntu
3 pages
Hadoop Installation Guide
No ratings yet
Hadoop Installation Guide
18 pages
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6 Installing On Ubuntu 14.04 (Single-Node Cluster)
27 pages
Big Data Record
No ratings yet
Big Data Record
69 pages
Experiment-2 BDA Lab
No ratings yet
Experiment-2 BDA Lab
13 pages
Start Hadoop
No ratings yet
Start Hadoop
4 pages
Hadoop Installation Steps
No ratings yet
Hadoop Installation Steps
4 pages
Unit 3 PART 2
No ratings yet
Unit 3 PART 2
11 pages
Lab Manual
No ratings yet
Lab Manual
27 pages
Installationof Hadoop 3
No ratings yet
Installationof Hadoop 3
6 pages
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
No ratings yet
Installing Standalone and Pseudocode Hadoop Cluster: 1. Setting Up Vmware Virtual Machine
14 pages
Hadoop
No ratings yet
Hadoop
5 pages
Assignment Tanupriya BDDV
No ratings yet
Assignment Tanupriya BDDV
8 pages
Exp 1 1
No ratings yet
Exp 1 1
24 pages
Installing Multi Node Cluster - Handbook 2.0
No ratings yet
Installing Multi Node Cluster - Handbook 2.0
2 pages
Hadoop 2.7.1 Setup on CentOS 6.4
No ratings yet
Hadoop 2.7.1 Setup on CentOS 6.4
4 pages
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
No ratings yet
Hadoop 2.6.5 Installing On Ubuntu 16.04 and 18.04 (Single-Node Cluster)
7 pages
Installation of Hadoop
No ratings yet
Installation of Hadoop
8 pages
Hadoop 2 - Pseudo Node Installation
No ratings yet
Hadoop 2 - Pseudo Node Installation
9 pages
Week 1 in Terminal
No ratings yet
Week 1 in Terminal
10 pages
Steps Single Node Setup
No ratings yet
Steps Single Node Setup
4 pages
BDA Lab Manual-1
No ratings yet
BDA Lab Manual-1
60 pages
Lab 0-Cluster With Multiple VMs-30-01-2024
No ratings yet
Lab 0-Cluster With Multiple VMs-30-01-2024
6 pages
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
No ratings yet
2023MCS320004 HEMANTH TARRA - Hadoop Installation - Assignment
9 pages
Anurag 1-6 Merged
No ratings yet
Anurag 1-6 Merged
60 pages
Hadoop Cluster
No ratings yet
Hadoop Cluster
26 pages
Hadoop Single Node Installation
No ratings yet
Hadoop Single Node Installation
7 pages
Step 1 - Install Oracle Java 8 On Ubuntu
No ratings yet
Step 1 - Install Oracle Java 8 On Ubuntu
7 pages
Exp 1
No ratings yet
Exp 1
24 pages
Install Sqoop
No ratings yet
Install Sqoop
7 pages
Hadoop Multi Node Cluster
No ratings yet
Hadoop Multi Node Cluster
7 pages
BDA Practical Experiment 1
No ratings yet
BDA Practical Experiment 1
5 pages
Aryan
No ratings yet
Aryan
60 pages
Practical 5
No ratings yet
Practical 5
3 pages
Hadoop Install
No ratings yet
Hadoop Install
19 pages
BDA Lab Manual
No ratings yet
BDA Lab Manual
49 pages
Hadoop Multinode Cluster Installation
No ratings yet
Hadoop Multinode Cluster Installation
4 pages
Hadoop Installation Commands
No ratings yet
Hadoop Installation Commands
3 pages
CLP5202 Veterinary Pharmacy and Agrochemicals Jan-2025
No ratings yet
CLP5202 Veterinary Pharmacy and Agrochemicals Jan-2025
55 pages
Republic Vs Migrino Case Digest
50% (2)
Republic Vs Migrino Case Digest
2 pages
Assignment Inventory Management
No ratings yet
Assignment Inventory Management
7 pages
SGLGB Form 2. Data Capture Form
No ratings yet
SGLGB Form 2. Data Capture Form
9 pages
Medication-Use Evaluation Guide
No ratings yet
Medication-Use Evaluation Guide
8 pages
Appeal in Labor Cases
0% (1)
Appeal in Labor Cases
2 pages
Ultrasonic Examination
No ratings yet
Ultrasonic Examination
14 pages
Boeing Supplier Performance Metrics
No ratings yet
Boeing Supplier Performance Metrics
8 pages
Partnership Liability Explained
No ratings yet
Partnership Liability Explained
54 pages
MSDS - Sulfuric Acid
No ratings yet
MSDS - Sulfuric Acid
10 pages
FM 5420 Carbon Dioxide Extinguishing Systems 2018
100% (1)
FM 5420 Carbon Dioxide Extinguishing Systems 2018
59 pages
Brand Management Insights
No ratings yet
Brand Management Insights
21 pages
Local Exchange Trading Systems Explained
No ratings yet
Local Exchange Trading Systems Explained
8 pages
Msc-International-Business .. Ulster
No ratings yet
Msc-International-Business .. Ulster
9 pages
Hydran 201ti (Mark IV) Essential DGA Monitoring For Transformers
No ratings yet
Hydran 201ti (Mark IV) Essential DGA Monitoring For Transformers
2 pages
BAC Resolution
No ratings yet
BAC Resolution
6 pages
Department of Education: "Project S .E.I.M.S" (School Supply and Equipment Inventory Management System)
100% (1)
Department of Education: "Project S .E.I.M.S" (School Supply and Equipment Inventory Management System)
6 pages
Grand Raid XXL Roof Top Tent Features
No ratings yet
Grand Raid XXL Roof Top Tent Features
2 pages
Viva Vigan Binatbatan Festival of The Arts 2016 - Schedule of Activities
No ratings yet
Viva Vigan Binatbatan Festival of The Arts 2016 - Schedule of Activities
4 pages
لإنشاء 200 سؤال اختيار من متعدد
No ratings yet
لإنشاء 200 سؤال اختيار من متعدد
99 pages
Basalte Brochure Basalte Home en
No ratings yet
Basalte Brochure Basalte Home en
40 pages
Samsung Retail Mode Setup Guide
No ratings yet
Samsung Retail Mode Setup Guide
11 pages
Folding Technical Drawings
100% (1)
Folding Technical Drawings
2 pages
Inverter Sumitomo Af3100
No ratings yet
Inverter Sumitomo Af3100
20 pages
Livpure Air Cooler Catalogue
No ratings yet
Livpure Air Cooler Catalogue
32 pages
Comperlan® 100: Product Data Sheet
100% (1)
Comperlan® 100: Product Data Sheet
2 pages
Untitled
No ratings yet
Untitled
116 pages
Make My Trip
No ratings yet
Make My Trip
1 page
Harvard Admissions 2025 Brochure
No ratings yet
Harvard Admissions 2025 Brochure
36 pages
RC1602D Datasheet
No ratings yet
RC1602D Datasheet
1 page

DataVisuaization Lab

Uploaded by

DataVisuaization Lab

Uploaded by

Practical: 1

Installation of Hadoop 3.3.2 on Ubuntu 18.04 LTS

$ sudo apt update

Set JAVA_HOME in .bashrc

Apply changes of bashrc in ubuntu environment either by rebooting the system or

2. Adding dedicated hadoop user

$ sudo addgroup hadoop

3. Adding hduser in sudoers file

Add following line in the /etc/sudoers.tmp file

hduser ALL=(ALL:ALL) ALL

4. Now switch to hduser

$ sudo apt-get install openssh-server openssh-client

6. Download Hadoop 3.3.2 tar file, extract it into /usr/local/Hadoop folder.

$ sudo tar xvzf hadoop-3.0.2.tar.gz

$ sudo mv -r hadoop-3.0.2 /usr/local/hadoop

7. Changing ownership to hduser:Hadoop group and full permission to them.

$ sudo chown -R hduser:hadoop /usr/local/hadoop $ sudo chmod -R 777

bashrc hadoop-env.sh core-site.xml hdfs-site.xml mapred-site-xml yarn-site.xml

$ sudo gedit ~/.bashrc

#Hadoop Related Options

Lets change the working directory to hadoop configurations location $ cd

$ sudo gedit hadoop-env.sh

$ sudo gedit yarn-site.xml

$ sudo gedit hdfs-site.xml

$ sudo gedit core-site.xml

<property> <name>fs.default.name</name> <value>hdfs://localhost:9000</value>

$ sudo gedit mapred-site.xml

9. Create temp directory, directory for datanode and namenode

$ sudo mkdir -p /home/hduser/hadoop/tmp

$ sudo chmod -R 777 /home/hduser/hadoop/tmp

$ sudo mkdir -p /usr/local/hadoop/yarn_data/hdfs/namenode

10. Format Hadoop namenode to get the fresh start

Following output is expected if all went well:

6960 SecondaryNameNode 7380 NodeManager

7244 ResourceManager 6766 DataNode

Access Hadoop UI from Browser

You might also like