0% found this document useful (0 votes)

10 views9 pages

Bigdatahbase

HBase is an open-source, column-oriented database built on Hadoop that manages structured and semi-structured data, offering features like scalability and fault-tolerance. The installation process involves prerequisites such as Java and Hadoop, followed by downloading HBase, setting environment variables, configuring settings, and verifying the installation. Users can interact with HBase through the shell to create tables, insert data, and access the HBase Web UI.

Uploaded by

Beesula Vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views9 pages

Bigdatahbase

Uploaded by

Beesula Vishnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

3. Process BigData using HBase.

HBase:

HBase is a column-oriented database that’s an open-source implementation of Google’s Big

Table storage architecture.

It can manage structured and semi-structured data and has some built-in features such as
scalability, versioning, compression and garbage collection.

Since its uses write-ahead logging and distributed configuration, it can provide fault-tolerance
and quick recovery from individual server failures.

HBase built on top of Hadoop / HDFS and the data stored in HBase can be manipulated using
Hadoop’s MapReduce capabilities.

HDFS vs. HBase

HDFS is a distributed file system that is well suited for storing large files. It’s designed to
support batch processing of data but doesn’t provide fast individual record lookups. HBase is
built on top of HDFS and is designed to provide access to single rows of data in large tables.

HBase Architecture

The HBase Physical Architecture consists of servers in a Master-Slave relationship. Typically, the
HBase cluster has one Master node, called HMaster and multiple Region Servers called
HRegionServer. Each Region Server contains multiple Regions – HRegions.

Just like in a Relational Database, data in HBase is stored in Tables and these Tables are stored
in Regions. When a Table becomes too big, the Table is partitioned into multiple Regions. These
Regions are assigned to Region Servers across the cluster. Each Region Server hosts roughly the
same number of Regions. The HMaster in the HBase is responsible for Performing
Administration Managing and Monitoring the Cluster Assigning Regions to the Region Servers
Controlling the Load Balancing and Failover.

23
HBase Installation steps:
Step 1: Prerequisites

i.Java 8 or higher (OpenJDK or Oracle)

ii.SSH (if you plan to run in pseudo

pseudo-distributed mode)

iii.hadoop installed.

Step 2: Download HBase

Get it from the official Apache website:

24
https://hbase.apache.org/downloads.html

vaagdevi:~/hdoop$ wget https://downloads.apache.org/hbase/2.4.17/hbase-2.6.2-bin.tar.gz

vaagdevi:~/hdoop$ tar -xzf hbase-2.6.2-bin.tar.gz

vaagdevi:~/hdoop$ mv hbase-2.6.2 hbase

Step 3: Set Environment Variables

vaagdevi:~/hdoop$ nano ~/.bashrc

export HBASE_HOME=~/hbase

25
export PATH=$PATH:$HBASE_HOME/bin

*Once you add the variables, save and exit the .bashrc file. ctrl+s & ctrl+x.

*Run the command below to apply the changes to the current running environment:

vaagdevi:~$ source ~/.bashrc

Step 4: Setting java path for hbase

Now copy java home path by following command

vaagdevi:~/hdoop$ echo $JAVA_HOME

/usr/lib/jvm/java-8-openjdk-amd64

a. Use the previously created $HBASE_HOME variable to access the hbase-env.sh file:

vaagdevi:~/hdoop$ nano $HBASE_HOME/conf/hbase-env.sh

b. Uncomment the $JAVA_HOME variable and replace the following.

export JAVA_HOME=/usr/lib/jvm/java-8-openjdk-amd64

Step 5: Configure HBase (Standalone Mode):

26
Navigate to the HBase configuration directory and edit the configuration files.

vaagdevi:~/hdoop$ cd /hbase/conf

a. Edit hbase-site.xml

Configure the default file system (HBASE):

vaagdevi:~/hdoop/hbase/conf $ nano hbase-site.xml

Add the following configuration inside <configuration>:

//Here you have to set the path where you want HBase to store its files.

<name>hbase.rootdir</name>

<value>hdfs://localhost:9000/hbase</value>

</property>

//Here you have to set the path where you want HBase to store its built in zookeeper files.

<name>hbase.zookeeper.quorum</name>

<value>localhost</value>

</property>

<name>hbase.zookeeper.property.dataDir</name>

<value>/home/vaagdevi/hdoop/hbase/zookeeper</value>

</property>

27

<name>hbase.wal.provider</name>

<value>filesystem</value>

</property>

<name>hbase.tmp.dir</name>

<value>/home/vaagdevi/hdoop/hbase/HFiles</value>

</property>

*save and exit the hbase-site.xml

site.xml file. ctrl+s & ctrl+x.

(now create 2 directories with "HFiles" & "zookeeper" to store logs in hbase folder)

Step 7: Verify the Installation

28
vaagdevi:~/hdoop/hbase/bin$ start-hbase.sh

*check logs created in HFiles & zookeeper diectories.

Check if the Hadoop&Hbase daemons are running:

vaagdevi:~/hdoop/hbase/bin $ jps

You should see the following processes running:

NameNode

DataNode

ResourceManager

NodeManager

HMaster

Step 8: HBase Shell

vaagdevi:~/hdoop/hbase $ ./bin/hbase shell

**to check status of servers

shell> status

29
**to create a table (create <table name>,<column family> )

shell> create 'emp', 'personal data', 'professional data'

**to verify

shell> list

**to insert data into table(put <table name>,row1,<colfamily:colname>,<value>)

put 'emp','1','personal data:name','navi'

put 'emp','1','personal data:city','hyderabad'

put 'emp','1','professional data:designation','manager'

put 'emp','1','professional data:salary','50000'

**to show table data

shell> scan 'emp'

30
Step 8: Access the HBase Web UI

http://localhost:16010

bdcc-2 5
No ratings yet
bdcc-2 5
9 pages
Hadoop/Hbase Installation: Install Java
No ratings yet
Hadoop/Hbase Installation: Install Java
11 pages
1.2. Quick Start - Standalone HBase
No ratings yet
1.2. Quick Start - Standalone HBase
7 pages
rc159-HBase 7 PDF
No ratings yet
rc159-HBase 7 PDF
7 pages
Install HBase On Ubuntu 20.04
No ratings yet
Install HBase On Ubuntu 20.04
4 pages
Hbase Apache Org Book HTML
No ratings yet
Hbase Apache Org Book HTML
482 pages
Install and Configure HBase
No ratings yet
Install and Configure HBase
8 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
HBase 1.1.2 Setup for Hadoop Users
No ratings yet
HBase 1.1.2 Setup for Hadoop Users
7 pages
Big Data Analytics Lab File
No ratings yet
Big Data Analytics Lab File
15 pages
Bda Unit 5
No ratings yet
Bda Unit 5
16 pages
Unit 5 Lecture No-3 (Hbase)
No ratings yet
Unit 5 Lecture No-3 (Hbase)
35 pages
Hbase Installationn
No ratings yet
Hbase Installationn
12 pages
Hbase - Quick Guide Hbase - Overview
No ratings yet
Hbase - Quick Guide Hbase - Overview
53 pages
Apache HBase Installation
No ratings yet
Apache HBase Installation
1 page
Hbase Tutorial
100% (1)
Hbase Tutorial
107 pages
Unit V Hadoop Related Tools
No ratings yet
Unit V Hadoop Related Tools
54 pages
HBase
No ratings yet
HBase
31 pages
HBase Installation Guide Linux
No ratings yet
HBase Installation Guide Linux
1 page
Big Data UNIT 5 Own
No ratings yet
Big Data UNIT 5 Own
18 pages
Unit-5 Notes
No ratings yet
Unit-5 Notes
61 pages
HBase
No ratings yet
HBase
27 pages
Apache HBase Tutorial & Setup Guide
No ratings yet
Apache HBase Tutorial & Setup Guide
19 pages
Bda - Unit 5
No ratings yet
Bda - Unit 5
30 pages
HBASE
No ratings yet
HBASE
11 pages
Hbase Installation Steps
No ratings yet
Hbase Installation Steps
13 pages
HBase - Tutorial
No ratings yet
HBase - Tutorial
14 pages
HBASE
No ratings yet
HBASE
18 pages
Hbase Java Client Api: For Live Hadoop Training, Please See Courses
No ratings yet
Hbase Java Client Api: For Live Hadoop Training, Please See Courses
21 pages
HBase
No ratings yet
HBase
4 pages
BDA Unit 5 HIVE HBASE
No ratings yet
BDA Unit 5 HIVE HBASE
33 pages
MapReduce Merged
No ratings yet
MapReduce Merged
18 pages
10 HBase
No ratings yet
10 HBase
13 pages
1.mrplab Intro
No ratings yet
1.mrplab Intro
18 pages
Unit 5 Hbase
No ratings yet
Unit 5 Hbase
15 pages
Ba Iift 17-18
No ratings yet
Ba Iift 17-18
40 pages
Hbase Tutorial
No ratings yet
Hbase Tutorial
22 pages
BDA1
No ratings yet
BDA1
42 pages
Unit 1 P2 HBase
No ratings yet
Unit 1 P2 HBase
22 pages
BDALAB Experiment08
No ratings yet
BDALAB Experiment08
15 pages
HBase: Data Management & Architecture
No ratings yet
HBase: Data Management & Architecture
36 pages
BDA Module 2-2023
No ratings yet
BDA Module 2-2023
30 pages
KCC Institute of Technology and Management: Big Data and Analytics Lab File BCDS651
No ratings yet
KCC Institute of Technology and Management: Big Data and Analytics Lab File BCDS651
30 pages
HBase & Hive Architecture Guide
No ratings yet
HBase & Hive Architecture Guide
10 pages
Bda Unit-4 Notes
No ratings yet
Bda Unit-4 Notes
15 pages
HBase Architecture and Its Important Components
No ratings yet
HBase Architecture and Its Important Components
11 pages
Hadoop Ecosystem PDF
No ratings yet
Hadoop Ecosystem PDF
55 pages
Hadoop Ecosystem PDF
No ratings yet
Hadoop Ecosystem PDF
55 pages
Unit 5 Big Data
No ratings yet
Unit 5 Big Data
34 pages
Hadoop Ecosystem Overview
No ratings yet
Hadoop Ecosystem Overview
55 pages
Hbase
No ratings yet
Hbase
3 pages
HBase Installation Guide for Hadoop
No ratings yet
HBase Installation Guide for Hadoop
3 pages
(Ebook) HBase: The Definitive Guide by Lars George ISBN 9781449396107, 1449396100 Instant Access 2025
No ratings yet
(Ebook) HBase: The Definitive Guide by Lars George ISBN 9781449396107, 1449396100 Instant Access 2025
103 pages
Bda 2
No ratings yet
Bda 2
25 pages
Hbase
No ratings yet
Hbase
6 pages
HBase Guide for Developers
No ratings yet
HBase Guide for Developers
33 pages
Isochretism and Style
No ratings yet
Isochretism and Style
12 pages
10 Reflection Lesson Plan 10 English
100% (2)
10 Reflection Lesson Plan 10 English
2 pages
Market Equilibrium
No ratings yet
Market Equilibrium
5 pages
Bacoor Cavite Eis Island C
No ratings yet
Bacoor Cavite Eis Island C
491 pages
Predicting Employees Performance Using Data Mining Techniques
No ratings yet
Predicting Employees Performance Using Data Mining Techniques
12 pages
Ficha Contactores HGC en
No ratings yet
Ficha Contactores HGC en
14 pages
Engineering Economics and Financial Analysis
No ratings yet
Engineering Economics and Financial Analysis
2 pages
Pharmaceutical Technology
No ratings yet
Pharmaceutical Technology
15 pages
List of Pharmaceuticals in Lahore
78% (18)
List of Pharmaceuticals in Lahore
3 pages
Veterinary Admission Guidelines
No ratings yet
Veterinary Admission Guidelines
47 pages
Titan: India's Leading Lifestyle Brand
No ratings yet
Titan: India's Leading Lifestyle Brand
4 pages
May Jun 2023
No ratings yet
May Jun 2023
2 pages
PDS - Primer - Multi-Gard-12-1088 PZ - en
No ratings yet
PDS - Primer - Multi-Gard-12-1088 PZ - en
3 pages
2016 - An Overview of Microgrid Protection Methods and The Factors Involved
No ratings yet
2016 - An Overview of Microgrid Protection Methods and The Factors Involved
13 pages
Necromancer List
No ratings yet
Necromancer List
5 pages
3D Printed Wind Turbines Part 1 Design Consid 2015 Sustainable Energy Techn
No ratings yet
3D Printed Wind Turbines Part 1 Design Consid 2015 Sustainable Energy Techn
8 pages
EASe Therapy for Sensory Disorders
No ratings yet
EASe Therapy for Sensory Disorders
9 pages
Dubinsky I Cryptography For Payment Professionals
No ratings yet
Dubinsky I Cryptography For Payment Professionals
204 pages
Endocrinología
No ratings yet
Endocrinología
19 pages
Adcote76P1 38TDS PDF
No ratings yet
Adcote76P1 38TDS PDF
3 pages
Major Physical Features in Our County
100% (1)
Major Physical Features in Our County
6 pages
Successful Orchard Growers in The Community 2
100% (2)
Successful Orchard Growers in The Community 2
4 pages
Geology Merit Badge Pamphlet 35904
No ratings yet
Geology Merit Badge Pamphlet 35904
100 pages
813 Magnetic Core Dimensioning Limits in Hydro Generators
No ratings yet
813 Magnetic Core Dimensioning Limits in Hydro Generators
77 pages
Key 1
No ratings yet
Key 1
3 pages
Whats New
No ratings yet
Whats New
26 pages
Chemistry Equation Basics
No ratings yet
Chemistry Equation Basics
23 pages
Practical Research 1 Module 1 3
No ratings yet
Practical Research 1 Module 1 3
75 pages
DLL - Mapeh-Health 9 - Q1 - W2
No ratings yet
DLL - Mapeh-Health 9 - Q1 - W2
13 pages
The Elements of Philosophy A C - William Wallace
100% (7)
The Elements of Philosophy A C - William Wallace
362 pages

Bigdatahbase

Uploaded by

Bigdatahbase

Uploaded by

3. Process BigData using HBase.

HBase is a column-oriented database that’s an open-source implementation of Google’s Big

HDFS vs. HBase

i.Java 8 or higher (OpenJDK or Oracle)

ii.SSH (if you plan to run in pseudo

Step 2: Download HBase

Get it from the official Apache website:

vaagdevi:~/hdoop$ wget https://downloads.apache.org/hbase/2.4.17/hbase-2.6.2-bin.tar.gz

vaagdevi:~/hdoop$ tar -xzf hbase-2.6.2-bin.tar.gz

vaagdevi:~/hdoop$ mv hbase-2.6.2 hbase

Step 3: Set Environment Variables

vaagdevi:~/hdoop$ nano ~/.bashrc

vaagdevi:~$ source ~/.bashrc

Step 4: Setting java path for hbase

Now copy java home path by following command

vaagdevi:~/hdoop$ echo $JAVA_HOME

vaagdevi:~/hdoop$ nano $HBASE_HOME/conf/hbase-env.sh

b. Uncomment the $JAVA_HOME variable and replace the following.

Step 5: Configure HBase (Standalone Mode):

Configure the default file system (HBASE):

vaagdevi:~/hdoop/hbase/conf $ nano hbase-site.xml

Add the following configuration inside <configuration>:

*save and exit the hbase-site.xml

Step 7: Verify the Installation

*check logs created in HFiles & zookeeper diectories.

Check if the Hadoop&Hbase daemons are running:

You should see the following processes running:

Step 8: HBase Shell

vaagdevi:~/hdoop/hbase $ ./bin/hbase shell

**to check status of servers

shell> create 'emp', 'personal data', 'professional data'

**to insert data into table(put <table name>,row1,<colfamily:colname>,<value>)

put 'emp','1','personal data:name','navi'

put 'emp','1','personal data:city','hyderabad'

put 'emp','1','professional data:designation','manager'

put 'emp','1','professional data:salary','50000'

**to show table data

shell> scan 'emp'

You might also like