0% found this document useful (0 votes)

14 views28 pages

Unit-5 DBMS

Unit 5 covers various aspects of storage and data security, including types of storage, file organization, RAID levels, indexing methods like B+ Trees and hashing, and database security measures. It also discusses data mining techniques, distributed databases, and Geographic Information Systems (GIS). The document emphasizes the importance of query optimization, data accessibility, and security in database management systems.

Uploaded by

gdrivee515

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views28 pages

Unit-5 DBMS

Uploaded by

gdrivee515

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Unit 5:

Storage & Data Security

• Storage structure, file organization, Recovery and
atomicity, Performance measures of discs, RAID
levels, Indices, B+ Tree, Hashing, Bitmap indices,
Query optimizations, Database Security, Data
mining models and techniques, Distributed
Databases, GIS.
Storage System
• For storing the data, there are different types
of storage options available. These storage
types differ from one another as per the speed
and accessibility. There are the following types
of storage devices used for storing the data:
Types of Data Storage

• Primary Storage- Main Memory, Cache

• Secondary Storage- Flash Memory, Magnetic
Disk Storage
• Tertiary Storage- Optical Storage, Tape Storage
System Issues: How to Build a DBMS

Query Optimization
and Execution

Discussed so far Relational Operators

Files and Access Methods
New topic
Buffer Management

Disk Space Management

DB
Data on External Storage
• Disks: Can retrieve random page at fixed cost
– But reading several consecutive pages is much cheaper than reading
them in random order
• Tapes: Can read pages only in sequence
– Cheaper than disks; used for archival storage
• File organization: Method of arranging a file of records on
external storage.
– Record id (rid) is sufficient to physically locate record
– Indexes are data structures that allow us to find the record ids of
records with given values in index search key fields
• Architecture: Buffer manager stages pages from external
storage to main memory buffer pool. File and index layers
make calls to the buffer manager.
Alternative File Organizations
Many alternatives exist, each ideal for some
situations, and not so good in others:
– Heap (random order) files: Suitable when typical
access is a file scan retrieving all records.
– Sorted Files: Best if records must be retrieved in
some order, or only a `range’ of records is needed.
– Indexes: Data structures to organize records via
trees or hashing.
• Like sorted files, they speed up searches for a subset of
records, based on values in certain (“search key”) fields
• Updates are much faster than in sorted files.
RAID
• Redundant Array of Independent Disk
(RAID) combines multiple small,
inexpensive disk drives into an array of disk
drives which yields performance more than
that of a Single Large Expensive Drive
(SLED). RAID is also called Redundant
Array of Inexpensive Disks.
Level of RAID
1.RAID-0 (Stripping)
2.RAID-1 (Mirroring)
3.RAID-2 (Bit-Level Stripping with Dedicated Parity)
4.RAID-3 (Byte-Level Stripping with Dedicated Parity)
5.RAID-4 (Block-Level Stripping with Dedicated Parity)
6.RAID-5 (Block-Level Stripping with Distributed Parity)
7.RAID-6 (Block-Level Stripping with two Parity Bits)
Indexes

• An index on a file speeds up selections on the

search key fields for the index.
– Any subset of the fields of a relation can be the
search key for an index on the relation (e.g., age or
colour).
– Search key is not the same as key (minimal set of
fields that uniquely identify a record in a relation).
• An index contains a collection of data entries,
and supports efficient retrieval of all data entries
k* with a given key value k.
• Example of Index: Essentials of Game Theory
Example of Alternative 2
Loca- colour
tion
6 data entries,
1 Red
sorted by colour
2 Red
3 Red

4 blue
5 blue
6 blue
Index Classification
• Primary vs. secondary: If search key contains primary key,
then called primary index.
– Unique index: Search key uniquely identifies record.
• Clustered vs. unclustered: If order of data records is the same
as, or `close to’, order of data entries, then called clustered
index.
– Alternative 1 implies clustered; in practice, clustered also implies
Alternative 1 (since sorted files are rare).
– A file can be clustered on at most one search key.
– Cost of retrieving data records through index varies greatly based on
whether index is clustered or not!
Clustered vs. Unclustered Index
• Suppose that Alternative (2) is used for data entries, and that the
data records are stored in a Heap file.
– To build clustered index, first sort the Heap file (with some free space on
each page for future inserts).
– Overflow pages may be needed for inserts. (Thus, order of data recs is `close
to’, but not identical to, the sort order.)

Index entries
CLUSTERED direct search for UNCLUSTERED
data entries

Data entries Data entries

(Index File)
(Data file)

Data Records Data Records

Hash-Based Indexes
• Good for equality selections.
• Index is a collection of buckets. Bucket = primary
page plus zero or more overflow pages.
• Hashing function h: h(r) = bucket in which record r
belongs. h looks at the search key fields of r.
• If Alternative (1) is used, the buckets contain
the data records; otherwise, they contain
<key, rid> or <key, rid-list> pairs.
B+ Tree Indexes

Non-leaf
Pages

Leaf
Pages

 Leaf pages contain data entries, and are chained (prev & next)
 Non-leaf pages contain index entries; they direct searches:

index entry

P0 K 1 P1 K 2 P 2 K m Pm
Example B+ Tree
Root

Entries <= 17 Entries > 17

5 13 27 30

2* 3* 5* 7* 8* 14* 16* 22* 24* 27* 29* 33* 34* 38* 39*

• Find 28? 29? All > 17* and < 30*

• Insert/delete: Find data entry in leaf, then
change it. Need to adjust parent sometimes.
– And change sometimes bubbles up the tree
Comparing File Organizations
• Heap files (random order; insert at eof)
• Sorted files, sorted on <age, sal>
• Clustered B+ tree file, Alternative (1), search key
<age, sal>
• Heap file with unclustered B + tree index on search
key <age, sal>
• Heap file with unclustered hash index on search key
<age, sal>
Hashing
• In this technique, data is stored at the data
blocks whose address is generated by using
the hashing function. The memory location
where these records are stored is known as
data bucket or data blocks.
Database Security
• Security of databases refers to the array of
controls, tools, and procedures designed to
ensure and safeguard confidentiality, integrity,
and accessibility. This tutorial will concentrate
on confidentiality because it's a component
that is most at risk in data security breaches.
Security for databases must cover and
safeguard the following aspects:

• The database containing data.

• Database management systems (DBMS)
• Any applications that are associated with it.
• Physical database servers or the database
server virtual, and the hardware that runs it.
• The infrastructure for computing or network
that is used to connect to the database.
Data Mining
• Data mining refers to extracting or mining
knowledge from large amounts of data. In
other words, Data mining is the science, art,
and technology of discovering large and
complex bodies of data in order to discover
useful patterns.
Data Mining Techniques
Distributed Database System
• A distributed database is basically a database
that is not limited to one system, it is spread
over different sites, i.e, on multiple computers
or over a network of computers.
• A distributed database system is located on
various sites that don’t share physical
components.
Types:

 1. Homogeneous Database:
In a homogeneous database, all different sites store database identically. The operating system,
database management system, and the data structures used – all are the same at all sites. Hence,
they’re easy to manage.
 2. Heterogeneous Database:
In a heterogeneous distributed database, different sites can use different schema and software that
can lead to problems in query processing and transactions. Also, a particular site might be
completely unaware of the other sites. Different computers may use a different operating system,
different database application. They may even use different data models for the database. Hence,

translations are required for different sites to communicate.

GIS: Geographic Information System

• GIS stands for Geographic Information System.

• It is a system designed to collect, analyze, manipulate,
manage, and display all types of geographical and
spatial data and information.
• It allows you to perform spatial analysis and manage
large data and display the information in maps or
graphical form for analysis and presentation.
• These benefits make GIS a valuable tool to visualize
spatial data or to build decision support systems for an
organization.
Thank you!!

Indexing
No ratings yet
Indexing
62 pages
DBMS-U5 Notes
No ratings yet
DBMS-U5 Notes
16 pages
DBMS Unit-5 Notes
No ratings yet
DBMS Unit-5 Notes
23 pages
Lt20 21 Index
No ratings yet
Lt20 21 Index
28 pages
File Organization
No ratings yet
File Organization
11 pages
V Unit
No ratings yet
V Unit
15 pages
V Unit
No ratings yet
V Unit
36 pages
UNIT-IV - File Organization
No ratings yet
UNIT-IV - File Organization
10 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
80 pages
DBMS Unit 5
No ratings yet
DBMS Unit 5
58 pages
Class 6
No ratings yet
Class 6
15 pages
Unit 4 Chapter 1 Storage and Querying
No ratings yet
Unit 4 Chapter 1 Storage and Querying
37 pages
Unit 5 DBMS
No ratings yet
Unit 5 DBMS
38 pages
DBMS Unit5
No ratings yet
DBMS Unit5
40 pages
Database Storage & Indexing Guide
No ratings yet
Database Storage & Indexing Guide
41 pages
Module Iippt
No ratings yet
Module Iippt
27 pages
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
No ratings yet
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
46 pages
Database File Organization Guide
No ratings yet
Database File Organization Guide
26 pages
Unit5 File Organization
No ratings yet
Unit5 File Organization
112 pages
Lecture9 PDF
No ratings yet
Lecture9 PDF
45 pages
Lesson 9 Lecture9
No ratings yet
Lesson 9 Lecture9
45 pages
Dbms 3 Sem
No ratings yet
Dbms 3 Sem
31 pages
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
No ratings yet
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
20 pages
DBMS File Organization
No ratings yet
DBMS File Organization
60 pages
Dbms Unit 5 Notes
No ratings yet
Dbms Unit 5 Notes
23 pages
Chapter 11. File Organisation and Indexes
No ratings yet
Chapter 11. File Organisation and Indexes
56 pages
Data Storage: Agnibesh Samanta Mba-Final Year
No ratings yet
Data Storage: Agnibesh Samanta Mba-Final Year
12 pages
Chapter 6
No ratings yet
Chapter 6
62 pages
Index 1
No ratings yet
Index 1
25 pages
Database Storage and Indexing
No ratings yet
Database Storage and Indexing
14 pages
File Storage and Indexing Guide
No ratings yet
File Storage and Indexing Guide
13 pages
Unit 4 Storage and Querying
No ratings yet
Unit 4 Storage and Querying
48 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
UNIT 5 Dbms
No ratings yet
UNIT 5 Dbms
25 pages
Efficient File Indexing Methods
No ratings yet
Efficient File Indexing Methods
40 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
File Structure and Indexing
No ratings yet
File Structure and Indexing
7 pages
Chapter5 Storage&Indexing
No ratings yet
Chapter5 Storage&Indexing
19 pages
Storage and Indexing Methods
No ratings yet
Storage and Indexing Methods
43 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
35 pages
File Organization
No ratings yet
File Organization
9 pages
1 - Disk Storage - Ch13
No ratings yet
1 - Disk Storage - Ch13
31 pages
22-File Organization-06-09-2024
No ratings yet
22-File Organization-06-09-2024
23 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
Self Unit 2
No ratings yet
Self Unit 2
18 pages
Layers of A DBMS
No ratings yet
Layers of A DBMS
38 pages
File Organization and Indexing
No ratings yet
File Organization and Indexing
13 pages
W5 Storage Files Indexing pt1
No ratings yet
W5 Storage Files Indexing pt1
61 pages
DBMS Internals: How Does It All Work?
No ratings yet
DBMS Internals: How Does It All Work?
94 pages
DINLect 1
No ratings yet
DINLect 1
69 pages
CIT 401 Lecture Note
No ratings yet
CIT 401 Lecture Note
46 pages
Database File Organisation Lecture
No ratings yet
Database File Organisation Lecture
32 pages
Unit 5
No ratings yet
Unit 5
185 pages
DBMS Unit 5 Notes
No ratings yet
DBMS Unit 5 Notes
28 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
81 pages
Unit-1 A) Laser Optic
No ratings yet
Unit-1 A) Laser Optic
48 pages
Unit-5 Non Conventional Energy (Compatibility Mode)
No ratings yet
Unit-5 Non Conventional Energy (Compatibility Mode)
27 pages
Part 2
No ratings yet
Part 2
3 pages
PSPC Unit I B DOS Commands
No ratings yet
PSPC Unit I B DOS Commands
4 pages
Assignment-1 Solutions
No ratings yet
Assignment-1 Solutions
20 pages
Unit-3 Mechanical and Electronical Properties of Material
No ratings yet
Unit-3 Mechanical and Electronical Properties of Material
55 pages
PSPC Unit IVb Preprocessors
No ratings yet
PSPC Unit IVb Preprocessors
5 pages
ASSIGNMENT-3 Measure of Dispersion
No ratings yet
ASSIGNMENT-3 Measure of Dispersion
2 pages
Transformer Handwritten Notes
No ratings yet
Transformer Handwritten Notes
10 pages
PSPC Unit III B Pointer
No ratings yet
PSPC Unit III B Pointer
13 pages
UNIT-5 (Database Concept)
No ratings yet
UNIT-5 (Database Concept)
25 pages
Assignment-1 Curve Fitting
No ratings yet
Assignment-1 Curve Fitting
2 pages
Unit-1 Engg Chemistry Notes
No ratings yet
Unit-1 Engg Chemistry Notes
39 pages
Unit-3 Engg Chemistry Notes
No ratings yet
Unit-3 Engg Chemistry Notes
46 pages
Part 1
No ratings yet
Part 1
3 pages
JLU-SOET ESE Time Table Even Semester 2023-24
No ratings yet
JLU-SOET ESE Time Table Even Semester 2023-24
2 pages
Unit-2 Quantum Mechanics & Wave Optics
No ratings yet
Unit-2 Quantum Mechanics & Wave Optics
46 pages
Single Phase Induction Motor
No ratings yet
Single Phase Induction Motor
7 pages
Unit
No ratings yet
Unit
3 pages
PSPC Unit IVc Strings in C
No ratings yet
PSPC Unit IVc Strings in C
12 pages
Part 3
No ratings yet
Part 3
2 pages
PSPC Unit II A Decision Control Structure
No ratings yet
PSPC Unit II A Decision Control Structure
3 pages
UNIT-4 (Database Concept)
No ratings yet
UNIT-4 (Database Concept)
8 pages
PSPC Unit I C Basics of Programming in C
No ratings yet
PSPC Unit I C Basics of Programming in C
10 pages
Data Structures Introduction and Arrays Notes
No ratings yet
Data Structures Introduction and Arrays Notes
31 pages
PSPC Unit III A Storage Classes in C
No ratings yet
PSPC Unit III A Storage Classes in C
8 pages
PSPC Unit IVa Structures
No ratings yet
PSPC Unit IVa Structures
13 pages
JLU-SOET MSE Time Table Even Semester 2023-24
No ratings yet
JLU-SOET MSE Time Table Even Semester 2023-24
1 page
PSPC Unit I A Computer Fundamentals
No ratings yet
PSPC Unit I A Computer Fundamentals
7 pages
Losses in Transformer Handwritten Notes
No ratings yet
Losses in Transformer Handwritten Notes
4 pages
Coa - Memory Organization
50% (2)
Coa - Memory Organization
31 pages
Understanding Computer - Chapter 01
100% (1)
Understanding Computer - Chapter 01
65 pages
Computer - Awareness MCQ (WWW - Sarkaripost.in) PDF
No ratings yet
Computer - Awareness MCQ (WWW - Sarkaripost.in) PDF
135 pages
Challenges and Opportunities in VLSI IoT Devices and Systems
No ratings yet
Challenges and Opportunities in VLSI IoT Devices and Systems
10 pages
RAMCHECK LX DDR4 Server and Desktop Memory Tester
No ratings yet
RAMCHECK LX DDR4 Server and Desktop Memory Tester
10 pages
Java Programming Basics
No ratings yet
Java Programming Basics
36 pages
Data Tiering Properties in ADSO - Visual BI Solutions
No ratings yet
Data Tiering Properties in ADSO - Visual BI Solutions
10 pages
Quiz (Unit - 4)
No ratings yet
Quiz (Unit - 4)
2 pages
Digital Banking Guide for Bankers
50% (2)
Digital Banking Guide for Bankers
38 pages
RTC Based - Automatic College Bell
No ratings yet
RTC Based - Automatic College Bell
53 pages
Сигейт F3 Serial Port Diagnostics
50% (2)
Сигейт F3 Serial Port Diagnostics
60 pages
Huawei: Huawei Certified ICT Associate - HCIA-Storage V5.0
No ratings yet
Huawei: Huawei Certified ICT Associate - HCIA-Storage V5.0
14 pages
Free Space Management
No ratings yet
Free Space Management
6 pages
Ubuntu Database Management System Guide
No ratings yet
Ubuntu Database Management System Guide
16 pages
OS File Systems Lab Guide
100% (1)
OS File Systems Lab Guide
19 pages
BCA Paper-V Unit-10
No ratings yet
BCA Paper-V Unit-10
15 pages
Databook - Q2'17 - NAND (Rev1.0)
No ratings yet
Databook - Q2'17 - NAND (Rev1.0)
12 pages
Module 11 Ebook
No ratings yet
Module 11 Ebook
106 pages
ASCII vs EBCDIC: Key Differences
0% (1)
ASCII vs EBCDIC: Key Differences
8 pages
Uboot m1s
No ratings yet
Uboot m1s
44 pages
SRAM: Basics, Design, and Operations
No ratings yet
SRAM: Basics, Design, and Operations
6 pages
Chapter 11
No ratings yet
Chapter 11
49 pages
E57 General Data Management
No ratings yet
E57 General Data Management
27 pages
Input/ Output Organization: UNIT-6
No ratings yet
Input/ Output Organization: UNIT-6
18 pages
CCS367 - ST Cia 1 & Cia Ii QP
No ratings yet
CCS367 - ST Cia 1 & Cia Ii QP
5 pages
Ibm Storage Product Guide
No ratings yet
Ibm Storage Product Guide
24 pages
Redmi 2
No ratings yet
Redmi 2
11 pages
CIT104 Revision
No ratings yet
CIT104 Revision
12 pages
Chapter 1 Critical Thinking Answers
No ratings yet
Chapter 1 Critical Thinking Answers
12 pages
Veritas NetBackup 8.1.2 Exam Guide
No ratings yet
Veritas NetBackup 8.1.2 Exam Guide
38 pages

Unit-5 DBMS

Uploaded by

Unit-5 DBMS

Uploaded by

Unit 5:

Storage & Data Security

• Primary Storage- Main Memory, Cache

Discussed so far Relational Operators

Disk Space Management

• An index on a file speeds up selections on the

Data entries Data entries

Data Records Data Records

Entries <= 17 Entries > 17

• Find 28*? 29*? All > 17* and < 30*

• The database containing data.

translations are required for different sites to communicate.

• GIS stands for Geographic Information System.

You might also like

• Find 28? 29? All > 17* and < 30*