Introduction to Information
Storage and Management
Chapter Objectives
• Describe who is creating data and the amount
of data being created
• Describe the value of data to business
• List the solutions available for data storage
• List and explain the core elements of data
center
• Describe the ILM strategy
• Describe storage evolution
Lesson : Information Storage
• Describe the importance of information to
individuals and to businesses
• Define data and information
• Discuss the categories of data
• Describe the storage architectures and their
evolution
Why Information Storage
• “Digital universe – The Information Explosion”
– 21st Century is information era
– Information is being created at ever increasing rate
– Information has become critical for success
• We live in an on-command, on-demand world
– Example: Social networking sites, e-mails, video and
photo sharing website, online shopping, search
engines etc
Nearly a quarter of the world's population – roughly
1.4 billion people – will use the Internet on a regular
basis in 2009.
50 billion photos taken every year
Online Video
4,700,000,000 video streams monthly
England has approximately 4 million surveillance
cameras
1 for approximately every 14 Britons
Storage requirements: Facebook
• 10,000,000,000 photos
• 2-3 Terabytes of photos
are being uploaded to
the site every day
• One petabyte of photo
storage
• Serve over 15 billion
photo images per day
• Photo traffic now peaks
at over 300,000 images
served per second
Information management is a big challenge
Store Protect Optimize Leverage
Infrastructure
Information
Store Protect Optimize Leverage
AVocabulary for Measuring Information
If a Grain of Sand were One Byte of Information . . .
1 Megabyte =
1 million bytes
a tablespoon of sand
1 Gigabyte =
1 billion bytes
patch of sand—
9” square, 1’ deep
1 Terabyte =
1 trillion bytes
a sandbox—
24’ square, 1’ deep
1 Petabyte =
1,000 terabytes
a mile long beach—
100’ wide , 1’ deep
A NewVocabulary for Measuring Information
If a Grain of Sand were One Byte of Information . . .
1 Exabyte =
1 Megabyte = 1,000 petabytes
1 million bytes the same beach—
a tablespoon of sand from Maine to North Carolina
1 Gigabyte = 1 Zetabyte =
1 billion bytes 1,000 exabytes
patch of sand— the same beach—
9” square, 1’ deep along the entire US coast
1 Terabyte = 1 Yottabyte =
1 trillion bytes 1,000 zetabytes
a sandbox— enough info to bury the entire
24’ square, 1’ deep US under 296 feet of sand
1 Petabyte =
1,000 terabytes
a mile long beach—
100’ wide , 1’ deep
What do you Think ?
• What is your contribution to the digital Universe
( how many Mb’s have you generated till date ?? )
a) <100 GB
b) 100 GB - 500 GB
c) 500 GB – 1 TB
d) > 1 TB
What is Data
“Collection of raw facts from which conclusions may be drawn”
• Data is converted into more
convenient form i.e. Digital
Data Video
– Increase in data processing 01010101010
capabilities Photo 10101011010
– Lower cost of digital storage 00010101011
01010101010
– Affordable and faster Book
10101010101
communication technology 01010101010
• Who creates data? Letter Digital Data
– Individuals
– Businesses
Categories of Data
• Data can be categorized as either
structured or unstructured data
• Over 80% of enterprise
information is unstructured E-Mail Attachments PDFs
X-Rays
Check Unstructured (80%)
Manuals Instant Messages
Images
Documents
Forms
Web Pages
Contracts
Rich Media
Invoices
Audio Video
Structured (20%)
Rows and Columns
Define Information
• What do individuals/businesses
do with the data they collect?
– They turn it into “information”
Centralized information
– “Information is the intelligence Network
storage and processing
Network
and knowledge derived from
data” Wired Wireless Wireless Wired
Uploading Accessing
information information
• Businesses analyze raw data in
order to identify meaningful Creators of
Users of
trends
Information
information
– For example:
• Buying habits and patterns of
customers
• Health history of patients Demand for more
Information
Virtuous cycle of information
Value of Information to a Business
• Identifying new business opportunities
– Buying/spending patterns
• Internet stores, retail stores, supermarkets
– Customer satisfaction/service
• Tracking shipments, and deliveries
• Identifying patterns that lead to changes in existing business
– Reduced cost
• Just-in-time inventory, eliminating over-stocking of products, optimizing shipment
and delivery
– New services
• Security alerts for “stolen” credit card purchases
– Targeted marketing campaigns
• Communicate to bank customers with high account balances about a special
savings plan
• Creating a competitive advantage
Storage
• Data created by individuals/businesses must be stored
for further processing
• Type of storage used is based on the type of data and
the rate at which it is created and used
• Examples:
– Individuals: Digital camera, Cell phone, DVD’s, Hard disk
– Businesses: Hard disk, external disk arrays, tape library
• Storage model: An evolution
– Centralized: mainframe computers
– Decentralized: Client –server model
– Centralized: Storage Networking
Storage
Technology
DAS NAS SAN
Storage Type sectors shared files blocks
Data TCP/IP,
IDE/SCSI Fibre Channel
Transmission Ethernet
clients or clients or
Access Mode servers
servers servers
Capacity (bytes) 109 109 - 1012 1012
Complexity Easy Moderate Difficult
Management
High Moderate Low
Cost (per GB)
19
Storage Technology and Architecture
Evolution
Multi Protocol
Router
LAN FC SAN
IP SAN
RAID Array SAN / NAS
JBOD
Internal DAS
Time
Data Center Infrastructure
• Data Centre
Components:
Applications
Database
Operating System/Server
Network
Storage Device
Example of an Order Processing System
Server/ OS
Client
LAN
Storage
network
Application
User
Interface DBMS
Storage Array
Key Requirements for Data Center Elements
Availability
Data Integrity Security
Manageability
Performance Capacity
Scalability
Managing Storage Information
• Monitoring
– Security, Performance, Accessibility, Capacity
• Reporting
– Resource performance, Capacity, Utilization
• Provisioning
– Providing necessary h/w, s/w and other resources
needed to run the data center
– Capacity, resource planning
Challenges in Managing Information
• Exploding digital universe
– Multifold increase of information growth
• Increasing dependency on information
– The strategic use of information plays
• Changing value of information
– Information that is valuable today may become less
important tomorrow.
Information Lifecycle Management
“CHANGE IN THE VALUE OF INFORMATION OVER TIME”
Protect
New Process Deliver Warranty
order order order claim
Time
Value
Fulfilled Aged Warranty
order data Voided
Create Access Migrate Archive Dispose
A proactive strategy that enables an IT organization
to effectively manage the data throughout its lifecycle
ILM strategy – characteristics
• Business-centric
• Centrally managed
• Policy-based
• Heterogeneous
• Optimized
Information Lifecycle Management Process
Policy-based Alignment of Storage Infrastructure with Data Value
AUTOMATED
Classify Implement policies Integrated Organize
data / with information management storage resources
applications based management tools of storage to
on business rules environment align with data
classes
FLEXIBLE
IMPLEMENTATION OF ILM
Benefits of Implementing ILM
• Improved utilization
– Tiered storage platforms
• Simplified management
– Processes, tools and automation
• Simplified backup and recovery
– A wider range of options to balance the need for business continuity
• Maintaining compliance
– Knowledge of what data needs to be protected for what length of time
• Lower Total Cost of Ownership
– By aligning the infrastructure and management costs with information value
Lesson Summary
Key points covered in this lesson:
• The five core elements of a Data Center infrastructure
• Key requirements of storage systems to support
business activities, as well as some of the constraints
• ILM strategy
– Importance
– Characteristics
– Activities in developing ILM strategy
– IML implementation
– Benefits of ILM
Chapter Summary
Key points covered in this chapter:
• Importance of data, information, and storage
infrastructure
• Types of data, its value, and key management
requirements of a storage system
• Evolution of storage architectures
• Core elements of a data center
• Importance of the ILM strategy