Data Warehouse Architecture Guide

A data warehouse system must meet architecture features including separating analytical and transactional processing, scalability to process huge data volumes, security, simplicity, and extensibility. Common data warehouse architectures include single-tier, two-tier, and three-tier structures.

Uploaded by

Maleeha Naz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views7 pages

Data Warehouse Architecture Guide

Uploaded by

Maleeha Naz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Data Warehouse Architecture Properties

A data warehouse system must meet the following architecture features:

 We sometimes wish to keep analytical and transactional processing as far away as possible.

 The scalability of the solution should be demonstrated by the ability to process a huge volume of data and
stream it to different destinations, at high speed, in various formats. The data stream should be processed
and presented in the required format, at the right time and location, with the minimum impact to the existing
infrastructure. The data stream must be protected and managed with the highest level of confidentiality and
integrity. The size of the data stream and the rate at which the data is being generated must be determined
by the business requirements, and the available hardware and software resources must be utilized to the
fullest extent possible.

 The architecture should be extensible; new functionality can be implemented in an existing service by
extending the service’s APIs. For example, an insurance company could extend their customer service
platform to provide a new feature that allows customers to obtain a personalized quote based on their
preferences. Newer technologies, such as artificial intelligence, can be implemented in an existing service
by extending the service’s APIs. For example, an insurance company could extend their customer service
platform to provide a new feature that allows customers to obtain a personalized quote based on their
preferences. Newer technologies, such as artificial intelligence, should be implemented in the core services;
the core services can be extended for new business functions, such as customer relationship management.

 Data security is a critical aspect of the data governance strategy. Data security controls at the source
include establishing data access controls and data encryption. Data security controls at the perimeter
include data security policies and monitoring access to the data.

 It should be simple and straightforward, and users should be able to work with the data in an efficient and
effective manner. Data Warehouse management should be easy to understand and implement. Data
Warehouse management should not be complicated and difficult for beginners should not find their way into
data warehouse management. It should be simple to use and easy to understand.
Types of Data Warehouse Architectures
There are basically three different data warehouse architectures.

Single-Tier Architecture
Single-tier architectures are not implemented in real-time systems. They are used for batch and real-time processing.
The data is first transferred to a single-tier architecture where it is converted into a format that is suitable for real-time
processing. This architecture is known as “single-threaded”. After this, the data is transferred to a real-time system.
Single-tier architectures are currently the most preferred way to process operational data. It is important to note that
single-tier architectures are not implemented in real time systems.
The data storage and processing middleware should be able to determine the quality of the data before the data is
accepted by the analytical engine and transformed into relevant information. If these steps are not performed, then
the middleware can be penetrated by malicious or faulty code. As an example, consider a credit score calculation. If a
malicious hacker controls the middleware, then the hacker can modify the score and extract valuable data.

Two-Tier Architecture
In a two-tier data warehouse, an analytical process is separated from a business process. This allows for greater
levels of control and efficiency. A two-tier system also provides a better understanding of the data and allows for
more informed decisions.
Two-layer architecture describes a four-stage data flow in which physical sources are separated from data
warehouses by a two-layered architecture.

 The source of the data is critical to the data warehouse’s integrity. The integrity of the data stored in the data
warehouse must be guaranteed. Data integrity is the degree to which data values in a database record are
true or accurate. A data warehouse is a system that stores information in a database so that it can be
searched and analyzed.

 Data staging is a key process in the ETL process, and one that can significantly reduce the time it takes to
extract, transform, and load (ETL) a large data set. ETL tools can extract data from various storage sources,
transform the data with corporate-specific functions, and load the data into a data warehouse. Data
warehouse functions such as monitoring the system, provisioning new data, and making decisions on the
basis of the data are all done through data warehouse functions such as ETL. Data warehouse functions
such as ETL can be implemented through a data warehouse.

 Data warehouse metadata is a critical component of the data warehouse. It is the information that helps a
data warehouse administrator decide which data to delete, which data to retain, and which data to use in
future reports. It is also important to maintain data warehouse consistency. Data warehouse administrators
must determine which data should be updated or deleted when new data arrives, and which data should be
left untouched. When data warehouse consistency is not guaranteed, application developers and users must
be careful about which tables and reports they create.

 Data profiling is also very important for this level as it helps in validating data integrity and presentation
standards. It also comes with advanced analytics such as real-time and batch reporting, data profiling and
visualizations, and rating functions. It is important to keep in mind that this is not just a data warehouse but a
live data platform that receives and analyzes massive amounts of data. This is why it is important to keep
track of data changes, scalability, and performance of the system.

Three-Tier Architecture
A three-tier structure is employed in the source layer, the reconciled layer, and the data warehouse layer. The
reconciled layer sits between the source data and data warehouse. The main disadvantage of the reconciled layer is
the fact that it is not possible to completely ignore the problems of the data before it is reconciled. Therefore, the main
focus of the reconciler should be on data integrity, accuracy, and consistency. For example, assume that the data
warehouse contains a collection of company data elements that are updated frequently, such as order book
information. In such a case, the best approach would be to use a web-based data warehouse refresh tool, which
extracts the latest data from the data warehouse and refreshes the data in the corporate application. This architecture
is appropriate for systems with a long-life cycle. Whenever a change occurs in the data, an extra layer of data review
and analysis is done to ensure that no erroneous data was entered. This architecture is also known as data-driven
architecture. This structure is mainly used for large-scale systems. It is important to note that the extra layers of data
review and analysis created by this structure does not consume any extra space in the storage device.

Advantages of Data
WarehouseArchitecture
 The data mart is a collection of data model definitions that captures the data model at a high level and
provides a common data access strategy for the data warehouse. The data mart provides a common data
access strategy for the data warehouse, consistency, and governance from one location to manage the
diverse data sources. The data mart is an important building block for the data warehouse. It provides a way
to standardize data access, create a common strategy for data integration, and make the data model
available for data profiling and analytics. The data mart does not create data – it only provides the data
access strategy.

 The process of change starts with identifying the problems and pain points of your current system, and then
mapping out a plan to solve those problems using the new system. After that, the system is tested to make
sure that everything is working as expected. Once the system is deemed fit for purpose, the process of
change starts. The first thing is to make sure that the existing stakeholders are comfortable with the new
system. Then, the process of change has to be validated by conducting an assessment. This is how the
model is considered the most appropriate for business transformation.

 There is a good reason why over 90% of the data businesses collect is in the form of data warehouses. Data
warehouses are huge collections of data, usually stored in a database, that are used to help make business
decisions. Many data warehouses are designed to support ETL processes and deliver data to a CRM
system so that business users can start looking at actual data and make decisions.

 With a data warehouse, you can also take advantage of ETL (extract, transform, and load) and ETL
management processes to connect your data sources and process them together. In other words, a data
warehouse is a central repository of your data that can be accessed by any of your analytic platforms.

 Data warehouses have increased in speed and scale with the adoption of NoSQL databases, like MongoDB
or GARIA. When implemented in conjunction with a BI platform, data warehouse technology enables real-
time analytics, enabling the streamlining of decision making, the reduction of lead and invoice inquiries, and
increased profitability.

Disadvantages of Data Warehouse

Architecture
 The maintenance of a data warehouse is a crucial task which needs to be done well. For a data warehouse
to be maintained, there is a need to collect data, process it, and then analyze it. However, data collection,
processing, and analysis need to be done within a certain timeframe. The maintenance of a data warehouse
requires a great deal of effort, which may not be justified by the returns on investment. However, a data
warehouse can be a critical component of an enterprise data management system.

 To speed up the process and minimize the time required for extracting data, you can leverage some ETL
tools to automate the process. However, automated extraction does not guarantee that the data is properly
cleaned and validated. It is best to do both manually and manually-enforced tasks in sequence. When the
data has been validated and the cleanup process has been automated, the data is ready for ingestion into
the warehouse.

 Such an omission might result in incorrect assessment of the property value, over estimation of expenses, or
under estimation of sales. Data integration is essential for any organization that processes large amounts of
data. It must be ensured that all required data is integrated in the warehouse. This can be done by using
data trapping or data mining techniques.

 The majority of the data will be stored in a warehouse and analyzed using data profiling tools. The
warehouse infrastructure will be required to support the analysis of massive amounts of data and the
storage of the data in the most cost-effective manner. The warehouse will act as a repository for the data
and will be the central depository for all data analysis tools.
 However, with the right approach, an organization can achieve better results by working with data
warehouse tools in a disciplined and structured fashion. One important aspect of a data warehouse’s
architecture that must be carefully considered is the data source. If the data comes from multiple sources
such as external sources such as sensors or authorized partners, then data integration is even more
important. An organization must first decide which sources of data it wants to work with and then work to
integrate these data sources.

Data Ware House Architectures
No ratings yet
Data Ware House Architectures
34 pages
Data Warehouse Architecture
No ratings yet
Data Warehouse Architecture
8 pages
Data Warehousing for Managers
No ratings yet
Data Warehousing for Managers
16 pages
Data Warehouse Architecture Guide
No ratings yet
Data Warehouse Architecture Guide
21 pages
Data Warehouse Architecture
No ratings yet
Data Warehouse Architecture
5 pages
Data Warehouse Architecture
100% (2)
Data Warehouse Architecture
5 pages
CH 2 Introduction To Data Warehousing
No ratings yet
CH 2 Introduction To Data Warehousing
31 pages
Data Warehouse Final Report
No ratings yet
Data Warehouse Final Report
19 pages
Architectural Components
No ratings yet
Architectural Components
34 pages
Data Warehouse Design - Ebookv1
No ratings yet
Data Warehouse Design - Ebookv1
18 pages
DMW Unit2
No ratings yet
DMW Unit2
69 pages
Data Warehouse
No ratings yet
Data Warehouse
71 pages
Data Warehouse 9 Oct
No ratings yet
Data Warehouse 9 Oct
15 pages
T2 Architecture of Data Warehousing
No ratings yet
T2 Architecture of Data Warehousing
9 pages
Data Warehousing-Notes (Module - I & II)
No ratings yet
Data Warehousing-Notes (Module - I & II)
32 pages
1 Unit
No ratings yet
1 Unit
46 pages
DMW p1 Merged
No ratings yet
DMW p1 Merged
316 pages
Data Warehouse Architecture Guide
No ratings yet
Data Warehouse Architecture Guide
3 pages
Data Warehouse Overview & Insights
No ratings yet
Data Warehouse Overview & Insights
18 pages
Data Warehouse Architechture-Layers
No ratings yet
Data Warehouse Architechture-Layers
21 pages
Data Warehouse Concepts & Architecture
100% (1)
Data Warehouse Concepts & Architecture
11 pages
Datawarehouse Architecture Business Analysis Framework
No ratings yet
Datawarehouse Architecture Business Analysis Framework
7 pages
Data Warehousing
No ratings yet
Data Warehousing
16 pages
Unit 1
No ratings yet
Unit 1
14 pages
Data Warehouse Basics & Applications
No ratings yet
Data Warehouse Basics & Applications
14 pages
DW Part A Part B Notes
No ratings yet
DW Part A Part B Notes
69 pages
Data Warehouse Overview & Applications
No ratings yet
Data Warehouse Overview & Applications
9 pages
Architectural COmponents
No ratings yet
Architectural COmponents
27 pages
Data Warehouse Unit-I
No ratings yet
Data Warehouse Unit-I
33 pages
Data Warehousing and Mining Guide
No ratings yet
Data Warehousing and Mining Guide
46 pages
Overview of Data Warehousing and OLAP
No ratings yet
Overview of Data Warehousing and OLAP
12 pages
Data Warehouse Architecture Types
No ratings yet
Data Warehouse Architecture Types
5 pages
Nimish PPT Datawarehouse
No ratings yet
Nimish PPT Datawarehouse
9 pages
L4. Datawarehouse Architecture PDF
No ratings yet
L4. Datawarehouse Architecture PDF
13 pages
Data Warehousing & Data Mining-A View
No ratings yet
Data Warehousing & Data Mining-A View
11 pages
Lecture 14 Data Warehouse and Data Lake Architecture Part 1
No ratings yet
Lecture 14 Data Warehouse and Data Lake Architecture Part 1
10 pages
DWM Exp1
No ratings yet
DWM Exp1
12 pages
Solve These Questions
No ratings yet
Solve These Questions
11 pages
MCS-221 2024-25 em
No ratings yet
MCS-221 2024-25 em
34 pages
Data Warehouses: FPT University
No ratings yet
Data Warehouses: FPT University
46 pages
Notes CS-703 (B) Data Mining & Warehousing All Units
No ratings yet
Notes CS-703 (B) Data Mining & Warehousing All Units
46 pages
Data Warehousing
No ratings yet
Data Warehousing
11 pages
Unit B Data Warehousing
No ratings yet
Unit B Data Warehousing
26 pages
DWDM202
No ratings yet
DWDM202
6 pages
Lec 01 - Intro To Data Warehouse
No ratings yet
Lec 01 - Intro To Data Warehouse
54 pages
03 Data Warehouse
No ratings yet
03 Data Warehouse
27 pages
Data Warehouse: From Wikipedia, The Free Encyclopedia
No ratings yet
Data Warehouse: From Wikipedia, The Free Encyclopedia
5 pages
Ex 1
No ratings yet
Ex 1
14 pages
Unit II DATA BI
No ratings yet
Unit II DATA BI
13 pages
Data Warehousing and Data Mining Original Notes
No ratings yet
Data Warehousing and Data Mining Original Notes
47 pages
Data Warehouse
No ratings yet
Data Warehouse
3 pages
Data Warehousing for Analysts
No ratings yet
Data Warehousing for Analysts
26 pages
Data WH Architecture
No ratings yet
Data WH Architecture
23 pages
Data Warehousing & Data Mining
No ratings yet
Data Warehousing & Data Mining
16 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
48 pages
Data Warehouse Week 1
No ratings yet
Data Warehouse Week 1
78 pages
SQL Commands Cheat Sheet
80% (10)
SQL Commands Cheat Sheet
1 page
SIMPLE SQL Begginers Guide To Master SQL and Boost Career
90% (10)
SIMPLE SQL Begginers Guide To Master SQL and Boost Career
425 pages
Learning C# by Developing Games With Unity
100% (2)
Learning C# by Developing Games With Unity
233 pages
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
94% (17)
Artificial Intelligence With Python (Machine Learning Foundations, Methodologies, and Applications) (Teik Toe Teoh, Zheng Rong)
334 pages
Libros Gratis de Programación
100% (6)
Libros Gratis de Programación
235 pages
The Python Bible
97% (33)
The Python Bible
506 pages
SQL & NoSQL Data PDF
100% (9)
SQL & NoSQL Data PDF
238 pages
Python in Excel (2024)
100% (14)
Python in Excel (2024)
607 pages
Microsoft Power BI Cookbook by Greg Deckler
100% (20)
Microsoft Power BI Cookbook by Greg Deckler
655 pages
Python Paso A Paso
88% (8)
Python Paso A Paso
116 pages
Data Analytics and AI
100% (12)
Data Analytics and AI
267 pages
Hackers Guide To Machine Learning With Python PDF
100% (16)
Hackers Guide To Machine Learning With Python PDF
272 pages
Machine Learning With Python
100% (15)
Machine Learning With Python
692 pages
Collect, Transform and Combine Data Using Power BI and Power Query in Excel (Business Skills)
86% (14)
Collect, Transform and Combine Data Using Power BI and Power Query in Excel (Business Skills)
543 pages
Natural Language Processing With PyTorch - Build Intelligent Language Applications Using Deep Learning PDF
100% (15)
Natural Language Processing With PyTorch - Build Intelligent Language Applications Using Deep Learning PDF
210 pages
Understanding Machine Learning
100% (72)
Understanding Machine Learning
416 pages
Python Completo PDF
100% (11)
Python Completo PDF
495 pages
Informatica Interview Questions Answers
No ratings yet
Informatica Interview Questions Answers
25 pages
SQL Interview Questions PDF
88% (43)
SQL Interview Questions PDF
48 pages
Applied Microsoft Power BI Bring Your Data To Life
100% (14)
Applied Microsoft Power BI Bring Your Data To Life
592 pages
Informatica IDQ Course Content
No ratings yet
Informatica IDQ Course Content
14 pages
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
100% (19)
Learning The Pandas Library Python Tools For Data Munging Analysis and Visual PDF
208 pages
Python Programming & SQL
100% (4)
Python Programming & SQL
152 pages
AI Agents by Google
100% (11)
AI Agents by Google
42 pages
Effective Pandas. Patterns For Data Manipulation (Treading On Python) - Matt Harrison - Independently Published (2021)
100% (13)
Effective Pandas. Patterns For Data Manipulation (Treading On Python) - Matt Harrison - Independently Published (2021)
392 pages
DAX in Power BI
100% (8)
DAX in Power BI
51 pages
Informatica MDM Interview Preparation
100% (2)
Informatica MDM Interview Preparation
35 pages
User Interface Design and Implementation in Unity
100% (1)
User Interface Design and Implementation in Unity
137 pages
Practical Projects
100% (31)
Practical Projects
478 pages
SQL PDF
100% (13)
SQL PDF
221 pages
SOC Road Map
No ratings yet
SOC Road Map
11 pages
Console Output CLI Console
No ratings yet
Console Output CLI Console
27 pages
Software Lifecycle in Multimedia
No ratings yet
Software Lifecycle in Multimedia
4 pages
Eclipse Short Cut
No ratings yet
Eclipse Short Cut
3 pages
قياس سعة التخزين الرقمي
No ratings yet
قياس سعة التخزين الرقمي
13 pages
Clownfish Voice Changer
No ratings yet
Clownfish Voice Changer
6 pages
Oral Questions 2021 - Network
No ratings yet
Oral Questions 2021 - Network
20 pages
Stock
No ratings yet
Stock
25 pages
4fa (1) - Pagenumber
No ratings yet
4fa (1) - Pagenumber
54 pages
Java Program Output Examples
No ratings yet
Java Program Output Examples
8 pages
MIC-2 MKII: Advanced Power Monitoring
No ratings yet
MIC-2 MKII: Advanced Power Monitoring
1 page
Cimplicity 2023 Datasheet
No ratings yet
Cimplicity 2023 Datasheet
3 pages
React Notes
No ratings yet
React Notes
3 pages
Build An Operating System From Scratch: A Project For An Introductory Operating Systems Course
No ratings yet
Build An Operating System From Scratch: A Project For An Introductory Operating Systems Course
5 pages
CS - 602: Systems Programming and Compiler Design: Assemblers & Loaders, Linkers
No ratings yet
CS - 602: Systems Programming and Compiler Design: Assemblers & Loaders, Linkers
2 pages
3 - Loading Text Files - Method 1
No ratings yet
3 - Loading Text Files - Method 1
9 pages
Python Graph Construction Program
No ratings yet
Python Graph Construction Program
8 pages
Introduction To Computing System Reading
No ratings yet
Introduction To Computing System Reading
4 pages
History of Computers Overview
No ratings yet
History of Computers Overview
8 pages
Sicam Recpro: Energy Automation
No ratings yet
Sicam Recpro: Energy Automation
9 pages
Hostel Management System
79% (57)
Hostel Management System
42 pages
Colab
No ratings yet
Colab
8 pages
BTCOC305 (A) Object Oriented Programming in C++
No ratings yet
BTCOC305 (A) Object Oriented Programming in C++
1 page
VYKON IO-28U JACE IO Module
No ratings yet
VYKON IO-28U JACE IO Module
3 pages
Events Information
No ratings yet
Events Information
24 pages
HFSecurity Cherry - RA08TP Android Face Temperature Palm Machine
No ratings yet
HFSecurity Cherry - RA08TP Android Face Temperature Palm Machine
1 page
High Availability and Data Protection With Dell PowerScale Scale-Out NAS
No ratings yet
High Availability and Data Protection With Dell PowerScale Scale-Out NAS
45 pages
Interinet of Thing
No ratings yet
Interinet of Thing
15 pages
F5 BIG-IP DNS for Network Admins
No ratings yet
F5 BIG-IP DNS for Network Admins
8 pages
Lab Task7
No ratings yet
Lab Task7
2 pages

Data Warehouse Architecture Guide

Uploaded by

Data Warehouse Architecture Guide

Uploaded by

Data Warehouse Architecture Properties

A data warehouse system must meet the following architecture features:

Disadvantages of Data Warehouse

You might also like