0% found this document useful (0 votes)

265 views6 pages

Test 8

This document discusses various topics related to data warehousing including: 1. The four modes of applying data to a data warehouse and reasons for selection. 2. Common data quality issues with legacy systems and suggestions for addressing them. 3. The differences in usage and value of data between operational systems and data warehouses. 4. An outline for a standards manual covering naming conventions for various data warehouse objects and why standards are important.

Uploaded by

Robert Kegara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

265 views6 pages

Test 8

Uploaded by

Robert Kegara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 6

Running Head: Data Warehouse 1

Data Warehouse

Professor’s Name

Students’ Name

Institution.

Date.
Data Warehouse 2

ASSIGNMENT-3

1. You are the staging area expert on the project team for a large toy manufacturer.

Discuss the four modes of applying data to the data warehouse. Select the modes you

want to use for your data warehouse and explain the reasons for your selection.

Modes of applying data will include:

Load: this applies when the data already exists in the target tables and there is need to

load the data warehouse with new incoming data to replace the old information. The reason

for selecting this mode for applying data to the data warehouse is that for example for a

manufacturing company it is easy to fully load the master data of items in that company in

terms of location, material etc. so it makes it easier to update you current items based on this

mode.

Append: it is an extension of the load process. The new data is added to the target table and

does not delete the pre-existing data.

Append mode of applying data to the data warehouse means that any operations can be

appended to a specific routing in a company therefore retaining very vital data of the

company.

Destructive merge: it applies the incoming data to the target table. The target data is updated

with the incoming data.

The reason for choosing this particular mode is that it’s the most suitable mode as it ensures

that every set of data is updated in real time based on the incoming data in the target table.

Constructive merge: here, the new incoming data does not overwrite the existing data if the

primary key is matched and will be marked as superseded.

This mode is used when items in a factory need to have the latest revisions updated over the

same key of items.

Data Warehouse 3

2. Assume that you are the data quality expert on the data warehouse project team for a

large financial institution with many legacy systems dating back to the 1970’s. Review

the types of data quality problems you are likely to have and make suggestions on how

to deal with those.

There are many inherent problems that are likely to be experienced when it comes to

managing the data quality inconsistencies with legacy systems that date back to 1970. Some

of the most significant problems will include byte-ordering inconsistencies from the

operating systems used during this era. Consequently, the portability of the data between

systems will be an issue. There is need therefore to use specialization translation applications.

Moreover, there are problems to do with the format of the data. In metadata, there are

relationships, entities and interrelationships that exist between data.

3. Compare the usage and value of information in the data warehouse with those in

operational systems. Explain the major differences. Discuss and give examples.

The major difference between operational systems and data warehouse systems is that the

operational systems are configured to deal with transaction processing while the data

warehouse systems are designed to support the online analytical processing.

For the operational systems, they are designed to support high volume transaction processing

and there is very little back-end reporting. In addition, they are mainly concerned with

current data, and it is generally updated in accordance to the need. A good example is where

there are purchase records that do not have any corresponding customer records to identify

who purchased what are clearly errors in source data. These errors could be corrected in the

source operational system before taking the data and loading it to the data warehouse.

When it comes to data warehousing, they are designed to support high volume analytical

processing and also elaborate report generation. They are concerned more with historical data
Data Warehouse 4

and data within them is non-volatile meaning data could be added but it is rare to change it.

This offers for an ever-growing history of information. The best example for such a

warehouse is Facebook.

4. Prepare an outline for a standards manual for your data warehouse. Consider all types

of objects and their naming conventions. Indicate why standards are important.

Produce a detailed table of contents.

Standards are conventions that every company employs so as to maintain uniformity.

Standards are used to ensure that there exists a level of consistency across any system in

terms of databases, processes or even objects this ensure that there is uniformity in

companies that may have many departments. Below is standard outline for various types of

objects that would be in the data warehouse.

Example
S.No. Object

1 Schema (In SQL) CREATE SCHEMA PRODUCT_DETAILS_NEW

2 Table PRODUCT_MASTER

3 Column PRODUCT_ID

4 Staging files EMPLOYEE_DAILY_STAGE,

EMPLOYEE_DAILY_UPDATE

5 Physical file (scripts) EMPLOYEE_P

6 Physical file (source) EMPLOYEE_SRC

7 Physical file (codec) EMPLOYEE_CDC

8 Physical file (Database EMPLOYEE_DB

file)

9 Logical File EMPLOYEE_L

Data Warehouse 5

10 Application document CUSTOMER_APP_DOC

11 Query ORDERS_DETAILS_QUERY

12 Report STORE_LOCATION_REPORT

Saudi Telecom – Questions for Discussion

5. Why do you think telecommunications companies are among the prime users of

information visualization tools?

From the case study, information visualization tools were important as they allowed the

managers to observe the trends and make the necessary corrections before things went out of

hand. These tools are important for the companies since they enable them to foresee any

likely problems and take the necessary measures to curb them. It also helps them to deal

with the large number of clients they have.

6. What were their challenges, the proposed solution, and the obtained results?

Challenges

The challenges were that data come from different kind of sources, and this might have

caused redundancy of this specific data. In addition to this it was very time consuming to

analyze the give data

Proposed solution

Use of TIBCO Tool

Use of this tool would enable to look at the specific data differently this would go a long

way in ensuring that we understand the given data.

Mining for Lies Case Study

7. How can text/data mining be used to detect deception in text?

Transcribing statements for processing and extracting cues and selecting them.
Data Warehouse 6

The text processing software identifies cues and generates quantified cues. The

classification models are trained and tested on quantified cues. The cues are then labeled as

true or deceptive.
8. What do you think are the main challenges for such an automated system?

Having such a system may sound easy theoretically but training a software to identify

human aspects creates problems such as terminologies, terms, references phrases and names

that could be used. Having a software that is capable of determining what is true or not

without having any human sensitivity is virtually impossible because we all have our

versions of truths and there is no standard way of identifying who or what determines it.

Big Data and Analytics in Politics Case Study

9. What is the role of analytics and Big Data in modern day politics? Do you think Big

Data analytics could change the outcome of an election?

In modern day politics, big data is essential in political campaigns. Characteristics of big

data such a variety velocity and volume are very much related to the data used in political

operations. Big data analytics can change an outcome of an election since it helps in the

forecast of the election results and also aims at the possible voters and contributors.
10. What do you think are the challenges, the potential solution, and the probable results

of the use of Big Data analytics in politics?

The main challenge would be the storage of the big data. It would be difficult to collect

and store such large volumes of data since data is increasing on a daily basis. Getting

efficient and well-equipped people to handle these large amounts of data is also another

problem. There is also a challenge of security since the data collected is too much and it

could also be very sensitive. The solutions to these challenges lie in the development of a

suitable code that would cater for all of these challenges at a go.

ISEC-655 Security Governance Management Assignment 1 Guidelines
No ratings yet
ISEC-655 Security Governance Management Assignment 1 Guidelines
2 pages
It Governance
No ratings yet
It Governance
12 pages
BigData Research Paper
No ratings yet
BigData Research Paper
22 pages
Agile Project Management For End User Information Systems Development
No ratings yet
Agile Project Management For End User Information Systems Development
10 pages
Big Data Security
No ratings yet
Big Data Security
4 pages
Hybrid Encryption For Cloud Database Security-Annotated
No ratings yet
Hybrid Encryption For Cloud Database Security-Annotated
7 pages
2020 Data Center Roadmap Survey PDF
No ratings yet
2020 Data Center Roadmap Survey PDF
16 pages
w07 Moss
No ratings yet
w07 Moss
16 pages
Decision Science Project Report On "Big Data"
No ratings yet
Decision Science Project Report On "Big Data"
9 pages
Er Diagram
No ratings yet
Er Diagram
4 pages
Cuestionario Resuelto Big Data
67% (6)
Cuestionario Resuelto Big Data
2 pages
Engineering Intern's Logistics Report
No ratings yet
Engineering Intern's Logistics Report
46 pages
Reasearch Proposal
No ratings yet
Reasearch Proposal
6 pages
Creating A Modern Analytics Architecture
No ratings yet
Creating A Modern Analytics Architecture
18 pages
Exodus Case Study - Ohio Department of Public Safety August 2012
100% (1)
Exodus Case Study - Ohio Department of Public Safety August 2012
18 pages
L14-15 Data Security
No ratings yet
L14-15 Data Security
29 pages
Emerging Spatial Information Systems and Applications PDF
100% (1)
Emerging Spatial Information Systems and Applications PDF
419 pages
Map Reduce
100% (1)
Map Reduce
33 pages
MDX Tutorial
100% (1)
MDX Tutorial
31 pages
Software Quality Assurance
No ratings yet
Software Quality Assurance
24 pages
Merkow - PPT - 02 F
No ratings yet
Merkow - PPT - 02 F
20 pages
Introduction To Data Management - Week 1 - 2024
No ratings yet
Introduction To Data Management - Week 1 - 2024
17 pages
NSE Quiz 1
No ratings yet
NSE Quiz 1
5 pages
Term Paper On Data Security
No ratings yet
Term Paper On Data Security
3 pages
Infrastructure As Code
No ratings yet
Infrastructure As Code
39 pages
PeopleSoft File Import Guide
No ratings yet
PeopleSoft File Import Guide
4 pages
Computer Security, Ethics and Privacy PDF
100% (2)
Computer Security, Ethics and Privacy PDF
55 pages
Product Data Mapping Guide
No ratings yet
Product Data Mapping Guide
1 page
Cyber Crimes and Laws Overview
33% (3)
Cyber Crimes and Laws Overview
50 pages
Best Practices For Securing Computer Networks
No ratings yet
Best Practices For Securing Computer Networks
2 pages
Tutorial Letter 101/0/2022: Ontology Engineering
No ratings yet
Tutorial Letter 101/0/2022: Ontology Engineering
10 pages
Computer Viruses
No ratings yet
Computer Viruses
62 pages
(Exam Outline) : Effective Date 1 January 2012
No ratings yet
(Exam Outline) : Effective Date 1 January 2012
43 pages
Teradata InfoSec Slides Defense in Depth Best Practices Pres December 2011 - FINAL
No ratings yet
Teradata InfoSec Slides Defense in Depth Best Practices Pres December 2011 - FINAL
134 pages
Big Data Analytical Tools
100% (1)
Big Data Analytical Tools
8 pages
Research in Cloud Security and Privacy
No ratings yet
Research in Cloud Security and Privacy
204 pages
Bussiness Intelligence
No ratings yet
Bussiness Intelligence
6 pages
Managing Information Resources and Security: Information Technology For Management 6 Edition
No ratings yet
Managing Information Resources and Security: Information Technology For Management 6 Edition
44 pages
IBA Karachi - VAPT - Revalidation Report
No ratings yet
IBA Karachi - VAPT - Revalidation Report
32 pages
Sourcefire Next-Generation IPS (NGIPS) White Paper
No ratings yet
Sourcefire Next-Generation IPS (NGIPS) White Paper
10 pages
SQL Injection Detection and Correction Using Machine
No ratings yet
SQL Injection Detection and Correction Using Machine
8 pages
UNIT 1 - Database Security
No ratings yet
UNIT 1 - Database Security
136 pages
D7.2 Data Managment Plan v1.04
No ratings yet
D7.2 Data Managment Plan v1.04
14 pages
Implementing ISO/IEC 27001 ISMS
100% (1)
Implementing ISO/IEC 27001 ISMS
33 pages
Explain The Concept of EISP With Example
No ratings yet
Explain The Concept of EISP With Example
2 pages
SAS Viya 3.5 New Features Updated 10082019
No ratings yet
SAS Viya 3.5 New Features Updated 10082019
38 pages
NY Cybersecurity Strategy Unveiled
No ratings yet
NY Cybersecurity Strategy Unveiled
15 pages
C-Level Guide to Data Security
No ratings yet
C-Level Guide to Data Security
10 pages
Code Is For Humans
No ratings yet
Code Is For Humans
142 pages
Name Construct and Structure
100% (2)
Name Construct and Structure
32 pages
CHAPTER 7 PKI and Cryptographic Applications
No ratings yet
CHAPTER 7 PKI and Cryptographic Applications
4 pages
BCOM ICT for Business Course
No ratings yet
BCOM ICT for Business Course
5 pages
Unit 38 DatabaseManagementSyst
No ratings yet
Unit 38 DatabaseManagementSyst
27 pages
Ssas Rolap For SQL Server
No ratings yet
Ssas Rolap For SQL Server
42 pages
Unit-I - Data and Network Security
No ratings yet
Unit-I - Data and Network Security
29 pages
MongoDB Security Architecture WP
No ratings yet
MongoDB Security Architecture WP
17 pages
Willowbrook School System Design
No ratings yet
Willowbrook School System Design
6 pages
Data Warehousing
No ratings yet
Data Warehousing
14 pages
Chapter 2
No ratings yet
Chapter 2
79 pages
Dominos Vs Cheasias
No ratings yet
Dominos Vs Cheasias
10 pages
DealReal Consulting Letter
No ratings yet
DealReal Consulting Letter
2 pages
Email Signature Gallery Template
No ratings yet
Email Signature Gallery Template
5 pages
Application For Registration of Company - Obinaco U. Enterprise Nigeria Limited
No ratings yet
Application For Registration of Company - Obinaco U. Enterprise Nigeria Limited
8 pages
Marketing To Children (By: Sharon Beder)
No ratings yet
Marketing To Children (By: Sharon Beder)
3 pages
Chicken Sisig La-Buffalo Projected Statement of Cash Flow For The Month Ended May 31, 2021
No ratings yet
Chicken Sisig La-Buffalo Projected Statement of Cash Flow For The Month Ended May 31, 2021
2 pages
Simontok 2.1 App 2020 Apk Download Latest Version
No ratings yet
Simontok 2.1 App 2020 Apk Download Latest Version
4 pages
Bank Islam Malaysia BHD V Aquasix Corp SDN BHD & Ors
No ratings yet
Bank Islam Malaysia BHD V Aquasix Corp SDN BHD & Ors
13 pages
TBC-WOP-EPC-GEN-MES.-ELE-00002-00 Superseded
No ratings yet
TBC-WOP-EPC-GEN-MES.-ELE-00002-00 Superseded
25 pages
Calabarzon ICT Plan 2018 2022
No ratings yet
Calabarzon ICT Plan 2018 2022
37 pages
Gear Materials, Properties, and Manufacture Reduced Size
No ratings yet
Gear Materials, Properties, and Manufacture Reduced Size
347 pages
Engineering Innovation Context
No ratings yet
Engineering Innovation Context
3 pages
Reg No.: Reg No.: Issue Date: Reg No.: Issue Date: Program: Program: Due Date: Program: Due Date: Semester: Session: Semester: Session: Semester: Session: Due Date: Issue Date
No ratings yet
Reg No.: Reg No.: Issue Date: Reg No.: Issue Date: Program: Program: Due Date: Program: Due Date: Semester: Session: Semester: Session: Semester: Session: Due Date: Issue Date
1 page
CHAPTER - 6 THEORETICAL MCQs
No ratings yet
CHAPTER - 6 THEORETICAL MCQs
17 pages
RAMS Requirements in Electronic Interlock Design
No ratings yet
RAMS Requirements in Electronic Interlock Design
9 pages
Financial System Technologies
No ratings yet
Financial System Technologies
13 pages
Junior John Ngulube: Sanlam & Munich Re Career
No ratings yet
Junior John Ngulube: Sanlam & Munich Re Career
2 pages
ATZ WorldWide 2019
No ratings yet
ATZ WorldWide 2019
84 pages
A Six Part Study Guide To Market Profile Part 6 - 190657
No ratings yet
A Six Part Study Guide To Market Profile Part 6 - 190657
84 pages
Earn with PaidVerts: Step-by-Step Guide
No ratings yet
Earn with PaidVerts: Step-by-Step Guide
15 pages
Monsterverse Omnibus Collection (2023)
100% (5)
Monsterverse Omnibus Collection (2023)
458 pages
Invoice Template for Welo Data
No ratings yet
Invoice Template for Welo Data
1 page
Top35 - 2020 Judging Pack PDF
No ratings yet
Top35 - 2020 Judging Pack PDF
689 pages
Written Submission
No ratings yet
Written Submission
4 pages
Dhaka-Barishal Bus E-Ticket Details
No ratings yet
Dhaka-Barishal Bus E-Ticket Details
1 page
The Power of Copywriting in PR
No ratings yet
The Power of Copywriting in PR
14 pages
Ais CH - 1
No ratings yet
Ais CH - 1
14 pages
Francisco Lorenzana Vs Atty. Cesar Fajardo
No ratings yet
Francisco Lorenzana Vs Atty. Cesar Fajardo
5 pages
MyGov 27th August 2024 & Agenda Kenya
No ratings yet
MyGov 27th August 2024 & Agenda Kenya
22 pages
LPOA - ATP - EN - SVG Royal - Bein Markets
No ratings yet
LPOA - ATP - EN - SVG Royal - Bein Markets
7 pages

Test 8

Uploaded by

Test 8

Uploaded by

Running Head: Data Warehouse 1

Modes of applying data will include:

does not delete the pre-existing data.

with the incoming data.

primary key is matched and will be marked as superseded.

same key of items.

to deal with those.

relationships, entities and interrelationships that exist between data.

warehouse systems are designed to support the online analytical processing.

Produce a detailed table of contents.

Standards are conventions that every company employs so as to maintain uniformity.

objects that would be in the data warehouse.

1 Schema (In SQL) CREATE SCHEMA PRODUCT_DETAILS_NEW

4 Staging files EMPLOYEE_DAILY_STAGE,

5 Physical file (scripts) EMPLOYEE_P

6 Physical file (source) EMPLOYEE_SRC

7 Physical file (codec) EMPLOYEE_CDC

8 Physical file (Database EMPLOYEE_DB

9 Logical File EMPLOYEE_L

10 Application document CUSTOMER_APP_DOC

Saudi Telecom – Questions for Discussion

information visualization tools?

with the large number of clients they have.

analyze the give data

Use of TIBCO Tool

way in ensuring that we understand the given data.

Mining for Lies Case Study

7. How can text/data mining be used to detect deception in text?

Big Data and Analytics in Politics Case Study

Data analytics could change the outcome of an election?

of the use of Big Data analytics in politics?

You might also like