0% found this document useful (0 votes)
41 views22 pages

Information Architecture

The CDMP Study Group session on Data Architecture, led by Laura Sebastian-Coleman, Ph.D., focuses on the importance of data architecture within enterprise architecture, including its role in guiding data integration and aligning data investments with business strategy. The session covers architectural frameworks, artifacts, and the necessity of a comprehensive approach to data architecture to drive innovation and support organizational goals. Future sessions will continue to explore various aspects of data management, including data modeling, storage, and security.

Uploaded by

Rafikul Islam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views22 pages

Information Architecture

The CDMP Study Group session on Data Architecture, led by Laura Sebastian-Coleman, Ph.D., focuses on the importance of data architecture within enterprise architecture, including its role in guiding data integration and aligning data investments with business strategy. The session covers architectural frameworks, artifacts, and the necessity of a comprehensive approach to data architecture to drive innovation and support organizational goals. Future sessions will continue to explore various aspects of data management, including data modeling, storage, and security.

Uploaded by

Rafikul Islam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

CDMP Study Group

SESSION 5 – DATA ARCHITECTURE


April 1, 2020
Laura Sebastian-Coleman, Ph.D., CDMP
Advisor to the DAMA New England BOD
Email: lsebastiancoleman@gmail.com
AGENDA
• Facilitator Introduction
• Chapter 4: Data Architecture
• Overview
• Introductory Note
• What data architecture is, how it fits with other architectural domains, and why it is important
• Architectural frameworks and why they are important
• Architectural artifacts: Enterprise Data Model and Data Flow Diagrams
• Approach to studying
•Q&A
• Next Session

New England Data Management Community


Facilitator
Laura Sebastian-Coleman, Ph.D., CDMP
 DQ Lead, Finance DG COE, Aetna/CVS Health
 DAMA New England Board Advisor
 Production Editor, DMBOK2
 Author:
 Navigating the Labyrinth
 Measuring Data Quality for Ongoing Improvement

CONTACT INFO:
EMAIL: lsebastiancoleman@gmail.com
PHONE: 860-983-0399

New England Data Management Community


INTRODUCTORY NOTE

This study group is offered as a service of DAMA New England for DAMA New England
members. It not an official, DAMA International authorized training course because DAMA-I has
not yet created an authorized trainer program.

The purpose of this group is to help prepare members to take the CDMP. We will do so by
reviewing the content of chapters of the DMBOK2.

DAMA New England makes no claims for the effectiveness of the sessions or the ability of
participants to pass the CDMP exam after having attended. In fact, you should plan on doing a
lot of individual study to pass the exam.

All tables and images, unless otherwise noted, are copyrighted by DAMA or based on DAMA-
DMBOK2 tables and images.

New England Data Management Community


Thinking about Chapter 4: Data Architecture
• No organization starts from scratch, from a data perspective. Even if
one did, data dependencies arise quickly.

• Architecture enables a BIG PICTURE view of an organization and its


data. Architectural artifacts are the means by which an organization
represents its data to itself.

• It also enables a LONG-TERM view of an organization and its data.


Organizations need an architectural ROADMAP to understand where
they are, as well as to understand where they want to be and how to
get there.

• An architectural perspective is critical to how an organization


establishes its data strategy. Architecture can DRIVE INNOVATION.

• If we recognize, as Tom Redman says, that DATA IS DONG ITS WORK


when it is moving through the organization and BEING USED, we will
see why ideally, data architecture practices should be comprehensive,
strategic, and dynamic.

New England Data Management Community


Chapter 4: Data Architecture
Identifying the data needs of the enterprise and designing and maintaining the master blueprints to meet those
needs. Using master blueprints to guide data integration, control data assets, and align data investments with
business strategy.

The word Architecture can refer to:


• A description of the current state of systems
• The components of a set of systems
• The discipline of designing systems (architecture practice)
• The intentional design of a system or a set of systems (future state or
proposed architecture)
• The artifacts that describe a system (architecture documentation)
• The team that does the design work (the Architects or the Architecture
team)

New England Data Management Community


Data Architecture: Part of Enterprise Architecture
• Enterprise Architecture encompasses: • Successful Architecture depends on
• Business architecture • Good design
• Data architecture • Planning
• Application architecture • Ensuring that the designs and plans are
• Technology architecture executed effectively

• Enterprise architecture enables an • Data Architects seek to


organization to: • Design an optimal technical footprint
• Understand the current state of their systems • Improve operational and project efficiencies
• Promote desirable change toward future • Increase the ability of the organization to use
state its data.
• Enable regulatory compliance • Data Architecture is most valuable
• Manage data when it supports the needs of the
• Manage the systems in which data is stored enterprise.
and used

New England Data Management Community


Data Architecture: Part of Enterprise Architecture
Domain Enterprise Business Enterprise Data Architecture Enterprise Applications Enterprise Technology
Architecture Architecture Architecture

Purpose To identify how an To describe how data should To describe the structure and To describe the physical
enterprise creates value be organized and managed functionality of applications technology needed to
for customers and other in an enterprise enable systems to
stakeholders function and deliver
value

Elements Business models, Data models, data Business systems, software Technical platforms,
processes, capabilities, definitions, data mapping packages, databases networks, security,
services, events, specifications, data flows, integration tools
strategies, vocabulary structured data APIs

Dependencies Establishes Manages data created and Acts on specified data Hosts and executes the
requirements for the required by business according to business application architecture
other domains architecture requirements
Roles Business architects and Data architects and Applications architects Infrastructure architects
analysts, business data modelers, data stewards
© DAMA-International, 2017
stewards

New England Data Management Community


Data Architecture: Business Drivers

The problems Data Architecture is trying to Data architecture enables organizations to:
solve: • Plan for evolution: Prepare organizations to
• More data than the average bear! Most evolve products, services, and data to take
organizations have more data than a single advantage of business opportunities inherent
person can understand. in emerging technologies
• Different Levels Of detail for different • Translate requirements: Translate business
decision-makers. Data needs to be needs into data and system requirements so
represented at different levels of detail so that processes consistently have the data they
that people can make decisions about it. require
• Architectural Artifacts help with • Manage delivery: Manage complex data and
comprehension. Data architects create the information delivery throughout the
artifacts that enable people to understand enterprise
their organization’s data. • Facilitate alignment between Business and IT
• Act as agents for change, transformation, and
agility

New England Data Management Community


Data Architecture from different perspectives
• Chapter Four looks at Data • Data Architecture artifacts (master
Architecture from three blueprints) are used to:
perspectives: • Define data requirements
• Guide data integration
• Data Architecture outcomes / • Control data assets
artifacts: models, definitions and data • Align data investments with business
flows on various levels of abstraction strategy
• Data Architecture activities, to form, • Data Architects must
deploy and fulfill Data Architecture
intentions • Collaborate with, learn from and
influence various stakeholders that
• Data Architecture behavior: are engaged with improving the
collaborations, mindsets, and skills business or IT systems development
among the various roles that affect
the enterprise’s Data Architecture. • Help to establish the semantics of an
enterprise, via a common business
vocabulary

New England Data Management Community


Architectural frameworks are
important to understanding
architecture as a whole.

Of these, the most influential


is the Zachman Framework,
which is organized around
basic questions related to the
perspectives of stakeholders
across an enterprise.

Different stakeholders require


information at different levels
of abstraction.

This is wicked hard to read, so


the next slide has the
simplified version that appears
in DMBOK2.
What How Where Who When Why

Inventory Process Distribution Responsibility Timing Motivation


Executive Identification Identification Identification Identification Identification Identification Scope Context

Inventory Process Distribution Responsibility Timing Motivation


Business definition Definition Definition Definition Definition Definition Business
Management Concepts

Inventory Process Distribution Responsibility Timing Motivation


Architect Representation Representation Representation Representation Representation Representation System Logic

Inventory Process Distribution Responsibility Timing Motivation


Technology
Engineer Specification Specification Specification Specification Specification Specification
Physics

Inventory Process Distribution Responsibility Timing Motivation Tool


Technician Configuration Configuration Configuration Configuration Configuration Configuration Components

Inventory Process Distribution Responsibility Timing Motivation Operational


Enterprise Instantiations Instantiations Instantiations Instantiations Instantiations Instantiations Instances

Inventory Process Distribution Responsibility Timing Motivation © DAMA-International, 2017


Sets Flows Networks Assignments Cycles Intentions
Data Architecture: EDM and Data Flow Design
Data exists in and moves through an enterprise.
Future State Enterprise Future State
Data Architecture needs to account for both the current and Data Model Data Flows
future state of the data model and the data flows.
Enterprise Data
Enterprise Data Model (EDM): The EDM is a holistic, enterprise-level, Architecture
implementation-independent conceptual or logical data model providing a
common consistent view of data across the enterprise. It sets forth the
Current state Enterprise Current State
foundation for all data and data-related projects. … Any project-level data
Data Model (EDM) Data Flows
model must be based on the EDM. The EDM should be reviewed by
stakeholders, so that there is consensus that it effectively represents the
enterprise.

Data Flow Design: Defines the requirements and master blueprint for
storage and processing across databases, applications, platforms, and
networks (the components). These data flows map the movement of data
to business processes, locations, business roles, and to technical
components.
New England Data Management Community
The EDM Enterprise Data Model
1 diagram:
12-20 significant
Conceptual Model business subject
areas with
The EDM can be a stand-alone artifact, or it can be comprised of relationships
the overall set of models created by the enterprise.

Different types of models – at different levels of abstraction –


Subject Subject Subject Subject 1+ diagram per
must be related to each other. subject area:
Area Area Area Area
50+ significant
Model Model Model Model entities within
Conceptual models are ultimately linkable to physical subject area with
relationships
application data models

The diagram shows: For each Subject


Area Logical
• A conceptual overview over the enterprise’s subject areas Model:
• Views of entities and relationships for each subject area Logical Logical Logical Logical Increase detail by
adding attributes,
• Detailed, partially attributed logical views of these same Model Model Model Model and less-significant
entities and
subject areas relationships
• Logical and physical models specific to an application or
Limit scope to the
project subject and 1-step
LDM LDM LDM LDM LDM LDM LDM
external
relationships
PDM PDM PDM PDM PDM PDM PDM Limit scope to
the subject physical
objects and
Application or Project-Specific relationships
New England Data Management Community © DAMA-International, 2017
The Subject Area Model
As stated, the EDM can be a stand-alone artifact, or it can be Product Design Commercial Sales Subject
Subject Area Offer Subject Area
comprised of the overall set of models created by the Area
enterprise. Market
Product Sales
Product Group Offering
Platform Order
Portfolio
Establish a principle for division of subject areas (a subject area
discriminator). For example, Offering Range Occurs in
Uses
• Normalization rules
• Systems portfolios (i.e., funding)
• Data governance structure and data ownership
Sales Sales
(organizational) Belongs to Product
Item Order Item
• Top-level processes (based on the business value chains)
• Business capabilities (enterprise architecture-based)
Product Design
Bill-of-material Specifies
(BOM)
The Subject Area structure is usually most effective for Data Sales Bundle
Bill-of-material
(BOM)
Architecture work if it is formed using normalization rules.
Product
Part Sales
Details Order Item
Part Configuration
Structure © DAMA-International, 2017

New England Data Management Community


Data Flows Service & repair statistics © DAMA-International, 2017
Material returns

Data Flow diagrams depict how data


CRM
moves through an organization. Data
flows can be organized in different Service
ways. Customer data & repair
Customer statistics
agreement

This traditional high-level flow


diagram shows the movement of data
between different systems that Product Design BOM
Product properties Aftermarket
support business processes. PDM System Sales System
System

Product design BOM


Such a diagram can be created with Product properties Sales BOM
Order Contents
Sourcing data
different levels of detail, depending
on the audience and purpose.
Manufacturing Manufacturing
Routing System Planning Serial numbers
Manufacturing BOM System Individual product BOM
Production line routing
Lead times

New England Data Management Community


Data Flows
Business Processes
This matrix shows the relationship between
Product Marketing Industrial Order
Manufacturing Logistics Invoicing
business processes and data entities that development & Sales preparation management

create or use data. Product

Product Part
Benefits of a matrix depiction: Manufacturing
Plant
• Shows that data does not flow in just one Customer
direction

Major Entities
Sales Item
• Shows many-to-many relationships Assembly
Structure
• Can clarify data acquisition responsibilities Sales Order
• Can clarify data dependencies between Production
Order
processes Individual
Product
Shipping
Customer’s
Invoice

Read/use © DAMA-International, 2017


Create

New England Data Management Community


Data Architecture Activities: Establish the practice
Establish Data Architecture Practice
Relationship to projects
• Part of Enterprise Architecture OR adopt a
• Defining requirements
framework
• Reviewing project data design
• Determining data lineage impact
Account for
• Data replication control
• Strategy
• Enforce Architecture standards
• Culture
• Guide technology decisions
• Organizational accountabilities,
responsibilities
Evaluate Existing Specifications
• Work methods
• Results / relation to roadmap
Develop a roadmap

Manage enterprise requirements within


projects

New England Data Management Community


Data Architecture Activities
Evaluate and Update Existing Specifications
Manage enterprise requirements within
projects
Develop a roadmap
• Bring an enterprise perspective to project
• Manage dependencies
scope
• Make forward-looking decisions
• Understand requirements
• Evaluate trade-offs
• Ensure implementation makes sense in
• Formulate pragmatic plans, aligned with
terms of the overall architecture
business needs and opportunities, external
requirements, and available resources.
Note: The role of the enterprise data architect
• To develop a roadmap, start with the
will depend in part on the implementation
lowest dependency activities
methodology (e.g., waterfall, agile, hybrid0

New England Data Management Community


Discussion / Q&A: How to study …
Architectural Frameworks

Architectural Artifacts

The role of the Data Architect


Strategy / Projects
Innovation / Quality

New England Data Management Community


NEXT SESSION
Date Topic Facilitator
February 19th Chapter 1: Data Management Tony Mazzarella
March 4th Chapter 2: Data Handling Ethics Lynn Noel
March 18th Chapter 3: Data Governance Sandi Perillo-Simmons
April 1st Chapter 4: Data Architecture Laura Sebastian Coleman
April 15th Chapter 5: Data Modeling & Design Lynn Noel
April 29th Chapter 6: Data Storage & Operations Karen Sheridan
May 13th Chapter 7: Data Security Laura Sebastian-Coleman
May 27th Chapter 8: Data Integration & Interoperability Mary Early
June 10th Chapter 9: Document & Content Management Sandi Perillo-Simmons
June 24th Chapter 10: Reference & Master Data Mary Early
July 8th Chapter 11: Data Warehousing & Business Intelligence Tony Mazzarella
July 22nd Chapter 12: Metadata Management Karen Sheridan
August 5th Chapter 13: Data Quality Laura Sebastian-Coleman
August 19th Chapter 14: Big Data & Data Science Nupur Gandhi
September 2nd Chapter 15: Data Management Maturity Assessment Laura Sebastian-Coleman
September 16th Chapter 16: Data Management Organization & Role Expectations Agnes Vega
September 30th Chapter 17: Data Management & Organizational Change Management Tony Mazzarella
October 7th Final Review Tony Mazzarella

New England Data Management Community


HOMEWORK – Data Modeling

What are the primary benefits of data modeling to the


organization?
What behaviors help organizations realize these
benefits?
How can they get more value from data modeling?

New England Data Management Community

You might also like