Day 1 Consolidation 1695814006998
Day 1 Consolidation 1695814006998
2023
IDMC Associate Bootcamp Setup
A single-stop, comprehensive program to serve all levels of learning needs!
Key Facts
•   Key Documentation under Related
    Content
•   Q&A box available for live sessions
•   Access to speaker information                Presentation             Q&A   Docs
•   Resize and change layout as you wish
•   Toolbox always available
                                                                Toolbox
       Agenda - DAY 1
               IDMC
                                                   IDMC        Data             IPU
             Bootcamp
                                                    Intro   Integration   Pricing Model
                Intro
                                                                                Modernization
                 DAY 1                             Cloud Data      Cloud
                                                                                    Panel
                 Recap                             Governance   Modernization
                                                                                 Discussion
2023
Technology Challenges                                             Business Challenges
                DATA                           DATA                API & APP                    DATA                     MDM & 360         GOVERNANCE &      DATA
               CATALOG                     INTEGRATION           INTEGRATION                   QUALITY                  APPLICATIONS         PRIVACY      MARKETPLACE
A A H I O N V O T D D
                                                                                                      L                          U
Vendors
B M O J M O P P
                   C                         F      P         50%W organizations
                                                                              P  rely on 5+K tools                                              Q
D G K K S R
55% have 1,000+ data sources and 78% predict more in 2023
                                                              Statistics are based on 600 CDOs surveyed around the world – November 2022
          10   © Informatica. Proprietary and Confidential.
Achieving Business Outcomes with Data
ETL Developer Data Engineer Citizen Integrator Data Scientist Data Analyst Business Users
                DISCOVER &                 ACCESS &                    CONNECT &                                                                                GOVERN &                      SHARE &
                                                                                                 CLEANSE & TRUST             MASTER & RELATE
               UNDERSTAND                 INTEGRATE                    AUTOMATE                                                                                 PROTECT                     DEMOCRATIZE
                   DATA                      DATA                        API & APP                         DATA                   MDM & 360                    GOVERNANCE &                    DATA
                  CATALOG                INTEGRATION                   INTEGRATION                        QUALITY                APPLICATIONS                    PRIVACY                    MARKETPLACE
                                                                                             Connectivity
                                                                                        Metadata System of Record
Multi-Cloud Hybrid
                 The
                                                      reference architecture                 performance and resiliency
               49                           Trillion
                                          Transactions per month
                                                              18   Petabytes
                                                                   Metadata
     49
      Analysts andTransactions
                               Trillion
                    data scientists per month                                   Boosts productivity for data
          find trusted data faster                                              engineers and data stewards
                                                      AI-powered Metadata
                                                    Intelligence & Automation
                                                                     18            Petabytes
                                                                                  Metadata
15   © Informatica. Proprietary and Confidential.
 Simple Consumption Based Pricing
         I nformatica
                                                                      Pay for only what you use
       P rocessing
                                                                      Access to all platform services
       U nit
                                                                    9
                                                               of Fortune 10
                                                                  85
                                                               of Fortune 100
2023
                                                                                              DATA CONSUMERS
ETL Developer Data Engineer Citizen Integrator Data Scientist Data Analyst Business Users
                DISCOVER &                 ACCESS &                    CONNECT &                                                                                         GOVERN &                        SHARE &
                                                                                                  CLEANSE & TRUST                 MASTER & RELATE
               UNDERSTAND                 INTEGRATE                    AUTOMATE                                                                                          PROTECT                       DEMOCRATIZE
                   DATA                      DATA                        API & APP                          DATA                       MDM & 360                        GOVERNANCE &                      DATA
                  CATALOG                INTEGRATION                   INTEGRATION                         QUALITY                    APPLICATIONS                        PRIVACY                      MARKETPLACE
                                                                                              Connectivity
                                                                                            Metadata System of Record
DATA SOURCES
                                                                                                          Data Integration
                                                                 Google
                                             Azure Data Lake     Cloud Storage
                                              Storage Gen2
                                                                                                                                       Cloud Data Warehouse                                         Line of
                                                                                                                                                                                                   Business
                                       Landing                    Data             Enterprise                                                         Google
                                                                                                                                                      BigQuery
                                        Zone                   Enrichment            Zone                                                             Azure
                                                                                                                                                      Synapse
                                                                                                                                                      Analytics
        Cloud                                                                                                                                                                                       Data
                                                                                                                                        Data Science / AI                                          Engineer
                                                                 Storage
                                                 Elastic Compute - Spark
                                                                                                                                                      Azure
                                                                                                                                                      Machine
                                                                                                                                                      Learning
                                                                                                                                                                                                    Data
                                                                                                                                                                                                   Scientist
                                                                                     API & Application Integration
         IoT
                                                                                                Data Catalog
                                                                                                                                                                                                  Governance
                                                                                                                                                                                                   Manager
                                                                                       Governance and Privacy
3    Serverless Spark/Kubernetes        2
     Cluster managed by Informatica
     Push Down Optimization to
4    Cloud Lakehouses                                                      Metadata      Informatica hosted
                                                                                         Intelligent Cloud Services
                                                 4       Data
                                                                                         IDMC
                                                                1
                                                                    Cloud Applications
                                                     firewall
IDMC – A Secure Platform
       49                        Trillion
                               Transactions per month
                                                        18   Petabytes
                                                             Metadata
24     © Informatica. Proprietary and Confidential.
     Operational Insights
Hybrid Cloud
Integrated Dashboard
Hybrid Cloud
Integrated Dashboard
SAML Authentication
                                                                                                     IP Address Filtering
                                                                                                     Allowed trusted IP ranges to access tenant
                                                                 Resources
                                                                                         Permission
2023
          IDMC Security Architecture Diagram
                                           Host
                                                                                                           Business Data
                                                                                                             (HTTPS)
                                                        Cloud
                                                                Multi-Tenant
                                              Micro             Repositories
                                             Services
                           Front End
                                                                           Metadata
                                             Services
                                             Services
                                             Services                      Data
                                                                                                 Business Data                  Cloud Applications
                                                           AES Encryption (256 Bit)                (HTTPS)
      • Data Integration
        - Cloud Data Integration (CDI) – Batch mainly with a few RT-enabled connectors. Most closely resembles
          PowerCenter in feel and execution
        - Advanced Mappings – Almost all of the same functionality is Cloud Data Integration with much better
          Hierarchical data handling and execution on SPARK
      • Application Integration
        - Cloud Application Integration (CAI) – Designed for orchestration or transaction style patterns on an event-
          driven basis
      • Mass Ingestion (MI)
        - File Mass Ingestion – Database Ingestion – App Mass Ingestion – Steaming Mass Ingestion
        - Low touch wizard-driven tool for moving large amounts of data from source to target with no
          transformation. General the first step in a full ELT pattern
      • Data Quality
      • Data Profiling
     49                        Trillion
                             Transactions per month
                                                    DEMO
                                                    File Movement
                                                               18   Petabytes
                                                                    Metadata
38   © Informatica. Proprietary and Confidential.
     Cloud Mass Ingestion
                                                                       MI Metadata
      Transfer any file type with a
      high performance and                                                                            Cloud
      scalability                                                                                                                GCS   Redshift   S3
                                                                               1     MI Task                       4
                                                                                                              Update Job Log Azure DW, Blob, Data Lake
      Job and file level tracking and
      monitoring
                                                                                           Secure
                                                                                           Agent
      Orchestrate File transfer and
                                                          2                                                             3
                                                                                          File Mass
      ingestion in hybrid/cloud as                                                        Ingestion                Ingest Data
                                                                                           Service
      managed and secure service                       Advanced
                                                       FTP/SFTP/FTPS
                                                       Cconnector
41      © Informatica. Proprietary and Confidential.
Mass Ingestion Files
                                                                                                                  Data              Data
     Ingest relational database data                                                                            Warehouse          Lakes
     from Oracle, SQL-Server & MySQL.
     Also supporting Schema Drift on
     CDC supported Databases
                                                                   On-Premises
                                                                     Sources
45     © Informatica. Proprietary and Confidential.
          Benefits of Mass Ingestion Databases
                                                    WebLogs
                                                                 Data Lake
        Ingest streaming data: Logs,                 Social        & ML        Consumption
                                                     Media
        clickstream, social media,
        Kafka Kinesis, S3, ADLS,                    Messaging
                                                     Systems
        Firehose, etc.
        Real-time monitoring of
        ingestion jobs with lifecycle
        management and alerting in
        case of issues
     49                        Trillion
                             Transactions per month
                                                    DEMO
                                                    Data Transformation
                                                                  18      Petabytes
                                                                          Metadata
52   © Informatica. Proprietary and Confidential.
     Cloud Data Integration
                               Task Flows
Multi Cloud Integrations using CDI
                                                         • Ease of Use
                                                         • Templates and Wizards
                                                         • Micro-service Architecture
                                                         • Reusability
                                                         • Broad Hybrid and Multi-Cloud
                                                           Connectivity
                                                         • No coding across the platform
                                                         • Performance optimizations like
                                                           CDC, parallel processing,
                                                           pushdown optimization, Mass
                                                           Ingestion, etc
  • Data Integration: Build a template once – automate mapping execution for 1000’s of
    sources with different schemas automatically
  • Mapping self-adjusts dynamically to external schema changes and column characteristics
 Generic Source and Target           Varying logic, e.g., apply TRIM for varying
 with varying schemas                number of String fields in the Source
     Advanced Integration
     Previously CDI-Elastic
•   Single design time experience for                                  Advanced Design Experience for Data Integration
    all your data integration needs
•   250+ purpose-built, cloud-native                    Adva                                                                       CDI
                                                        nced
    connectors with purpose-built                        Map
    transformations for any type of                     pings
    workload, at any scale
•   Support for optimized mixed-mode
                                                                                CLAIRE FinOps Optimization Engine
    execution (part DTM, part Spark)
•   Intelligent (CLAIRE-driven)
                                                         Spark Processing                  ELT (PDO)                ETL (Secure Agent Processing)
    optimization @runtime for best
    cost-performance
                                        Execution
                                                    Auto-Scaling
                                                    Elastic Spark                   ETL                             ETL
                                                    Cluster
                                                                                                                                         Secure Agent
                                                                                                                                          Processing
Advanced Mappings
Enabling Kubernetes for auto-scaling and provisioning
IDMC
                                                        Same, familiar
                                                        Informatica Design-Time
         Manual work
         30% of your Engineers time
                                                 Pick new
                                                Parameters
         Frequent Outages
         Pager ringing at 3 AM
                                                    Developer
                                                                Analyze the
                                      Run the Job
                                                                  Logs
         Slow and expensive
         Missing SLA’s every week.
Advanced Mappings: What is tuned?
                                                                 Execute Advanced
                              Source File Directory
                                                                 Mapping Process
   • Solution:
      •   Advanced Mappings can track data that has been processed during a previous run of an MCT by
          persisting the state information of the job run.
      •   Incremental File Load is a feature of Advanced Mappings which will maintain the state
          information and prevent reprocessing of old data.
      •   Time travel will help to go back in time and re-process files
         Use Case
         Data will be aggregated into a summary table
     49                        Trillion
                             Transactions per month
                                                     DEMO
                                                    Leverage Investment In
                                                            CDW
                                                                    18       Petabytes
                                                                             Metadata
74   © Informatica. Proprietary and Confidential.
Advanced Pushdown Optimization
Server
                Features
 • Data pipeline logic gets translated into
   Cloud ecosystem based native SQL (SQL
   Based PDO) or native ecosystem API/
   commands (Ecosystem PDO) based on the
   Data integration pattern
 • Support for Full, Source, Partial PDO
 • Broadest array of connectors and support
   for all major ecosystems (CDL/CDWs)
 • Ecosystem agnostic
 • Simple drop-down option in GUI with no                                                S3/ ADLS/ GCS
   need to learn proprietary commands
                                                          Enable faster processing with zero data egress charges through
                                                                         advanced pushdown optimization
     ODBC PDO vs APDO
      ODBC Based Pushdown Optimization                      Advanced Pushdown Optimization
      Developed 15+ years ago. No further plans to expand   Specifically designed to support CDW/CDL patterns. Major
      transformation/function support                       expansion plan for transformations/function support, more
                                                            features in roadmap.
      An ODBC connection needs to be created and used in    Advanced Pushdown Optimization is a native connector feature.
      mappings                                              No separate ODBC connection required
      Classical CDW patterns only                           Multiple patterns within CDW, CDL, including classical CDW
      Supports only ODBC connection features                Existing connector features are supported (example: any
                                                            advanced authentication options)
      No separate license required                          Enabled with IPU based model. For Non-IPU, requires separate
                                                            license.
                                                 S3                                                                             S3
                                                                                                               COPY      $
                     $$$
                                              Redshift                                                                       Redshift
                                              AWS                                                                            AWS
   Loading data from Data lake to Data warehouse using                           Loading data from Data lake to Data warehouse using AWS
                    Informatica engine                                                                 commands
Use Case 2: Data warehouse Pushdown
CDW CDW
ODS ODS
$$$
                                                AWS                                                                                   AWS
  Loading data from staging to ODS in Snowflake using Informatica engine                  Loading data from staging to ODS in Snowflake using Snowflake engine
         Use Case
         Any errors must trigger a case in ServiceNow
     49                        Trillion
                             Transactions per month
                                                     DEMO
                                                    APIs and Error Handling
                                                                    18        Petabytes
                                                                              Metadata
83   © Informatica. Proprietary and Confidential.
     Cloud Application Integration
   …for sharing data using                    …for integrating business                                         … for app-to-app data
 no-code data access APIs                     processes that span applications                                  interchange in real-time using
                                              and automating user tasks                                         data events and messaging
API and Application Integration
Where things run                                                                                                                Cloud-based Design,
                                                                                                                        Deployment and Management
                                             iPaaS
Data Services
2023
                                             DATA CONSUMERS
 DATA         DATA        API & APP        DATA            DATA          MASTER DATA     CUSTOMER &        DATA       GOVERNANCE &
CATALOG   INTEGRATION   INTEGRATION        PREP           QUALITY        MANAGEMENT      BUSINESS 360   MARKETPLACE     PRIVACY
                                               DATA SOURCES
          Why is Data Management Hard and Complex?
                DATA                           DATA             API & APP          DATA          MDM & 360     GOVERNANCE &      DATA
               CATALOG                     INTEGRATION        INTEGRATION         QUALITY       APPLICATIONS     PRIVACY      MARKETPLACE
A A H I O N V O T D D
                                                                                       L                U
Vendors
B M O J M O P P
C F P W P K Q
D G K K S R
IDMC Service A
                                                                                   IPU Consumption
                                                                                     Measured by
                         Exchange
                                               IPU      Consume   IDMC Service B                     Flexible and Interchangeable
                                             Currency
                                                                  IDMC Service X
       CATALOG                INGEST              INTEGRATE             CLEANSE              RELATE             GOVERN              PROTECT              PREPARE           SHARE & DELIVER
     Discover, catalog,    Multi-latency data   Integrate all types   Make data fit for   Match and relate   Define and verify   Detect and protect     For analytics         Publish and
       and curate all     ingestion and edge         of data             purpose           identities and    data governance       sensitive data     and collaborate on    manage APIs and
      enterprise data         computing                                                       entities           policies                                  projects          Data Services
Azure
       CATALOG                INGEST              INTEGRATE             CLEANSE              RELATE             GOVERN              PROTECT              PREPARE           SHARE & DELIVER
     Discover, catalog,    Multi-latency data   Integrate all types   Make data fit for   Match and relate   Define and verify   Detect and protect     For analytics         Publish and
       and curate all     ingestion and edge         of data             purpose           identities and    data governance       sensitive data     and collaborate on    manage APIs and
      enterprise data         computing                                                       entities           policies                                  projects          Data Services
Azure
   Daily Assets
                    100/1k/100k Assets Number of Assets that are stored by the service on a single day
     Stored
70
60
50
40
30
20
10
                     -
                                                                                                 IPU
                                                                                            Volume Package
    Whether a customer pre-commits to 120 IPUs or 10K IPUs, they all get access to the same functionality
Informatica IPU Commercial Model
      49                        Trillion
                              Transactions per month
                                                     DEMO
                                                     IPU Metering
                                                                18   Petabytes
                                                                     Metadata
103   © Informatica. Proprietary and Confidential.
Wrap up Day 1
                 IDMC
                                                     IDMC        Data             IPU
               Bootcamp
                                                      Intro   Integration   Pricing Model
                  Intro
                                                                                  Modernization
                   DAY 1                             Cloud Data      Cloud
                                                                                      Panel
                   Recap                             Governance   Modernization
                                                                                   Discussion