Data: Information
Data: Information
BYTE: Binary term. A group of 8 BITs treated as a single entity or as an array of individual BITs.
- 0 or 1.
Barriers to study
1. Absence of mass:
Mb= megabits
MB = megabytes.
- 128 GB = 128.000.000.000.000 bits
Infrastructure:
Infra: below, so the infrastructure is the “underlying structure” of a country and its economy, the
fixed installations….
Hardware (physical)
Computer memory: temporary storage for data and instructions that the CPU can access quickly
- Also known as Random Access Memory (RAM)
- Stores active data: holds data and programs currently in use
- Allows multitasking: allows multiple programs to run simultaneously
- Improves performance: provides fast access to frquently ….
Wifi
Wireless fidelity
most commonly used wireless communication technology that uses radio waves to provide
wireless high-speed internet access.
- Primary medium for global internet traffic
WIfi access points
device that creates a wireless local area network. Connects to a wired router, switch, or hub via
ethernet cable.
Router
device that connects networks to each other, forwarding data packets from one location to
another until they reach their destination.
27/03
Gradient: step.
The TCO is managed by the managers.
❖ Fiber optics is a technology that uses thin strands of glass or plastic, known as optical
fibers, to transmit data as light signals over long distances, to connect the world. Here
are some key points about fiber optics:
➢ High-Speed Data Transmission: Fiber optics can transmit data at incredibly high
speeds, much faster than traditional copper cables. This makes it ideal for
internet connections, telecommunications, and networking
➢ Long-Distance Communication: Because light signals experience less loss over
long distances compared to electrical signals in copper wires, fiber optics is
perfect for long-distance communication.
➢ Immunity to Electromagnetic Interference: Unlike metal wires, optical fibers are
not affected by electromagnetic interference, which ensures a more stable and
reliable connection
- If there were no fiber cables, the world would be disconnected, and it would have an
impact on economies.
SpaceX’s goal is to provide high-speed, low-latency broadband internet connections to parts of
the planet that are traditionally underserved, like rural areas, and to theoretically offer more
competitive pricing in urban spaces.
● People who control the communication lines can control everything.
- Products are worthless if they can´t reach the customer.
Program
Sequence of instructions for how data is processed.
- Those instructions are written in a way that the computer can understand.
- At the core of all software is logic and the principle of input > process > output.
Software
Computer programs (stored and executed by computer hardware) and associated data (stored
in the hardware) that may be dynamically written or modified during execution.
- System software also known as operating system
➔ The software “master control application” that runs the computer. It is the first
program loaded when the computer is turned on, and its main component, the
kernel, resides in memory at all times.
➔ The operating system sets the standards for all application programs (such as
the Web server) that run in the computer.
➔ The applications communicate with the operating system for most user interface
and file management operations.
Application software
➔ A system for collecting, saving, processing, and presenting data by means of a
computer.
➔ The term application is generally used when referring to a component of software
that can be executed.
➔ The terms application and software application are often used synonymously.
Virtualization
Means that it is created by a computer technology and appears to exist but not existing in the
physical world
- To change something that exists in a real form into a virtual version (= one that is
created using a computer)
- Virtualization is an abstraction layer that decouples the physical hardware from the
operating system to deliver greater IT resource utilization and flexibility.
It makes it possible to run 2 virtual computers in one software.
- Virtualization is a process that allows for more efficient use of physical computer
hardware and is the foundation of cloud computing. By using many computers in one.
- Virtualization allows cloud computing, reducing costs.
Cloud computing
Is a collection of computing services (compute, storage, network, applications, etc) accessible
on-demand over a network
- It is a business and technology model for delivering and acquiring IT services in a utility-
like fashion.
● Services are elastic and can shrink and grow in response to demand
● Services can be programmatically controlled on-demand
● Reliability and scalability logic remain hidden, giving the illusion of infinite capacity
● Charges are based on usage- no upfront investment
● Services are provided to multiple customers, typically using shared resources.
Cloud computing is an approach to computing that leverages the efficient pooling of on-demand,
self-managed virtual infrastructure, consumed as a service.
1. Efficiency through utilization and automation
- Pooling: From machines to on-demand, highly elastic resource pools
- Zero-touch infrastructure: Policy-driven automation of provisioning, development
and management.
2. Agility with control
- Self service: Easy access with policy-based provisioning and development
-Control: Application-aware infrastructure with built-in availability, scalability,
security, and performance guarantees
3. Freedom of choices
- Open and interoperable: Application mobility between clouds, based on open
standards
- Leverage existing investment: Benefits of cloud computing to exist applications
and datacenters.
Computers work with electricity. Energy permits only two possible states, 0s and 1s. Whereas
humans use words. So how to program a computer?
Propierty software
- is computer software whose source code is not publicly accessible. Only the company
that created it can make changes.
- This software is developed and tested by its individual or organization rather than the
public.
- It’s managed by a private team responsible for its development.
- Users must purchase this software, and commercial support is available for its
maintenance.
- The company issues a specific license to users, allowing them to use the software.
To have a competitive advantage, they protect their code.
Open-source spftwarte
Open-Source Software is a computer program with its source code freely accessible online,
allowing programmers to enhance it by adding features at no cost.
This software is collaboratively developed and tested by a community of open-source
developers. It offers both community and commercial support for maintenance
- Anyone can modify the software, and it is free to install.
Perpetual licensing
- is the most traditional software licensing model. The customer purchases software once
and keeps it forever.
- Customers typically pay upfront for the license and sometimes pay a yearly
maintenance fee. One advantage of this license is that customers know the exact
amount they need to pay, without being surprised by pricing changes in a subscription
model. After purchasing a perpetual license, customers often host the software on their
servers. The money you pay upfront is the only investment needed.
- Today, many companies are moving away from traditional perpetual licenses to more
flexible subscription models (for example Microsoft Office Suite).
Concurrent license
- Based on the maximum number of users who will use it simultaneously.
For example, if a company purchases 5 concurrent licenses, that means that up to 5 of their
employees can use the software at any given time.
- This software is usable for certain amount of users, wherever they are located. They can
be located one on each part of the world, or 2 groups of that amount of workers, working
in different timetables.
Subscription-based licensing
- With a subscription license, instead of making a one-time purchase, you pay a monthly
or annual fee to use the software.
- has become one of the most popular software license types, specifically in today’s
consumer market.
Companies like Spotify, Adobe, Netflix, and others use this software licensing model with great
success
Customer satisfaction is particularly high with this model due to customers having previous
familiarity and comfort with such a system.
Additionally, customers enjoy knowing exactly when they will be charged for use (monthly,
yearly, etc.).
You don't own the software for which you obtain a license to use.
Developing a software
Pros:
- Customization: You can tailor the software to meet your exact needs.
- Innovation: Homegrown software can bring innovation to your market or create more
efficient operations.
Cons:
- Time and Resource Intensive: It can be a strain on time and resources.
- Requires Expertise: You need to have the necessary skills and knowledge to develop
the software.
10/04
How companies store data
Database (supports data)is an organized collection of data stored and accessed electronically
- It is a crucial tool used by organizations to store, retrieve, and manage information
efficiently.
3 types of databases
1. Online transaction processing (stores present) OLTP
- Class of system that manages transaction-oriented applications, typically for data entry
and retrieval transaction processing.
- To ensure a process is done correctly
- High transaction Volume: It can handle a large volume of transactions at the same time.
(insert, update, delete)
- Concurrency: Supports many users performing transactions simultaneously.
- Data integrity: Ensuring data accuracy and consistency is crucial, often achieved through
ACID (Atomicity, Consistency, Isolation, Durability) properties
- Real-time processing: transactions are processed in real-time, providing immediate
feedback to the end user.
- Banking systems, order entry systems, retail sales systems, and airline reservation
system
- Se basa en la teoría de intersecciones, pq junta información en un mismo sistema.
2. Data warehouse (stores info from the past)
- A centralized repository designed to store large volumes of data from various sources.
- It is primarily used for reporting and data analysis, supporting business intelligence
activities (decision making with the info you have.
- Integration: Data from different sources is integrated into a single, consistent data store.
- Historical Data: It stores historical data, allowing for trend analysis and long-term
reporting.
- Read-Optimized: Unlike OLTP systems, data warehouses are optimized for read-heavy
operations, such as complex queries and analysis.(not for update or inserting data,
because it is the past. It is just for reading)
- ETL Processes: Data is typically Extracted from source systems, Transformed to ensure
consistency and quality, and Loaded into the data warehouse (ETL process).
- Data warehouses enable organizations to perform powerful analytics, data mining, and
support decision-making processes
3. Online analytical processing (stores the future)
- It is a technology used for performing complex queries and multidimensional analysis on
large volumes of data, typically stored in data warehouses or data lakes.
- Multidimensional Analysis: OLAP allows data to be analyzed from multiple perspectives,
often using OLAP cubes, which are multidimensional arrays of data.
- High-Speed Query Performance: It is optimized for fast retrieval of data, enabling quick
responses to complex queries
- Business Intelligence: OLAP is widely used in business intelligence for decision support,
reporting, and forecasting.
- Data Aggregation: It supports the aggregation of data, such as summing, averaging, and
other statistical operations, across multiple dimensions.
- For example, a retailer might use OLAP to analyze sales data by product, region, and
time period, allowing them to identify trends and make informed business decisions
OLTP
SQL (Structured Query Language)
OLTP systems primarily use SQRL for managing and manipulating data
SQL is widely used because it is well-suited for handling the high volume of simple, interactive
transactions typical in OLTP environments
It is fundamentally based on set theory, a branch of mathematical logic that deals with
collections of objects.
E-commerce websites:
Impact: These systems handle customer order, inventory updates, and payment processing
- They ensure that transactions are processed quickly and accurately, enhancing
customer satisfaction and operational efficiency
Banking systems:
Impact: OLTP systems process financial transactions, update account balances, and manage
customer information
- They provide real-time access to account data, improving customer service and reducing
the risk of errors.
Airline reservation systems:
Impact: these system manage flighty booking, ticketing, and seat assignments. Tehy enable
airlines to efficiently handle large volumes of transactions, ensuring accurate and timely updates
to flight schedule and passenger information.
Point of sale system:
Impact: POS systems facilitate sales transactions and inventory updates in real-time. They help
retailers manage stock levels, track sales data, and improve customer service by speeding up
the checkout process.
Healthcare system:
Impact: OLTP systems manage patient records, appointmet scheduling, and billing processes.
They ensure that healthcare providers have accurate and up-to-date information, improving
patient care and operational efficiency.
Performance:
•The number of transactions an OLTP system can support per minute varies widely depending
on the hardware, software, and specific configuration used. However, modern OLTP systems
can handle thousands to millions of transactions per minute.
OLTP products
Oracle:
Products: Oracle Database, Oracle Autonomous Transaction Processing.
Impact: Widely used in various industries for its robust transaction processing capabilities and
scalability.
IBM:
Products: IBM Db2, IBM Informix.
Impact: Known for high reliability and performance, especially in banking and financial services.
Microsoft:
Products: Microsoft SQL Server, Azure SQL Database.
Impact: Popular for its integration with other Microsoft products and cloud services, making it a
versatile choice for many businesses.
SAP:
Products: SAP (System Analysis Program development) HANA.
Impact: Offers real-time data processing and analytics, widely used in enterprise resource
planning (ERP) systems.
MongoDB (humongous):
Products: MongoDB.
Impact: A NoSQL database known for its flexibility and scalability, suitable for modern
applications requiring high transaction throughput.
DATA WAREHOUSE
Retail: A retail company uses a data warehouse to consolidate sales data from multiple stores
and online channels.
- Impact: This enables the company to analyze sales trends, manage inventory more
effectively, and personalize marketing campaigns based on customer purchasing
behavior.
Finance: Banks and financial institutions use data warehouses to integrate data from different
branches and departments.
- This helps in risk management, fraud detection, and regulatory compliance by providing
a comprehensive view of financial transactions and customer activities
Telecomunications: Telecom companies use data warehouses to analyze call records, customer
data, and network performance.
- This helps in optimizing network operations, improving customer service, and developing
targeted marketing strategies4
Manufacturing: Manufacturing companies use data warehouses to track production data, supply
chain information, and quality control metrics.
- Impact: This enables better production planning, inventory management, and
identification of process improvements.
OLAP
OLAP SLICE: Is a term for a dimension held constant for all cells so tat ulti-dimestional
information can be shown i a two-dimensional physical space of spreadsheet or pivot table.
OLAP CELL: Each cell of the cube holds a number that represents some measure of the
business, such as sales volume, profit, cost, budget, or forecast
OLAP DICING: Creating a sub-cbe from the main oe.
SLICING: Subtracting rectangular par of a cube of the same single value.
ROLL-UP: Summarizig data along a dimension
DRILL-DOWN: An operation opposite to roll-up
PIVOTING ROTATION: Allows to see another perspective on the dataset, by rotating the whole
cube in space
OLAP (Online Analytical Processing) products used for multidimensional data analysis:
IBM Cognos:
Features: Provides comprehensive BI capabilities, including reporting, analysis, dashboards,
and scorecards.
Microsoft SQL Server Analysis Services (SSAS):
Features: Offers powerful OLAP and data mining capabilities, integrated with the Microsoft BI
stack.
Oracle Essbase:
Features: A multidimensional database management system that supports complex analytical
queries and data modeling.
SAP BW/4HANA:
Features: Combines OLAP capabilities with SAP’s in-memory database technology for real-time
analytics.
MicroStrategy:
Features: Provides robust OLAP capabilities along with advanced analytics, mobile BI, and data
visualization.
Apache Kylin:
Features: An open-source distributed analytics engine designed to provide OLAP on Hadoop,
supporting extremely large datasets
Pentaho BI:
Features: An open-source BI suite that includes OLAP services, data integration, and reporting
tools.
Google BigQuery:
Features: A fully managed, serverless data warehouse that supports OLAP operations and
integrates with other Google Cloud services.
These products help organizations perform complex data analysis, enabling better decision-
making and strategic planning.
DATA ANALYSIS
Business analysys use tools like microsoft excel or power BI to connect to the OLAP
cube
They perform various analyses, such as:
Slice and Dice: Analyzing sales data by different dimensions (e.g., sales by product category
and region).
Drill Down: Exploring detailed data (e.g., drilling down from yearly sales to monthly and daily
sales).
Pivot Tables: Creating pivot tables to summarize and visualize sales data.
By using Microsoft SQL Server Analysis Services SSAS for OLAP, the retail company can:
1. Improve Decision-Making: Gain deeper insights into sales performance and customer
preferences.
2. Enhance Efficiency: Quickly generate reports and dashboards for various stakeholders.
3. Optimize Inventory: Ensure optimal stock levels and reduce overstock or stockouts.
16/04
Supply chain management (SCM) systems
Software solutions that manage the flow of goods, data, and
finances related to a product or service from raw material
sourcing to final product delivery
- Integrates various processes including procurement,
production, and distribution
7. Returns Management
- Handles product returns and reverse logistics
- Processes refunds, repairs, and restocking (whe you give it back ad te give another to you)
Characteristics of TPS
- Rapid processing
- Reliability
- Standardization
- Controlled access
Types of Transactions
- Order entry
- Billing
- Payroll
- Inventory management
In modern use, the word usually has the figurative sense “without flaws” or "without interruption"
rather than “without seams.”
Types of implementation
1. Data integration:
- Combines data from various sources into a unified view
- Example: Data warehouses, data lakes
2. Application Integration
- Connects different software applications to share functionalities
- Example: API-based integrations, middleware solutions
3. Process Integration
- Aligns and coordinates business processes across different systems
- Example: Workflow automation tools
4. Network Integration
- Connects different networks to enable seamless communication
- Example: Virtual Private Networks (VPNs)
Bus Architecture
a system in which multiple components share a common communication pathway to transfer
data, addresses, and control signals. It is widely used in computer hardware and networking to
facilitate efficient communication between different parts of a system.
- The term "bus" is inspired by public transportation buses, which carry passengers
between stops along a predefined route. Similarly, in computing, a bus carries data
between different components, ensuring structured and efficient communication.
- Interconnects systems.
Integration Process:
1. Implement an ESB to act as a central hub for communication between systems
2. Develop APIs for each system to enable data exchange
3. Use HL7 (Health Level Seven) standards for healthcare data interoperability
4. Implement data transformation services to ensure consistent data formats
5. Set up real-time event-driven integration for critical updates (e.g., lab results)
Benefits achieved
1. Unified patient records accessible across all departments
2. Improved diagnosis and treatment with comprehensive patient data
3. Streamlined workflows (e.g., automated lab order and result reporting)
4. Enhanced billing accuracy and reduced administrative errors
5. Better data analytics for hospital management and research