0% found this document useful (0 votes)

8 views9 pages

MOOC PART 3 in Gndu

Data analytics

Uploaded by

dont.waste.time.only.study

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views9 pages

MOOC PART 3 in Gndu

Data analytics

Uploaded by

dont.waste.time.only.study

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

INTRODUCTION TO

MACHINE LEARNING

Machine learning is a type of AI (Artificial Intelligence) where

computers learn from data and get better at making predictions and
decisions over time.

In machine learning, systems called Artificial Neural Networks

(ANNs) are trained to find patterns in large amounts of data, helping
them make predictions based on new information. The better the
algorithm, the more accurate the predictions.

Here are some real-life examples of machine learning:

1. Digital assistants that understand voice commands and play

music or search the web.
2. Websites recommending products, movies, or songs based on
your past choices.
3. Spam detectors that block unwanted emails.
4. Medical systems that help doctors diagnose conditions using
images.
5. Self-driving cars that recognize their surroundings and make
decisions while driving.

As more data becomes available, computers get more powerful, and

scientists develop better algorithms, machine learning will play an
even bigger role in our lives.
Step 1: Preparing the Training Data Set

The training data set is sample data used to teach the machine
learning model how to solve a problem. It can be either labeled (with
features and classifications) or unlabeled (the model has to find
patterns on its own).

Whether labeled or not, the training data should be:

• Randomized
• Balanced
• Unbiased

The data is split into two parts:

• Training subset: Used to train the model.

• Evaluation subset: Used to test and improve the model.

Step 2: Choosing the Training Algorithm

The choice of algorithm depends on:

• Type of data (labeled or unlabeled)

• Amount of data
• The problem you want to solve

There are different machine learning algorithms available for both

labeled and unlabeled data.
For labeled training data:

• Regression algorithms: Used to find relationships in data.

o Linear regression predicts a value based on one variable
(e.g., predicting salary based on student records).
o Logistic regression is used for binary outcomes (yes/no).
o Support vector machines are good for complex
classification problems.
• Decision trees: Use rules to make recommendations (e.g.,
recommending a stock to buy based on data).
• Instance-based algorithms: Like K-Nearest Neighbor (k-
NN), classify data points by comparing them to nearby points.

For unlabeled training data:

• Clustering: Groups similar data points without prior

knowledge. Popular methods include K-means, Two-Step, and
Kohonen clustering.
• Association algorithms: Extract "if-then" rules from data
patterns, similar to data mining.
• Multilayer Feedforward Neural Networks: An ANN with
multiple layers where data moves through layers to reach
conclusions. Uses Backpropagation for learning. Deep neural
networks have many hidden layers to refine results.
Step 3: Training the System

Training an Artificial Neural Network (ANN) is done in multiple

rounds (epochs). Each epoch involves:

1. Input data is fed into the system.

2. Data moves through layers of the network.
3. The output is compared with the target output, and errors are
calculated.
4. Weights and biases are adjusted backward through the layers.

Step 4: Applying to Practical Data

5. Once trained, the ANN is used to solve real-world problems.

Over time, it can continue learning and improve based on new
data, like medical images or user browsing history, depending
on the task.

Supervised Machine Learning

In supervised learning, the system learns from labeled data, where

each input has a correct output. The system compares its actual output
with the correct one and adjusts when there's a mismatch. While it's
easier to use, preparing the training data is challenging, and there’s a
risk of overfitting, where the system becomes too specific to the
training data and struggles with new data.

Unsupervised Machine Learning

Unsupervised learning finds patterns and relationships in large

amounts of raw data without labeled examples. It groups similar data
into clusters, discovering hidden patterns instead of making decisions
or predictions.
Reinforcement Machine Learning

Reinforcement learning involves learning by trial and error without

using sample training data. Successful actions are reinforced to find
the best solution or policy.

Neural Networks (ANNs)

Artificial Neural Networks (ANNs) are inspired by the human brain.

They consist of many connected processing units called neurons.
Each neuron processes input, sums it, and based on a rule, either
sends an output or doesn’t. When many neurons work together, they
can perform complex tasks like classification and clustering. ANNs
can learn, which makes them very useful in solving problems.
Machine Learning (ML) and Deep Learning (DL) are based on ANN
structures.

Deep Learning (DL)

Deep Learning (DL) uses complex neural networks with many layers
to automatically learn features from large data sets, like images, text,
or sound. It mimics human learning by examples. DL models are
trained with lots of labeled data and can learn without manual feature
extraction. Unlike regular machine learning, DL improves with more
data. DL's rise is due to the availability of big data and powerful
hardware like GPUs, which speed up training significantly.

Deep Learning Architecture

Different types of neural networks (ANNs) are used for various tasks.
For example, recurrent networks are good for language and speech,
while Convolutional Neural Networks (CNNs) are best for image
processing and classification.
In Deep Learning, the network has many layers of neurons. The first
is the input layer, followed by hidden layers, and the final output
layer. Ordinary machine learning uses 2-3 hidden layers, but deep
learning can have hundreds.

Convolutional Neural Networks (CNNs)

CNNs are specialized neural networks that excel at processing

images, speech, and audio. They have three main types of layers:

1. Convolutional Layer: This is the first layer that extracts basic

features like colors and edges.
2. Pooling Layer: This layer reduces the size of the data, helping
the network focus on important features.
3. Fully-Connected (FC) Layer: This is the final layer that
combines all the learned features to identify the target object.

Best Practices in Machine Learning (ML) and Deep Learning

(DL)

1. Choosing Between ML and DL:

o Use Deep Learning (DL) when you have a large amount
of data (thousands of images) and powerful GPUs for
processing.
o If you lack these conditions, stick to Machine Learning
(ML).
2. Common Uses of DL:
o DL is mainly used for object classification.
3. Methods to Work with DL:
o Training from Scratch: Requires a large set of labeled
data and can take days or weeks. Not usually
recommended unless necessary.
o Transfer Learning: Fine-tunes an existing model (like
AlexNet) with new data for a specific task (e.g.,
recognizing bicycles). It’s less intensive and needs less
data.
o Feature Extraction: Uses DL to extract features from
data, which are then input into an ML model, like Support
Vector Machines (SVM).
4. Efficiency:
o Combining GPUs with software tools (like MATLAB)
can drastically reduce training time from days to hours or
even minutes.

A Black-box Approach to Regression Analysis

Regression Analysis is a statistical method used to find relationships

between variables and to predict the value of one variable (dependent)
based on another variable (independent).

Examples of Regression Analysis:

1. Advertisement Duration vs. Production Cost:

o Data: Duration of an ad film (in seconds) and its
production cost (in lakhs).
o Example:

Duration (sec) Cost (Lakh Rs)

10 8
25 22
30 25
60 47
Customers vs. Biryani Sales:

• Task: Predict the number of biryanis expected to be sold based

on customer visits.
• Data:

No. of Customers No. of Biryani Packets

517 215
410 189
630 230
285 122

Simple Regression

Simple Regression is a method that uses one independent variable (x)

to predict another dependent variable (y). The formula for simple
linear regression is:

y=β0+β1x+ϵy = \beta_0 + \beta_1 x + \epsilony=β0+β1x+ϵ

• x: Independent variable (predictor)

• y: Dependent variable (response)
• β0: Y-intercept (where the line crosses the y-axis)
• β1: Slope of the line (how much y changes for each unit change
in x)
• ε: Random error (difference between the predicted and actual
values)

Multiple Regression

Multiple Regression uses two or more independent variables to

predict a dependent variable (y). It can be linear or nonlinear.

Linear Multiple Regression is expressed with the formula:

y=β0+β1x1+β2x2+…+βkxk+ϵy = \beta_0 + \beta_1 x_1 + \beta_2
x_2 + \ldots + \beta_k x_k + \epsilony=β0+β1x1+β2x2+…+βkxk+ϵ

• y: Dependent variable (what you're trying to predict)

• x₁, x₂, ... , xₖ: Independent variables (predictors)
• β0: Y-intercept (where the line crosses the y-axis)
• β1, β2, ... , βk: Slopes for each independent variable (how much
y changes for each predictor)
• ε: Random error (difference between predicted and actual
values)

Popular Data Analytic Tools

There are many tools for data analysis. Here are some popular ones:

1. Excel: A widely used spreadsheet software for calculations and

graphs. It’s user-friendly and accessible for everyone.
2. R: An open-source programming language great for statistical
analysis and data visualization. It has a simple syntax and is
ideal for complex computations.
3. Python: A versatile and readable programming language with a
rich library for various data analytics tasks. It's essential for data
analysts.
4. Apache Spark: Designed for analyzing large, unstructured data.
It can efficiently process vast amounts of data and distribute
tasks across multiple computers.
5. Tableau: A user-friendly tool that allows easy manipulation of
large datasets using a drag-and-drop interface.

Other tools include MS Power BI, KNIME, SAS, and Jupyter

Notebook.

Cbsyllabus Bda 1
No ratings yet
Cbsyllabus Bda 1
4 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
19 pages
Machine Learning
No ratings yet
Machine Learning
12 pages
Fundamentals of AI and ML
No ratings yet
Fundamentals of AI and ML
5 pages
Department of Emerging Technology (SB) III B.Tech - I Semester
No ratings yet
Department of Emerging Technology (SB) III B.Tech - I Semester
12 pages
ML 1
No ratings yet
ML 1
79 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
ML Unit 1
No ratings yet
ML Unit 1
20 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
9 pages
Presenttion 33
No ratings yet
Presenttion 33
2 pages
Machine Learning Basics for Beginners
No ratings yet
Machine Learning Basics for Beginners
26 pages
Machine Learning PPT For Students
73% (11)
Machine Learning PPT For Students
18 pages
Bca ML I
No ratings yet
Bca ML I
26 pages
Maharana Pratap Group of Institutions, Mandhana, Kanpur: Department of Computer Science Engineering)
No ratings yet
Maharana Pratap Group of Institutions, Mandhana, Kanpur: Department of Computer Science Engineering)
115 pages
Machine Learning Lecture-01
No ratings yet
Machine Learning Lecture-01
37 pages
Chapter 01 Machine Learning
No ratings yet
Chapter 01 Machine Learning
22 pages
Machine Learning Overview
100% (2)
Machine Learning Overview
42 pages
Supervised & Deep Learning Guide
No ratings yet
Supervised & Deep Learning Guide
83 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
23 pages
Machine Learning: Louis Fippo Fitime
No ratings yet
Machine Learning: Louis Fippo Fitime
37 pages
Deep Learning Stock Prediction Report
No ratings yet
Deep Learning Stock Prediction Report
37 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
Unit 1
No ratings yet
Unit 1
62 pages
Unit 1
No ratings yet
Unit 1
112 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Unit 1
No ratings yet
Unit 1
38 pages
Machine Learning?
100% (5)
Machine Learning?
114 pages
Updated Unit 1
No ratings yet
Updated Unit 1
57 pages
Introduction To Machine Learning Basics
No ratings yet
Introduction To Machine Learning Basics
12 pages
Machine Learning Basics
No ratings yet
Machine Learning Basics
9 pages
Unit-1 Part-1 Material
No ratings yet
Unit-1 Part-1 Material
45 pages
Machine Learning
No ratings yet
Machine Learning
39 pages
Artificial Intelligence Lec 1 PDF
No ratings yet
Artificial Intelligence Lec 1 PDF
15 pages
ML Notes
No ratings yet
ML Notes
101 pages
ML, DL, DS
No ratings yet
ML, DL, DS
23 pages
Machine Learning
No ratings yet
Machine Learning
25 pages
Machine Learning - 1
No ratings yet
Machine Learning - 1
19 pages
Machine Learning
No ratings yet
Machine Learning
11 pages
Karthik
No ratings yet
Karthik
10 pages
Fundamentals of Machine Learning II
No ratings yet
Fundamentals of Machine Learning II
13 pages
Machine Learning
100% (3)
Machine Learning
47 pages
Ai Faheem
No ratings yet
Ai Faheem
16 pages
Pds Notes ML Unit4
No ratings yet
Pds Notes ML Unit4
13 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
27 pages
Intro to Machine Learning Concepts
No ratings yet
Intro to Machine Learning Concepts
35 pages
Mehakreport
No ratings yet
Mehakreport
23 pages
Aws ML
No ratings yet
Aws ML
125 pages
Report On Machine Learning-Jyoti Poddar-EC084
No ratings yet
Report On Machine Learning-Jyoti Poddar-EC084
70 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
5 pages
ML Module I
No ratings yet
ML Module I
71 pages
Unit I
No ratings yet
Unit I
8 pages
CE469 - Introduction To Machine Learning: Lecturer Contact
No ratings yet
CE469 - Introduction To Machine Learning: Lecturer Contact
33 pages
ME3435E ADDTE Lect27 Machine Learning For Signal Processing 19.03.25
No ratings yet
ME3435E ADDTE Lect27 Machine Learning For Signal Processing 19.03.25
34 pages
SEng5305-chap-1-Introduction To ML
No ratings yet
SEng5305-chap-1-Introduction To ML
85 pages
ML-Unit 1 Merged
No ratings yet
ML-Unit 1 Merged
151 pages
ML-Unit 1
No ratings yet
ML-Unit 1
43 pages
Unit I
No ratings yet
Unit I
48 pages
Unit 1
No ratings yet
Unit 1
55 pages
Review of Underlying Technology-IV
No ratings yet
Review of Underlying Technology-IV
18 pages
Unit 7 Data Visualization in Punjab
No ratings yet
Unit 7 Data Visualization in Punjab
6 pages
History - of - Internet
No ratings yet
History - of - Internet
12 pages
Unit 5 Educational Data Analyticsz
No ratings yet
Unit 5 Educational Data Analyticsz
13 pages
Data Analytics in Gnu of India
No ratings yet
Data Analytics in Gnu of India
10 pages
Mooc Part 2
No ratings yet
Mooc Part 2
8 pages
AdmitCard (1) - 1
No ratings yet
AdmitCard (1) - 1
1 page
Btech Question Papers
No ratings yet
Btech Question Papers
7 pages
The Spirit of Leadership
No ratings yet
The Spirit of Leadership
16 pages
Summative Computer7 Q2
No ratings yet
Summative Computer7 Q2
4 pages
Pathfinder 1998 1999
No ratings yet
Pathfinder 1998 1999
1,818 pages
02 Task Performance 1
No ratings yet
02 Task Performance 1
4 pages
Full (Ebook PDF) Introduction To Health Policy, Second Edition PDF All Chapters
100% (2)
Full (Ebook PDF) Introduction To Health Policy, Second Edition PDF All Chapters
41 pages
Unacompanied Minors
No ratings yet
Unacompanied Minors
2 pages
Nature &amp Process of Decision Making
No ratings yet
Nature &amp Process of Decision Making
15 pages
Criminal 1705525662
No ratings yet
Criminal 1705525662
5 pages
Data Flow & Entity Relationship Diagrams
100% (1)
Data Flow & Entity Relationship Diagrams
7 pages
L1 - MATH1143 Intro and Fundamentals I
No ratings yet
L1 - MATH1143 Intro and Fundamentals I
19 pages
Executive Customer Service Offer Letter
No ratings yet
Executive Customer Service Offer Letter
4 pages
Store Supervisor Resume
No ratings yet
Store Supervisor Resume
2 pages
HIC Student Handbook Rev2023 2024 041820 8
No ratings yet
HIC Student Handbook Rev2023 2024 041820 8
99 pages
Structural Icons: Global Skyscrapers
No ratings yet
Structural Icons: Global Skyscrapers
16 pages
Aldeguer v. Hoskyn, G.R. No. 1164, (September 17, 1903), 2 PHIL 500-503
No ratings yet
Aldeguer v. Hoskyn, G.R. No. 1164, (September 17, 1903), 2 PHIL 500-503
2 pages
Design For Serviceability and Design For The Environments
No ratings yet
Design For Serviceability and Design For The Environments
6 pages
Iso Dis 9073-18 (E)
No ratings yet
Iso Dis 9073-18 (E)
16 pages
2023-AD-Optimizing The Performance of The RF Signal Chain
No ratings yet
2023-AD-Optimizing The Performance of The RF Signal Chain
62 pages
Ifs Pacsecure: Standard For Auditing Quality and Safety of Packaging Materials
No ratings yet
Ifs Pacsecure: Standard For Auditing Quality and Safety of Packaging Materials
150 pages
Bose Lifestyle 28 Repair - Schematics PDF
86% (7)
Bose Lifestyle 28 Repair - Schematics PDF
32 pages
Diesel Engine PLC Configuration
No ratings yet
Diesel Engine PLC Configuration
6 pages
Jurnal Kinetika Kimia
No ratings yet
Jurnal Kinetika Kimia
7 pages
Multi-Pin Plug Connections Overview
No ratings yet
Multi-Pin Plug Connections Overview
45 pages
Leo99 Manual PDF
100% (2)
Leo99 Manual PDF
100 pages
Tax Dispute: McDonald's vs. CIR
No ratings yet
Tax Dispute: McDonald's vs. CIR
37 pages
Intention: The Power of Clear Intention
100% (1)
Intention: The Power of Clear Intention
17 pages
Chery International After-Sales Guide
No ratings yet
Chery International After-Sales Guide
66 pages
Nandakumar & Anr V State of Kerala
No ratings yet
Nandakumar & Anr V State of Kerala
6 pages
Bali Land for Luxury Resort Development
No ratings yet
Bali Land for Luxury Resort Development
12 pages
Resum MD Mosharof Hossain - FM
No ratings yet
Resum MD Mosharof Hossain - FM
3 pages

MOOC PART 3 in Gndu

Uploaded by

MOOC PART 3 in Gndu

Uploaded by

INTRODUCTION TO

Machine learning is a type of AI (Artificial Intelligence) where

In machine learning, systems called Artificial Neural Networks

Here are some real-life examples of machine learning:

1. Digital assistants that understand voice commands and play

As more data becomes available, computers get more powerful, and

Whether labeled or not, the training data should be:

The data is split into two parts:

• Training subset: Used to train the model.

Step 2: Choosing the Training Algorithm

The choice of algorithm depends on:

• Type of data (labeled or unlabeled)

There are different machine learning algorithms available for both

• Regression algorithms: Used to find relationships in data.

For unlabeled training data:

• Clustering: Groups similar data points without prior

Training an Artificial Neural Network (ANN) is done in multiple

1. Input data is fed into the system.

Step 4: Applying to Practical Data

5. Once trained, the ANN is used to solve real-world problems.

Supervised Machine Learning

In supervised learning, the system learns from labeled data, where

Unsupervised Machine Learning

Unsupervised learning finds patterns and relationships in large

Reinforcement learning involves learning by trial and error without

Neural Networks (ANNs)

Artificial Neural Networks (ANNs) are inspired by the human brain.

Deep Learning (DL)

Deep Learning Architecture

Convolutional Neural Networks (CNNs)

CNNs are specialized neural networks that excel at processing

1. Convolutional Layer: This is the first layer that extracts basic

Best Practices in Machine Learning (ML) and Deep Learning

1. Choosing Between ML and DL:

A Black-box Approach to Regression Analysis

Regression Analysis is a statistical method used to find relationships

Examples of Regression Analysis:

1. Advertisement Duration vs. Production Cost:

Duration (sec) Cost (Lakh Rs)

• Task: Predict the number of biryanis expected to be sold based

No. of Customers No. of Biryani Packets

Simple Regression is a method that uses one independent variable (x)

y=β0+β1x+ϵy = \beta_0 + \beta_1 x + \epsilony=β0+β1x+ϵ

• x: Independent variable (predictor)

Multiple Regression uses two or more independent variables to

Linear Multiple Regression is expressed with the formula:

• y: Dependent variable (what you're trying to predict)

Popular Data Analytic Tools

1. Excel: A widely used spreadsheet software for calculations and

Other tools include MS Power BI, KNIME, SAS, and Jupyter

You might also like