0% found this document useful (0 votes)

45 views20 pages

Activation Function

The document discusses various activation functions used in neural networks, including Sigmoid, Tanh, ReLU, Leaky ReLU, ELU, Softmax, Swish, Maxout, and Softplus, highlighting their mathematical definitions, advantages, and disadvantages. Activation functions introduce non-linearity, enabling networks to learn complex patterns, with each function having unique characteristics that make them suitable for different tasks. The document emphasizes the importance of choosing the right activation function based on the specific requirements of the model and the data.

Uploaded by

nafulaaurelia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views20 pages

Activation Function

Uploaded by

nafulaaurelia

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

Activation Functions

An activation function is a mathematical function applied to the output of a neuron.

It introduces non-linearity into the model, allowing the network to learn and
represent complex patterns in the data.

Without this non-linearity feature, a neural network would behave like a linear
regression model, no matter how many layers it has.

The activation function decides whether a neuron should be activated by

calculating the weighted sum of inputs and adding a bias term.

This helps the model make complex decisions and predictions by introducing non-
linearities to the output of each neuron.
1. SIGMOID Sigmoidal functions are frequently used
in machine learning, specifically in the
ACTIVATION testing of artificial neural networks, as a

FUNCTION way of understanding the output of a

node or “neuron.”

A sigmoid function is a type of activation

function, and more specifically defined
as a squashing function. Squashing
functions limit the output to a range
between 0 and 1.
Pros And Cons of Sigmoid Activation function
Pros Cons

1. The performance of Binary 1. The calculation in sigmoid function is

Classification is very well as compare complex.
to other activation function. 2. It is not useful in multiclass
2. Clear predictions, i.e very close to 1 or classification .
0. 3. For negative values of x-axis gives 0.
4. It become constant and gives 1 for any
high positive values.
5. Function output is not zero-centered
Hypertangent
activation Function

This function is easily defined as the

ratio between the hyperbolic sine and
the cosine functions
Pros And Cons of Tanh Activation function
Pros Cons

1. The gradient is stronger for tanh than 1. Tanh also has the vanishing gradient
sigmoid ( derivatives are steeper). problem.
2. The output interval of tanh is 1, and the
whole function is 0-centric, which is
better than sigmod
ReLu Activation
Function

The ReLU function is actually a

function that takes the maximum
value
Pros And Cons of ReLu function
Pros Cons

1. When the input is positive, there is no 1. When the input is negative, ReLU is
gradient saturation problem. completely inactive, which means that
2. The calculation speed is much faster. once a negative number is entered,
3. The ReLU function has only a linear ReLU will die
relationship. 2. We find that the output of the ReLU
4. Whether it is forward or backward, it is function is either 0 or a positive number,
much faster than sigmod and tanh. which means that the ReLU function is
not a 0-centric function.
Leaky ReLu
Function

It is an attempt to solve the dying

ReLU problem
The leak helps to increase the
range of the ReLU function.
Usually, the value of a is 0.01 or so.
Pros And Cons of Leaky ReLu Activation
function
Pros Cons

1. There will be no problems with Dead 1. It has not been fully proved that Leaky
ReLU. ReLU is always better than ReLU.
2. A parameter-based method, Parametric
ReLU : f(x)= max(alpha x,x), which
alpha can be learned from back
propagation.
ELU is very similiar to RELU except negative inputs.
They are both in identity function form for non-
negative inputs. On the other hand, ELU becomes
smooth slowly until its output equal to -α whereas
RELU sharply smoothes.

ELU (Exponential
Linear Units)
function
Pros And Cons of ELU Activation function
Pros Cons

1. ELU becomes smooth slowly until its 1. For x > 0, it can blow up the activation
output equal to -α whereas RELU with the output range of [0, inf].
sharply smoothes.
2. ELU is a strong alternative to ReLU.
3. Unlike to ReLU, ELU can produce
negative outputs.
Softmax Function

Softmax function calculates the

probabilities distribution of the event
over ‘n’ different events. In general
way of saying, this function will
calculate the probabilities of each
target class over all possible target
classes.
Pros And Cons of Softmax Activation function
Pros Cons

1. It mimics the one hot encoded labels 1. The softmax function should not be used
better than the absolute values. for multi-label classification.
2. If we use the absolute (modulus) values 2. the sigmoid function (discussed later) is
we would lose information, while the preferred for multi-label classification.
exponential intrinsically takes care of 3. The Softmax function should not be used
this. for a regression task as well.
Swish Function

Swish's design was inspired by the

use of sigmoid functions for gating
in LSTMs and highway networks. We
use the same value for gating to
simplify the gating mechanism,
which is called self-gating.
Pros And Cons of Swish Activation function
Pros Cons

1. No dying ReLU. 1. Slightly more computationally

2. Increase in accuracy over ReLU expensive.
3. Outperforms ReLU in every batch size. 2. More problems with the algorithm will
probably arise given time.
Maxout Function The Maxout activation function
is defined as follows

The Maxout activation is a

generalization of the ReLU and the
leaky ReLU functions. It is a
learnable activation function.
Pros And Cons of Maxout Activation function
Pros Cons

1. It is a learnable activation function. 1. It doubles the total number of

parameters for each neuron and hence,
a higher total number of parameters
need to be trained.
Softplus Activation
Funtion

The softplus function is similar to

the ReLU function, but it is relatively
smooth.It is unilateral suppression
like ReLU.It has a wide acceptance
range (0, + inf).

Softplus function: f(x) = ln(1+exp x)

Pros And Cons of Softplus Activation function
Pros Cons

1. It is relatively smooth. 1. Leaky ReLU is a piecewise linear

2. It is unilateral suppression like ReLU. function, just as for ReLU, so quick to
3. It has a wide acceptance range (0, + compute. ELU has the advantage over
inf). softmax and ReLU that it's mean output
is closer to zero, which improves
learning.

Neural Network Activation Guide
No ratings yet
Neural Network Activation Guide
14 pages
Activation Functions - Ipynb - Colaboratory
No ratings yet
Activation Functions - Ipynb - Colaboratory
10 pages
Activation Function
No ratings yet
Activation Function
10 pages
Activation Function
No ratings yet
Activation Function
36 pages
Activation Function
No ratings yet
Activation Function
34 pages
Act Fun
No ratings yet
Act Fun
7 pages
Deep Learning Tutorial 3
No ratings yet
Deep Learning Tutorial 3
12 pages
Activation
No ratings yet
Activation
7 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
7 pages
Neural Network Activation Guide
No ratings yet
Neural Network Activation Guide
43 pages
Deep Learning Activation Functions
No ratings yet
Deep Learning Activation Functions
10 pages
Artificial Neural Networks (ANN)
No ratings yet
Artificial Neural Networks (ANN)
67 pages
003 Activation Functions in Machine Learning
No ratings yet
003 Activation Functions in Machine Learning
19 pages
UNIT-III Activation-Function
No ratings yet
UNIT-III Activation-Function
6 pages
Types of Neural Network Activation Functions - How To Choose
No ratings yet
Types of Neural Network Activation Functions - How To Choose
36 pages
Activation Function
No ratings yet
Activation Function
18 pages
DL Module 2
No ratings yet
DL Module 2
148 pages
Deep Learning: International Islamic University of Chittagong
No ratings yet
Deep Learning: International Islamic University of Chittagong
31 pages
Module-4 Neural Network
No ratings yet
Module-4 Neural Network
61 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
10 pages
ML PPT Activation Functions
No ratings yet
ML PPT Activation Functions
12 pages
4 4 Choosing The Right Activation Function For Neural Networks
No ratings yet
4 4 Choosing The Right Activation Function For Neural Networks
25 pages
Common Activation Function
No ratings yet
Common Activation Function
13 pages
Activation Functions
No ratings yet
Activation Functions
4 pages
Activation Functions
No ratings yet
Activation Functions
8 pages
Activation Functions and Loss
No ratings yet
Activation Functions and Loss
17 pages
DL Lecture 08 Activation Functions
No ratings yet
DL Lecture 08 Activation Functions
19 pages
Activation Functions
No ratings yet
Activation Functions
34 pages
Lecture 2.1.2activation Function
No ratings yet
Lecture 2.1.2activation Function
15 pages
Pr1 ANN Writeup
No ratings yet
Pr1 ANN Writeup
7 pages
Arjun Yadav 32, Activation Function Assignment
No ratings yet
Arjun Yadav 32, Activation Function Assignment
7 pages
Lect 5 - Non Linear Activation Functions
No ratings yet
Lect 5 - Non Linear Activation Functions
41 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
Deep Learning: Course Code: Unit 1
No ratings yet
Deep Learning: Course Code: Unit 1
27 pages
SC Exp1
No ratings yet
SC Exp1
19 pages
Activation Function
No ratings yet
Activation Function
13 pages
Activation Functions
No ratings yet
Activation Functions
3 pages
SoftComp 02
No ratings yet
SoftComp 02
33 pages
Need and Use of Activation Functions in Anndeep Learning
No ratings yet
Need and Use of Activation Functions in Anndeep Learning
7 pages
Neural Network Activation Guide
No ratings yet
Neural Network Activation Guide
11 pages
Mod 2.3 - Activation Function
No ratings yet
Mod 2.3 - Activation Function
9 pages
Unit 2b
No ratings yet
Unit 2b
11 pages
4 - Activation Functions in Neural Networks
No ratings yet
4 - Activation Functions in Neural Networks
12 pages
Activation Functions
No ratings yet
Activation Functions
6 pages
Activation Functions
No ratings yet
Activation Functions
23 pages
Ijisae 4865
No ratings yet
Ijisae 4865
8 pages
Lec08-1Activation Functions
No ratings yet
Lec08-1Activation Functions
19 pages
Dl-Module 2
No ratings yet
Dl-Module 2
138 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Deep Learning for Engineers
No ratings yet
Deep Learning for Engineers
141 pages
Functii de Activare1
No ratings yet
Functii de Activare1
89 pages
Activation Functions in Neural Networks
No ratings yet
Activation Functions in Neural Networks
3 pages
UNIT-3 Deep Learning (Revised) - 1
No ratings yet
UNIT-3 Deep Learning (Revised) - 1
92 pages
Activation Function
No ratings yet
Activation Function
9 pages
7 Types of Neural Network Activation Functions
No ratings yet
7 Types of Neural Network Activation Functions
16 pages
L11 Introduction To Neural Network AI&ML CS877
No ratings yet
L11 Introduction To Neural Network AI&ML CS877
24 pages
Deeplearning Shreiyans
No ratings yet
Deeplearning Shreiyans
18 pages
Perceptron in Machine Learning
No ratings yet
Perceptron in Machine Learning
11 pages
Deep Learning Tutorial 3
No ratings yet
Deep Learning Tutorial 3
12 pages
Prolog
No ratings yet
Prolog
10 pages
MITIT 2025 Winter Contest Editorials
No ratings yet
MITIT 2025 Winter Contest Editorials
6 pages
B.Tech Soft Computing Exam 2020-21
100% (1)
B.Tech Soft Computing Exam 2020-21
2 pages
Whats Your Function
No ratings yet
Whats Your Function
15 pages
Asymptotic Notation Explained
No ratings yet
Asymptotic Notation Explained
17 pages
7.travelling Salesman Problem
No ratings yet
7.travelling Salesman Problem
27 pages
Conversionworksheet PDF
No ratings yet
Conversionworksheet PDF
2 pages
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
No ratings yet
Implementation of A High Speed Single Precision Floating Point Unit Using Verilog
5 pages
Loop Optimization & Pointer Analysis
No ratings yet
Loop Optimization & Pointer Analysis
6 pages
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
No ratings yet
Module 3 Topic 3 Lesson 2B Weighted Graphs PDF
14 pages
Data Structures Unit-5 Notes
No ratings yet
Data Structures Unit-5 Notes
20 pages
Discrete Syllabus
No ratings yet
Discrete Syllabus
2 pages
IT-Design and Analysis of Algorithms - 2021-22!4!2021!11!12NOON
No ratings yet
IT-Design and Analysis of Algorithms - 2021-22!4!2021!11!12NOON
1 page
General Mathematics: Grade 11
No ratings yet
General Mathematics: Grade 11
14 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
U19EC416 DSP Lab Syllabus
No ratings yet
U19EC416 DSP Lab Syllabus
2 pages
Time and Space Complexity
No ratings yet
Time and Space Complexity
5 pages
Collision Resolution: Ananda Gunawardena
No ratings yet
Collision Resolution: Ananda Gunawardena
22 pages
M.Tech DSE Mid-Semester Test 2020
No ratings yet
M.Tech DSE Mid-Semester Test 2020
3 pages
Gradient Descent for Deep Learning
No ratings yet
Gradient Descent for Deep Learning
21 pages
Hierarchical Timing Model Reduction
No ratings yet
Hierarchical Timing Model Reduction
30 pages
Graph Traversal: BFS and DFS Code
No ratings yet
Graph Traversal: BFS and DFS Code
2 pages
CS102 Computer Programming I: Algorithms
No ratings yet
CS102 Computer Programming I: Algorithms
31 pages
Algorithm Efficiency Basics
No ratings yet
Algorithm Efficiency Basics
34 pages
Numerical Methods for Students
No ratings yet
Numerical Methods for Students
30 pages
Btech Ec 3 Sem Digital Logic Design Nec 309 2016 17
No ratings yet
Btech Ec 3 Sem Digital Logic Design Nec 309 2016 17
4 pages
Maths Mphil Project - Compress
No ratings yet
Maths Mphil Project - Compress
80 pages
Deepak Kapur Hantao Zhang: Key Words
No ratings yet
Deepak Kapur Hantao Zhang: Key Words
30 pages
MCQ For IES Gate PSU S Practice Test Workbook Booklet PDF
No ratings yet
MCQ For IES Gate PSU S Practice Test Workbook Booklet PDF
154 pages
E. Nadeau and S. Schanck: Elmsley's Problem For Horseshoe Permutations
No ratings yet
E. Nadeau and S. Schanck: Elmsley's Problem For Horseshoe Permutations
1 page