0% found this document useful (0 votes)

105 views7 pages

Floating Point 6up

1) Floating point numbers represent fractions in computers using scientific notation, with a mantissa and exponent. The IEEE 754 standard defines common floating point representations. 2) Floating point numbers use a sign bit, exponent field, and mantissa field. The exponent is stored using bias to represent both positive and negative exponents. 3) The IEEE 754 standard defines single and double precision floating point number formats. It allows for consistent representation of floating point values across systems.

Uploaded by

edemkv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

105 views7 pages

Floating Point 6up

Uploaded by

edemkv

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Outline

  Fractional numbers
  Floating point scientific notation
Floating Point Representation
  Floating point in binary
  IEEE Floating Point Standard
DCS111 Computer Architecture   Behaviour of Floating Point Numbers

Recap: fractions
  Decimal 5.6710 is
  5 x 100 plus
Fractional Numbers   6 x 10-1 plus
  7 x 10–2
… not whole numbers   Binary 11.0112 is
  1 x 21 plus
  1 x 20 plus
  0 x 2-1 plus Quiz: what is
  1 x 2–2 plus 11.0112 in decimal?
  1 x 2–3

Recap: fractions Recap: fractions

Quiz: what is a third as a Quiz: what is a third as a
decimal: N.NNNNN? decimal: N.NNNNN?

  Third is 0.33333…
  Not all numbers can be represented exactly
(with limited digits)

1
Problem Solution 1 – Fixed Point
  How to hold fractions in computers?   Divide bits between whole and fractional parts

0 0 1 1 1 1 0 1

integer bits fractional bits integer bits fractional bits

Point always Quiz: what is this in

in the same decimal?
place

Solution 1 – Fixed Point Evaluation of Fix Point

  Divide bits between whole and fractional parts   Range versus Accuracy
  High accuracy means low range
  High range means low accuracy
  Has uses

integer bits fractional bits

Quiz:
•  What is maximum number?
  Really just scaled integers
range
•  What is difference between   Software library for fixed point numbers
successive numbers? accuracy   No need for special hardware

Scientific (Exponent) Notation Scientific (Exponent) Notation

3.21 x 105 6.54 x 10-5 3.21 x 105 6.54 x 10-5

Mantissa   321,000 and 0.0000654

Exponent
5 -5
  Same accuracy
  Mantissa is a fraction
  Different magnitude   Exponent is an integer
  Both mantissa and exponent can be negative
Quiz: Write these number as decimal, without exponents

2
Normalisation
Advantage of Scientific Notation

}
  Large range   0.002 x 100
  Constant proportional accuracy (… with   0.2 x 10-2
exceptions)   2.0 x 10-3 all the same value
  20 x 10-4

  Normalised number has 1 digit before the point

Binary Floating Point

  1.01 x 22
  1.1 x 2-2
Floating Point in Binary
  Exponent: positive or negative
  Mantissa: positive or negative

Quiz:
•  Effect of negative mantissa?
•  Effect of negative exponent?

Normalised Binary FP Representation (32 bits)‫‏‬

  Sign bit S
  In normalised binary scientific notation
  Exponent E
  1.mmmm…mmm x 2E
  Mantissa M
  unless the number is 0
  1.mmm…mmm is the mantissa
  E is the exponent

exponent fraction (mantissa)‫‏‬

sign

First digit
always 1

3
Representation (32 bits)‫‏‬ Negative exponents - how?
  Sign bit S – 1 bit
  Aim: ALU (Arithmetic Logic Unit) can reuse
  Exponent E – 8 bits integer machinery
  Mantissa M – 23 bits BUT   Eg, comparison with zero: x > 0
  Easy because of sign bit
  Floating point numbers can be easily classified as
negative or positive
exponent fraction (mantissa)‫‏‬
sign
  Comparison of two floating point numbers x<y
not so straightforward...
  (-1)S x 1.M x 2E   choose exponent representation to help
First digit always 1, so
not included

Exponent in 2's Comp ?? Representation of Exponents

  Consider: 1/2 < 1   We want:
  half: 0.1 = 1.0 x 2-1 (normalised)‫‏‬   FP number order to follow (unsigned) bit order
  one: 1.0 = 1.0 x 20 (normalised)‫‏‬   11111111 to represent the highest positive exponent

0 11111111 000 …   Use biased representation

0 00000000 000 …

Bad Design

Bias by N (Excess N)‫‏‬ Bias by N (Excess N)‫‏‬

  Representation of negative numbers used in   Excess 7
floating point numbers
  Numbers in ‘correct’ order 0000 -7 1000 1
0001 -6 1001 2
0010 -5 1010 3
excess-N-rep(X) = unsigned-rep(X + N) 0011 -4 1011 4
0100 -3 1100 5
  Excess 7 0101 -2 1101 6
0110 -1 1110 7
excess-7-rep(-3) = unsigned-rep(-3 + 7)‫‏‬ 0111 0 1111 8
= 0100
excess-7-rep(-7) = 0000 E.g –2 is represented as unsigned(7-2)
excess-7-rep(4) = unsigned-rep(4 + 7)‫‏‬ = unsigned(5)‫‏‬
= 1011 = 0101

4
IEEE 754-1985
  What is IEEE?
  Standard important for
IEEE Standard   exchange of data
  portability of code

  Representation for FP numbers in

  32-bit (single precision)‫‏‬
  64-bit (double precision)‫‏‬

IEEE 32-bit FP IEEE 32-bit FP

  Sign bit S – 1 bit   Sign bit S – 1 bit
  Mantissa M – 23 bits   Mantissa M – 23 bits
  Exponent E – 8 bits
S E M
exponent fraction (mantissa)‫‏‬
sign
  Exponent E – 8 bits
  Bias is 127 (-1)S x (1.M) x 2E-127
  Exponents –126 (00000001) to +127 (11111110)‫‏‬
  Exponents 00000000 and 11111111 special

Example 1 – Convert to FP Example 2 – Convert from FP

  Represent 0.312510 = 5/16   What number is represented by:
  5/16 = 1/4 + 1/16 = 0.01012= 1.01*2-2
0 01111101 010000 ... 000
 S = 0
 S = 0
  E = -2 + bias = -2 + 127 = 12510=01111101
  E = 0111 1101 = 12510
  M = 010....000
  Real exponent = E-bias = 125-127 = -2
  M = 1/4
  (-1)S x (1+M) x 2E-bias
0 01111101 010000 ... 000 = (1 + 1/4) x (1/4)
= 5/16

5
Quiz IEEE FP Extra’s
  What are   Zero
  Both E and M = zero
0 10000001 111000 ... 000   Can be positive or negative

1 01111001 011000 ... 000   +/- Infinity (exponent all 1's)‫‏‬

  De-normalised numbers
  E=0
  Convert to 32 FP using IEEE
  close to zero, exponent is -126
  4.125
  -7.625

Overflow and Underflow

  Overflow
Behaviour of Floating Point   Results too large (positive or negative) to be
Numbers represented
  Underflow
  Result too close to zero (positive or negative) to be
represented

Range – 32 bit FP Range – 32 bit FP

negative zero positive negative zero positive

smallest smallest positive (>0) largest smallest smallest positive (>0) largest
largest negative largest negative

  Quiz: find the largest and smallest FP in IEEE   Largest/smallest +/- (2 – 223) x 2127 ≈ 1038
32-bit   Near zero (normalised numbers)‫‏‬
  +/- 1.0 x 2-126

6
How do they behave? Summary
  If x, y are positive is:   FP scientific notation
  x+y>x ?   Normalised representation in binary
  If x and y are different can:   Bias to represent -ve to +ve range in exponent
  x–y=0?   Notice how a 32-bit binary number can
  Do these rules hold: represent many different entities in memory
  (x + y) + z = x + (y + z) ?   Underflow as well as overflow
  (x * y) * z = x * (y * z) ?
  x * (y + z) = x*y + x*z ?

Different evaluation orders have different rounding errors

Floating - Point - Number
No ratings yet
Floating - Point - Number
36 pages
IEEE 754 Floating Point Guide
No ratings yet
IEEE 754 Floating Point Guide
38 pages
Floating Points
No ratings yet
Floating Points
31 pages
Cacc
No ratings yet
Cacc
106 pages
Computer Architecture Basics
No ratings yet
Computer Architecture Basics
64 pages
Neural Network Quantization Guide
No ratings yet
Neural Network Quantization Guide
150 pages
Unit 2
No ratings yet
Unit 2
16 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Floating Point & Fixed Point Representation - BCA II
No ratings yet
Floating Point & Fixed Point Representation - BCA II
24 pages
Floating Point Representation of Numbers: Wide Range
No ratings yet
Floating Point Representation of Numbers: Wide Range
11 pages
Floating Point Arithmetic Guide
No ratings yet
Floating Point Arithmetic Guide
42 pages
Floating Point Numbers: CS031 September 12, 2011
No ratings yet
Floating Point Numbers: CS031 September 12, 2011
22 pages
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
No ratings yet
Lecture 14 - Arithmetic Subsystems - Numbering Systems and Floating Point Unit (FPU)
32 pages
Week 5: IEEE Floating Point Revision Guide For Phase Test
No ratings yet
Week 5: IEEE Floating Point Revision Guide For Phase Test
23 pages
ENSC254 - Floating Point Computation
No ratings yet
ENSC254 - Floating Point Computation
29 pages
Lec 08
No ratings yet
Lec 08
36 pages
Floating Point Number
No ratings yet
Floating Point Number
28 pages
Floating Point
No ratings yet
Floating Point
33 pages
Floating-Point Representation Guide
No ratings yet
Floating-Point Representation Guide
14 pages
Data Representation
No ratings yet
Data Representation
28 pages
IEEE 754: Floating Point Guide
No ratings yet
IEEE 754: Floating Point Guide
10 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
Computer Architecture: Data Types
No ratings yet
Computer Architecture: Data Types
25 pages
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
No ratings yet
16-Algorithms For Floating Point Arithmetic Operations and Numericals-01-02-2024
21 pages
Lec 4
No ratings yet
Lec 4
15 pages
Floating-Point Representation in Computing
No ratings yet
Floating-Point Representation in Computing
6 pages
CH03 Data II
No ratings yet
CH03 Data II
31 pages
IEEE Floating Point Conversion Guide
No ratings yet
IEEE Floating Point Conversion Guide
34 pages
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
No ratings yet
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
34 pages
Fixed vs. Floating Point in Computing
No ratings yet
Fixed vs. Floating Point in Computing
24 pages
4.4 - 1 New Floating Point
No ratings yet
4.4 - 1 New Floating Point
22 pages
01 DigitalNumericalFormats
No ratings yet
01 DigitalNumericalFormats
27 pages
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
No ratings yet
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
31 pages
Chapter3 3
No ratings yet
Chapter3 3
13 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
Binary Number Representations
No ratings yet
Binary Number Representations
14 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Single Precision Floating Point
No ratings yet
Single Precision Floating Point
24 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
HW 4 Sol
No ratings yet
HW 4 Sol
10 pages
Lec 3
No ratings yet
Lec 3
20 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
Machine Level Representation of Data Part 3
100% (1)
Machine Level Representation of Data Part 3
32 pages
Floating Point
No ratings yet
Floating Point
16 pages
Number Representation Explained
No ratings yet
Number Representation Explained
5 pages
4 Floating Point Inclass
No ratings yet
4 Floating Point Inclass
33 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
27 pages
COA - Unit2 Floating Point Arithmetic 3
No ratings yet
COA - Unit2 Floating Point Arithmetic 3
19 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
31 pages
2.4 Floating Points
No ratings yet
2.4 Floating Points
36 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
8 pages
IEEE Standard 754 Floating Point Numbers
No ratings yet
IEEE Standard 754 Floating Point Numbers
7 pages
Finite Word Length Effects
No ratings yet
Finite Word Length Effects
31 pages
Fixed vs Floating Point Numbers
No ratings yet
Fixed vs Floating Point Numbers
31 pages
What Are Floating Point Numbers?
No ratings yet
What Are Floating Point Numbers?
7 pages
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
No ratings yet
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
49 pages
L-5 Floating Point Representation of Numbers
No ratings yet
L-5 Floating Point Representation of Numbers
21 pages
COA Unit 2
No ratings yet
COA Unit 2
23 pages
Strategic Analysis of United Bank Limited
80% (5)
Strategic Analysis of United Bank Limited
75 pages
LGBT Rights
No ratings yet
LGBT Rights
6 pages
Exp 2.1-Free Fall
100% (1)
Exp 2.1-Free Fall
2 pages
ÔN TẬP CUỐI KÌ I LỚP 6
No ratings yet
ÔN TẬP CUỐI KÌ I LỚP 6
4 pages
Gen-Y Women's Online Fashion Buying Motivations
No ratings yet
Gen-Y Women's Online Fashion Buying Motivations
17 pages
Tpa Project Sale
No ratings yet
Tpa Project Sale
6 pages
Ssbdoc PDF
No ratings yet
Ssbdoc PDF
25 pages
Subject and Object Pronouns Possessive Adjectives 93842
No ratings yet
Subject and Object Pronouns Possessive Adjectives 93842
1 page
India's Independence Day Significance
No ratings yet
India's Independence Day Significance
2 pages
Public Private Partnership in Urban Services: - Special Reference To Tamilnadu On Solid Waste Management Projects
No ratings yet
Public Private Partnership in Urban Services: - Special Reference To Tamilnadu On Solid Waste Management Projects
50 pages
Bài tập câu if, wish
No ratings yet
Bài tập câu if, wish
9 pages
Effective Communication in AS9100D Audits
No ratings yet
Effective Communication in AS9100D Audits
2 pages
Mastering Old Earth
No ratings yet
Mastering Old Earth
185 pages
STA 116-Chapter 1 - Introduction To Statistics
No ratings yet
STA 116-Chapter 1 - Introduction To Statistics
49 pages
1 - Unit 4 - Notes & Quizzes
No ratings yet
1 - Unit 4 - Notes & Quizzes
29 pages
Bye Bye Birdie Cuts
No ratings yet
Bye Bye Birdie Cuts
1 page
RBI Strategy-Digvijay Chand
No ratings yet
RBI Strategy-Digvijay Chand
3 pages
Productos Veterinarios Registrados Vigentes Julio 2024
No ratings yet
Productos Veterinarios Registrados Vigentes Julio 2024
8 pages
Introduction To Business Management Lesson 4
No ratings yet
Introduction To Business Management Lesson 4
3 pages
Story I Wrote
No ratings yet
Story I Wrote
4 pages
Aratra Pentelici Illustrated Edition John Ruskin - Download The Entire Ebook Instantly and Explore Every Detail
100% (17)
Aratra Pentelici Illustrated Edition John Ruskin - Download The Entire Ebook Instantly and Explore Every Detail
89 pages
Autumn Scene Writing PDF
No ratings yet
Autumn Scene Writing PDF
4 pages
An Overview of The Financial System: © 2005 Pearson Education Canada Inc
No ratings yet
An Overview of The Financial System: © 2005 Pearson Education Canada Inc
12 pages
An Analysis of The Characters in A Streetcar Named Desire: Yang Zhao
No ratings yet
An Analysis of The Characters in A Streetcar Named Desire: Yang Zhao
4 pages
ANIRA
No ratings yet
ANIRA
51 pages
2024 Kcse Examination Essential Statistics
No ratings yet
2024 Kcse Examination Essential Statistics
15 pages
Progress Test 4 For Fulltime 2B
No ratings yet
Progress Test 4 For Fulltime 2B
4 pages
Learning English With CBC Calgary
No ratings yet
Learning English With CBC Calgary
25 pages
A Guide To Employee Engagement Surveys 1697457165
100% (1)
A Guide To Employee Engagement Surveys 1697457165
24 pages
The 10 Step KM Roadmap
No ratings yet
The 10 Step KM Roadmap
8 pages

Floating Point 6up

Uploaded by

Floating Point 6up

Uploaded by

Outline

Recap: fractions Recap: fractions

integer bits fractional bits integer bits fractional bits

Point always Quiz: what is this in

Solution 1 – Fixed Point Evaluation of Fix Point

integer bits fractional bits

Scientific (Exponent) Notation Scientific (Exponent) Notation

Mantissa 321,000 and 0.0000654

Normalised number has 1 digit before the point

Binary Floating Point

Normalised Binary FP Representation (32 bits)‫‏‬

exponent fraction (mantissa)‫‏‬

Exponent in 2's Comp ?? Representation of Exponents

0 11111111 000 … Use biased representation

Bias by N (Excess N)‫‏‬ Bias by N (Excess N)‫‏‬

Representation for FP numbers in

IEEE 32-bit FP IEEE 32-bit FP

Example 1 – Convert to FP Example 2 – Convert from FP

1 01111001 011000 ... 000 +/- Infinity (exponent all 1's)‫‏‬

Overflow and Underflow

Range – 32 bit FP Range – 32 bit FP

Different evaluation orders have different rounding errors

You might also like

Mantissa   321,000 and 0.0000654

  Normalised number has 1 digit before the point

0 11111111 000 …   Use biased representation

  Representation for FP numbers in

1 01111001 011000 ... 000   +/- Infinity (exponent all 1's)‫‏‬