Optical Character Recognition

Optical character recognition (OCR) is a process that converts scanned images of text documents into machine-readable text. The OCR process involves pre-processing the image, recognizing characters using pattern matching or feature extraction, and post-processing the text using spelling and grammar checks. OCR has applications in scanning documents like passports, bills, and business cards so they can be electronically edited. It is used across industries like customized OCR for business cards and invoices, and server-based OCR for large volumes of documents.

Uploaded by

Gopal Savaliya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views3 pages

Optical Character Recognition

Uploaded by

Gopal Savaliya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Tutorial 5 Optical Character Recognition

CE406 - Computer Peripherals Workshop

Introduction
Optical character recognition is process of converting image capture in text
format into mechanical or electronic format.it is widely used for scan
passport documents,bank statements,business card,bills etc so that it is
electronically edited.OCR falls into the field of computer vision,Artificial
intelligence and pattern recognition.

Components
The architecture of OCR requires below components:
• Scanner
• OCR software/Hardware
• Output Interface

Recognition process
In the process of OCR involves basically three steps:

1. Pre-Processing -
• Optimizing the image for character recognition, this involves
Converting the image to grayscale image, for that Deskew,De-
speckle,Binarization
Normalization,and then after sampling.
• Denoising - removing noise in the image and other noise that may
have introducing in the scanning process.
• Thinning - Used in handwriting recognition, the strokes of the letters
are thinned to a width of one pixel to ease recognition
2.Character Recognition:
The primary stage of OCR are applied in this step.there are two methods.
• Matrix-Matching - Here each symbol is compared with a database of
matrices of characters so that whenever the matrix which is matched
the symbol of the closest ,which is chosen as the character and its
ASCII value is output.which is also known as pattern matching.
• Feature Extraction - This method is better than matrix matching
because a Feature Extraction is looked up for curves, closed loops
and general features of a character for recognition.

3.Post-Processing:
This step involves making spelling,context and grammar based corrections
on the output text so that the accuracy of output is increases.
• Manual Correction - Errors are remove by hand but sometimes
mistakes are left because of human error.
• Dictionary Based correction - Words are looked up in the dictionary
and then after automatically corrected.
• Context based correction - Advanced language models are applied to
understand and correct text.

Applications
Application of OCR basically divided by types of platform which is
running on so that it is divided into following types:
• Application oriented OCR
• Server based OCR
• Desktop based OCR

Increasing the popularity of OCR system,it is started to face variety of

problems with reference to original format of documents like that corrupted
images,paper skew,aggravated by framework and lines,some kind of
unique tests,etc
All of these are affected the OCR accuracy. To improve the recognition
accuracy they are connect various types of techniques which is related to
special images likes standard expressio,dictionary and rich data contained
in shading of pictures.this is called as Application oriented OCR or
Customized OCR.which is used scanned for business card,invoice
OCR,licence etc.

Server based OCR are used for bigger volume and large number of groups
of users.Here further process of scanned documents handled by server
based OCR software.we can get high accuracy by better feature and
functionality.

Optical Character Recognition
No ratings yet
Optical Character Recognition
7 pages
OCR: Definition, Uses, and Accuracy
No ratings yet
OCR: Definition, Uses, and Accuracy
16 pages
Ocr
No ratings yet
Ocr
16 pages
Optical Character Recognition Project Report
No ratings yet
Optical Character Recognition Project Report
71 pages
Pi MSP
No ratings yet
Pi MSP
93 pages
OMR OCR Software for Educators
No ratings yet
OMR OCR Software for Educators
2 pages
Introduction To Optical Character Recognition (OCR)
No ratings yet
Introduction To Optical Character Recognition (OCR)
29 pages
TR 343 RFP Part I
No ratings yet
TR 343 RFP Part I
124 pages
Product Manual For A.C. Static Transformer Operated Watthour and Var-Hour Smart Meters, Class 0.2S, 0.5S and 1S ACCORDING TO IS 16444 (PART 2) :2017
No ratings yet
Product Manual For A.C. Static Transformer Operated Watthour and Var-Hour Smart Meters, Class 0.2S, 0.5S and 1S ACCORDING TO IS 16444 (PART 2) :2017
11 pages
Sinexcel Presentation
No ratings yet
Sinexcel Presentation
33 pages
MIOS-Version 3.0 PDF
No ratings yet
MIOS-Version 3.0 PDF
113 pages
AMISPUPPCL
No ratings yet
AMISPUPPCL
342 pages
Conzerv Power Energy Meters
No ratings yet
Conzerv Power Energy Meters
3 pages
IoT BASED ELECTRIC ENERGY METER
No ratings yet
IoT BASED ELECTRIC ENERGY METER
3 pages
Common Meter Reading Instrument
No ratings yet
Common Meter Reading Instrument
2 pages
NB-IoT White Paper
No ratings yet
NB-IoT White Paper
46 pages
IT Infrastructure
100% (1)
IT Infrastructure
29 pages
Presentation On Centralised Computing - VDI
100% (1)
Presentation On Centralised Computing - VDI
52 pages
Modules and Processes v1.1
No ratings yet
Modules and Processes v1.1
15 pages
Spec Static Meter PDF
No ratings yet
Spec Static Meter PDF
200 pages
D Met Test Cal - 00
No ratings yet
D Met Test Cal - 00
5 pages
CIM Integration in UPGRID Project
No ratings yet
CIM Integration in UPGRID Project
130 pages
16444-Part 1 - A1-2017 PDF
No ratings yet
16444-Part 1 - A1-2017 PDF
2 pages
PowerGrid's Strategic Overview
No ratings yet
PowerGrid's Strategic Overview
104 pages
PFCCL Model Smart Metering
No ratings yet
PFCCL Model Smart Metering
14 pages
MQX-Enabled MK30X256 Single-Phase Electricity Meter Reference Design
No ratings yet
MQX-Enabled MK30X256 Single-Phase Electricity Meter Reference Design
83 pages
Aclara SMETS2 Single Phase Electric
No ratings yet
Aclara SMETS2 Single Phase Electric
108 pages
,-LH-LFKSFRD Lhèkk TQM+K Okvvkoj Lekvz Ehvj Oxz, Oa Fof'Kf"V
No ratings yet
,-LH-LFKSFRD Lhèkk TQM+K Okvvkoj Lekvz Ehvj Oxz, Oa Fof'Kf"V
11 pages
Quectel BG95BG77BG600L Series QCFG at Commands Manual V1.0
No ratings yet
Quectel BG95BG77BG600L Series QCFG at Commands Manual V1.0
42 pages
Indian Electrical Equipment Including Consumer Electronics Industry
No ratings yet
Indian Electrical Equipment Including Consumer Electronics Industry
50 pages
Grid
No ratings yet
Grid
54 pages
PrimeRead ES
No ratings yet
PrimeRead ES
8 pages
Advanced Metering Infrastructure
No ratings yet
Advanced Metering Infrastructure
2 pages
It Ron Glossary
No ratings yet
It Ron Glossary
242 pages
21-2006
No ratings yet
21-2006
10 pages
CCMS-brochure-design-Schnell Energy
No ratings yet
CCMS-brochure-design-Schnell Energy
4 pages
Is 15707 2006
No ratings yet
Is 15707 2006
23 pages
ANSI C12.6-1987 y C12.7-1987 - Electricity Metering
No ratings yet
ANSI C12.6-1987 y C12.7-1987 - Electricity Metering
36 pages
ATCommandSet StepIII
No ratings yet
ATCommandSet StepIII
70 pages
Single Phase Smart Energy Metering System Research Paper by Muhammad Usman Ahmad, Ansar Shabir
No ratings yet
Single Phase Smart Energy Metering System Research Paper by Muhammad Usman Ahmad, Ansar Shabir
5 pages
Jaipur Vidyut Vitran Nigam Limited: Bidding Document FOR
No ratings yet
Jaipur Vidyut Vitran Nigam Limited: Bidding Document FOR
79 pages
Gateway Wiz
No ratings yet
Gateway Wiz
2 pages
IT Infra
100% (1)
IT Infra
12 pages
Automatic Meter Reading
No ratings yet
Automatic Meter Reading
15 pages
8 AMIandDLMS Finalversion IJERT
No ratings yet
8 AMIandDLMS Finalversion IJERT
6 pages
RFP TN DSM 02
No ratings yet
RFP TN DSM 02
202 pages
Revamped Power Distribution Scheme
100% (1)
Revamped Power Distribution Scheme
56 pages
Next Gen Smart Metering: IP Solutions
No ratings yet
Next Gen Smart Metering: IP Solutions
10 pages
Facogaz: Gas Smart Metering System Marcogaz/Facogaz Position Paper
No ratings yet
Facogaz: Gas Smart Metering System Marcogaz/Facogaz Position Paper
13 pages
17 Metering Protocols
No ratings yet
17 Metering Protocols
10 pages
A1800 Technical Manual - Rev.02 PDF
100% (1)
A1800 Technical Manual - Rev.02 PDF
142 pages
Cobb EMC Battery Storage RFP
No ratings yet
Cobb EMC Battery Storage RFP
11 pages
6LoWPAN Protocol Stack Brief
100% (1)
6LoWPAN Protocol Stack Brief
13 pages
PSPCL Single Billing System RFP
No ratings yet
PSPCL Single Billing System RFP
333 pages
of The Day! - : Guide To Harmonics With AC Drives
No ratings yet
of The Day! - : Guide To Harmonics With AC Drives
6 pages
IS.15959.2-2016 Recognized
No ratings yet
IS.15959.2-2016 Recognized
30 pages
Data Collecting From Smart Meters in An Advanced Metering Infrastructure
No ratings yet
Data Collecting From Smart Meters in An Advanced Metering Infrastructure
6 pages
Optical Character Recognition: Selected Topics in Computer Science
No ratings yet
Optical Character Recognition: Selected Topics in Computer Science
7 pages
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
No ratings yet
Optical Character Recognition: Made By: Dhairya Goel-02814803115 Madhwan Sharma-60214803115
15 pages
OCR Techniques and Applications
No ratings yet
OCR Techniques and Applications
2 pages
DHCP Detailed Operation (En)
No ratings yet
DHCP Detailed Operation (En)
13 pages
RX 14 Chi̇cago Breaker
No ratings yet
RX 14 Chi̇cago Breaker
20 pages
Session 2 Introduction To Unix 2023
No ratings yet
Session 2 Introduction To Unix 2023
32 pages
AFIAS Cataloge
No ratings yet
AFIAS Cataloge
8 pages
OTS100AF, OTS80AF & OTS60AF Laboratory Oil Test Set: User Guide
No ratings yet
OTS100AF, OTS80AF & OTS60AF Laboratory Oil Test Set: User Guide
22 pages
NB1 Miniature Circuit Breaker: Modular DIN Rail Products
No ratings yet
NB1 Miniature Circuit Breaker: Modular DIN Rail Products
5 pages
Pulsar 220 - FI
86% (7)
Pulsar 220 - FI
46 pages
CRBT Presentation
No ratings yet
CRBT Presentation
11 pages
Coolmay HMI Technical Guide
No ratings yet
Coolmay HMI Technical Guide
2 pages
Series 3725 Electropneumatic Positioner Type 3725: Mounting and Operating Instructions EB 8394 EN (1300-1621)
No ratings yet
Series 3725 Electropneumatic Positioner Type 3725: Mounting and Operating Instructions EB 8394 EN (1300-1621)
56 pages
Unit Ii Linear Data Structures - Stacks, Queues
No ratings yet
Unit Ii Linear Data Structures - Stacks, Queues
201 pages
CSE Green Computing
No ratings yet
CSE Green Computing
22 pages
Balancer and Integrated Hardware Protector
No ratings yet
Balancer and Integrated Hardware Protector
12 pages
100 - 00 Manual de Service Ford Mondeo MK4 2007-2010
63% (8)
100 - 00 Manual de Service Ford Mondeo MK4 2007-2010
62 pages
OS Concepts for Computer Science Students
No ratings yet
OS Concepts for Computer Science Students
3 pages
Projector Setup & Troubleshooting Guide
No ratings yet
Projector Setup & Troubleshooting Guide
8 pages
Instruction Manual Champion 990 Promotional Printing Machinery
100% (2)
Instruction Manual Champion 990 Promotional Printing Machinery
37 pages
G200 Hardware Guide
No ratings yet
G200 Hardware Guide
130 pages
Timer Siemens en
No ratings yet
Timer Siemens en
154 pages
Unix Equivalents To Microsoft Windows Commands
100% (8)
Unix Equivalents To Microsoft Windows Commands
5 pages
Unit Ii - Embedded C Programming
No ratings yet
Unit Ii - Embedded C Programming
36 pages
Computer Graphics
100% (1)
Computer Graphics
325 pages
Computer Hardware Troubleshooting
No ratings yet
Computer Hardware Troubleshooting
15 pages
Tle CSS Module 6 - Testing Electronic Components
100% (1)
Tle CSS Module 6 - Testing Electronic Components
27 pages
Srs
100% (1)
Srs
14 pages
VINGTOR Product Catalog
100% (1)
VINGTOR Product Catalog
168 pages
Access Wikipedia via SMS & Voice
No ratings yet
Access Wikipedia via SMS & Voice
15 pages
"Errormancy: Glitch As Divination": Errormancer
No ratings yet
"Errormancy: Glitch As Divination": Errormancer
3 pages
B450M DS3H: User's Manual
No ratings yet
B450M DS3H: User's Manual
42 pages
HUS Getting Started Guide
No ratings yet
HUS Getting Started Guide
82 pages