0% found this document useful (0 votes)
23 views19 pages

Review

Uploaded by

Anusha Kandula
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views19 pages

Review

Uploaded by

Anusha Kandula
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Mini Project Seminar

On
Digitalization of Handwritten Text
using Neural Networks
Gayatri Vidya Parishad College of Engineering
(Autonomous)
Madhurawada,Visakhapatnam-530 048
Under the esteemed guidance of
Dr.Ch.Sita Kumari
Sr. Assistant Professor
Department of Information Technology

Project Team Members


K.Anusha 17131A1251
A.Sailaja 17131A1206
Ch.Anjani Nookambica 17131A1223
N.Amulya 17131A1207
ABSTRACT

Handwritten Text Recognition has been one of the active and challenging research areas
in the field of image processing and pattern recognition. It has numerous applications
which include reading aid for partially blind, bank cheques and conversion of any
handwritten document into structural text. In this project, an attempt is made to recognise
handwritten characters for English alphabets . We use a NN for our task. It consists of
convolutional NN (CNN) layers, recurrent NN (RNN) layers and a final Connectionist
Temporal Classification (CTC) layer. The dataset contains text and digits. In this project,
each text image is resized into 128X32 pixels which is directly subjected to Training. That
is, each resized image has 4096 pixels and these pixels are given as input to the neural
network. The trained network is used for classification and recognition.
INTRODUCTION:

The objective of this project is to take handwritten English text or digit


image as input,process the text, train the neural network algorithm,to
recognize the text.

This project is aimed at developing software which will be helpful in


recognizing characters of English language and digits.This project is
restricted to English characters only.It can be further developed to
recognize the characters of different languages.It engulfs the concept
of neural network

PURPOSE:

• Document Reading
• Conversion of any handwritten document into structural text form.
Input layer Hidden layers Output layer

79 neurons
neurons
4096

Neural network
Architecture
TECHNOLOGIES USED:

• CNN(Convolutional Neural Networks)

• RNN(Recurrent Neural Networks)

PLATFORMS USED:

• Pycharm

• Python 3.6
EXISTING SYSTEM:

• Optical Character Recognition(OCR) is the existing system


for character recognition.

• It is an electronic translation of images of hand-written ,


type-written or printed text into machine editable text.

Drawbacks:
• It doesn’t have noise reduction.

• Direct use of OCR remains difficult problem to resolve,as it


leads to low reading accuracy.
PROPOSED SYSTEM:
• We use a NN for our task. It consists of convolutional NN (CNN)
layers, recurrent NN (RNN) layers and a final Connectionist
Temporal Classification (CTC) layer.

• The input image is fed into the CNN layers. These layers are
trained to extract relevant features from the image.

• The RNN output sequence is mapped to a matrix of size 32×80.

• While training the NN, the CTC is given the RNN output matrix and
the ground truth text and it computes the loss value.
REQUIREMENTS SPECIFICATION:
HARDWARE REQUIREMENTS:

• Ram:4GB or higher

• Disc Space:1TB

• Processor:Intel i5 or higher

SOFTWARE REQUIREMENTS:

• Operating System:WINDOWS

• Python3

• Packages :TensorFlow,numpy,opencv,keras

• Pycharm
FUNCTIONAL REQUIREMENTS:

• The system should process the input given by the user only if it is
an image file.

• System will show the error message to the user when the input
given is not in the required format.

• System should detect the characters present in the image.

• System should retrieve characters present in the image and


display them to the user.
NON-FUNCTIONAL REQUIREMENTS:

• Performance: Handwritten characters in the input image will


be recognized with high accuracy.
• Functionality: This software will deliver on the functional
requirements mentioned in this document.
• Availability: This system will retrieve the handwritten character
regions only if the image contains written characters in it.
• Recognition Ability: The software is very easy to use and
recognizes the characters from the image.
• Reliability: This software will work reliably for any type of
character images.
SYSTEM ARCHITECTURE/FLOW CHART:
Start

Real
Image
Noise
Removal
Classification of
image

Extraction of text from image

Text contained in the image


will be displayed

Stop
Image Acquisition

Preprocessing

Segmentation

Classification and
Recognition

Post processing

Process Flow
Image Acquisition:

• In Image acquisition,the recognition system acquires a scanned


image as an input image.

• The image should be in png format.


Pre-processing:

• The pre-processing is a series of operations performed on


scanned input image .

• Generally, noise filtering , smoothing should be done in this step.

• The pre-processing also defines a compact representation of the


character .

• Binarization process converts a gray scale image into a binary


image.

• Dilation of edges in the binarized image is done.


Segmentation:

• In this stage, an image of sequence of characters is decomposed into


sub-images of individual characters.
• The pre-processed input device is segmented into isolated characters by
assigning a number to each character using labelling process.
• Labelling process provides information about number of characters in
image.
• Each individual character is uniformly resized into pixels.
Classification And Recognition:

• The classification stage is the decision making part of the recognition


system.

• A feed forward back propagation neural network is used in this work


for classifying and recognizing the handwritten characters .

• The total number of neurons in the output layer is 79 as the proposed


system is designed to recognize English alphabets and digits.

Post-Processing:

• Post-Processing stage is the final stage of the proposed recognition


system.

• It prints the corresponding recognized character in the structured text


form.
UML DIAGRAM:

USE CASE DIAGRAM:

Upload
Image
Cance
l <<include User
Convert Image-
Initialize >> Gray Scale
<<include
>>
Pre-Process Image
<<include Gray Scale to
>> Binary format
Syste Recogniz
m e

Normalizatio
Generate n
Output
STATUS:

o Literature survey is completed.

o Studied various datasets for a fair selection of a feasible


dataset which could be applied for predicting all the
activities of the user.

o Installed required packages.

o Completed Model building


Thank You

You might also like