Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
I would like to express my deepest gratitude and thanks to my supervisor, muazzam maqsood, who
suggested the idea of the An Efficient Optical Character Recognition For urdu script and who gave me
the needed information to
start working on the project. Also, I would like to thank him for being supportive and for his guidance
through this semester and for giving me the necessary advices to be able to realize this project. I am
really grateful to his contribution. Moreover, I would like to thank him for his supervising
methodology that made my tasks easier and motivated me through this period.
Table of Content :
1 Abstract
2. Introduction
2.1. Objectives
3. Requirements
4 System descriptions
5 Conceptual diagrams
Introduction:
Urdu is the national language of Pakistan, and is understood by well over 300 million people around the
world. There is a need to convert historical database of Urdu literature into electronic form, so that
Urdu can prosper in the age of computers. Urdu text recognition endeavors to convert scanned Urdu
documents automatically into computerized text files.
Objectives
To develop an efficient OCR system for URDU
To develop post-processing algorithms for output generation and error correction of Urdu OCR
Existing systems
Optical Character Recognition (OCR) still remains a challenging task in image processing and machine
learning. There is little work on urdu as compare to other many languages. The accuracy of the existing
systems is not very high.
Proposed system
In proposed system we will develop the OCR that should improve the accuracy rate of the existing
system We will develop algorithm that reduce the working of manual system and convert the desired
image in soft form .The desired output can be used in word document file.
Requirements
As aforementioned, we need to retrieve text from scanned documents or any text
image and make it editable to reuse it and read it word by word. For instance, there are plenty
of books that are only available on printed format, so even if we scan them, they will be
Stored only as images. With the use of the Optical Character Recognition (OCR), these scanned
Documents will be available for later editing and can be reused by the user.
Functional Requirements
The system will perform the following functionality
Product Requirements:
Usability Requirements:
The system shall be user friendly and doesn't require any guidance to be used. In
Other words, the system has to be as simple as possible, so its users shall use it
easily.
Reliability Requirements:
The system should not have any unexpected failure. In order to avoid any failure's
Occurrence, the specifications have been respected and followed correctly. The only
problem that may occur in some cases is that the system do not get 100% of the
Efficiency Requirements:
Performance:
The system response time shall be adequate and sufficient enough, that's
why the time required for this system to response to its user's actions has Been managed and
controlled…