You are on page 1of 3

PRE-SYNOPSIS REPORT

PROPOSED TITLE-Optical Character Recognition (OCR) Based Speech Synthesis System


using LabVIEW
AREA OF WORK- LabVIEW and OCR based Speech Synthesis.
TYPE OF MODEL- Hybrid (hardware + software-LabVIEW)
PROJECT CONCEPT- Speech signal is more effective means of communication than text
because blind and visually impaired persons can also respond to sounds. This project aims to
develop a cost effective, and user friendly OCR based speech synthesis system. The OCR
based speech synthesis system will significantly improve the degree to which the visually
impaired persons can also respond to sounds.
The project contains two parts Optical Character Recognition and Text to speech conversion.
An OCR based Speech Synthesis System is a computer based system that should be able to
read any text and give voice output, when the text is scanned and submitted to an OCR
system.
PROJECT OBJECTIVES- The main objectives of this project are1. Develop Optical Character Recognition using LabVIEW-In optical character
recognition process the image of printed text is used as input for the OCR system. The
following steps are involved in Optical Character Recognition
Image Acquisition-Image will be captured using a digital HP scanner. The
flap of the scanner will be kept open during the acquisition process to obtain
black background. Image configuration will be done with the help of Imaq
create subvi of LabVIEW.
Binarization- It is the process of converting a greyscale image into binary
image by a threshold value of 175.Pixels lighter than threshold is converted to
white and others black.
Segmentation Process-The input of this step is obtained image from above
step. Three steps are involved in segmentation process: Line detection and
segmentation, word segmentation and character segmentation.
Template matching-In this, correlation between stored templates and
segmented character will be found by using correlation vi.
Recognition-After getting character, we store image in a structure. This
character has to be identified for pre-defined character set. There will be
preliminary data for all stored characters for identified font and size. It
contains some information and for every recognized character, these
information will be captured.
2. Develop text to speech module-In text to speech module, text recognized by OCR
system will be the inputs of speech synthesis system which is to be converted into
speech in .wav file format and creates a wavefile named output .wav which can be
listened by using wave file player.

3. Combine OCR and text to speech module to obtain the desired result.
BLOCK DIAGRAM-

Fig. Block Diagram Text to Speech Synthesis

Fig. Flow Chart OCR System


APPLICATIONS1. The most important application of speech synthesis is the reading and communication aids
for the blind.

2. Synthesized speech can also be used in some educational situations. A computer with
speech synthesizer can teach 24 hours a day and 365 days a year.
REFERENCES1. T. Dutoit, "High quality text-to-speech synthesis: a comparison of four candidate
algorithms," IEEE International Conference on Acoustics, Speech, and Signal Processing,
1994, pp. 19-22
2. H. Li et al., Automatic Text Detection and Tracking in Digital Video, IEEE Trans. on Im.
Proc., vol. 9(1), pp. 147-156, 2000.
NAME OF THE TEAM MEMBERS- Ashita Srivastava (1312231040), Deeksha Anand
(1312231056), Deepak Kumar Tiwari (1312231057)

You might also like