text to speech conversation in gujarati

15
TEXT TO SPEECH CONVERSATION Prepared By : Nakum Ashvin P.

Upload: ashvin-nakum

Post on 02-Dec-2014

189 views

Category:

Engineering


0 download

DESCRIPTION

text to speech conversion using MATLAB.

TRANSCRIPT

Page 1: Text to speech conversation in gujarati

TEXT TO SPEECH CONVERSATION

Prepared By : Nakum Ashvin P.

Page 2: Text to speech conversation in gujarati

Introduction:

• It converts scanned text into equivalent speech

• For it simulation is done in MATLAB

• The image is in .jpeg format.

• Speech is in .wav format

• The language used for it is English

Page 3: Text to speech conversation in gujarati

Introduction to MATLAB:

• MATLAB stands for Matrix Laboratory. • It is high performance language for technical computing.• It’s basic data element is a matrix.• Typical uses

1). Math and computation2). Algorithm development3). Modeling, Simulation4). Data analysis, exploration, and visualization5). Scientific and engineering graphics6). Application development, including building graphical user interfaces.

Page 4: Text to speech conversation in gujarati

What is Image Processing?

• Image processing is the study of any algorithm that takes an image as input and returns an image as output.• An image may be defined as a two dimensional function, f(x , y).• x and y are spatial coordinates, and the amplitude of f at any pair of coordinates (x , y) is called intensity of image at that point. • When the value of x, y & f are all finite, discrete quantities it’s called digital image• Digital image is composed of a finite no. of elements, each of which has a particular location and value these elements are referred as picture element, image element and pixels.

Page 5: Text to speech conversation in gujarati

Brief Description:

The Project can be divided in three parts(1.) Scan the text which to be converted(2.) Optical character recognization(3.) Conversation of recognized word to speech

Step 1: • The text which to be converted scanned by scanner in jpeg

format• Then all process will done in MATLAB

Page 6: Text to speech conversation in gujarati

Step 2: ( Optical Character Recognition)

• Optical Character Recognition, usually abbreviated to OCR, is the electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text.

• Widely used to convert books and documents into electronic files, to

computerize a record-keeping system in an office, or to publish the text on a website.

Block diagram Of OCR

TEXT AREA DECTECTION

TEXT AREA PROCESSING

BINARY OUTPUT

Page 7: Text to speech conversation in gujarati

Algorithm used in OCR

Convert RGB image to Gray Scale Image and display it

Convert image into binary image after defining its threshold

Place the ideal match in the text file

Open text file to write the text

Load Templates

Crop lines one by one from the image file

Match each letter with the templates using correlation

Page 8: Text to speech conversation in gujarati

Working

First RGB image is converted in GRAYSCALE image

RGB image GRAYSCALE image

BINARY image

Page 9: Text to speech conversation in gujarati

All these are stored in matrix form in MATLAB

Page 10: Text to speech conversation in gujarati

How is it decided that weather it is character or not?

• Black and white images are stored in MATLAB as a two dimensional array containing 1s and 0s where 0 represents black and 1 represents white. The values in a single row are all added together.

• A row with all zeroes is a row with only blank space and without any text and a row with some non-zero elements is a row with some portion of the text.

Page 11: Text to speech conversation in gujarati

How is it decided that weather it is character or not?

• Black and white images are stored in MATLAB as a two dimensional array containing 1s and 0s where 0 represents black and 1 represents white. The values in a single row are all added together.

• A row with all zeroes is a row with only blank space and without any text and a row with some non-zero elements is a row with some portion of the text.

Separation of characters from line

• All the operations which were being performed on the rows while extracting lines are now performed on the columns. The values in a single column are all added together.

• If the sum is not equal to zero, then it means that there is at least one non-zero value in the column. However, if the sum is equal to zero, then it indicates that the column contains only zero elements.

• A column with all zeroes is a column with only blank space and without any text and a column with some non-zero elements is a column with some portion of the text.

Page 12: Text to speech conversation in gujarati

Identifying individual characters

• Once the characters have been extracted from the image, they are passed to the read letter block for identification.

• Two-dimensional correlation function is then used to find the best match between the extracted character and the characters stored in the templates file. The character from the templates file that gives the highest correlation is the extracted character and it is written into a text file.

Page 13: Text to speech conversation in gujarati

Step 3: (Text To Speech Conversation)

• This done by programming in MATLAB

• In it the programming of MATLAB gives pre-recorded voice as output which is related with that particular character.

• Thus conversion is done.

Page 14: Text to speech conversation in gujarati

APPLICATION:

• The blind use Braille script to read or write any text. This scheme uses a pattern of pockmarks or indentations to represent each alphabet. Reading is done by feeling these notches with the fingers. This makes reading a very slow and laborious task for them.

• By using this a blind people know content of any letter

Page 15: Text to speech conversation in gujarati

THANK YOU