phonet presentation

PHONET

Presented by :- RASHMI R (4AD12EC063)GUIDE:- Guruprasad K N Assistant Professor , ECE dept.

1

CONTENTSABSTRACTINTRODUCTIONMOTIVATIONBLOCK DIAGRAMWORKING PRINCIPLEPROPOSED METHADOLOGYADVANTAGES AND DISADVANTAGESCONCLUSIONBIBLIOGRAPHY

2

ABSTRACTVoice based web access is a rapidly developing technology. PHONET is a solution for this technology

PHONET is used to make information accessible to users who may not be able to read or write, or who do not have access to the internet

3

Unlike a computer interface , a voice interface needs no keyboard, no mouse, freeing from these barriers to access internet

It requires no training

It is accessible to anyone with a telephone 4

INTRODUCTIONPHONET involves the most complex technologies Like Speech Recognition (SR), Text to speech (TTS) conversion and Artificial Intelligence (AI).

The technologies like SR, TTS and AI are integrated to develop a intelligent Platform (PHONET) to achieve voice based web access which involves Document processing and Document Rendering.

5

Document Processing consists of two approaches i.e., telephone browsing and transcoding

In Document Rendering we present the major problem i.e., the text rendering.

The companies which deliver contents such as news, weather, horoscopes, and stock quotes, etc. over the phone, are called Voice Portals.

6

Voice portals were the first web applications that tried to integrate websites with voice.

With the Voice Internet technology PHONET, anyone can surf, search, send and receive email, and conduct e-commerce transactions, etc.

PHONET technology is faster and cheaper.

7

The PHONET platform acts as an Intelligent Agent (IA) located between the user and the Internet .

The IA automates the process of rendering information from the Internet to the user in a meaningful, precise and pleasant to listen audio format

8

Language TranslationThe IA includes a language translation engine that dynamically translates web contents from one language into another in real time.

Thus, a Chinese speaking person can ask to surf an English website in Chinese - the Intelligent Agent would access the English website, extract the content of the website and translate it in Chinese and read it back to the user in Chinese.

9

Grammar processing continues with compilation and optimization where redundancies are eliminated. The word vocabulary associated with the grammar is further processed by a Text-To-Speech (TTS) pronunciation module that generates phonetic transcriptions for each word of the grammar.

Since the TTS engine uses pronunciation rules it is not limited to dictionary words. The grammar and vocabulary are then loaded into the speech recognizer. This process typically takes about a second. At the same time, the Web document is described to the user.

10

MOTIVATIONWhen we are in the car or away from the office or computer, accessing the Web is difficult.

An increasing number of people prefer an interface that allows them to hear and speak rather than see and click or type.

Some existing Internet users have also identified problems with the visual Internet experience.

11

12

Proposed SystemFig1:- PROPOSED PLAN FOR THE WORKING OF PHONET

WORKING PRINCIPLE

Subscribers dial a toll-free number, and start accessing the Internet using voice commands.

Speech Recognition technology in the companys system allows users to give simple commands, such as "go to Yahoo" or "read my email" to get to the Net-based information they want. 13

When the user sends request to access the internet, the request goes to the voice browser.

If the request is voice, speech Recognition converts voice into text.

They will be able to quickly locate information, such as breaking news, traffic reports, directions, or anything interested in the World Wide Web.

14

Using text-to-speech technology, an "intelligent agent" will read the requested information out loud via a computerized voice, and process the users voice commands.

15Fig2:- INTELLIGENT AGENT AND KEY FEATURES OF PHONET

The technologies employed are: 1. Speech processing 2. Text-to-speech translation, and 3. Artificial Intelligence

16

Rendering is achieved by using Page Highlights (a method to find and speak the key contents on a page), finding right as well as only relevant contents on a linked page, assembling right contents from a linked page, and providing easy navigation. These key steps are done using the information available in the visual web page itself and proper algorithms that use information such as text contents, color, font size, links, paragraph, and amount of text. Artificial Intelligence techniques are used in this automated rendering process. This is similar to how the human brain renders from a visual page; selecting the information of interest and then reading.The IA includes a language translation engine that dynamically translates web contents from one language into another in real time. Thus, a Chinese speaking person can ask to surf an English website in Chinese - the Intelligent Agent would access the English website, extract the content of the website and translate it on the fly in Chinese and read it back to the user in Chinese.

17

1. Speech ProcessingSpeech processing is the study of speech signals and the processing methods of these signals.The speech signals are processed in digital representation.

18Fig3:- BASIC FUNCTION OF SPEECH PROCESSING AND DIGITAL SPEECH PROCESSING REPRESENTATION

2.Text To Speech TranslationText to speech translation is the process by which conversion of text data to speech signals takes place

19Fig4:- TEXT TO SPEECH CONVERSION

3. Artificial IntelligenceIt is the intelligence exhibited by software.

20Fig5:- ARTIFICIAL INTELLIGENCE

20

PHONET Offer the promise of Allowing everyone to access web based services from any phone.

Users will be able to choose whether to respond by a key press or a spoken command

The main plan is Accept the voice commandsOutput in audio format

21

ADVANTAGES

The possibility of accessing web through an ordinary phone

Email (send, receive, compose, copy, forward, reply, delete and more)

Airline reservations and tracking

22

DISADVANTAGES

Complexity in Hardware interface

All the users should know English language , as the user interface will be provided in English

23

CONCLUSIONIt is a new technology which provides a true audio Internet experience. Using an ordinary telephone and simple voice commands, users will be able to surf and hear the entire Internet information they desire

Any web page will be accessible, but not limited to sites as written with Wireless Application Protocol.24

Fig6:- GOOGLE VOICE SEARCHFig7:- SPEECH RECOGNITIONFig8:- SIMPLE REPRESENTATION OF VOICE RECOGNITION25

BIBLIOGRAPHY http://www.w3.org/Voice/

http://www.voicexml.org/

Internet speech Inc.

http://www.lhs.com/

http://www.dcp.ucla.edu/

26

27

phonet presentation

Engineering