creating speaking web pages: the text-to-speech integrated development environment (tts-ide)

27
Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE) David C. Gibbs Department of Mathematics and Computing University of Wisconsin-Stevens Point Stevens Point, WI 54481 [email protected]

Upload: alec-nieves

Post on 03-Jan-2016

37 views

Category:

Documents


0 download

DESCRIPTION

Creating Speaking Web Pages: The Text-to-Speech Integrated Development Environment (TTS-IDE). David C. Gibbs Department of Mathematics and Computing University of Wisconsin-Stevens Point Stevens Point, WI 54481 [email protected]. Introduction. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

Creating Speaking Web Pages: The Text-to-Speech Integrated

Development Environment (TTS-IDE)

David C. Gibbs

Department of Mathematics and Computing University of Wisconsin-Stevens PointStevens Point, WI 54481

[email protected]

Page 2: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

2

Introduction

University of Wisconsin - Stevens Point (USA) Department of Mathematics and

Computing Web Speech Research Group

• Undergraduate Computing students

Page 3: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

3

This Presentation What motivated development of the

TTS-IDE? What? Browsers speak? Which browsers? How do browsers speak? How do I create a speaking

presentation?

Page 4: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

4

The Motivation Online Course in JavaScript

programming : Spring 2004 Multiple learning styles, as applied

to online instruction Text presentation Online discussion Graphics Audio

Page 5: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

5

The Motivation, cont’d. First three are “easy” (text,

discussion, graphics) Audio delivery (then) was

problematic PowerPoint, saved as HTML

• large files – inaccessible to dial-up users• Clumsy to edit, maintain

Page 6: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

6

The Motivation, cont’d. Difficulties with audio

Recording file size 0.5 MB/minute Cannot easily edit the text

Updating presentations essential in technology disciplines

Page 7: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

7

A timely coincidence? Opera introduced “speaking

browser” – March 2004

Investigated Text-To-Speech (TTS)

Page 8: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

8

A Possible Solution Speech Synthesis

“The process of automatic generation of speech output from data input which may include plain text, marked up text or binary objects.” (w3.org, 2004)

For this paper, speech synthesis within a web browser

Page 9: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

9

Competing Technologies Two HTML-like markup languages

Microsoft SALT• Speech Application Language Tags

VoiceXML from W3C

Page 10: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

10

Microsoft SALT:Speech Application Language Tags

Uses O.S. “default voice” XP ships with Microsoft Mary, Mike,

Sam, LH Michael and Michelle Purchase voices: NeoSpeech Kate and

Paul ($30 US street price) DEMONSTRATION – speech properties

dialog (control panel) Browser use requires download

voice add-in for I.E. (65MB)

Page 11: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

11

VoiceXML Opera Browser (v8 and beyond)

Partnered with IBM ViaVoice Download Opera – follow Voice

instructions DEMONSTRATION

• Speak the text on any page – select text and right-click | “Speak”

Page 12: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

12

Code SamplesHello World: html doc

1 <html>2 <head>3 <title>Hello World</title>4 </head>5 <body>6 Hello World! <!-- displayed on the web page -->

7 </body>8 </html>

Page 13: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

13

Code SamplesHello World: SALT doc (p. 1 of 2)

1 <html xmlns:salt="http://www.saltforum.org/2002/SALT">2 <head>3 <title>HelloWorld</title>

4 <!-- SALT Add-in to Internet Explorer object -->5 <object id="SpeechTags" CLASSID="clsid:33cbfc53-a7de-491a-90f3-

0e782a7e347a" VIEWASTEXT>6 </object>

7 <!-- salt: Importing the namespace for implementation -->8 <?import namespace="salt“ implementation="#SpeechTags" />9 </head>

Page 14: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

14

Code SamplesHello World: SALT doc (p. 2 of 2)

1 <body onload="hello.start()">

2 <salt:prompt id="hello">3 Hello World <!-- this text is spoken -->4 </salt:prompt>

5 Hello World! <!-- text displayed on the web page -->

6 </body>7 </html>

SALT-HelloWorld.htm

Page 15: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

15

Code SamplesHello World: VoiceXML doc (p. 1 of 2)

1 <!DOCTYPE html PUBLIC "-//VoiceXML Forum//DTD XHTML+Voice 1.2//EN“ "http://www.voicexml.org/specs/multimodal/x+v/12/dtd/xhtml+voice12.dtd">

2 <html xmlns=http://www.w3.org/1999/xhtml xmlns:ev = "http://www.w3.org/2001/xml-events">

3 <head>4 <title>Hello World</title>

5 <form xmlns="http://www.w3.org/2001/vxml" id="sayHello">6 <block>7 Hello World! <!-- this text is spoken -->8 </block>9 </form>

1 </head>

Page 16: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

16

Code SamplesHello World: VoiceXML doc (p. 2 of 2)

1 <body ev:event="load" ev:handler="#sayHello">

2 Hello World! <!-- text displayed on the web page -->

3 </body>4 </html>

OPERA-HelloWorld.xml(open the Opera browser)

Page 17: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

17

Text-to-Speech IDE

Page 18: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

18

What does the IDE do? (SALT)

Modifies existing page by adding the necessary tags Creates the salt namespace

<html xmlns:salt="http://www.saltforum.org/2002/SALT"> Adds SALT Add-in

<object id="SpeechTags" CLASSID="clsid:33cbfc53-a7de-491a-90f3-0e782a7e347a" VIEWASTEXT>

</object>

Adds namespace <?import namespace="salt“ implementation="#SpeechTags" />

Adds SALT prompt tags around “notes”<salt:prompt id=“lecture">

Page 19: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

19

Preparatory tasks (I.E.)

Download and install the Internet Explorer Speech Add-in

Download the TTS-IDE

Page 20: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

20

Creating an “Online Lecture”

Run the TTS-IDE Add the text to be spoken Convert the page to use TTS Example: my home page

Page 21: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

21

Creating an “Online Lecture”:from PowerPoint slides

1. Open the PPT presentation in PowerPoint

2. Then save the PowerPoint file as a Web page (File | Save as Web Page).

3. From the TTS-IDE, open the <filename>.htm file.

Page 22: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

22

Creating an “Online Lecture”

4. Add the text to be spoken to each page.

5. Convert the page.6. Test the pages.7. Copy the pages to the course

website or CD.

Page 23: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

23

Demonstration Example file:

E:\WBE2007-SpeakingBrowserPPT.ppt

Page 24: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

24

Further Work on TTS-IDE Speech Recognition

For basic navigation • “Next Page” “Previous Page” “Links”

Custom Voices FestVox, Carnegie Mellon

Additional Language Support Client browser, speech engine

Page 25: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

25

Further Work Speech Recognition Speaking and Listening Browser

Hands-free browsing (via speech recognition)

RSS news feeds, by topic

Page 26: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

26

Conclusion – TTS-IDE Add text to create speaking online

“lectures.” Can use HTML or PPT – saved as web Can create for Internet Explorer

(SALT) or Opera (VoiceXML)

Page 27: Creating Speaking Web Pages:  The Text-to-Speech Integrated Development Environment  (TTS-IDE)

WBE 2007 March 16, 2007 Chamonix, France

27

Contact Information This presentation and download of IDE

(available 2007/03/26)

http://www.uwsp.edu/cis/dgibbs/WBE2007/

e-Mail

[email protected]