wired for speech: how voice activates and advances the human-computer relationship clifford nass...

56
Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Upload: daniela-mcdaniel

Post on 18-Dec-2015

220 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Wired for Speech:

How Voice Activates and Advances the Human-Computer Relationship

Clifford Nass

Stanford University

Page 2: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Speaking is Fundamental

Fundamental means of human communication Everyone speaks

IQs as low as 50 Brains as small as 400 grams

Humans are built for words Learn new word every two hours for 11 years

Page 3: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Listening to Speech is Fundamental

Womb: Mother’s voice differentiation One day old: Differentiate speech vs. other sounds

Responses Brain hemispheres

Four day olds: Differentiate native language vs. other languages

Adults: Phoneme differentiation at 40-50 phonemes per second Cope with cocktail parties

Page 4: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Listening Beyond Speech is Fundamental

Humans are acutely aware of para-linguistic cues Gender Personality Accent Emotion Identity

Page 5: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Humans are Wired for Speech

Special parts of the brain devoted to Speech recognition Speech production Para-linguistic processing Voice recognition and discrimination

Page 6: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Therefore …

Voice interface should be the most

Enjoyable,

Efficient, &

Memorable

method for providing and acquiring information

Page 7: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Are They? No!Why Not?

Machines are different than humans Technology is insufficient

But are these good reasons?

Page 8: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Critical Insights

Voice = Human

Technology Voice = Human Voice

Human-Technology Interaction =

Human-Human Interaction

Page 9: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Where’s the Leverage?

Social sciences can give us What’s important What’s unimportant Understanding Methods Unanswered questions

Page 10: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Male or Female Voice?

Is gender important? Can technology have gender?

Page 11: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

The Case of BMW

Page 12: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Brains are Built to Detect Voice Gender

First human category Infants at six months Self-identification by 2-3 years old Within seconds for adults

Multiple ways to recognize gender in voice Pitch Pitch range Variety of other spectral characteristics

Page 13: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Once Person Identifies Gender by Voice

Guides every interaction Same-gender favoritism

Trust Comfort

Gender stereotyping

Page 14: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Gender and Products

Gender should match product More appropriate More credible

Mutual influence of voice and product gender Female voices feminize products (and conversely) Female products feminize voices (and conversely) “Match principle”

Page 15: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Research Context

“Gender” of voice (synthetic) Gender of user “Gender” of product E-Commerce website

Page 16: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Examples of Advertisements

“Female” voice; female product

“Male” voice; female product

“Male” voice; male product

Page 17: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Appropriateness of the Voice

2

3

4

Female Product Male Product

Female Voice

Male Voice

Page 18: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Voice/Product Gender Influences

Female voices feminize products;Male voices masculinize products Strongest for opposite gender products

Female products feminize voices;Male products maculinize voices

Strong preference when voice matches product

Page 19: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results for User Gender

People trust voices that match themselves Females conform more with “female” voices Males conform more with “male” voices

People like voices that match themselves Females like the “female” voice more Males like the “male” voice more

Page 20: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Other Results

Participants denied stereotyping technology Participants denied harboring stereotypes!

Page 21: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

People stereotype voices by gender

Voice “gender” should match content “gender” Product descriptions Teaching Praise Jokes

Page 22: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Gender is Marked by Word Choice Female speech More “I,” “you,” “she,” “her,” “their,” “myself” Less “the,” “that,” these,” “one,” “two,” “some

more” More compliments More apologies More relationships between things Less description of particular things “They” for living things only

Voices should speak consistently with their “gender”

Page 23: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Selecting Voices

Voices manifest many traits Gender Personality Age Ethnicity

Voice traits should match content traits Content Language style Appearance (e.g., accent and race) Context

Voice traits should match user traits

Page 24: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

If Only One Voice

Consider stereotypes Masculine vs. feminine (same voice)

Boost high frequencies (feminine) Boost low frequencies (masculine)

Page 25: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Emotions

Page 26: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Emotion and Voice

Voice is the first indicator of emotion Voice emotion has many markers

Pitch Value Range Change rate

Amplitude Value Range Change rate

Words per minute

Page 27: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Emotion is always relevant

User has initial emotion Interactions create emotions

Voice is particularly powerful Frustration is particularly powerful

Page 28: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Emotion and Technology

Could technology-based voices exhibit emotion?

Could technology-based voice emotion influence people?

Page 29: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Research Context

Create upset or happy drivers Have them “drive” for 15 minutes Female voice gives information and makes suggestions

Upbeat

Subdued

Page 30: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Number of Accidents

1

5

9

Happy Driver Upset Driver

Upbeat Voice

Subdued Voice

Page 31: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results

People speak to car much more when emotion is consistent

People like car much more when emotion is consistent

Page 32: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Implications

User emotion is a critical part of any interaction

Emotion must match content Perception of voice

Trust Intelligence

User Performance Comfort Enjoyment

Page 33: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

One Voice Emotion: Select for Goal

Overall liking Slightly happy voice

Attention-getting Anger Sadness

Trust and vulnerability Sadness (mild)

Page 34: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

If You Can’t Manipulate Voice Emotion

Manipulate content Manipulate music

Page 35: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Using the First Person: Should IT say “I”

Page 36: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Should Voice Interfaces say “I”?

When should a voice interface say “I”? Does synthetic vs. recorded speech affect the

answer to the previous question?

Page 37: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

The Importance of “I”

“I” is the most basic claim to humanity “I think, therefore I am” “I, Robot” Dobby and monsters don’t say “I”

“I” is the marker of responsibility “I made a mistake” vs.

“Mistakes were made”

Page 38: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Research Context Auction site Telephone interface with speech recognition Recorded bidding behavior Online questionnaire

Page 39: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Average Bidding Price

20

22

24

26

'I' No ''I'

Recorded Voice

Synthetic Voice

Page 40: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results

When “I”+Recorded or “No I”+Synthetic System is higher quality Users were much more relaxed

“No I” is more objective “I” is more “present”

Page 41: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results

“I” is right for embodiments Robots Characters Autonomous intelligence (“KITT”)

“I” is wrong when voice is second fiddle to technology Traditional car Heavily-branded products

Page 42: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Design

Text-to-Speech is a machine voice Recorded speech is a human voice Design questions are

Not philosophical questions Not judgment questions Experimentally verifiable

Page 43: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Mistakes are Tough to Talk About

Page 44: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Who is Responsible for Errors?

Recognition is not perfect When system fails, who should be assigned

responsibility? System User No one

Page 45: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Responding to Errors

Modesty Likable Unintelligent (people believe modesty!)

Criticism Isn’t really constructive Unpleasant Intelligent

Scapegoating Effective Safe

Page 46: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

System Responses to Errors

System blame (most common)

No blame

User blame

Page 47: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Research context

Amazon-by-phone Numerous planned interaction errors

Page 48: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

2

3

4

Likelihood of Purchase

No blame

System blame

User blame

Book Buying

Page 49: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results

Neutral and system blame Sell much better than user blame

Neutral blame Easier to use than system blame Nicer than system blame

User blame is most intelligent! System blame is least intelligent

Page 50: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results for Errors

Take responsibility when unavoidable Increases trust Increases liking Weak negative effect on intelligence

Ignore errors whenever possible Duck responsibility to third party if needed

Blame the phone line Blame the road

Page 51: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Results for Errors

Show commitment to the interaction Make guesses Show concern Griceian maxims

Quantity Relevance Clarity

Page 52: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Design

Error recovery is critically important Negative experiences are more memorable Adaptation is crucially important

Flattery is effective Note times when interaction is successful

Design to avoid errors Alignment (good repetition) Air quotes

Scripting is important at all stages of the interaction

Page 53: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Other Key Findings

Personality Accents Multiple voices and mixing voices Input vs. output modality Microphone type

Page 54: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Tying it All Together

Voice interfaces can be the most enjoyable, efficient, and memorable method for acquiring and providing information

Voice interfaces turn up the volume knob in user responses

The key is leveraging social aspects of speech

Page 55: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Summary – Part 1

Humans are wired for speech Interactions with voice interfaces are

fundamentally social Same social rules Same social expectations

Page 56: Wired for Speech: How Voice Activates and Advances the Human-Computer Relationship Clifford Nass Stanford University

Summary – Part 2

Social aspects of voice interfaces can be beneficial Users perform better Users feel better Users understand better

Social aspects of voice interfaces cannot be ignored Social audit is critical Social design is critical

Design psychology can be leveraged Less expensive than technology More effective than technology Broader impact than technology