human computer interaction

22
Human Computer Interaction

Upload: verda

Post on 19-Jan-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Human Computer Interaction. Outline. Multimodal HCI Issues More than 1 sensory channel in interaction e.g. Sound, text, hypertext, animation, video, gestures, vision Used in a range of applications: Particularly good for users with special needs, and virtual reality Will cover - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Human Computer Interaction

Human Computer Interaction

Page 2: Human Computer Interaction

Outline Multimodal HCI Issues More than 1 sensory channel in interaction

e.g. Sound, text, hypertext, animation, video, gestures, vision

Used in a range of applications: Particularly good for users with special needs, and

virtual reality

Will cover General terminology Speech Non-speech sounds Handwriting Text and hypertext Animation and video

Considering applications as well as principles

Page 3: Human Computer Interaction

Usable Senses

The 5 senses (sight, sound, touch, taste & smell) are used by us everyday Each is important on its own

Together, they provide a fuller interaction with the natural world

Computers rarely offer such a rich interaction

Can we use all the available senses? Ideally, YES

Practically, NO

We can use Sight, Sound & Touch (sometimes)

We cannot (yet) use Taste & Smell

3

Page 4: Human Computer Interaction

Multi-modal vs. Multimedia

Multi-modal systemsUse more than 1 sense (or mode) of interactione.g. Visual and Aural senses: a text processor

may speak the words as well as echoing them to the screen

Multimedia SystemsUse a number of different media to communicate

informationE.g. a computer-based teaching system:

May use video, animation, text and still images: Different media all using the visual mode of interaction

May also use sounds, both speech and non-speech: Two more media, now using a different mode

4

Page 5: Human Computer Interaction

Speech

Human beings have great & natural mastery of speech Makes it difficult to appreciate the complexity, but It’ an easy medium for communication

Structure of Speech Phonemes

40 of them: basic, atomic units

Sound slightly different depending on the context they are in

This larger set of sounds are:Allophones

All the sounds in the language: between 120 & 130 of them.

These are formed intoMorphemes: smaller unit of language that has meaning

5

Page 6: Human Computer Interaction

Speech (cont’d)

Other terminologies:

Prosody Alterations in tone and quality Variations in emphasis, stress, pauses and pitches Impart more meaning to sentences

Co-articulation The effect of context on the sound Co-articulation transforms the phonemes into allophones.

Syntax – structure of sentences

Semantics – meaning of sentences6

Page 7: Human Computer Interaction

Speech Recognition Problems

Different people speak different: accent, intonation, stress, idioms, volume and so can all vary.

The syntax of semantically similar sentences may vary

Background noises can interfere

People often “umm…” and “errrr…”

Words not enough – semantics needed as well Requires intelligence to understand a sentence Context of the utterance often has to be known also information about the subject and speaker

E.g: Even if “Errr…I, um, don’t like this” is recognized, it is fairly useless

piece of information on it’s own7

Page 8: Human Computer Interaction

Speech recognition: Currently Useful?

Single user, limited vocabulary systems widely available e.g. computer dictionary

Open use, limited vocabulary systems can work satisfactorily e.g. some voice activated telephone systems

No general user, wide vocabulary systems are commercially successful, yet

Large potential, however When users hands are already occupied – e.g. driving,

manufacturing For users with physical disabilities Lightweight, mobile devices

8

Page 9: Human Computer Interaction

Speech Synthesis Speech synthesis: the generation of speech

Useful – natural and familiar way of receiving information

Problems – similar to recognition: prosody particularly

Additional Problems Instructive – needs headphones, or creates noise in the

workplace Transient – harder to review and browse

Successful in certain constrained applications, usually when the user is particularly motivated to overcome the problems and has few alternatives Screen reader – read the textual display to the user, utilized by

visually impaired people Warning Signals – spoken information sometimes presented to

pilots whose visual and haptic skills are already fully occupied9

Page 10: Human Computer Interaction

Non-speech Sounds

Bongs, bangs, squeaks, clicks etc. Commonly used in interfaces to provide warnings and alarms

Evidence to show they are useful Fewer typing mistakes with key clicks Video games harder without sound

Dual mode display: information presented along two different sensory channels

Allows for redundant presentation of information

Allows resolution of ambiguity in one mode through information in another

Sound especially good for transient information, and background status information language/culture independent, unlike speech10

Page 11: Human Computer Interaction

Non-speech Sounds (cont’d)

Example: Sound can be used as a redundant mode in the Apple Macintosh: almost any user action (file selection, window active, disk insert, search error, copy complete, etc.) can have a different sound associated with it.

11

Page 12: Human Computer Interaction

Handwriting Recognition

Handwriting is another communication mechanism which we are used to

Technology: Handwriting consists of complex strokes and spaces Captured by digitizing tablet – strokes transformed

to sequence of dots Large tablets available – suitable for digitizing maps and

technical drawings

Smaller devices, some incorporated thin screens to display the information e.g. PDA’s such as Palm Pilot

12

Page 13: Human Computer Interaction

Handwriting Recognition (cont’d)

RecognitionProblems

Personal differences in letter formation

Co-articulation effects

Some success for systems trained on a few users, with separated letters

Generic multi-user naturally-written text recognition systems….still some way off!!

13

Page 14: Human Computer Interaction

Auditory Icons

Use natural sounds to represent different types of object or action

Natural sounds have associated semantics which can be mapped onto similar meanings in the interaction

e.g. Throwing something away ~ the sound of smashing glass

Problem: not all things have associated meaninge.g. Copying Applications: SonicFinder for the Macintosh

14

Page 15: Human Computer Interaction

Auditory Icons (cont’d)

Items & actions on the desktop have associated sounds Folders have a papery noise

Moving files is accomplished by dragging sound

Copying – a problem

Big files have a louder sound than smaller ones

Additional information can also be presented Muffled sounds if object is obscured or action is in the

background

Use of stereo allows positional information to be added

15

Page 16: Human Computer Interaction

Text & Hypertext

Text is a common form of output, and very useful in many situations Imposes a strict linear progression on the reader The authors ideas of what is best – this may not be ideal

Hypertext structures blocks of text into a mesh or network that can be traversed in many different ways

Allows a user to follow their own path through information

Hypertext systems comprise: A number of pages, and Links, that allow one page to be accessed from another

e.g: technical manual for a photocopier All the technical words linked to their definition in a glossary Links between similar photocopiers16

Page 17: Human Computer Interaction

Hypermedia

Hypermedia systems are hypertext systems that incorporate additional media, such as illustrations, photographs, video and sound

Particularly useful for educational purposes Animation & graphics allow user to see things happen Hypertext structure allows users to explore at their own pace

Problems “lost in hypothesis” – users unsure where in the web they are

maps of the hypertext are a partial solution Incomplete coverage of information

Some routes through the hypertext miss critical chunks Difficult to print out and take away

Printed documents require a linear structure17

Page 18: Human Computer Interaction

Animation

The additional of motion to image – they change and move in time

Example: Clocks

Digital faces – seconds flick pastAnalogue face – seconds hand sweeps round constantlySalvador Dali clock – digits warp and melt into each other

CursorHourglass/watch/spinning disc indicates the systems is

busy flashing cursor indicates typing position

18

Page 19: Human Computer Interaction

Animation (cont’d)

Animation used to indicate temporally-varying information.

Useful in education and training: allow users to see things happening, as well as being interesting and entertaining images in their own right

Example: data visualisation Abrupt and smooth changes in multi-dimensional data

visualized using animated, colored surfaces

Complex molecules and their interactions more easily understood when they are rotated and viewed on the screen

19

Page 20: Human Computer Interaction

Utilising animation and video

Animation and video are potentially powerful tools

Notice the success of television and arcade games

However, the standard approaches to interface design do not take into account the full possibilities of such experience

We will probably only start to reap the full benefits from this technology when we have much more experience

We also need to learn from the masters of this new art form: interface designers will need to acquire the skills of film makers and cartoonist as well as artist and writers20

Page 21: Human Computer Interaction

Applications

Users with special needs have specialised requirements which are often well-served by multimedia and/or multimodal systems

Visual impairment – screen readers, SonicFinder

Physical disabilities (e.g. dyslexia) – speech input, output

Virtual Reality

Multimedia multimodal interaction at its most extreme, VR is the computer simulation of a world in which the user is immersed

Headsets allow user to “see” the virtual world

Gesture recognition achieved with DataGlove (lycra glove with optical sensors that measure hand and finger positions)

Eyegaze allows users to indicate direction with eyes alone

21

Page 22: Human Computer Interaction

Applications (cont’d)

examples:

VR in chemistryUsers can manipulate molecules in space, turning them

and trying to fit different ones together to understand the nature of reactions and bonding

Flight SimulatorsScreens show the “world” outside, whilst cockpit

controls are faithfully reproduced inside a hydraulically-animated box

22