psychology of perception and music education … · web viewif we turn to experiments concerned...

THE COGNITIVE PEDAGOGY OF AURAL TRAINING

Orlando Musumeci - Conservatorio Alberto Ginastera - Buenos Aires

“. . . to look at rather than through the lens that is shaping the materials . . . “

Jeanne Bamberger: The Mind behind the Musical Ear

INTRODUCTION

Music psychology and aural training

This paper is concerned with the ways in which research in cognition in general and in

music psychology in particular can inform the pedagogy of aural training/music theory in

conservatories and music schools. A review of the scarce literature on the topic shows no

more than just polite complaints and good wishes, both from researchers and educators, that

the psychology of music should inspire aural training, but almost nothing that might be

considered as a direct undertaking of that task. An increasing number of people within the

conservatories and schools of music is slowly becoming aware of the tremendous importance

that psychological and cognitive findings may have for music teaching (cf. Butler 1997,

Cook 1994, Covington and Lord 1994, Musumeci 1998a-b). Common sense and musical

experience strongly suggest that plenty of links should exist between both domains. Almost

everybody agree in that “taking into account some basic properties of human brain and

senses would be of much help in taking decisions concerning music pedagogy” (Rakowski

1999: 33); aural training has been equated with “brain training” (Henson 1987: 73) and both

psychologists and pedagogues readily acknowledge that “what is aural training, after all, if it

is not the acquisition of cognitive skills in music?” (Butler 1997: 47). But all the thrust

provided by this opinions vanishes when it comes to make such links explicit, and as a

consequence such endeavours, the development of a “cognitive pedagogy of music”, are late

to arrive. Psychologists are strongly reluctant, short of interest, or unable, to speak of the

didactic implications of their findings, and the literature shows scarce efforts to relate both

disciplines in an explicit manner.

The teaching of aural training in conservatories and music schools - strongly linked to the

teaching of notation - is consistently regarded by teachers and students as a challenging task.

There are lots of obscure points in the pedagogic theory that are left to be explained by

imprecise concepts like musical gifts, or the mysterious possession or absence of a “musical

ear”. Aural training courses, like the whole milieu of the music schools, are still explicitly or

1

implicitly very reliant on the myth of giftedness, and the inefficacy of the pedagogical

procedures is masked by the belief that “talent” will fill in the didactic gaps. Even at the

most elementary level nobody can be sure that he/she will pass an aural education course. It

is not like other subjects like history, language, or even mathematics: in ear training

education, no matter the effort invested, the learner is always in danger of bumping into an

insurmountable obstacle: to run short of “ear”.

In this paper I will try to put together existing findings in the psychological literature and

anecdotal evidence gathered in the conservatory classroom. Drawing on some ideas from the

research literature I will attempt to find a cognitive explanation - though perhaps somewhat

preliminary and broad - for didactic processes which I believe are effective for teaching

aural training. Many times it can be found that psychological findings indeed confirm what

particularly insightful ear training teachers noticed. We have indeed some excellent books

and pedagogic treatises (i.e. Aguilar 1978), mainly based upon musical intuitions, that many

times coincide with some principles corroborated by psychological research. In that cases the

educational implications are not foreseen but confirmed by later psychological findings. The

further gathering of empirical psychological data should be guided by a framework that

should include theoretical and practical issues pertaining to music pedagogy. Only after this

preliminary work has been performed is that further suggestions could be made about the

areas where music education can profit from past and present findings in music perception

research. I will try to show how the intuitions of the accomplished music teacher can be

explained by some extant findings in music psychology, since I am strongly convinced that

there is already large enough empirical data on music perception to allow the outline of a

broad theoretical perspective taking into account elements from both fields.

Aural training and Standard Musical Notation

The term aural training has been applied in relation to many different skills from singing

solfeo lessons, tapping rhythms, singing scales, writing down dictations, to discriminating

musical textures, forms and timbres (Henson 1987). In fact aural awareness can be seen as

an ability concomitant with any musical activity, but in this work I am concerned with the

concept of aural training focused in the teaching of Standard Musical Notation (henceforth

SMN).

The use of notation is a constitutive feature of Western music. In its origin notation was

simply a tool for remembering the tunes, a mnemonic devise that really did work: all music

prior to this century has reached us by notational means; and it is precisely that usage what

2

still mainly justifies its teaching in the conservatories: to have access to the vast “written”

repertory accumulated from the Renaissance to our days. Important as it goal may be, its

status has been erroneously exaggerated in conservatories’ academic circles to the point of

being considered the main goal of music theory courses1.

Western-tonal music grew increasingly dependent on symbols, both for compositive and

performance terms, not only for needs of recording and transmission but to achieve high

levels of complexity. If in its origin notation was only a kind of by-product of music, along

the centuries notational symbols in turn also shaped the way of thinking about music (cf.

López Puccio 1978). If we consider notation as the final output of the cognition of music, as

the formalization of abstract relations among sounds, it is made evident the usefulness of any

notation for music teaching and learning. When we put musical ideas on paper, they “hold

still” and “talk back” (Bamberger 1991: 118), and this two-way confrontation is central for

the development of musical understanding, as it was during the history of Western music.

The above arguments justify my view that music literacy - by means of any notational

system - is paramount for the study of music, and not just an accessory skill: it is essentially

“an act of reflection . . . of which features are most salient and in need of recording”. . . [a]

“post hoc consideration of a piece of music” (Serafine 1988: 37). Accordingly, we can

assume that any attempt to teach notation should start by re-constructing this sequence in the

novice music learner, guiding the reflection over the features that are prominent in his/her

representation of the music. To assess that we must consider first how the illiterate listener

listens to music.

FIGURAL AND FORMAL KNOWLEDGE

How people listen to music

Musical listening is a multidimensional experience. The physical stimulus offers us many

interacting dimensions - rhythm, melody, harmony, timbre, text - integrated in a perceptual

continuum. To this wholeness our cognition adds even more layers of “meaning”, ranging

from low level grouping strategies involving a few events to abstract relations at higher

structural levels, and also aesthetic and emotional responses:

“the sense made of phenomena is always a construction . . each of the

individuals finds in the material and thereby gives existence to aspects that

simply does not exist for the other. For the person who attends the metric aspects

of rhythm, figures [in the sense of figural knowledge] remain unrecognised; for

3

the person who attends to figures, the classification of events according to their

shared duration remains inscrutable” (Bamberger 1991: 29)

Figural knowledge, or sometimes functional, refers to that kind of global, continuous

apprehension of the musical phenomenon that gives instantaneous access to a holistic

representation of the music. Our cognition keeps track simultaneously of all music

parameters, assigning each surface event a function inside a figural representation that is

highly dependent on the surrounding context, “events within a figure are contextually

bound” (Upitis 1987: 41). Usually a minor alteration in a single dimension, or the tiniest

attempt to break down this functional relation into its different constituents, is enough to

alter or lose altogether the musical identity of the figure.

Formal knowledge or sometimes metrical, stands for a kind of music apprehension that is

completely different. It is concerned with those aspects of the musical stimulus than can be

counted, measured, classified, mostly in terms of proportional durations and frequency

ratios, and is put to play when a conventional representation of music, as in SMN, is needed.

Figural apprehension is the kind of knowledge involved in several different responses to

music - aesthetic, emotional, kinaesthetic - that involve some kind of meaning. The transit

from a figural mode to a formal one may be conceived in relative terms: any time that we

integrate a formal concept in a context, we are moving up in the hierarchy and giving rise to

a figural concept. Where structural components are involved, it could be said that figural

knowledge allows the access to a higher level hierarchy that shapes the meaning of

individual events, and viceversa, to formalise in the context of this work means to access to a

lower level in a hierarchical representation of music. However, it is not that the figural and

formal modes are mutually exclusive, nor can we imagine each one of them in a pure form.

While most of the times both figural and formal modes alternatively contribute to music

cognition, their contribution may be considered asymmetrical. If a “purely” figural approach

is in a sense limited because it has a restricted access to formal knowledge, this fact does not

diminish its musical value. On the contrary, a purely formal approach is limited in a more

important way, since no formal knowledge is musical, without the meaning added by the

figural dimension. Presumably a musically literate musician can switch from one mode of

perception to the other at will, though I suspect that there is always a mismatch between both

modes; our figural perceptions always staying beyond what we are able to formalise; we

always ‘know more than we can tell’.

The figural - formal transaction

4

The teaching of notation - certainly a complex formalization - consists in allowing the

learner to perform a fluid interplay between both modes. At the very beginning the learner

needs to switch from his/her figural representation to a formal one, a process that we call the

figural - formal transaction. The crucial point is how to integrate both dimensions in the

learner’s representation: one is intuitive, continuous and holistic, the other is rational,

discrete and analytical. When we are perceiving in a figural mode we do not hear each note

in the music, but rather “a non-linear relationship [exists] between the notes in the score and

what people hear when they listen to a performance of it” (Cook 1994: 79). The

formalization needed for notation follows exactly the opposite way: it replaces a figural

experience with a formal description, “a meaningful continuity with a meaningless

particularity” (Cole 1974, cited by Terry 1994: 104). It is this wholeness what renders so

difficult to determine exactly all the processes involved in the figural experience of music,

but certainly grouping, beat and meter abstraction, perception of regularities like pattern

alternation or repetition, segmentation, and awareness of tension-relaxation schemas can be

considered kinds of figural apprehension. One central contention of this work is that when

we “rescue” our students from that undifferentiated sea of music, when we guide the first

transactions between their figural and formal knowledge, if we fail in establishing the right

first link between both modes of representation it probably will never be easily and naturally

restored. Or otherwise, it will be restored just by some exceptional individuals who are able

to find their way alone in that tempestuous sea2.

Pedagogical implications: modelling the learner’s cognition

The formalization required for SMN necessarily needs some kind of categorisation of this

figural perceptions - to make discrete something that is naturally continuos -, and

categorisation in turn involves a finite number of categories within a given continuous

dimension. If we assume that possible figural hearings are infinite and formal

representations are finite, we conclude with the very obvious assertion that two or more

figural representations may have, depending on the level of analysis, a common formal

representation; and also the opposite, that a formal representation - no matter how detailed it

may be - will have more than only one functional representation3. This is so because the

figural experience is holistic and will be automatically affected by the context. A higher

hierarchical organisation can always be preserved with a theoretically infinite number of

surface (low level) features. Conversely, a formal construct may preserve its integrity while

5

its figural function may be affected by an infinite number of contextual, higher level frames

of interpretation. In Example 1 are exemplified two cases of formalizations coupled with

different figural interpretations. In both cases the figural interpretation involves the

consideration of a context operating at a level higher than the one at which the formalization

operates.

What puzzles the learner in his/her first contacts with SMN is the finite, relatively limited

number of concrete means of notation (musical figures) - a characteristic inherent to any

symbolisation - and the consequent infinite number of musical “faces” that that symbols may

adopt in different contexts:

“mistake[s] can be traced to a mismatch between internal mental representation

and conventional descriptions, [and so] it is useful to help students confront

these differences. In this instance, that would mean helping students to move

back and forth between metric and figural hearings and between metric and

figural descriptions of a rhythm or melody” (Bamberger 1991: 66-7).

The usual strategy of the ear training pedagogy is to present as many examples of formal

configurations as possible, each one tied to a unique figural representation. This one-to-one

relationship obscures the listener’s active role in the process of construction of these figural

meanings from a neutral notation, and consequently the student achieves, at best, a figural -

formal transaction that is rigid and inflexible. He/she finds it difficult to distinguish one

mode of representation from the other, and has problems to differentiate among the aspects

of his/her figural experience that can be reflected by notation and those that notes cannot

portray, no matter the degree of detail in the score’s specifications.

This dialogue between the specifications present in the notation and the sense that the

learner makes of it is of crucial importance and must be carefully aided. It could be said that

the role of the teacher is to model the learner’s cognition by helping to disentangle both

kinds of representations while allowing a fluid interplay between them. As few as a single

error, only a small mismatch between this two dimensions, may be catastrophic for any

further learning. The didactic strategy proposed here is in a sense opposed to the traditional

one: it consists in presenting the learner with as many different figural representations as

possible that a single formal representation can stand for.

To learn what is already known

6

How can extant psychological studies inform an ear training pedagogy guided by the

conceptual framework outlined so far? One assumption of this approach to the teaching of

notation is that formal knowledge can only be constructed from figural knowledge. In this

conception the figural mode is not an imperfect or preliminary form of knowledge but a

foundation and in turn also a goal of the formal one. If we found that every member of a

culture is able to construct a figural representation of the music, that would imply that every

person within the range of standard perceptual and cognitive abilities should be able to learn

to read and write music.

Cultured adults process automatically enormous amounts of musical information. The

exposure to the music of the culture, together with innate cognitive machinery to structure

the environment, originates an ‘implicit knowledge’ that can be found in most people. This

ability consists mainly in grouping musical sounds and in giving them meaning inside a

culture-driven pattern (cf. Dowling 1999). This musical competence is relevant here because

it seems to involve similar processes in both musicians and no musicians. If we turn to

experiments concerned with some of the different processes involved - like grouping (cf.

Deliège 1987), the perception of tensing-relaxing patterns (cf. Bigand 1993) and of

underlying hierarchic structures (cf. Serafine, Glassman and Overbeeke 1989) - we find that

naive listeners indeed show a performance very similar to that of musicians. It seems that

musical training does not affect substantially the ability of normal adults to respond to the

figural aspects of the music, but rather that “music perception is fundamentally similar in

listeners with varying degrees of sophistication” (Trehub, Schellenberg & Hill 1997: 104).

There is also enough evidence to assume that the processes involved in this music

competence develop gradually from the pre-natal stage (cf. Lecanuet 1996) and through

childhood as a result of normal exposure to music. Infants group auditory information

according to Gestalt principles of proximity and similarity in ways similar to those used by

adults (Demany 1982; Fassbender 1993; Thorpe and Trehub 1989; Thorpe, Trehub &

Morrongiello 1988), can discriminate between different rhythmic and melodic patterns

(Chang and Trehub 1977a-b), and are sensitive to phrase structure (Krumhansl & Jusczyc

1990; Jusczyc & Krumhansl 1993). These and other cognitive processes involved in the

figural apprehension of music - like the understanding of tonal closure and harmonic

relationships - develop steadily through childhood and “are generally well in place in human

cognition by the age of 10 or 11 years” (Serafine 1988: 224).

This challenges the traditional conception of the aural training pedagogy that treats the

learner as if his/her first contact with the music took place at the conservatory classroom. If

the music theory teacher assumes a valuable pre-existing musical knowledge in the learner,

7

the process of formalization will consist simply in learning “what he/she already knows”

(Bamberger 1991: 259). The transition from figural to formal will consist simply in putting

names to “things” that the novice already perceives and notices. In that sense the job of the

teacher is not to teach but just to show.

When a person listens to a given piece she recognises - understands - several

characteristics of the music in a straightforward manner. The cruder, most basic

formalization of that understanding is the discrimination between that precise piece and

another, even if she could not rationalise the difference. In that case she “knows more than

she can tell”. In a very direct way that situation of the novice learner differs only in degree

from the experience that any musician knows: to listen to some sounds or music passage, to

perceive and understand the function, the “meaning”, but not being able to write it down or

to play it in the piano: “that sounds like a cadence but I don’t know exactly which the chords

or the inversions are”. Musicians and aural training teachers, no matter their audio-analytic

sophistication, continuously are faced with passages, chords, musical relations that they

“understand”, but are not able to formalise.

But if the average listener can indeed make all the subtle discriminations involved in the

“musical” apprehension of sounds, why is it that not everybody can learn to formalise this

knowledge in the form of SMN? I argue that when people fail to understand the basis of

music notation it is because aural training didactics fails when coming to formalise their pre-

existing musical knowledge: the “units of perception” do not match the “units of

description” (cf. Bamberger 1991: 8).

One of the central contentions of this work is that with the adequate teaching almost

every musically cultured adult should be able to learn the basics of music reading and

writing. We only need to find means in which the learner could profit from what he/she

already knows. If they do not find bonds between their percepts and the formalizations

required for notation the music teacher will simply be, as the saying goes, “answering

questions that the learners have never asked themselves” and that knowledge will never

become operative. I propose that the best way to establish links between figural and formal

percepts in music is to draw on the extant figural knowledge of the learner and to exploit any

previously acquired abilities in general auditory pattern processing. In the following section

I present some brief comments and suggestions of how a didactic approach to the teaching of

rhythm could take into account that premises.

A DIDACTIC APPROACH TO RHYTHMIC TEACHING

8

The key didactic issue is to find out the kind of formal description of rhythm that the

novice can more easily access, the one that is closer to his/her figural description. The

crucial question for the aural training teacher is how to lead the learner’s rhythmic

processing beyond the basic cognitive mechanisms like regularity extraction and

segmentation into groups (cf. Drake 1998). What is the next step that the learner should take

towards a formalization of his/her perceptions of uniformity and change in the rhythmic

flow? At this point the usual focus in ear training courses is to start from the metric aspects

of rhythm. Traditional pedagogy has assumed that, given that everyone can perceive an

underlying beat, they can easily use it as a temporal ruler to measure the length of events

that group over the pulsed field. But, as we have seen, that is an “after the fact” reasoning

that assumes a metrical “skill” inexistent in the learner. I argue that the next stage in the

formalization of rhythmic perception should rely not on the metric aspects of groups of

events, but on the functional relations among them. Now let’s examine what are the

functional relations in a group than can be more easily formalised (I use the word functional

instead of figural because it reflects more accurately the idea of relatedness of one event to

other, or others, in a group).

Speech and music prosody

The grouping of events - a functional perception - takes place in the framework of “a

regular pattern of strong and weak beats to which he [the listener] relates the actual musical

sounds” (Lerdahl & Jackendoff 1983: 12) called metrical hierarchy. It seems evident that the

first novices’ attempts at the formalization of rhythmic groups should be more conveniently

done at a single level of the metrical hierarchy and without loading on long term memory,

that is, weak beats grouped around strong ones, and precisely speech prosody offers an

example of that kind of grouping. Since most music in the Western tradition can be

interpreted in terms of figural units that can be described in prosodic terms, I propose that

the figural-formal transaction should start by the formalization of rhythmic groups in terms

of prosodic units.

Prosody is the overall acoustical profile of a spoken/chanted utterance that results from

the organisation of weak discrete elements - syllables - around stressed ones; the term stress

designates perceptual salience achieved by any mean like dynamic changes, lengthening of

sounds, and pause, etc. In speech comprehension language prosody plays a paramount role

(cf. Cutler, Dahan & Donselaar 1997, Pynte 1998), and the same is true for music. We can

imagine the musical parallel in the case when we chant a melody in a monotone, at half way

9

between speech and singing. In the absence of accurate melodic or rhythmic information, we

can still keep track of its structure relying solely in the information carried by the prosodic

profile of the output: almost every music can be reproduced, or mimicked, with variable

degrees of precision by the human vocal apparatus.

Both speech utterances and music involve the perception of auditory patterns in the

framework of sequential and hierarchical cognitive processing, where continuous sound

strings are analysed into discrete phonemes or notes. For language and music reading the

opposite way is followed: a symbol string is decoded to produce a spoken/sung/performed

output. This continuous strings of syllables or sounds, as they unfold in time, are structured

around metrical stresses that determine the prosodic profile of the phrase. The similar nature

of constituents marking in speech and music (cf. Carlson, Friberg, Frydén, Granström. &

Sundberg, 1989) indeed suggests the existence of some shared cognitive mechanisms for

grouping in language and music (cf. Pinker 1997: 535, Fassbender 1996: 80)4.

The first didactic work with prosody must start with the distinction between stressed and

unstressed events in a sound string. This is not a trivial point, since we have seen that in the

multidimensionality of the listener’s experience it may be difficult to keep track of the

salience of events along a single dimension. But the formal processing of speech sentences -

a well learned process in most schooled adults - constitutes a clear example of the grouping

of events (syllables) along a single temporal dimension. The syllable is the basic rhythmic

unit of Spanish, and the prosodic profile of an utterance is shaped by the way in which weak

syllables group around stressed ones. One of the most characteristic features of the spoken

language is that all the syllables in a rhythmic group, whether stressed or unstressed, tend to

follow each other at more or less evenly spaced intervals of time5. The identification of

stressed syllables in words and sentences may be used as a common cognitive framework for

the processing of musical events. By splitting words into syllables, and imposing a steady

beat, we can bring easily our students to the perception of the prosodic profile of an

utterance, and to transfer it to SMN. In a typical didactic sequence we can ask the student:

to recite a sentence following a steady beat and assigning one syllable to each beat

to find the stressed syllables that organise the meter

to place bars before the stressed syllables, and finally

to transfer to music notation the metrical structure of the utterance - establishing a

correspondence between strong and weak syllables and strong and weak musical beats - by

replacing the syllables by any conventional music symbol (Example 2).

Once the metrical structure is established, the further identification and description of

prosodic constituents follows. Each prosodic unit, or foot, is organised around a metrical

10

stress. The formalization of this feet in the learner’s representation must consist in the

distinction between the different ways in which syllables/events organise around this stress.

Since the same metrical structure allows several prosodic interpretations, depending on the

grouping, by interchanging different words to match a given metrical structure, the different

groups/feet that unstressed syllables/events may form around metrical stresses can be made

evident for the learner (Example 3)6.

In a typical analytic-reading activity the learner will be asked to search for all the possible

different figural interpretations, that is, all possible prosodic feet that could be formed,

following the rule that each of them must contain one, and no more than one, metrical stress

(or head). In that way, for instance, a 4 bar rhythmic phrase will present 4 different groups,

each “wrapped” around a head. The feet can be kept similar among themselves, within the

possibilities of the phrase structure, or different feet may be combined (Examples 4 and 5).

The concept of duration

People is not very good at comparing durations because the formal operation involved in

comparison of temporal lengths is strongly influenced by the figural organisation of the

events (i.e., the detection of the lengthenings of events between groups are “harder to detect

and identify than similar lengthenings in other positions within the group” [Drake 1998: 18;

cf. also Fitzgibbons, Pollatsek & Thomas 1974]). Usually a notated duration stands not for

the metrical length of an event, but mainly for the rate of attacks between events that in turn

will determine its figural grouping. From experimental data and musical experience it is well

known that “musically calibrated durations [are not] the fundamental processing units of

rhythm” (Serafine 1988: 60). Indeed notational figures used in rhythmic SMN are very

limited even at specifying relative durations: a crude arithmetic relation of twice as much, or

half, does not reflect the timing of even the simplest tune when it is performed musically.

More important, a tune can be perfectly recognised when the absolute durations of the events

are changed (as with changing articulations) provided that the attack points are preserved. A

primary role of rhythmic notation is not to specify the exact relative durations of events, but

rather the relative time spans between attacks, leaving the performer an “enormous latitude

[for] . . . the manner (legato-staccato) of filling them with sounds” (Monahan 1993: 127).

Even when absolute durations have an important role in determining grouping behaviour,

that comes as a mere consequence, or by-product, of the change in the rate of attacks (cf.

Povel & Okkerman 1981).

11

In spite of this, most music syllabus approach the issue of durations from the stand point

of the durations of events. In conservatory courses all music figures (crochets, quavers,

semiquavers, etc.) are “taught” fairly quickly - usually in the first year of formal training (cf.

Musumeci 1998b) - relying in the common sense assumption that for teaching relative

durations as those represented by rhythmic notation, one needs at least two different

durations to compare between each other. But what that conceptions promote is the close

(and erroneous) association - in the learners representation of the event - of the different

dimensions of event attack point and event duration. In the teaching of notation the

distinction between this two dimensions must be carefully aided. If it is not so, the learner

will not understand what a notational figure - i. e. a quaver - stands for (it is usual to be

asked by first year students “how long exactly is a quarter note”).

It is for that reason that I propose to avoid the concept of figures’ duration at the

beginning of rhythmic notation teaching. In the examples presented so far it should be noted

that the syllables/quavers stand for isochronous beats in which events are attacked, with not

special concern with its duration. However, when chanting a rhythmic phrase students are

encouraged to perform so many subtle deviations in absolute and relative durations as

necessary for achieving the “sense” of the phrase. When chanting “like speaking” all

variations in timing (as well as in loudness, pause and pitch like in speech prosody) are

performed intuitively under the illusion of beat’s isochrony. By eliminating the factor of

figure’s durations the learner can concentrate in the “important differences between notes

that look the same on the page” (Bamberger 1991: 39). The simple figural percepts that arise

from different groupings at a single level (Example 6) - like if the quaver marked * can be

thought of as “forward looking” or “backward looking” - are basic and underlie all further

perceptual organisation involving different rates in the attacks of events, and in consequence

their formalization must precede the work with the different proportional durations

represented by the whole range of conventional musical figures.

But rhythmic teaching must also go on to help the grasping of that groupings that do

depend in gross variations in the rate of appearance of events, like the relations expressed by

conventional figures. This relations are indeed very few and simple: patterns composed of a

few events with attack points holding relations multiples of two and three, and further

rhythmic complexity is achieved through a hierarchical arrangement of these basic

“rhythmic contours” (Monahan 1993: 127). In traditional aural-training settings the

confusion between absolute and relative duration is perpetuated by means of the explicit

taxonomy of contents, that presents as different several groupings-of-figures that have in fact

a common figural-grouping when transposed to another metrical hierarchy (i.e., the groups

12

half note - two crochets and crochet - two quavers are metrically (formally) different but

figurally identical.

The conflict arising from this ambiguity surely is one of the factors responsible for the

mismatch between novices’ figural and formal representations mentioned in previous

sections. I argue that the formalization of these hierarchical transpositions should come only

after the basic relationships of rhythmic contours have been grasped at a single metric level.

By introducing whole-beat rests to our ‘elemental’ prosodic phrases we can convey a whole

range of time-span proportions without the burden of introducing different durational

symbols. Since the goal is not to teach relations among figures but among time spans, this

method allows the learner to grasp the concept of duration as an abstract relation between

time spans, rather than a property of figures themselves. A whole range of rhythmic contours

can be conveyed without using more than just one metrical level (Example 7). Only after the

formalization of this rhythmic contours is that the “transposition” to other hierarchical levels

symbolised by the different range of durational figures makes sense.

A COGNITIVE PEDAGOGY OF AURAL TRAINING

The kind of training proposed so far has not the traditional goal of just teaching to read

rhythmic notation, but rather that the student should become an analyst of the notation to

find which factors will lead his/her cognition to reverse the process of formalization and

give rise to figural concepts. In a former section I argued that traditional teaching presents

formal descriptions (notations) tied to single figural interpretations, usually forgetting that

the sense made of music is not determined completely by the score, but rather is govern by

several factors with variable strengths and that may be in concordance or in conflict. Each

passage of music may have different preferred hearings - or figural interpretations. Lerdahl

and Jackendoff (1983) formalised the interplay of these factors in a set of preference rules

that establish “not inflexible decisions about structure, but relative preferences among a

number of logically possible analysis” (p. 42). It is paramount to show the learner how this

several preferred hearings originate from a single formal representation.

I consider a useful pedagogic tool for aural training classes to encourage the identification

of all “logically possible analysis” - in terms of prosodic feet - that may be performed over a

rhythmic surface. The simplest rhythmic phrase may be a source of enlightenment regarding

exactly what characteristics of musical understanding are reflected by conventional notation

and what others are ignored (Example 8).

13

By constructing and deconstructing the different figural representations corresponding to

different preferred hearings learners may gain a conscious access to the factors that govern

the formation of that representations. This factors, like Gestalt principles of temporal

proximity, similarity, and good continuation, and abstract concepts like balance between

phrases, symmetry, and motivic repetition, must be made evident to the students from the

beginning of teaching. This elemental, crudely stated, “well formedness rules” are perfectly

accessible to the music novice and constitute a basic premise for all further learning. The

educative power of the concept of preferences among competing organising principles

cannot be overestimated. Such concept does not play an accessory role in music reading but

on the contrary, good music reading consists in evaluating in real time the odds of each

different musical figure to arise from the current notation. This ability to foreseen the

chances of alternative groupings must not be considered just one more didactic devise to

gain proficiency in note reading but a fundamental component of musical understanding.

From this standpoint the ambiguity inherent to music notation represents not a marginal

effect of an inefficient system, but rather the space left for a “cognitive” interpreter.

Rhythmic complexity in most music is achieved by competing simple preferences at

multiple levels. But, regrettably, the prevailing didactic approach to rhythmic training in

conservatories and music schools indeed reflects a noteworthy lack of contact with the

principles of structural organisation that are at the heart of “musical intuitions”. I argument

that this lack of musicality is the consequence of confounding structural complexity with

sheer difficulty, confusion that in turn reveals a profound ignorance of the ways in which

human cognition “composes” and in turn interprets music pieces. If it is possible “to explain

artistically interesting aspects of musical structure in terms of principles that account for

simpler musical phenomena” (Lerdahl & Jackendoff 1983: 2-3), that imply that artistic

music teaching should always remain close to that simpler phenomena: “structural simples”

(Bamberger 1991) like Gestalt principles, temporal symmetry or balance, alternation

between stability and tension, and so forth.

CONCLUDING REMARKS

Aural training pedagogues must take into account what is now widely accepted in music

cognition literature: that artistic content - or musical meaning - is not an unexplainable

achievement of humans, but rather the output of very refined cognitive processes that are

common to the species and that are revealed by the widespread receptive music competence

14

of people. What that implies is that formal music learning - involving SMN or not - is

accessible to any person with standard cognitive abilities and exposure to music. The lack of

“ear” can be seen only as the failure to develop spontaneously the links between the inborn

and acquired cognitive processes underlying musical understanding and a formal system of

description. Keeping this in mind would reverse the charge of responsibility for learning that

we found in typical music settings, where the success of the learner mainly depends on his

ability, or talent, to pick up the musical knowledge provided by the teacher. A failure to do

so is considered most of the time as beyond the reach of music pedagogy. On the contrary, a

“cognitive teaching” would be attentive to the way in which learners make sense of the

musical phenomenon, and their success would heavily depend on how that implicit

knowledge is didactically developed - transformed from figural into formal - by the teacher.

What can be done to improve the communication between pedagogues and

psychologists?, and who can do it? Within the great diversity of lines and approaches, it

could be said that researchers in perception have done their job well. They have been

improving their methods, technologies and - more important - their theoretical frameworks

steadily during the last three decades, and there is a growing concern with achieving a

musical significance in their works. Perhaps music educators would be grateful if music

psychologists focused more on the evolutive aspects of the phenomena that they investigate,

and indeed it is very rare to find in the academic papers a explicit reference to the

transference of their findings to music pedagogy. But with regard to the teaching profession,

I am inclined to think that after all psychological findings are there waiting for music

educators to make sense of them. If somehow these findings may be too many for the

professional teacher to handle, it seems useful to outline a framework - articulating

pragmatic and theoretical concerns - inside which it could be assessed the relevance of the

psychological data for pedagogic ends. This work is an attempt to foreseen how that kind of

synthetic framework could enrich professional music teaching.

REFERENCES

Aguilar, M. C. (1978). Método para leer y escribir música a partir de la percepción. Bs.

As.: María del Carmen Aguilar Ed.

Bamberger, J. (1991). The Mind behind the Musical Ear. Cambridge, MA: Harvard

University Press.

15

Bigand, E. (1993). The influence of implicit harmony, rhythm and musical training on the

abstraction of “tension - relaxation schemas” in tonal musical phrases.

Contemporary Music Review, 9 (1-2), 123-37.

Butler, D. (1997). Why the Gulf Between Music Perception Research and Aural Training?

Bulletin of the Council for Research in Music Education, 123, 38-48.

Carlson, R., Friberg, A., Frydén, L., Granström, B. & Sundberg, J. (1989). Speech and

music performance: parallels and contrasts. Contemporary Music Review, 4, 389-402.

Chang, H. W. & Trehub, S. E. (1977a). Auditory processing of relational information by

young infants. Journal of Experimental Child Psychology, 24, 324-31.

Chang, H. W. & Trehub, S. E. (1977b). Infants’ perception of temporal grouping in auditory

patterns. Child Development, 48, 1666-70.

Clarke, E. F. (1988). Generative principles in music performance. In J. A. Sloboda (Ed.)

Generative Processes in Music, pp. 1-26. Oxford: Clarendon Press

Cole, H. (1974). Sound and Signs, aspects of music notation. London: Oxford University

Press.

Cook, N. (1994). Perception: A Perspective from Music Theory. In R. Aiello and J. Sloboda

(Eds.) Musical Perceptions, pp. 64-95. Oxford: Oxford University Press.

Covington, K & Lord, C. H. (1994). Epistemology and Procedure in Aural Training: In

Search of a Unification of Music Cognitive Theory with its Applications. Music

Theory Spectrum, 16 (2), 159-70.

Cutler, A., Dahan, D. & Donselaar, W. van (1997). Prosody in the Comprehension of

Spoken Language: A Literature Review. Language and Speech, 40 (2), 141-201.

Deliège, I. (1987). Grouping Conditions in Listening to Music: An Approach to Lerdahl &

Jackendoff’s Grouping Preference Rules. Music Perception, 4 (4), 325-60.

Demany, L. (1982). Auditory stream segregation in infancy. Infant Behaviour and

Development, 5, 261-76.

Dowling, W. J. (1999). The Development of Music Perception and Cognition. In D. Deutsch

(Ed.) The Psychology of Music [1st. ed. 1982], pp. 603-25. San Diego: Academic

Press.

Drake, C. (1998). Psychological Processes Involved in the Temporal Organisation of

Complex Auditory Sequences: Universal and Acquired Processes. Music Perception,

16 (1), 11-26.

Fassbender, C. (1993). Auditory grouping and segregation processes in infancy. Kaste

Verlag, Nordestedt.

16

Fassbender, C. (1996). Infant’s auditory sensitivity towards acoustic parameters of speech

and music. In I. Deliège and J. A. Sloboda (Eds.) Musical Beginnings, pp. 56-87.

Oxford: Oxford University Press.

Fitzgibbons, P. J., Pollatsek, A. & Thomas, I. B. (1974). Detection of temporal gaps within

and between perceptual tonal groups. Perception & Psychophysics, 16, 522-8.

Gérard, C. & Auxiette, C. (1992). The Processing of Musical Prosody by Musical and

Nonmusical Children. Music Perception, 10 (1), 93-126.

Henson, M. (1987). Musical Awareness. Huddersfield: Music Department of Huddersfield

Polytechnic.

Jusczyc, P. W. & Krumhansl, C. L. (1993). Pitch and rhythm patterns affecting infant’s

sensitivity to musical phrase structure. Journal of Experimental Psychology: Human

Perception and Performance, 19, 627-40.

Krumhansl, C. L. & Jusczyc, P. W. (1990). Infant’s perception of phrase structure in music.

Psychological Science, 1, 70-3.

Lecanuet, J-P. (1996). Prenatal auditory experience. In I. Deliège and J. A. Sloboda (Eds.)

Musical Beginnings, pp. 3-34. Oxford: Oxford University Press.

Lerdahl, F. and Jackendoff, R. (1983). A generative theory of tonal music. Cambridge, MA:

MIT Press.

López Puccio, C. (1978). Prólogo. In María del Carmen Aguilar (Author) Método para leer

y escribir música a partir de la percepción. Bs. As.: María del Carmen Aguilar Ed.

Monahan, C. B. (1993). Parallels Between Pitch and Time and How They Go Together. In

Thomas J. Tighe and W. Jay Dowling (Eds.) Psychology and Music: The

Understanding of Melody and Rhythm, pp. 121-54. Hillsdale, NJ: Lawrence Erlbaum

Associates.

Musumeci, O. (1998a). Should we change conservatory to “renovatory”? Proceedings of the

1998 ISME Seminar on the Education of the Professional Musician. Harare: ISME.

Musumeci, O. (1998b). La didáctica de los cursos de teoría y solfeo: el material musical

(The didactics of theory and solfeo courses: the musical material). Boletín del CIEM.

12, 47-51.

Papousek, M. (1987). Melodies in motherese in tonal and nontonal languages: Mandarin

Chinese, Caucasian American, and German. Presentation at the Ninth Biennial

Meeting of the International Society for the Study of Behavioural Development,

Tokyo, Japan, July 1987.

Pinker, S. (1997). How the Mind Works. Allen Lane The Penguin Press.

17

Povel, D. J. & Okkerman, H. (1981). Accents in equitone sequences. Perception &

Psychophysics, 30, 565-72.

Pynte, J. (1998). The Role of Prosody in Semantic Interpretation. Music Perception, 16 (1),

79-98.

Rakowski, A. (1999). Perceptual dimensions of pitch and their appearance in the

phonological system of music. Musicae Scientiae¸ 3 (1), 23-39.

Serafine, M. L. (1988). Music as Cognition. New York: Columbia University Press.

Serafine, M. L., Glassman, N. & Overbeeke, C. (1989). The Cognitive Reality of Hierarchic

Structure in Music. Music Perception, 6 (4), 397-430.

Terry, P. (1994). Musical Notation in Secondary Education: Some Aspects of Theory and

Practice. British Journal of Music Education, 11, 99-111.

Thorpe, L. A. & Trehub, S. E. (1989). Duration illusion and auditory grouping in infancy.

Developmental Psychology, 25, 122-7.

Thorpe, L. A., Trehub, S. E., Morrongiello, B. A. & Bull, D. (1988). Perceptual grouping by

infants and preschool children. Developmental Psychology, 24, 484-91.

Trehub, S., Schellenberg, E. & Hill, D. (1997). The origins of music perception and

cognition: A developmental perspective. In I. Deliège and J. A. Sloboda (Eds.)

Perception and Cognition of Music, pp. 103-28. Hove: Psychology Press.

Upitis, R. (1987). Children’s Understanding of Rhythm: The Relationship between

Development and Music Training. Psychomusicology, 7 (1), 41-60.

18

Example 1

FORMALIZATION DIFFERENT FIGURAL INTERPRETATIONS

three events stress context

three heights articulation context

Example 2Es - ta ca - si - ta tie - ne ven - ta - nas

| | | | | | | | | | a steady beat

Es - ta ca - si - ta tie - ne ven - ta - nas

| | | | | | | | | |

Es - ta ca - si - ta tie - ne ven - ta - nas

| | | | | | | | | |

Example 3Es - ta ca - si - ta tie - ne ven - ta - nas

1 |________| |_______________| |_________| |________________|

Un ju - ga - dor ca yó des - ma - ya - do

2 |___| |________________| |_________| |______________________|

Ár - bo - les al - tos pó - da - los to - dos

3 |________________| |_________| |_________________| |__________|

Example 4

|_______| |_________| |_________| |____________|

|_________| |_________| |_________| |_________|

|____________| |_________| |_________| |_______|

|_______________| |________| |_______| |____|

Example 5

|_______| |___________| |_______| |____________|

|_________| |_______| |___________| |__________|

|____________| |____| |______________| |________|

Example 6 *

|____| |_______|

|_| |__________|

Example 7

= or

=

Example 8

1 The teaching of notation as a hard and fast rule for decoding a score, specially for instrumental reading, perhaps enables the musician to perform some very complex tasks from an orchestral stand, but if acquired at the expense of other music cognitive abilities - as it usually happens - it turns the music reader into a merely mechanistic “note reader”. The score should not be considered a mere prescription of the notes to be played or sang, but rather a description of the music that needs the interpretation of human cognition. The word description involves an active participation of the performer - or singer - who makes his/her own abstract representation of the music from the myriad of symbols in the score.

2 This figural - formal transaction is also paramount for instrumental performance. For a performer it is very important to keep in mind the figural aspect of music notation, because when reading a score he/she will have to reverse the process: a performance is the enactment of figural designs. When a poor sight reader reads a score for the first time, he/she is performing formal operations like counting, labelling, etc. Later the automation of this behaviour will help he/she to achieve a musical rendition, that will consist in the integration of that surface features in successively higher level arrangements, from motives and phrases to the whole piece (cf. Clarke 1988). A good sight reader who can make musical sense of a score the first time he/she reads it is in a sense performing the transaction formal - figural at a better rate. He/she is accessing alternatively, in quick succession, to different hierarchical levels of knowledge.

3 This procedure is at the core of any attempt to describe and analyse music. Its limitation is at the same time its strength. Any attempt to describe music needs a symbol system, discrete and finite (be it SMN or not) that could stand for, or represent, infinite musical relations. ¿Could a notation be devised with an increased number of categories? ¿Could a system be developed where, for instance, a specific pitch - that has a unique formal representation in SMN - may have many formal representations depending on its figural function within the musical context? In view of the success of Western written music to be transmitted by notation, we may at first think that the number of categories used for formalization in SMN has been optimal, but of course this is an issue like the chicken-and-egg argument.

4 It has been proposed that motherese - the infant-directed speech - exhibits universal prosodic features that are not only brought about by cultural tradition and education but also by genetic predisposition (Papousek 1987). The fact that “there is considerable similarity across languages in the prosodic correlates of utterance position” (Cutler et al. 1997: 159) suggests that the proverbial “universality” of musical language may lie in its prosody-dependence.

5 “The subjective perception of isochrony between spoken sounds correspond[s] . . . neither to equal interonset intervals, nor to equal durations of the sounds and/or silences between them, but to equal intervals between internal points of the syllables which play the role of ‘gravity centres’. . . isochrony in any language is not determined by acoustical properties of speech but by the isochrony illusion of the perceiver” (Gérard & Auxiette 1992: 101). Spanish speakers tend to assign a regular beat to each syllable, a type of timing that is called “syllable-timed”, in contrast to English language that is “stress-timed”, that is, not all the syllables but only the stressed ones tend to be evenly spaced in time. In syllable-timed languages “the metric organisation of music can be paralleled with the eurythmic principle in speech: in the two domains, small cells . . . are signalled by accents and concatenated in longer strings” (Gérard & Auxiette 1992: 100).

6 Words are specially relevant for this purpose because in Spanish each polysyllabic word is a foot with a nucleus (the accented syllable) and different unaccented components that may act as upbeats (co - ra - zón), afterbeats (tém - pa - no) or a combination of both (ca - lien - te).

|_| |___________| |__________| |___________|

|_____| |__________| |__________| |_______|

psychology of perception and music education … · web viewif we turn to experiments concerned...

Documents