polyphonic audio key finding using the spiral array ceg algorithm

18
RESEARCH BY ELAINE CHEW AND CHING-HUA CHUAN UNIVERSITY OF SOUTHERN CALIFORNIA PRESENTATION BY SEAN SWEENEY DIGIPEN INSTITUTE OF TECHNOLOGY CS 582 / APRIL 17, 2011 DR. DIMITRI VOLPER Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Upload: lonna

Post on 23-Feb-2016

46 views

Category:

Documents


0 download

DESCRIPTION

Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm. Research by Elaine Chew and Ching-Hua Chuan University of Southern California Presentation by Sean Sweeney DigiPen Institute of Technology CS 582 / April 17, 2011 Dr. Dimitri Volper. Presentation Flow. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

RESEARCH BYELAINE CHEW AND CHING-HUA CHUAN

UNIVERSITY OF SOUTHERN CALIFORNIA

PRESENTATION BYSEAN SWEENEY

DIGIPEN INSTITUTE OF TECHNOLOGY

CS 582 / APRIL 17 , 2011DR. DIMITRI VOLPER

Polyphonic Audio Key Finding Using the Spiral Array CEG

Algorithm

Page 2: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Presentation Flow

Musical Pitch and KeyHuman Perception of PitchThe Spiral Array Model

Pitches Chords Keys

The CEG Algorithm Algorithm Visualization

Page 3: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Musical Pitch and Key

Pitch The perceived value of a tone, “Low” to “High” Psycho-acoustic (subjective) perception of Frequency

Frequency (Hz) is a scientific measurement of period

Key (Western music) Labels the “center” tone in a section of music Standard smallest interval: Semitone or “half-step” Standard pattern of semitones around “center”

Ascending: 2,2,1,2,2,2,1

Page 4: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Human Perception of Pitch

Limited range of perception Typically 20Hz – 20,000Hz Range tends to decrease with age

Noticable Difference is coarser at low Hz Less distance (Hz) between lower sounds Around 1400 perceivable intervals

Certain frequency distances sound relatively close Thirds, Fifths, Octaves

Page 5: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

The Spiral Array Model

Page 6: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

The Spiral Array Model

Helical Structure

Toroidal across Octaves

Distance in 3D model approximates perceived closeness between pitch

Pitch, chord and key can all map to the same space

Page 7: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Chords in the Spiral Array

Standard chords are based on three supporting tones

Create Triangles in 3D relative to the model

Triangles are effectively continuous, as pitch is

Major and Minor chords’ centers thus form helixes

Page 8: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Key in the Spiral Array

Simple keys are based on three supporting chords

Creates triangles in 3D, based on supporting chords’ triangular centers

Triangles are effectively continuous, as chords are

Major and Minor keys’ centers thus form helixes

Page 9: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Center of Effect

Center of Effect (CE) Relative location of a chord based on its supporting

tones

Notes of different strength change the CE location Complex chord CE’s will not line up exactly on the

model

Page 10: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Center of Effect Generator (CEG) Key-Finding

Center of Effect relates position of multiple pitches in model

Spatially closest chord is most likely key Correlates input music

to standard key structure

Page 11: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Helping Visualize the CEG Algorithm

Keys exist as a triangle in 3-space

Keys’ centers-of-effect make up two helixes in the 3D model

In standard intonation, keys are discrete (12 minor, 12 major)

Page 12: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Helping Visualize the CEG Algorithm

From a complex audio signal, weighted values are calculated for bins on each discrete tone

The weighted values approximate the current key’s location on the model

The spatially-closest key is the most likely match

Page 13: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

CEG Key-Finding Algorithm

Pitch detection Extract pitch class and strength from signal

Key finding Nearest Neighbor Search in Spiral Array

Page 14: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Fast Fourier Transform

Efficient algorithm to compute Discrete Fourier Transform O(n log n) vs O(n2)

Transforms function into its Frequency Domain representation

Widely used across many fields Solving Partial Differential Equations

Data Compression

Polynomial Multiplication

Spectral Analysis Frequency bands

Page 15: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

Algorithm for Pitch Class/Strength from FFT

For each frequency spectrum in a 0.37 second period:

1. For each frequency band find peak value2. For each pitch-class, k, and its strength at time

j: Fjk, is the sum of all peak values for that frequency band (and others related by octaves)

3. Normalize1. Divide all pitch-strength values by the largest:

2. Divide all pitch-strength values by their sum:(k = 0, 1, …, 11)

Page 16: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

CEG Key-Finding Algorithm

Pitch detection Extract pitch class and strength from signal

Key finding Nearest Neighbor Search in Spiral Array

Page 17: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

CEG Algorithm

For pitch class and strength from each 0.37 seconds:

1. Assign pitch-names to pitch classes:1. Generate CE for previous 5 seconds; and2. Assign pitch-names to current pitch-classes by

nearest neighbor search in Spiral Array Space2. Determine Key based on pitch names:

1. Generate the cumulative CE from beginning to current

2. Perform nearest-neighbor search to find closest key

Page 18: Polyphonic Audio Key Finding Using the Spiral Array CEG Algorithm

BIBLIOGRAPHY:

• Po lyphon i c Aud i o Key F ind i ng Us i ng the Sp i r a l Ar r ay C EG A lgor i thm

Chuan , C. and Chew, E .IEEE I n te r na t i ona l Con fe rence on Mu l t i med i a & Expo

2005

• Towards a Mathemat i ca l Mode l o f Tona l i t y Chew, E .Doc tor a l d i s se r ta t i on , MIT 2000

Questions?