identifying the classical music composition of an unknown performance with wavelet dispersion vector...

17
Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

Upload: francisca-velazquez-valenzuela

Post on 02-Feb-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

Identifying the Classical Music Composition

of an Unknown Performancewith Wavelet Dispersion Vector

andNeural Nets

V. CONTENT DESCRIPTION WITH WAVELETS

Page 2: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Representación compacta que permita la búsqueda y recuperación eficientes de la información deseada.•Extracción del contenido de la información•Los datos extraídos deben permitir una generalización de la información

Se considera el movimiento 4 de Sonata No. 1 registrado por Y. Menuhin y N. Milstein.

La descripción de los datos permitirá una medida de la semejanza que identifique la composición audio.

Page 3: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETSEn este capítulo se presentan dos procedimientos para la extracción de información:•Descriptor de envolvente•A través de herramientas estadísticas

A. Gaussian Wavelet Envelope Descriptor: Descripción exacta del contenido de audio teniendo en cuenta una posible generalización del mismo a través de la estimación de la energía media de los coeficientes

N = 320, f=8Khz, T=40ms

Se fijan todos los valores de la energía más bajos que un umbral t a cero

Se elegido un valor de umbral limitador t de 0,05 que representa la intensidad de coeficientes apenas visibles.

Page 4: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Se petrende que en motorde la búsqueda pueda identificar la pieza midiendo la semejanzaentre estas dos funciones.

Page 5: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Se propone un algoritmo de sincronización

El algoritmo corrige retardos de tiempo en un archivo de voz distorsionado para obtener una correlación máxima con referencia al archivo.

Este algoritmo tendría que ser empleado para sincronizar todas las funciones en la base de datos del motor de búsqueda con la función de la pregunta de usuario. De tal modo, las funciones en la base de datos son corregidas insertando o suprimiendo muestras para obtener una semejanza máxima a la función del usuario.

Durante el procedimiento sincronización el algoritmo mide finalmente la correlación obtenida con relación con una función de referencia.

Page 6: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETSPara obtener una buena representación de las señales se empleaUna función gausiana:

Page 7: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

B. Statistical Wavelet Analysis for Content Description

The first set (set I) contains three different recordings of the firstmovement of Partita 3:1) Partita 3, movement i, Y. Menuhin (Pa3iMen57)2) Partita 3, movement i, J. Heifetz (Pa3iHei52)3) Partita 3, movement i, N. Milstein (Pa3iMil73)The second set (set II) contains the movements ii, iii, and iv of Partita 3, each one recorded by a different player:1) Partita 3, movement ii, Y. Menuhin (Pa3iiMen57)2) Partita 3, movement iii, J. Heifetz (Pa3iiiHei52)3) Partita 3, movement iv, N. Milstein (Pa3ivMil73)

Page 8: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Herramientas Estadísticas

Page 9: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Page 10: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Medida de Escala de frecuencia

Page 11: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

CONTENT DESCRIPTION WITH WAVELETS

Porcentaje de correlación

Page 12: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

Identifying the Classical Music Composition

of an Unknown PerformancewithWavelet Dispersion Vector

andNeural Nets

VI. A NOVEL WAVELET DISPERSION MEASURE

Page 13: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

A NOVEL WAVELET DISPERSION MEASURE

A. Wavelet Dispersion Classifier

El vector de dispersión wavelet permite la extracción de características especiales de un archivo audio 8 Khz5 segundos18 escalas

Los coeficientes obtenidos se almacenan en una matriz de 18 x (5 x 8000). Para cada escala (representada por una fila en la matriz de coeficientes)

Ejemplo:Sea C r la matriz de coeficiente s

Page 14: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

A NOVEL WAVELET DISPERSION MEASURE

Page 15: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

A NOVEL WAVELET DISPERSION MEASURE

Vector de dispersión

Para cada archivo audio este vector se puede construir para representar las características audio especiales. La base de datos del proyecto contiene 128 archivos audio.

Page 16: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

A NOVEL WAVELET DISPERSION MEASURE

B. Wavelet Dispersion Measure Dimension Reduction

Page 17: Identifying the Classical Music Composition of an Unknown Performance with Wavelet Dispersion Vector and Neural Nets V. CONTENT DESCRIPTION WITH WAVELETS

A NOVEL WAVELET DISPERSION MEASURE

C. Wavelet Dispersion Measure Performance Indicator