2003 international conference on multimedia and expo · ss-l9.3: information fusion and decision...

13
PROCEEDINGS 2003 International Conference on Multimedia and Expo Volume III of III 6-9 July 2003 Baltimore Marriott Waterfront Hote Baltimore, Maryland, USA Sponsored by TECHNISCHE ;NFCRMATION3^:3L!OTHEK UNIVERSITATSBIBLICTHEK HANNOVER The Institute of Electrical and Electronics Engineers Signal Processing Society Computer Society Circuits and Systems Society Communications Society IEEE UB/TIB Hannover 124 754 279 89 /£££ IEEE S/ffna/ 'ProcessingSociety COMPUTER SOCIETY CAS IEEE COMMUNICATIONS SOCIETY

Upload: others

Post on 27-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

PROCEEDINGS

2003 International Conferenceon Multimedia and Expo

Volume III of III6-9 July 2003

Baltimore Marriott Waterfront HoteBaltimore, Maryland, USA

Sponsored by

TECHNISCHE;NFCRMATION3^:3L!OTHEK

UNIVERSITATSBIBL ICTHEKHANNOVER

The Institute of Electrical and Electronics EngineersSignal Processing Society

Computer SocietyCircuits and Systems Society

Communications Society

IEEE UB/TIB Hannover124 754 279

89

/£££IEEE

S/ffna/ 'ProcessingSocietyCOMPUTER

SOCIETY CASIEEE

COMMUNICATIONSSOCIETY

Page 2: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

VOLUME III

SS-L9: THEORETICAL INSIGHTS AND IMPROVEMENTS FOR MULTIMODALBIOMETRICS

SS-L9.1: SPEECH & FACE BASED BIOMETRIC AUTHENTICATION AT IDIAP Ill -1Conrad Sanderson, Samy Bengio, Herve Bourlard, Johnny Mariethoz, Ronan Collobert, Mohamed BenZeghiba, FabienCardinaux, Sebastien Marcel, IDIAP, Switzerland

SS-L9.2: FUSION STRATEGIES IN MULTIMODAL BIOMETRIC VERIFICATION Ill - 5Julian Fierrez-Aguilar, Javier Ortega-Garcia, Joaquin Gonzalez-Rodriguez, Universidad Politecnica de Madrid, Spain

SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9RECOGNITION BASED ON TIME-VARYING STREAM RELIABILITY PREDICTIONUpendra Chaudhari, Ganesh Ramaswamy, Gerasimos Potamianos, Chalapathy Neti, IBM T. J. Watson Research Center,United States

SS-L9.4: COMBINING CLASSIFIERS FOR FACE RECOGNITION HI -13Xiaoguang Lu, Michigan State University, United States; Yunhong Wang, Chinese Academy of Sciences, China; Anil Jain,Michigan State University, United States

SS-L9.5: A CLASSIFICATION OF BIOMETRIC SIGNATURES HI -17

Arslan Bromme, Magdeburg University, Germany

MD-L5: SUMMARIZATION

MD-L5.1: ENHANCED ACCESS TO DIGITAL VIDEO THROUGH VISUALLY RICH INTERFACES Ill - 21

Michael Christel, Chang Huang, Carnegie Mellon University, United StatesMD-L5.2: MULTIMODAL SUMMARIZATION OF MEETING RECORDINGS Ill - 25Berna Erol, Dar-Shyang Lee, Jonathan Hull, Ricoh California Research Center, United States

MD-L5.3: UNSUPERVISED DISCOVERY OF MULTILEVEL STATISTICAL VIDEO HI - 29STRUCTURES USING HIERARCHICAL HIDDEN MARKOV MODELSLexing Xie, Shih-Fu Chang, Columbia University, United States; Ajay Divakaran, Huifang Sun, Mitsubishi Electric ResearchLaboratories, United States

MD-L5.4: MERGING RESULTS OF DISTRIBUTED IMAGE LIBRARIES Ill - 33Stefano Berretti, Alberto Del Bimbo, Pietro Pala, Universita di Firenze, Italy

MD-L5.5: HIGHLIGHT SOUND EFFECTS DETECTION IN AUDIO STREAM HI - 37Rui Cai, Tsinghua University, China; Lie Lu, Hong-Jiang Zhang, Microsoft Research Asia, China; Lian-Hong Cai, TsinghuaUniversity, China

SS-L10: MULTISTREAM AUDIO AND VIDEO PROCESSING FOR TELEPRESENCE

SS-L10.1: FOUR-DIMENSIONAL SOUND SOURCE RECOVERY FROM ARBITRARY ACOUSTIC HI - 41ARRAYSDouglas Jones, University of Illinois, United States

SS-L10.2: MULTICHANNEL VIDEO/AUDIO ACQUISITION FOR IMMERSIVE CONFERENCING Ill - 45Qiong Liu, Don Kimber, Jonathan Foote, FX Palo Alto Laboratory, Inc., United States; Chunyuan Liao, University ofMaryland, United States

xl

Page 3: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

SS-L10.3: FULL-DUPLEX MULTICHANNEL COMMUNICATION: REAL-TIME HI - 49IMPLEMENTATIONS IN A GENERAL FRAMEWORKWolfgang Herbordt, Herbert Buchner, Walter Kellermann, University Erlangen-Nuremberg, Germany; Rudolf Rabenstein,University of Erlangen-Nuremberg, Germany; Sascha Spors, Heinz Teutsch, University Erlangen-Nuremberg, Germany

SS-L10.4: SCENE RECONSTRUCTION USING DISTRIBUTED MICROPHONE ARRAYS Ill - 53Parham Aarabi, Bob Mungamuru, University of Toronto, Canada

SS-L10.5: USING COMPUTER VISION TO GENERATE CUSTOMIZED SPATIAL AUDIO Ill - 57Ankur Mohan, Ramani Duraiswami, Dmitry N. Zotkin, Daniel DeMenthon, Larry S. Davis, University of Maryland, CollegePark, United States

AIVP-L7: VIDEO/IMAGE TRACKING

AIVP-L7.1: SHAPE AND MOTION DRIVEN PARTICLE FILTERING FOR HUMAN BODY Ill - 61TRACKINGTakashi Yamamoto, Rama Chellappa, University of Maryland, United States

AIVP-L7.2: OBJECT TRACKING USING ADAPTIVE BLOCK MATCHING Ill - 65Karthik Hariharakrishnan, Dan Schonfeld, University of Illinois, Chicago, United States; Philippe Raffy, Fathy Yassa,NeoMagic Corp., United States

AIVP-L7.3: OBJECT TRACKING IN CLUTTER AND PARTIAL OCCLUSION THROUGH HI - 69RULE-DRIVEN UTILIZATION OF SNAKESGabriel Tsechpenakis, Kostas Rapatzikos, Nicolas Tsapatsoulis, Stefanos Kollias, National Technical University of Athens,Greece

AIVP-L7.4: TRACKING OF MOVING OBJECTS BASED ON GRAPH EDGES SIMILARITY Ill - 73Ofer Miller, Ety Navon, Amir Averbuch, Tel-Aviv University, Israel

AIVP-L7.5: SHADOW-RESISTANT TRACKING IN VIDEO Ill - 77

Hao Jiang, Mark Drew, Simon Fraser University, Canada

MSCP-P2: MULTIMEDIA SECURITY AND CONTENT PROTECTION IV

MSCP-P2.1: HIGH PERFORMANCE ELLIPTIC CURVE GF(2AK) CRYPTOPROCESSOR Ill - 81

ARCHITECTURE FOR MULTIMEDIAAdnan Abdul-Aziz Gutub, Mohammad K. Ibrahim, King Fahd University of Petroleum and Minerals, Saudi ArabiaMSCP-P2.2: SCRAMBLING OF ENGINEERING DRAWINGS Ill - 85Wei-Qi Yan, Mohan Kankanhalli, National University of Singapore, Singapore

MSCP-P2.3 *: NONLINEAR SEPARATION OF SIGNATURE TRAJECTORIES FOR ON-LINE HI - 89PERSONAL AUTHENTICATIONMitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto, Waseda University, Japan

MSCP-P2.4: AN ACCURATE BILLING MECHANISM FOR MULTIMEDIA COMMUNICATIONS III - 93Jose Gabriel Gomes, Mylene de Farias, Sanjit Mitra, University of California, Santa Barbara, United States; Marco Carli,University of Rome TRE, Italy

MSCP-P2.5: ROBUST BUYER AUTHENTICATION SCHEME FOR MULTIMEDIA OBJECT Ill - 97Dipti Prasad Mukherjee, Subhamoy Maitra, Indian Statistical Institute, India

MSCP-P2.6: BINARY IMAGE WATERMARKING THROUGH BIASED BINARIZATION HI -101Haiping Lu, Alex C. Kot, Nanyang Technological University, Singapore; Rahardja Susanto, Institute for Infocomm Research,Singapore

xli

Page 4: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

MSCP-P2.7: 3D POLYGONAL MESHES WATERMARKING USING NORMAL VECTOR HI -105DISTRIBUTIONSSuk-Hwan Lee, Tae-Su Kim, Byung-Ju Kim, Kyungpook National University, Republic of Korea; Seong-Geun Kwon,Samsung Electronics Co. Ltd., Republic of Korea; Ki-Ryong Kwon, Pusan University of Foreign Studies, Republic of Korea;Kuhn-Il Lee, Kyungpook National University, Republic of Korea

MSCP-P2.8: A SECURE REGISTRATION PROTOCOL FOR MEDIA APPLIANCES IN WIRELESS Ill -109HOME NETWORKSNut Taesombut, Vineet Kumar, Rishi Dubey, P. Venkat Rangan, University of California, San Diego, United States

ICASSP-7: HUMAN MOVEMENT AND FACE ANALYSIS

ICASSP-7.1 *: COMBINING MULTIPLE EVIDENCES FOR GAIT RECOGNITION Ill -113Naresh Cuntoor, Amit Kale, Rama Chellappa, University of Maryland, United States

ICASSP-7.2 *: TRACKING HUMAN MOVEMENT PATTERNS USING PARTICLE FILTERING Ill -117Richard Green, University of Sydney, Canada; Ling Guan, Ryerson University, Canada

ICASSP-7.3 *: A COMPARISON OF SUBSPACE ANALYSIS FOR FACE RECOGNITION HI -121Jian Li, Shaohua Zhou, Chandra Shekhar, University of Maryland, United States

ICASSP-7.4 *: FACIAL FEATURE TRACKING COMBINING MODEL-BASED AND MODEL-FREE Ill -125METHODJianYu Wang, Harbin Institute of Technology, China; Wen Gao, Harbin Institute of Technology / FRJDL, China; ShiguangShan, FRJDL, China; XiaoPeng Hu, Wuhan University, China

ICASSP-7.5 *: SIMULTANEOUS TRACKING AND RECOGNITION OF HUMAN FACES FROM HI -129VIDEOShaohua Zhou, Rama Chellappa, University of Maryland, United States

ICASSP-7.6 *: AUTOMATIC 3D FACE VERIFICATION FROM RANGE DATA HI -133Gang Pan, Zhaohui Wu, Yunhe Pan, Zhejiang University, China

ICASSP-7.7 *: ROTATED FACE DETECTION IN COLOR IMAGES USING RADIAL TEMPLATE (RT).. HI -137Heng Liu, Harbin Institute of Technology, China; Shengye Yan, Beijing Polytechnic University, China; Xilin Chen, HarbinInstitute of Technology, China; Wen Gao, Institute of Computing Technology, China

ICASSP-7.8 *: NOVEL EXAMPLE-BASED SHAPE LEARNING FOR FAST FACE ALIGNMENT ...Ill • 141Xiujuan Chai, Harbin Institute of Technology, China; Shiguang Shan, Wen Gao, Bo Cao, Institute of Computing Technology,China

ICASSP-7.9 *: REAL-TIME FACE VERIFICATION USING MULTIPLE FEATURE COMBINATION ......HI -145AND A SUPPORT VECTOR MACHINE SUPERVISORDo-Hyung Kim, Jae-Yeon Lee, Jung Soh, Yun-Koo Chung, ETR1, Republic of Korea

ICASSP-7.10 *: VIRTUAL FACE IMAGE GENERATION FOR ILLUMINATION AND POSE Ill -149INSENSITIVE FACE RECOGNITIONWen Gao, Shiguang Shan, Institute of Computing Technology, China; Xiujuan Chai, Xiaowei Fu, Harbin Institute ofTechnology, China

ICASSP-8: IMAGE AND VIDEO CODING AND ANALYSIS

ICASSP-8.1 *: ERROR RESILIENT PRE-/POST-FILTERING FOR DCT-BASED BLOCK CODING Ill -153SYSTEMSChengjie Tu, Trac Tran, Jie Liang, The Johns Hopkins University, United States

xlii

Page 5: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

ICASSP-8.2 *: MULTIPLE, ARBITRARY SHAPE ROI CODING WITH ZEROTREE BASED HI -157WAVELET CODERSAysegul Cuhadar, Sinan Tasdoken, Carleton University, Canada

ICASSP-8.3 *: LOSSLESS AND LOSSY MINIMAL REDUNDANCY PYRAMIDAL DECOMPOSITION ....III -161FOR SCALABLE IMAGE COMPRESSION TECHNIQUEMarie Babel, Olivier Deforges, UMR CNRS 6164 IETR/INSA Rennes, France

ICASSP-8.4 *: SCHEMES FOR ERROR RESILIENT STREAMING OF PERCEPTUALLY CODED Ill -165AUDIOJari Korhonen, Nokia Research Center, Finland; Ye Wang, National University of Singapore, Singapore

ICASSP-8.5 *: SPATIO-TEMPORAL VIDEO ERROR CONCEALMENT WITH PERCEPTUALLY Ill -169OPTIMIZED MODE SELECTIONStefano Belfiore, Marco Grangetto, Enrico Magli, Gabriella Olmo, Politecnico di Torino, Italy

ICASSP-8.6 *: ADAPTIVE SKIN SEGMENTATION IN COLOR IMAGES Ill -173Son Lam Phung, Douglas Chai, Abdesselam Bouzerdoum, Edith Cowan University, Australia

ICASSP-8.7 *: INVERTIBLE DEINTERLACING WITH VARIABLE COEFFICIENTS AND ITS Ill -177LIFTING IMPLEMENTATIONTakuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Niigata University, Japan; Tetsuro Kuge, NHK, Japan

ICASSP-8.8 *: STATISTICAL SHAPE THEORY FOR ACTIVITY MODELING Ill -181Namrata Vaswani, Amit Roy Chowdhury, Rama Chellappa, University of Maryland, United States

ICASSP-8.9 *: EVIDENCE-BASED OBJECT TRACKING VIA GLOBAL ENERGY MAXIMIZATION Ill -185John Carter, Pelo Lappas, Robert Damper, University of Southampton, United Kingdom

ICASSP-8.10 *: A NEW REAL-TIME PATTERN SELECTION ALGORITHM FOR VERY LOW Ill -189BIT-RATE VIDEO CODING FOCUSING ON MOVING REGIONSManoranjan Paul, Manzur Murshed, Laurence Dooley, Monash University, Australia

ICASSP-9: SPEECH AND AUDIO PROCESSING IV

ICASSP-9.1 *: PARAMETRIC VECTOR QUANTIZATION FOR CODING PERCUSSIVE SOUNDS IN Ill -193MUSICYe Wang, National University of Singapore, Singapore; Jian Tang, AH Ahmaniemi, Nokia Research Center, Finland; MarkusVaalgamaa, Nokia Mobile Phones, Finland

ICASSP-9.2 *: HMM-NEURAL NETWORK MONOPHONE MODELS FOR COMPUTER-BASED Ill -197ARTICULATION TRAINING FOR THE HEARING IMPAIREDMukund Devarajan, Fansheng Meng, Penny Hix, Stephen Zahorian, Old Dominion University, United States

ICASSP-9.3 *: CONSTRAINT SATISFACTION MODEL FOR ENHANCEMENT OF EVIDENCE IN Ill - 201RECOGNITION OF CONSONANT-VOWEL UTTERANCESSuryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana, Indian Institute of Technology, Madras, India

ICASSP-9.4 *: SUPPORT VECTOR MACHINE FUSION OF IDIOLECTAL AND ACOUSTIC Ill - 205SPEAKER INFORMATION IN SPANISH CONVERSATIONAL SPEECHDaniel Garcia-Romero, Julian Fierrez-Aguilar, Joaquin Gonzalez-Rodriguez, Javier Ortega-Garcia, UniversidadPolitecnica de Madrid, Spain

ICASSP-9.5 *: AN EVALUATION OF ADAPTIVE BEAMFORMER BASED ON AVERAGE SPEECH Ill - 209SPECTRUM FOR NOISY SPEECH RECOGNITIONTakanobu Nishiura, ATR, Japan; Masato Nakayama, Wakayama University, Japan; Satoshi Nakamura, ATR, Japan

ICASSP-9.6 *: AUDITIVE LEARNING BASED CHINESE F0 PREDICTION Ill - 213Jianhua Tao, Chinese Academy of Sciences, China; Xing Ni, Tsinghua University, China

xliii

Page 6: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

ICASSP-9.7 *: MULTI-CHANNEL PSYCHO ACOUSTIC ALLY MOTIVATED SPEECH Ill - 217ENHANCEMENTJustinian Rosed, Siemens Corporate Research, United States; Radu Balan, Siemens Corporate research, United States;Christophe Beaugeant, Siemens AG, Germany

ICASSP-9.8 *: A PROGRESSIVE TO LOSSLESS EMBEDDED AUDIO CODER (PLEAC) WITH Ill - 221REVERSIBLE MODULATED LAPPED TRANSFORMJin Li, Microsoft Research, United States

SS-L11: SIGNAL PROCESSING AND TESTING IN MULTIMODAL BIOMETRICS

SS-Lll . l : A TEST TOOL TO SUPPORT BRUT-FORCE ONLINE AND OFFLINE SIGNATURE Il l - 225FORGERY TESTS ON MOBILE DEVICESFrank Zoebisch, Claus Vielhauer, University of Magdeburg, Germany

SS-L11.2: INCREMENTAL UPDATING OF ADVANCED CORRELATION FILTERS FOR Ill - 229BIOMETRIC AUTHENTICATION SYSTEMSMarios Savvides, Krithika Venkataramani, B. V. K. Vijayakumar, Carnegie Mellon University, United States

SS-L11.3: A REAL TIME AUTOMATIC ACCESS CONTROL SYSTEM BASED ON FACE AND EYE HI - 233CORNERS DETECTION, FACE RECOGNITION AND SPEAKER IDENTIFICATIONZiyou Xiong, Yunqiang Chen, Roy Wang, Thomas Huang, University of Illinois, Urbana-Champaign, United States

SS-L11.4: MULTIMEDIA CONTENT PROTECTION VIA BIOMETRICS-BASED ENCRYPTION Ill - 237Umut Uludag, Anil Jain, Michigan State University, United States

SS-L11.5: ON TESTING METHODS FOR BIOMETRIC AUTHENTICATION HI - 241

Enrico Grosso, Massimo Tistarelli, University ofSassari, Italy

MCN-L6: MULTIMEDIA CODING AND TRANSPORT

MCN-L6.1: A DELAY-EFFICIENT RE-ROUTING SCHEME FOR VOIP TRAFFIC HI - 245

Narasinha Kamat, Ju Wang, Jonathan Liu, University of Florida, United StatesMCN-L6.2: A NEW CLUSTER-BASED DISTRIBUTED VIDEO RECORD SERVER HI - 249Xiaofei Liao, Hai Jin, Huazhong University of Science and Technology, China

MCN-L6.3: EXTENDING PROGRESSIVE MESHES FOR USE OVER UNRELIABLE HI - 253NETWORKSZhihua Chen, Bobby Bodenheimer, J. Fritz Barnes, Vanderbilt University, United States

MCN-L6.4: A SCALABLE VIRTUAL PROGRAMMABLE REAL-TIME TESTBED FOR RAPID Ill - 257MULTIMEDIA SERVICE CREATION AND EVALUATIONChristian Bachmeir, Peter Tabery, Serdar Uziimcti, Eckehard Steinbach, Munich University of Technology, Germany

MCN-L6.5: REAL-TIME ADAPTIVE FORWARD ERROR CORRECTION FOR MPEG-2 VIDEO Ill - 261COMMUNICATIONS OVER RTP NETWORKSBulent Cavusoglu, Dan Schonfeld, Rashid Ansari, University of Illinois, Chicago, United States

MHMII-L3: MULTIMEDIA STANDARDS

MHMII-L3.1: MODELING OF THE NON-DETERMINISTIC SYNCHRONIZATION BEHAVIORS Ill - 265IN SMIL2.0 DOCUMENTSChun-Chuan Yang, Chih-Wen Tien, Yung-Chi Wang, National Chi Nan University, Taiwan

xliv

Page 7: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

MHMII-L3.2: EXTENDING MPEG-7 DESCRIPTION SCHEME OF MOVING REGIONS BY THE HI - 269SEMANTIC VISUAL-SPATIO-TEMPORAL RELATIONSHIPSZaher Aghbari, Akifumi Makinouchi, Kyushu University, Japan

MHMII-L3.3: PERFORMANCE OF MPEG-7 LOW LEVEL AUDIO DESCRIPTORS WITH Ill - 273COMPRESSED DATAJason Lukasiak, David Stirling, Nick N. Harders, Shane Perrow, University of Wollongong, Australia

MHMH-L3.4: EFFICIENT RATE CONTROL TECHNIQUE FOR JPEG2000 IMAGE CODING USING Ill - 277PRIORITY SCANNINGYick Ming Yeung, Oscar C. Au, Andy Chang, Hong Kong University of Science and Technology, Hong Kong SAR of China

MHMII-L3.5: CONTENT-ADAPTIVE UTILITY-BASED VIDEO ADAPTATION HI - 281Jae-Gon Kim, Electronics and Telecommunications Research Institute, Republic of Korea; Yong Wang, Shih-Fu Chang,Columbia University, United States

AIVP-L8: FACE ANALYSIS AND MODELING

AIVP-L8.1: FACE REPRESENTATION UNDER DIFFERENT ILLUMINATION CONDITIONS Ill - 285Haitao Wang, Chinese Academy of Sciences, China; Hong Wei, University of Reading, United Kingdom; Yangsheng Wang,Chinese Academy of Sciences, China

AIVP-L8.2: 3D FACE MODELING USING TWO ORTHOGONAL VIEWS AND A GENERIC FACE Ill - 289MODELA-Nasser Ansari, Mohamed Abdel-Mottaleb, University of Miami, United States

AIVP-L8.3: FACE TRACKING IN VIDEO WITH HYBRID OF LUCAS-KANADE AND HI - 293CONDENSATION ALGORITHMChong Luo, Tat Seng Chua, Teck Khim Ng, National University of Singapore, Singapore

AIVP-L8.4: FACE IMAGE RESTORATION BASED ON STATISTICAL PRIOR AND IMAGE BLUR Ill - 297MEASUREXin Fan, Xian JiaoTong University, China; Qi Zhang, Dequn Liang, Ling Zhao, Dalian Maritime University, China

AIVP-L8.5: FAST HIERARCHICAL FACE DETECTION Ill - 301

Yao-Hong Tsai, Yea-Shuan Huang, Industrial Technology Research Institute, Taiwan

MD-P4: SEGMENTATION, SUMMARIZATION, AND STRUCTURING

MD-P4.1: TOPIC-BASED INTER-VIDEO STRUCTURING OF A LARGE-SCALE NEWS VIDEO Ill - 305

CORPUSIchiro Ide, Hiroshi Mo, Norio Katayama, Shin'ichi Satoh, National Institute of Informatics, JapanMD-P4.2: HMM BASED STRUCTURING OF TENNIS VIDEOS USING VISUAL AND AUDIO Ill - 309CUESEwa Kijak, Thomson multimedia R&D, France; Guillaume Gravier, Patrick Gros, IR1SA, France; Lionel Oisel, Thomsonmultimedia R&D, France; Frederic Bimbot, 1R1SA, France

MD-P4.3: FAST METHOD OF SEGMENTATION AND INDEXING MPEG1-2 FLOW HI - 313Lionel Brunei, Pierre Mathieu, Universite de Nice - Sophia Antipolis, France

MD-P4.4: BUILDING IMAGE MOSAICS: AN APPLICATION OF CONTENT-BASED IMAGE HI - 317RETRIEVALYue Zhang, Mario Nascimento, Osmar Zaiane, University of Alberta, Canada

MD-P4.5: A PROPOSAL FOR A VIDEO CONTENT GENERATION SUPPORT SYSTEM AND ITS HI - 321APPLICATIONSWenli Zhang, Xiaomeng Xu, Shunsuke Kamijo, Masao Sakauchi, University of Tokyo, Japan

xlv

Page 8: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

MD-P4.6: FAST SCENE SEGMENTATION USING MULTI-LEVEL FEATURE SELECTION HI - 325Yan Liu, John R. Kender, Columbia University, United States

MD-P4.7: SEMANTIC VIDEO SUMMARIZATION IN COMPRESSED DOMAIN MPEG VIDEO Ill - 329Jek Charlson So Yu, Mohan Kankanhalli, Philippe Mulhem, National University of Singapore, Singapore

MD-P4.8: SEQUENTIAL ASSOCIATION MINING FOR VIDEO SUMMARIZATION HI - 333Xingquan Zhu, Xindong Wu, University of Vermont, United States

MD-P4.9 *: AN UNSUPERVISED APPROACH TO COLOR VIDEO THRESHOLDING HI • 337Eliza Yingzi Du, Chein-I Chang, University of Maryland, Baltimore County, United States; Paul D. Thouin, US Departmentof Defense, United States

MD-P4.10 *: REAL-TIME ADAPTIVE BACKGROUND SEGMENTATION Ill - 341Darren Butler, Sridha Sridharan, Queensland University of Technology, Australia; V. Michael Bove, Jr., MassachusettsInstitute of Technology, United States

MCN-P7: RATE CONTROL AND PACKET CLASSIFICATION FOR TRANSMISSION

MCN-P7.1: ANALYSIS-BY-SYNTHESIS DISTORTION COMPUTATION FOR RATE-DISTORTION HI - 345OPTIMIZED MULTIMEDIA STREAMINGEnrico Masala, Politecnico di Torino, Italy; Juan Carlos De Martin, IEIIT-CNR, Italy

MCN-P7.2: A RATE CONTROL SCHEME FOR H.26L VIDEO TRANSMISSION Ill - 349Yuh-Ching Wang, Jin-Jang Leou, National Chung Cheng University, Taiwan

MCN-P7.3: ENSURING FAIRNESS IN MULTIMEDIA MULTICAST STREAMING WITH OPTIMAL HI - 353RATE ALLOCATION AND CLIENT BUFFER UTILIZATIONMei-Ling Shyu, University of Miami, United States; Shu-Ching Chen, Florida International University, United States; HongliLuo, University of Miami, United States

MCN-P7.4: A SCHEME FOR FAIR, RATE-BASED END-TO-END CONGESTION CONTROL OF HI - 357MULTIMEDIA TRAFFIC IN PACKET SWITCHED NETWORKSSrikantia R. Subramanya, Sarangapani Jagannathan, Mingsheng Peng, University of Missouri-Rolla, United States

MCN-P7.5: PERCEPTUAL RATE CONTROL FOR LOW-DELAY VIDEO COMMUNICATIONS HI - 361Chi-Wah Wong, Oscar C. Au, Bojun Meng, Hong-Kwai Lam, Hong Kong University of Science and Technology, Hong KongSAR ofChina

MCN-P7.6: PER-CLASS QUEUE MANAGEMENT AND ADAPTIVE PACKET DROP MECHANISM HI - 365FOR MULTIMEDIA NETWORKINGMei-Ling Shyu, University of Miami, United States; Shu-Ching Chen, Florida International University, United States; HongliLuo, University of Miami, United States

MCN-P7.7: ADAPTIVE PACKET CLASSIFICATION FOR CONSTANT PERCEPTUAL QUALITY OF .... Ill - 369SERVICE DELIVERY OF VIDEO STREAMS OVER TIME-VARYING NETWORKSDavide Quaglia, Politecnico di Torino, Italy; Juan Carlos De Martin, IEIIT-CNR, Italy

MCN-P7.8: END-TO-END AVAILABLE BANDWIDTH ESTIMATION AND TIME MEASUREMENT .......III - 373ADJUSTMENT FOR MULTIMEDIA QOSQiang Liu, Jenq-Neng Hwang, University of Washington, United States

MCN-P7.9 *: BUFFER-CONSTRAINED R-D OPTIMIZED RATE CONTROL FOR VIDEO CODING......... HI - 377LifengZhao, C.-C. Jay Kuo, University of Southern California, United States

xlvi

Page 9: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

ICASSP-10: AUDIO SIGNAL PROCESSING

ICASSP-10.1 *: PITCH AND TIMBRE MANIPULATIONS USING CORTICAL.REPRESENTATION OF ..III - 381SOUNDDmitry N. Zotkin, Shihab A. Shamma, Powen Ru, Ramani Duraiswami, Larry S. Davis, University of Maryland, CollegePark, United States

ICASSP-10.2 *: MULTIDIMENSIONAL HUMMING TRANSCRIPTION USING A STATISTICAL Ill - 385APPROACH FOR QUERY BY HUMMING SYSTEMSHsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo, University of Southern California, United States

ICASSP-10.3 *: APPLICATION OF PITCH TRACKING TO SOUTH INDIAN CLASSICAL MUSIC Ill - 389Arvindh Krishnaswamy, Stanford University, United States

ICASSP-10.4 *: SCALABLE TO LOSSLESS AUDIO COMPRESSION BASED ON PERCEPTUAL SET Ill - 393PARTITIONING IN HIERARCHICAL TREES (PSPIHT)Mohammed Raad, Alfred Merlins, Ian Burnett, University OfWollongong, Australia

ICASSP-10.5 *: COMPARING MFCC AND MPEG-7 AUDIO FEATURES FOR FEATURE Ill - 397EXTRACTION, MAXIMUM LIKELIHOOD HMM AND ENTROPIC PRIOR HMM FOR SPORTS AUDIOCLASSIFICATIONZiyou Xiong, University of Illinois, Urbana-Champaign, United States; Regunathan Radhakrishnan, Ajay Divakaran,Mitsubishi Electric Research Laboratories, United States; Thomas Huang, University of Illinois, Urbana-Champaign, UnitedStates

ICASSP-10.6 *: AUDIO EVENTS DETECTION BASED HIGHLIGHTS EXTRACTION FROM Ill - 401BASEBALL, GOLF AND SOCCER GAMES IN A UNIFIED FRAMEWORKZiyou Xiong, University of Illinois, Urbana-Champaign, United States; Regunathan Radhakrishnan, Ajay Divakaran,Mitsubishi Electric Research Laboratories, United States; Thomas Huang, University of Illinois, Urbana-Champaign, UnitedStates

ICASSP-10.7 *: AUDIO RESTORATION BY CONSTRAINED AUDIO TEXTURE SYNTHESIS Ill - 405Lie Lu, Microsoft Research Asia, China; Yi Mao, Zhejiang University, China; Liu Wenyin, City University of Hong Kong,Hong Kong SAR of China; Hong-Jiang Zhang, Microsoft Research Asia, China

ICASSP-10.8 *: MUSICAL INSTRUMENT IDENTIFICATION BASED ON FO-DEPENDENT HI - 409MULTIVARIATE NORMAL DISTRIBUTIONTetsuro Kitahara, Kyoto University, Japan; Masataka Goto, National Institute of Advanced Insdustrial Science andTechnology, Japan; Hiroshi G. Okuno, Kyoto University, Japan

ICASSP-10.9 *: ON PSYCHOACOUSTIC NOISE SHAPING FOR AUDIO REQUANTIZATION Ill - 413

Dreten De Koning, Werner Verhelst, Vrije Universiteit Brussel, Belgium

ICASSP-11: ARCHITECTURE, IMPLEMENTATION, AND DESIGN

ICASSP-ll.l *: RAPID PROTOTYPING FOR AN OPTIMIZED MPEG4 DECODER HI - 417

IMPLEMENTATION OVER A PARALLEL HETEROGENEOUS ARCHITECTURENicolas Ventroux, Jean Francois Nezan, Mickael Raulet, Olivier Deforges, 1ETR / 1NSA Rennes, FranceICASSP-11.2 *: HARDWARE ORIENTED RATE CONTROL ALGORITHM AND IMPLEMENTATION ..III - 421FOR REALTIME VIDEO CODINGHung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen, National Taiwan University, Taiwan

ICASSP-11.3 *: FACE RECOGNITION COMMITTEE MACHINE Ill - 425H. M. Tang, Michael R. Lyu, Irwin King, Chinese University of Hong Kong, Hong Kong SAR of China

xlvii

Page 10: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

ICASSP-11.4 *: ROBUST CEPHALOMETRIC LANDMARK IDENTIFICATION USING SUPPORT HI - 429VECTOR MACHINESShantanu Chakrabartty, The Johns Hopkins University, United States; Masakazu Yagi, Tadashi Shibata, University of Tokyo,Japan; Gert Cauwenberghs, The Johns Hopkins University, United States

ICASSP-11.5 *: A METHOD OF GENERATING UNIFORMLY DISTRIBUTED SEQUENCES OVER Ill - 433[0,K] WHERE K+l IS NOT A POWER OF TWORichard Kuehnel, US Department of Defense, United States; Yuke Wang, University of Texas, Dallas, United States

ICASSP-11.6 *: AN EFFICIENT IMPLEMENTATION OF MULTI-PRIME RSA ON DSP PROCESSOR Ill - 437Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang, University of Texas, Dallas, United States

ICASSP-11.7 *: AN IMPROVED PARALLEL ARCHUTECTURE FOR MPEG-4 MOTION Ill - 441ESTIMATION IN 3G MOBILE APPLICATIONSDonglaiXu, University of Teesside, United Kingdom; Hadj Batatia, ENSEEIHT, France; Rui Gao, University of Teesside,United Kingdom

ICASSP-11.8 *: AN ULTRA-FAST REED-SOLOMON DECODER SOFT-IP WITH 8-ERROR HI - 445CORRECTING CAPABILITYToshiyuki Yamane, Yasunao Katayama, IBM Research, Tokyo Research Laboratory, Japan

SS-L12: MULTIMEDIA TECHNOLOGY IN BIOINFORMATICS

SS-L12.1: COMPUTATIONAL INTELLIGENCE APPROACH FOR GENE EXPRESSION DATA Ill - 449MINING AND CLASSIFICATIONZuyi Wang, Catholic University of America, United States; Sun-Yuan Kung, Princeton University, United States; JunyingZhang, Catholic University of America, United States; Javed Khan, National Institutes of Health, United States; JianhuaXuan, Yue Wang, Catholic University of America, United States

SS-L12.2: DYNAMIC QUERYING FOR PATTERN IDENTIFICATION IN MICROARRAY AND Ill - 453GENOMIC DATAHarry Hochheiser, Eric Baehrecke, Stephen Mount, Ben Shneiderman, University of Maryland, United States

SS-L12.3: PROTEOMICS: APPROACHES AND IMAGE ANALYSIS TOOLS FOR DRUG HI - 457DISCOVERYSophia R. He, Edmond J. Breen, Sybille M. N. Hunt, Proteome Systems Limited, Australia

SS-L12.4: INTERACTIVE COLOR MOSAIC AND DENDROGRAM DISPLAYS FOR SIGNAL/NOISE HI - 461OPTIMIZATION IN MICROARRAY DATA ANALYSISJinwook Seo, University of Maryland, United States; Marina Bakay, Po Zhao, Yi-Wen Chen, Children's National MedicalCenter, United States; Priscilla Clarkson, University of Massachusetts, Amherst, United States; Ben Shneiderman, Universityof Maryland, United States; Eric Hoffman, Children's National Medical Center, United States

SS-L12.5: REGISTERING ELECTROPHORESIS IMAGES FOR BIOINFORMATICS STUDY OF Ill - 465PROTEINPer H0jte, Technical University of Denmark, Denmark; Xiaoxing Wang, University of Sydney, United States

MD-L3: VIDEO ANALYSIS AND MINING

MD-L3.1: A NOVEL MOTION-BASED REPRESENTATION FOR VIDEO MINING Ill - 469Dong-Jun Lan, Tsinghua University, China; Yu-Fei Ma, Hong-Jiang Zhang, Microsoft Research Asia, China

MD-L3.2: IMPROVED TEXT OVERLAY DETECTION IN VIDEOS USING A FUSION-BASED Ill - 473CLASSIFIERBelle L. Tseng, Ching-Yung Lin, IBM T. J. Watson Research Center,- United States; Dongqing Zhang, Columbia University,United States; John R. Smith, IBM T. J. Watson Research Center, United States

xlviii

Page 11: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

MD-L3.3: MOTION INDEXING AND SYNTHESIS HI - 477Chih-Yi Chiu, Shih-Pin Chao, Jui-Hsiang Chao, Wen-Yen Chang, National Tsing Hua University, Taiwan; Hsin-Chih Lin,Chang Jung Christian University, Taiwan; Shi-Nine Yang, National Tsing Hua University, Taiwan

MD-L3.4: TIME INTERVAL MAXIMUM ENTROPY BASED EVENT INDEXING IN SOCCER Ill • 481VIDEOCees Snoek, Marcel Worring, University of Amsterdam, Netherlands

MD-L3.5: VIDEO CLASSIFICATION USING SPATIAL-TEMPORAL FEATURES AND PCA Ill - 485

Li-Qun Xu, Yongmin Li, BTexact Technologies, United Kingdom

MCSA-L1: MULTIMEDIA COMPUTING SYSTEMS AND APPLIANCES

MCSA-Ll.l: EFFICIENT BUFFERING CONTROL FOR A SOFTWARE-ONLY, HIGH-LEVEL, Ill - 489

HIGH-PROFILE MPEG-2 DECODERJu Wang, Jonathan Liu, Yishu He, University of Florida, United StatesMCSA-L1.2: COMPARISON STUDY AND EVALUATION OF OVERLAY MULTICAST NETWORKS Ill - 493Yan Zhu, Min-You Wu, Wei Shu, University of New Mexico, United StatesMCSA-L1.3: ON DESIGING END-USER MULTICAST FOR MULTIPLE VIDEO SOURCES HI - 497Yoshitaka Nakamura, Hirozumi Yamaguchi, Akihito Hiromori, Osaka University, Japan; Keiichi Yasumoto, Nara Institute ofScience and Technology, Japan; Teruo Higashino, Kenichi Taniguchi, Osaka University, Japan

MCSA-L1.4: CHARACTERIZATION AND MODELING OF CAMPUS-LEVEL IP NETWORK TRAFFIC .HI - 501Eugenio Costamagna, Lorenzo Favalli, Francesco Tarantola, University ofPavia, Italy

MCSA-L1.5: ATTENUATOR: TOWARDS PRESERVING THE ORIGINAL APPEARANCE OF LARGE ..III - 505DOCUMENTS WHEN RENDERED ON SMALL SCREEN MOBILE DEVICESStuart Goose, Rajanikanth Tanikella, Sreedhar Kodlahalli, Siemens Corporate Research, United States

AIVP-L9: FAST ALGORITHM FOR VIDEO PROCESSING

AIVP-L9.1: PRACTICAL REAL-TIME VIDEO CODEC FOR MOBILE DEVICES Ill - 509Keman Yu, Jiangbo Lv, Jiang Li, Shipeng Li, Microsoft Research Asia, China

AIVP-L9.2: LOW-COMPLEXITY RATE-DISTORTION OPTIMAL MACROBLOCK MODE Ill - 513SELECTION FOR MPEG-LIKE VIDEO CODERSHyungjoon Kim, Yucel Ahunbasak, Georgia Institute of Technology, United States

AIVP-L9.3: FAST MOTION ESTIMATION WITHIN THE H.264 CODEC Ill - 517Hye-Yeon Cheong Tourapis, Microsoft Research Asia, China; Alexis Tourapis, Thomson multimedia Inc. United States

AIVP-L9.4: EFFICIENT INTRA-PREDICTION MODE SELECTION FOR 4X4 BLOCKS IN H.264 Ill - 521Bojun Meng, Oscar C. An, Chi-Wah Wong, Hong-Kwai Lam, Hong Kong University of Science and Technology, Hong KongSAR of China

AIVP-L9.5: DIVERSITY-BASED FAST BLOCK MOTION ESTIMATION HI - 525JunXin, Ming-Ting Sun, University of Washington, United States; Vincent Hsu, CCUITRI, Taiwan

MHMII-P1: MULTIMEDIA HUMAN-MACHINE INTERFACE AND INTERACTION

MHMII-Pl.l: SPEECH-ASSISTED FACIAL EXPRESSION ANALYSIS AND SYNTHESIS FOR HI - 529VIRTUAL CONFERENCING SYSTEMSYao-Jen Chang, Industrial Technology Research Institute, Taiwan; Chao-Kuei Heish, Pei-Wei Hsu, Yung-Chang Chen,National Tsing Hua University, Taiwan

I xlixk

Page 12: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

MHMII-P1.2 *: USING VISEME BASED ACOUSTIC MODELS FOR SPEECH DRIVEN LIP HI - 533SYNTHESISAshish Verma, Nitendra Rajput, Venkat Subramaniam, Indian Institute of Technology, India

MHMII-P1.3: DETECTING AUDITORY INFORMATION IN CONCENTRATION BASED ON EYE HI - 537MOVEMENTAtsuo Yoshitaka, Hirokazu Seki, Hiroshima University, Japan

MHMII-P1.4: A REALTIME SYSTEM FOR HAND GESTURE CONTROLLED OPERATION OF HI - 541IN-CAR DEVICESMartin Zobl, Michael Geiger, Institute for Human-Machine Interaction, Germany; Bjorn Schuller, Munich University ofTechnology, Germany; Manfred Lang, Gerhard Rigoll, Institute for Human-Machine Interaction, Germany

MHMII-P1.5: AIDED DESIGN OF FINITE-STATE DIALOGUE MANAGEMENT SYSTEMS Ill - 545Olivier Pietquin, Thierry Dutoit, Faculte Polytechnique de Mons, Belgium

MHMII-P1.6: EMOTION DETECTION IN TASK-ORIENTED DIALOGS Ill - 549Laurence Devillers, Lori Lamel, LIMSl-CNRS, France; loana Vasilescu, ENST-CNRS, France

MHMII-P1.7 *: EDITING BY VOICE AND THE ROLE OF SEQUENTIAL SYMBOL SYSTEMS Ill - 553FOR IMPROVED HUMAN-TO-COMPUTER INFORMATION RATESNils Klarlund, AT&T Labs - Research, United States

MHMH-P1.8: REAL TIME EYE TRACKING FOR HUMAN COMPUTER INTERFACES Ill - 557Subramanya Amarnag, Raghunandan Kumaran, John Gowdy, Clemson University, United States

MHMII-P1.9 *: JOINT AUDIO-VIDEO PROCESSING FOR BIOMETRIC SPEAKER HI - 561IDENTIFICATIONAlper Kanak, Engin Erzin, Yucel Yemez, Koc University, Turkey; A. Murat Tekalp, University of Rochester, United States;Koc University, Turkey

MHMII-P1.10 *: AUDIOVISUAL-BASED ADAPTIVE SPEAKER IDENTIFICATION Ill - 565Ying Li, Shrikanth S. Narayanan, C.-C. Jay Kuo, University of Southern California, United States

HSMS-P2: ALGORITHMS AND ARCHITECTURES FOR MULTIMEDIACOMMUNCATIONS

HSMS-P2.1: ARCHITECTURE OF A MODULAR STREAMING MEDIA SERVER FOR CONTENT HI - 569DELIVERY NETWORKSSumit Roy, John Ankcorn, Susie Wee, Hewlett-Packard Laboratories, United States

HSMS-P2.2: A DELIVERY METHOD OF VIDEOS WITH REQUIRED MINIMUM BAND WIDTHS. Ill - 573Hideaki Ito, Teruo Fukumura, Chukyo University, Japan

HSMS-P2.3: A CAPABLE LOCATION PREDICTION AND RESOURCE RESERVATION SCHEME IN ....HI - 577WIRELESS NETWORKS FOR MULTIMEDIAShiang-Chun Liou, Hsuan-Chia Lu, Kuo-Hsien Yeh, Leader University, Taiwan

HSMS-P2.4: A DRIFT-FREE MOTION-COMPENSATED PREDICTIVE ENCODING TECHNIQUE Ill - 581FOR MULTIPLE DESCRIPTION CODINGYen-Chi Lee, Yucel Altunbasak, Russell Mersereau, Georgia Institute of Technology, United States

HSMS-P2.5: LOW-COMPLEXITY VIDEO COMPRESSION FOR WIRELESS SENSOR Ill - 585NETWORKSEnrico Magli, Massimo Mancin, Luca Merello, Politecnico di Torino, Italy

HSMS-P2.6: AN IMPROVED RM ALGORITHM FOR PREVENTING STREAMING MEDIA TASKS Ill - 589FROM STARVATIONShuhua Peng, Xiaodong Liu, Qionghai Dai, Yu Cheng, Tsinghua University, China

I

Page 13: 2003 International Conference on Multimedia and Expo · SS-L9.3: INFORMATION FUSION AND DECISION CASCADING FOR AUDIO-VISUAL SPEAKER HI - 9 RECOGNITION BASED ON TIME-VARYING STREAM

HSMS-P2.7: A FRAMEWORK FOR VIDEO REPRESENTATION AND TRANSCODING USING Ill - 593APPEARANCE SPACESGaurav Harit, Santanu Chaudhury, Gaurav Garg, Pramod Kumar Sharma, I IT Delhi, India

HSMS-P2.8: SEMANTIC SEGMENTATION AND DESCRIPTION FOR VIDEO TRANSCODING Ill - 597Andrea Cavallaro, Olivier Steiger, Touradj Ebrahimi, Swiss Federal Institute of Technology (EPFL), Switzerland

HSMS-P2.9 *: PERFORMANCE ANALYSIS OF HARDWARE ORIENTED ALGORITHM HI - 601MODIFICATIONS IN H.264Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen, National Taiwan University, Taiwan

ICASSP-12: SPEECH RECOGNITION AND ENHANCEMENT

ICASSP-12.1 *: FRAME-DEPENDENT MULTI-STREAM RELIABILITY INDICATORS FOR Ill - 605AUDIO-VISUAL SPEECH RECOGNITIONAshutosh Garg, University of Illinois, Urbana-Champaign, United States; Gerasimos Potamianos, Chalapathy Neti, IBM T.J. Watson Research Center, United States; Thomas Huang, University of Illinois, Urbana-Champaign, United States

ICASSP-12.2 *: IN-CAR SPEECH RECOGNITION USING DISTRIBUTED MICROPHONES - Ill - 609ADAPTING TO AUTOMATICALLY DETECTED DRIVING CONDITIONS -Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura, Nagoya University, Japan

ICASSP-12.3 *: AUTOMATIC SPEAKER RECOGNITION USING DYNAMIC BAYESIAN NETWORK Ill - 613Lifeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang, Zhejiang University, China

ICASSP-12.4 *: TEMPORAL DECOMPOSITION: A PROMISING APPROACH TO VQ-BASED Ill - 617SPEAKER IDENTIFICATIONPhu Chien Nguyen, Masato Akagi, Tu Bao Ho, Japan Advanced Institute of Science and Technology, Japan

ICASSP-12.5 *: LOCATION BASED SPEAKER SEGMENTATION Ill - 621Guillaume Lathoud, lain McCowan, IDIAP, Switzerland

ICASSP-12.6 *: NON-NATIVE ENGLISH SPEECH RECOGNITION USING BILINGUAL ENGLISH Ill - 625LEXICON AND ACOUSTIC MODELSShoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura, NTT Cyber Space Labs., Japan

ICASSP-12.7 *: ROBUST DIGIT RECOGNITION USING PHASE-DEPENDENT '. Ill - 629TIME-FREQUENCY MASKINGGuangji Shi, Parham Aarabi, University of Toronto, Canada

ICASSP-12.8 *: A NOVEL SPECTRAL SUBTRACTION SCHEME FOR ROBUST SPEECH Ill - 633RECOGNITON: SPECTRAL SUBTRACTION USING SPECTRAL HARMONICS OF SPEECHJounghoon Beh, Hanseok Ko, Korea University, Republic of Korea

* denotes an ICASSP 2003 presentation.