advances in neural information processing …

9
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10 Proceedings ofthe 1997 Conference edited by Michael I. Jordan, Michael J. Keams and Sara A. Solla A Bradford Book The MIT Press Cambridge, Massachusetts London, England

Upload: others

Post on 17-Nov-2021

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ADVANCES IN NEURAL INFORMATION PROCESSING …

ADVANCES IN NEURAL INFORMATION

PROCESSING SYSTEMS 10

Proceedings ofthe 1997 Conference

edited by

Michael I. Jordan, Michael J. Keams and Sara A. Solla

A Bradford Book The MIT Press

Cambridge, Massachusetts London, England

Page 2: ADVANCES IN NEURAL INFORMATION PROCESSING …

Contents

Preface xiii

NIPS Committees xv

Reviewers xvii

Part I Cognitive Science

Synchronized Auditory and Cognitive 40 Hz Attentional Streams, and the Impact ofRhythmic Expectation on Auditory Scene Analysis, Bill Baird 3

On Parallel versus Serial Processing: A Computational Study of Visual Search, Eyal Cohen and Eytan Ruppin 10

Task and Spatial Frequency Effects on Face Specialization, Matthew N. Dailey and Garrison W. Cottrell 17

Neural Basis of Object-Centered Representations, Sophie Deneve and Alexandre Pouget 24

A Neural Network Model of Naive Preference and Filial Imprinting in the

Domestic Chick, Lucy E. Hadden 31

Adaptation in Speech Motor Control, John F. Houde and Michael I. Jordan . . . . 38

Learning Human-like Ifriowledge by Singular Value Decomposition: A Progress

Report, Thomas K. Landauer, Darrell Laham and Peter Foltz 45

Mülti-modular Associative Memory, NirLevy, David Hörn and Eytan Ruppin . . . 52

Serial Order in Reading Aloud: Connectionist Models and Neighborhood Structure, Jeanne C. Milostan and Garrison W. Cottrell 59 A Superadditive-Impairment Theory of Optic Aphasia, Michael C. Mozer, Mark Sitton and Martha Farah 66 A Hippocampal Model ofRecognition Memory, Randall C. O'Reilly, Kenneth A. Norman and James L. McClelland 73

Correlates of Attention in a Model of Dynamic Visual Recognition, Rajesh P N. Rao 80

Recurrent Neural Networks Can Learn to Implement Symbol-Sensitive Counting, Paul Rodriguez and JanetWiles . . . 87

Comparison of Human and Machine Word Recognition, Markus Schenkel, Cyril Latimer and Marwan Jabri 94

Part II Neuroscience

Coding of Naturalistic Stimuli by Auditory Midbrain Neurons, Hagai Attias and Christoph E. Schreiner . 103

Page 3: ADVANCES IN NEURAL INFORMATION PROCESSING …

vi Contents

Refractoriness and Neural Precision, Michael J. Berry II and Markus Meister . . . 110

Statistical Models of Conditioning, Peter Dayan and Theresa Long 117

Characterizing Neurons in the Primary Auditory Cortex ofthe Awake Primate Using Reverse Correlation, R. Christopher deChanns and Michael M. Merzenich . 124

Using Helmholtz Machines to Analyze Multi-channel Neuronal Recordings, Virginia R. de Sa, R. Christopher deChanns and Michael M. Merzenich 131

Instabilities in Eye Movement Control: A Model of Periodic Alternating Nystagmus, Ernst R. Dow and Thomas J. Anastasio 138

Hippocampal Model ofRat Spatial Abilities Using Temporal Difference

Learning, David J. Foster, Richard G. M. Morris and Peter Dayan 145

GradientsforRetinotectalMapping,Geof(RyJ.Goodtä\l 152

A Mathematical Model ofAxon Guidance by Diffusible Factors, Geoffrey J. Goodhill 159 Computing withAction Potentials (Invited Talk), John J. Hopfield, Carlos D. Brody and Sam Roweis 166

A Model ofEarly Visual Processing, Laurent M, Jochen Braun, Dale K. Lee and Christof Koch 173

Perturbative M-Sequences for Auditory Systems Identification, Mark Kvale and Christoph E. Schreiner 180

Effects of Spike Urning Underlying Binocular Integration andRivalry in a Neural Model ofEarly Visual Cortex, Erik D. Lumer 187

Dynamic Stochastic Synapses as Computational Units, Wolfgang Maass and Anthony M. Zador 194

Synaptic Transmission: An Information-Theoretic Perspective, Amit Manwani and Christof Koch 201

Toward a Single-Cell Account for Binocular Disparity Tuning: An Energy Model May Be Hiding in Your Dendrites, BartlettW. Mel, Daniel L. Ruderman and Kevin A. Archie 208

Just One View: Invariances in Inferotemporal Cell Tuning, Maximilian Riesenhuber and Tomaso Poggio 215

On the Separation of Signals from Neighboring Cells in Tetrode Recordings, Maneesh Sahani, John S. Pezaris and Richard A. Andersen 222

Independent Component Analysis for Identification ofArtifacts in Magnetoencephalographic Recordings, Ricardo Vigärio, Veikko Jousmäki, Matti Hämäläinen, Riitta Hari and Erkki Oja . 229

Modeling Complex Cells in an Awake Macaque during Natural Image Viewing, William E.Vinje and Jack L. Gallant 236

%

Page 4: ADVANCES IN NEURAL INFORMATION PROCESSING …

Contents vii

Part i n Theory

The Canonical Distortion Measure in Feature Space and 1-NN Classification,

Jonathan Baxter and Peter Bartlett 245

Multiple Threshold Neural Logic, Vasken Bohossian and Jehoshua Brück 252

Generalization in Decision Trees andDNF: Does Size Matter?

Mostefa Golea, Peter Bartlett, Wee Sun Lee and LlewMason 259

Selecting Weighting Factors in Logarithmic Opinion Pools, Tom Heskes 266

New Approximations of Differential Entropyfor Independent Component Analysis and Projection Pursuit, Aapo Hyvärinen 273 Boltzmann Machine Learning Using Mean Field Theory and Linear Response Correction, Hilbert J. Kappen and F. B. Rodriguez 280 Relative Loss Boundsfor Multidimensional Regression Problems, Jyrki Kivinen and Manfred K. Warmuth 287

Asymptotic Theory for Regularization: One-Dimensional Linear Case, Petri Koistinen 294

Two Approaches to Optimal Armealing, ToddK. Leen, Bernhard Schottky and David Saad 301

Structural Risk Minimizationfor Nonparametric Time Series Prediction, RonMeir f: 308

Analytical Study ofthe Interplay between Architecture and Predictability,

Avner Priel, Ido Kanter and David A. Kessler 315

Globally Optimal On-line Learning Rules, Magnus Rattray and David Saad . . . . 322

Minimax and Hamiltonian Dynamics of Excitatory-Inhibitory Networks, H. Sebastian Seung, Tom J. Richardson, Jeffrey C. Lagarias and John J. Hopfield . 329 Data-Dependent Structural Risk Minimizationfor Perceptron Decision Trees, John Shawe-Taylor and Nello Cristianini 336

From Regularization Operators to Support Vector Kernels, Alex J. Smola and Bernhard Schölkopf 343

The Rectified Gaussian Distribution, Nicholas D. Socci, Daniel D. Lee and H. Sebastian Seung 350

On-line Learning from Finite Training Sets in Nonlinear Networks,

Peter Sollich and David Barber 357

Competitive On-line Linear Regression, VolodyaVovk 364

On the Infeasibility of Training Neural Networks with Small Squared Errors, VanH.Vu 371 The Storage Capacity ofa Fully-Connected Committee Machine, Yuansheng Xiong, Chulan Kwon and Jong-Hoon Oh 378

Page 5: ADVANCES IN NEURAL INFORMATION PROCESSING …

vii'i Contents

The Efficiency and the Robustness of Natural Gradient Descent Learning Rule, Howard H. Yang and Shun-ichi Amari 385

Part IV Algorithms and Architecture

Ensemble Learning for Multi-Layer Networks, David Barberand Christopher M. Bishop 395

Radial Basis Functions: A Bayesian Treatment, David Barber and Bernhard Schottky 402

Shared Context Probabilistic Transducers, Yoshua Bengio, Samy Bengio, Jean-Francois Isabelle and Yoram Singer 409

Approximating Posterior Distributions in Belief Networks Using Mixtures, Christopher M. Bishop, Neil Lawrence, Tommi Jaakkola and Michael I. Jordan . . 416

Receptive Field Formation in Natural Scene Environments: Comparison of Single Cell Learning Rules, BrianS. Blais, Nathan Intrator, Harel Shouval and Leon N. Cooper 423

An Annealed Self-Organizing Map for Source Channel Coding, Matthias Burger, Thore Graepel and Klaus Obermayer 430

Incorporating Test Inputs into Learning, Zehra Cataltepe and Malik Magdon-Ismail 437

On Efficient Heuristic Ranking ofHypotheses, Steve Chien, Andre Stechert and Darren Mutz 444

Learning to Order Things, William W. Cohen, Robert E. Schapire and Yoram Singer 451

Regularisation in Sequential Learning Algorithms, Joäo F. G. de Freitas, Mahesan Niranjan and Andrew H. Gee 458

Agnostic Classification ofMarkovian Sequences, RanEl-Yaniv, ShaiFineandNaftaliTishby 465

Ensemble and Modular Approaches for Face Detection: A Comparison, Raphael Feraud and Olivier Bernier . 472

A Revolution: Belief Propagation in Graphs with Cycles, Brendan J. Frey and David J. C. MacKay 479

Hierarchical Non-linear Factor Analysis and Topographie Mops, Zoubin Ghahramani and Geoffrey E. Hinton 486

Regression with Input-dependent Noise: A Gaussian Process Treatment, Paul W. Goldberg, Christopher K. I. Williams and Christopher M. Bishop 493

Linear Concepts and Hidden variables: An Empirical Study, Adam J. Grove and Dan Roth 500

Classification by Pairwise Coupling, Trevor Hastie and Robert Tibshirani 507

Page 6: ADVANCES IN NEURAL INFORMATION PROCESSING …

Contents S /' üc

Unsupervised On-line Learning ofDecision Treesfor Hierarchical Data Analysis, Marcus Held and Joachim M. Buhmann 514

Nonlinear Markov Networks for ContinuoUs Variables,

Reimar Hof mann and Volker Tresp 521

Active Data Clustering, Thomas Hofmann and Joachim M. Buhmann 528

Function Approximation with the Sweeping Hinge Algorithm, Don R. Hush, Fernando Lozano and Bill Home 535 The Error Coding and Substitution PaCTs, Gareüi James and Trevor Hastie . . . . 542

S-Map: A Network with a Simple Self-Organization Algorithmfor Generative Topographie Mappings, Kimmo Kiviluoto and Erkki Oja . 549

Learning Nonlinear Overcomplete Representations for Efficient Coding,

Michaels.LewickiandTerrenceJ. Sejnowski 556

Factorizing Multivariate Function Classes, Juan K. Lin 563

A Framework for Multiple-Instance Learning, Oded Maron and Tomas Lozano-Perez 570 An Application of Reversible-Jump MCMC to Multivariate Spherical Gaussian Mixtures, Alan D. Marrs 577

Estimating Dependency Structure as a Hidden Variable,

Marina Meilä and Michael I. Jordan 584

Combining Classifiers Using Correspondence Analysis, Christopher J. Merz . . . 591

Learning Path Distributions Using Nonequilibrium Diffusion Networks, Paul Mineiro, Javier Movellan and Ruth J. Williams 598 Learning Generative Models with the Up-Propagation Algorithm, Jong-Hoon Oh and H. Sebastian Seung 605

An Incremental Nearest Neighbor Algorithm with Queries, JoelRatsaby 612

RCC Cannot Compute Certain FSA, Even with Arbitrary Transfer Functions, MarkRing 619

EMAlgorithmsfor PCA and SPCA, Sam Roweis 626

Locol Dimensionality Reduction, Stefan Schaal, Sethu Vijayakumar and Christopher G. Atkeson 633

Prior Knowledge in Support Vector Kernels, Bernhard Schölkopf, Patrice Simard, Alex J. Smola and Vladimir Vapnik 640

Training Methods for Adaptive Boosting of Neural Networks,

Holger Schwenk and Yoshua Bengio 647

Learning Continuous Attractors in Recurrent Networks, H. Sebastian Seung . . . 654

Monotonie Networks, Joseph Sill 661

Stacked Density Estimation, Padhraic Smyth and David Wolpert 668

Page 7: ADVANCES IN NEURAL INFORMATION PROCESSING …

X Contents

Bidirectional Retrievalfrom Associative Memory,

Friedrich T. Sommer and Günther Palm 675

Mapping a Manifold of Perceptual Observations, Joshua B. Tenenbaum 682

Graph Matching with Hierarchical Discrete Relaxation, Richard C. Wilson and Edwin R. Hancock 689 Multiplicative Updating Rulefor Blind Separation Derivedfrom the Method of Scoring, Howard H. Yang 696

PartV Implementation

A 1,000-Neuron System with One Million 7-bit Physical Interconnections,

YuzoHirai 705

Silicon Retina with Adaptive Filtering Properties, Shih-ChiiLiu 712

Analog VLSI Model oflntersegmental Coordination with Nearest-Neighbor Coupling, Girish N. Patel, Jeremy H. Holleman and Stephen P. DeWeerth 719 An Analog VLSI Neural Network for Phase-based Machine Vision, Bertram E.Shi and KwokFai Hui 726

Part VI Speech, Handwriting and Signal Processing

Analysis ofDrifting Dynamics with Neural Network Hidden Markov Models, Jens Kohlmorgen, Klaus-Robert Müller and Klaus Pawelzik 735

Bayesian Robustification for Audio Visual Fusion, '**' Javier Movellan and Paul Mineiro 742

Modeling Acoustic Correlations by Factor Analysis, Lawrence Saul and Mazin Rahim 749

Blind Separation of Radio Signals in Fading Channels, Kari Torkkola 756

Hybrid NN/HMM-Based Speech Recognition with a Discriminant Neural Feature Extraction, Daniel Willett and Gerhard Rigoll 763

Part VII Visual Processing

A Non-Parametric Multi-Scale Statistical Model for Natural Images, Jeremy S. De Bonet and Paul A. Viola 773

Recovering Perspective Pose with a Dual Step EMAlgorithm,

Andrew D. J. Gross and Edwin R. Hancock 780

Bayesian Model ofSurface Perception, William T. Freeman and Paul A. Viola . . 787

Features as Sufficient Statistics, Davi Geiger, Archisman Rudra and Laurance T. Maloney 794 Detection of First and Second Order Motion, Alexander Grunewald and Heiko Neumann 801

Page 8: ADVANCES IN NEURAL INFORMATION PROCESSING …

Contents xi

A Simple and Fast Neu'ral Network Approachto Stereovision, Rolf D.Henkel . . . 808

Inferring Sparse, Overcomplete Image Codes Using an Efficient Coding

Framework, Michael S. Lewicki and Bruno Ä. Olshausen 815

Visual Navigation in a Robot Using Zig-Zag Behavior, M. Anthony Lewis 822

2D Observersfor Human 3D Object Recognition? Zili Liu and Daniel Kersten . . 829

Self-similarity Properties of Natural Images, Antonio Turiel, Germän Mato, Nestor Parga and Jean-Pierre Nadal 836 Multiresolution Tangent Distance for Affine-invariant Classification, Nuno Vasconcelos and Andrew Lippman 843

Phase Transitions and the Perceptual Organization of Video Sequences, YairWeiss 850

PartVIÜ Applications

Using Expectation to Guide Processing: A Study ofThree Real-World Applications, Shumeet Baluja 859

Structure Driven Image Database Retrieval, Jeremy S. De Bonet and Paul A. Viola 866

A General Purpose Image Processing Chip: Orientation Detection, Ralph Etienne-Curnnüngs and Donghui Cai 873

An Analog VLSI Model ofthe Fly Elementary Motion Detector, ReidR. Harrison and Christof Koch 880

MELONETI: Neural Nets for Inventing Baroque-Style Chorale Variations, Dominik Hörnel 887

Extended ICA Removes Artifacts from Electroencephalographic Recordings, Tzyy-Ping Jung, Colin Humphries, Te-Won Lee, Scott Makeig, Martin J. McKeown, Vicente Iragui and Terrence J. Sejnowski 894

A Generic Approachfor Identification of Event Related Brain Potentials via a Competitive Neural Network Structure, Daniel H. Lange, Hava T. Siegelmann, Hillel Pratt and Gideon F. Inbar 901

A Neural Network Based Head Tracking System,

Daniel D. Lee and H. Sebastian Seung 908

Wavelet Models for Video Time-Series, ShengMa and ChuanyiJi 915

Reinforcement Leaming for Call Admission Control and Routing in Integrated Service Networks, Peter Marbach, Oliver Mihatsch, Miriam Schulte and John N. Tsitsiklis 922 Leaming to Schedule Straight-Line Code, Eliot Moss, Paul Utgoff, John Cavazos, Doina Precup, Darko Stefanovic, Carla Brodley and David Scheeff . 929

Enhancing Q-Learning for Optimal Asset Allocation, Ralph Neuneier 936

Page 9: ADVANCES IN NEURAL INFORMATION PROCESSING …

XU Contents

Intrusion Detection with Neural Networks, Jake Ryan, Meng-Jang Lin and Risto Miikkulainen 943

Incorporating Contextual Information in White Blood Cell Identification, Xubo Song, Yaser Abu-Mostafa, Joseph Sill and Harvey Kasdan 950

Bach in a Box—Real-Time Harmony, Randall R. Spangler, Rodney M. Goodman and Jim Hawkins 957

Experiences with Bayesian Learning in a Real World Application, Peter Sykacek, Georg Dorffher, Peter Rappelsbergerand Josef Zeitlhofer 964

A Solution for Missing Data in Recurrent Neural Networks with an Application to Blood Glucose Prediction, Volker Tresp and Thomas Briegel 971

Use ofa Multi-Layer Perceptron to Predict Malignancy in Ovarian Tumors,

Herman Verreist, Yves Moreau, Joos Vandewalle and Dirk Trmmerman 978

Modelling Seasonality and Trends in Daily Rainfall Data, Peter M. Williams . . . 985

The Observer-Observation Dilemma in Neuro-Forecasting, Hans Georg Zimmermann and Ralph Neüneier 992

Part IX Control, Navigation and Planning

Generalized Prioritized Sweeping, David Andre, Nir Friedman and Ronald Parr . . 1001

Nonparametric Model-Based Reinforcement Learning, Christopher G. Atkeson . . 1008

An Improved Policy Iteration Algorithmfor Partially Observable MDPs, Eric A. Hansen > 1015

Automated Aircraft Recovery via Reinforcement Learning: Initial Experiments, Jeffrey F. Monaco, David G. Ward and Andrew G. Barto 1022

Reinforcement Learning for Continuous Stochastic Control Problems,

Remi Munos and Paul Bourgine 1029

Adaptive Choice ofGrid and Time in Reinforcement Learning, Stephan Pareigis . . 1036

Reinforcement Learning with Hierarchies of Machines, Ronald Parr and Stuart Russell 1043 Multi-time Models for Temporally Abstract Planning, Doina Precup and Richard S. Sutton 1050

How to Dynamically Merge Markov Decision Processes,

Satinder Singh and David Cohn 1057

The Asymptotic Convergence-Rate of Q-karning, Csäba. Szepesväri . . . . . . . . 1064

Hybrid Reinforcement Learning and Its Application to Biped Robot Control, Satoshi Yamada, AkiraWatanabeandMichioNakashima 1071 Index ofAuthors 1079

Keyword Index 1083