silicon neurons that phase-lock

6
University of Pennsylvania University of Pennsylvania ScholarlyCommons ScholarlyCommons Departmental Papers (BE) Department of Bioengineering May 2006 Silicon Neurons That Phase-Lock Silicon Neurons That Phase-Lock John H. Wittig Jr University of Pennsylvania Kwabena Boahen Stanford University Follow this and additional works at: https://repository.upenn.edu/be_papers Recommended Citation Recommended Citation Wittig, J. H., & Boahen, K. (2006). Silicon Neurons That Phase-Lock. Retrieved from https://repository.upenn.edu/be_papers/90 Copyright 2006 IEEE. Reprinted from Proceedings of the 2006 IEEE International Symposium on Circuits and Systems, (ISCAS 2006), May 2006, 4 pages. This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to [email protected]. By choosing to view this document, you agree to all provisions of the copyright laws protecting it. This paper is posted at ScholarlyCommons. https://repository.upenn.edu/be_papers/90 For more information, please contact [email protected].

Upload: others

Post on 13-Feb-2022

2 views

Category:

Documents


0 download

TRANSCRIPT

University of Pennsylvania University of Pennsylvania

ScholarlyCommons ScholarlyCommons

Departmental Papers (BE) Department of Bioengineering

May 2006

Silicon Neurons That Phase-Lock Silicon Neurons That Phase-Lock

John H. Wittig Jr University of Pennsylvania

Kwabena Boahen Stanford University

Follow this and additional works at: https://repository.upenn.edu/be_papers

Recommended Citation Recommended Citation Wittig, J. H., & Boahen, K. (2006). Silicon Neurons That Phase-Lock. Retrieved from https://repository.upenn.edu/be_papers/90

Copyright 2006 IEEE. Reprinted from Proceedings of the 2006 IEEE International Symposium on Circuits and Systems, (ISCAS 2006), May 2006, 4 pages.

This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to [email protected]. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

This paper is posted at ScholarlyCommons. https://repository.upenn.edu/be_papers/90 For more information, please contact [email protected].

Silicon Neurons That Phase-Lock Silicon Neurons That Phase-Lock

Abstract Abstract We present a silicon neuron with a dynamic, active leak that enables precise spike-timing with respect to a time-varying input signal. Our neuron models the mammalian bushy cell, which enhances the phase-locking of its acoustically driven inputs. Our model enhances phase-locking by up to 38% (quantified by vector strength) across a 60 dB range of acoustic intensities, and up to 22% over a passive leak. Its conductance-based log-domain design yields a compact and efficient circuit, fabricated in 0.25 /spl mu/m CMOS, that is an ideal timing-enhancing component for neuromorphic speech recognition systems.

Comments Comments Copyright 2006 IEEE. Reprinted from Proceedings of the 2006 IEEE International Symposium on Circuits and Systems, (ISCAS 2006), May 2006, 4 pages.

This material is posted here with permission of the IEEE. Such permission of the IEEE does not in any way imply IEEE endorsement of any of the University of Pennsylvania's products or services. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to [email protected]. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.

This conference paper is available at ScholarlyCommons: https://repository.upenn.edu/be_papers/90

Silicon Neurons That Phase-LockJohn H. Wittig Jr and Kwabena Boahen

Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania 19104{jwittig, boahen}@seas .upenn. edu

Abstract-We present a silicon neuron with a dynamic, ac- LPFtive leak that enables precise spike-timing with respectto-ancingtime-varying input signal. Our neuron models the mammalian te _ - * Kbushy cell, which enhances the phase-locking of its acoustically sip

driven inputs. Our model enhances phase-locking by up to 38% >intensities, and up to 22% over a passive leak. Its conductance-based log-domain design yields a compact and efficient circuit, HIfabricated in 0.25 Mom CMOS, that is an ideal timing-enhancingmcomponent for neuromorphic speech recognition systems. j

. IMPROVING SPEECH RECOGNITION SYSTEMS l

Automatic speech recognition (ASR) has been sought by K Eengineers for more than 50 years [1]. Unlike human speechRErecognition, which degrades gracefully under progressively AN1 >21deteriorating acoustic conditions, ASR systems are sensitive ANF2 ~ ~ VLto variance in the sound source and to background noise. In >1 *contrast, human listeners improve recognition in noisy envi- _Nronments by incorporating knowledge of a speaker's location,or the pitch of his or her voice (i.e., active listening and ANF13the cocktail party problem [2]). To do so, they exploit the -Najspeech waveform's fine temporal structure [3], discarded byASR preprocessors, which instead extract the instantaneous Fig. 1. Bushy cell circuit. Vm, the membrane voltage, drives gH,energy (but not phase) of numerous frequency channels across the hyperpolarization-activated mixed-cation conductance YKLT the low-

threshold potassium conductance; and YNa, the spike-generating sodiumthe spectrum [1]. By not only including, but enhancing timing conductance. REQ, the spike request, is acknowledged by ACK, whichinformation during preprocessing, ASR systems may access drives 9K, the spike-resetting potassium conductance. ANFi, the auditorythe plethora of sound-cues that humans rely on. nerve input drives 9SYN the synaptic conductance. ICL the current-clamp

transistor, is an excitatory input controlled off chip. Currents produced byCustom-designed hardware models of the auditory periphery these conductances are integrated on Vintt which can be isolated from Vm

are well suited to perform ASR preprocessing they are power by VCL, the voltage-clamp circuitry. The low-pass filter building block isefficient yet can evaluate biologically-inspired computations outlined (LPF).in real-time. Several silicon cochleas have been created thatfrequency-decompose sound by modeling the observed me-chanical behavior of the inner ear [4,5]. Furthermore, inspired The only VLSI model of a CN neuron used a "moderatelyby the biological auditory periphery, circuit designers have high" leak and a fast refractory period to realize a bushymimicked the analog-to-digital conversion that transpires at cell-like response (spikes aligned at the onset of sound) [10].the auditory nerve (AN) by coding the cochlear output as a With a fixed leak, however, only a narrow range of stimulusspike rate [6,7]. intensities elicit this response-below which the neuron fails

In addition to using a spike-rate code to represent the to respond due to excessive inhibition and above which thespectral energy of sound (as decomposed along the length neuron spikes repetitively during the on-phase of a stimulusof the basilar membrane), the mammalian AN represents the cycle. In contrast, when a biological bushy cells is stimulatedphase of an acoustic waveform with spike timing. Sound local- with intense sound its active conductances dynamically limitization computations that compare phase differences between repetitive action-potentials (until the bombardment of synapticthe two ears require 10 ,us timing accuracy, which exceeds the input relaxes during the off-phase of the stimulus cycle).precision of individual AN fibers. Indeed, AN spike timing (as Physiologists identified a large potassium conductance thatwell as other acoustic features) is enhanced in the mammalian activates near rest, gKLT [11L], which has been shown tocochlear nucleus (CN) [8]. In the cat, Joris et al. recorded enhance phase-locking [12]. We show here that gKLT is alsofrom CN cells, and found that, for frequencies less than 1 kHz, responsible for intensity adaptation.globular bushy cells' spike timing was more precise than their Taking a neuromnorphic approach to achieving spike-timningAN inputs, with a sparse code of a single spike per cycle [9]. enhancement, we have implemnented a silicon model of the

0-7803-9390-2/06/$20.00 ©2006 IEEE 4535 ISCAS 2006

globular bushy cell that includes 9KLT. In Section II, we de- 200 Biological Bushy Cellscribe our bushy cell circuit. In Section III, we characterize the Boa1s0 Ktcircuit with static and time-varying stimuli, then demonstrate i Icthat 9KLT enables our bushy cell circuit to phase-lock to an 5 9Hacoustic stimulus over a wide intensity range. In Section IV, - vrest

we summarize our results and their implications for extracting 80 75.. 70.5.0.5.5.4.4.380 75 70 65 60 55 50 45 40 35both sound source pitch and location for ASR. Vmem (mV)

-4II. NEURON IMPLEMENTATION Silion Bushy Cell

9KItThe heart of our silicon bushy cell is a log-domain low pass 22 .

0~~~~~~~~~~~filter (LPF) whose output controls four active conductances . 9Hc - ~Vrest(Fig. 1). We represent the cell's membrane voltage as the 0

current flowing through a subthreshold pMOS device operating 5 1 0 15 29 25 30 35in saturation (Imem Xc et(Vdd-Vm)/UT). Our simplified imple- /memTflA)mentation of the conductance equation ((mem UT/ ) dlmem +I9KLT + /9H dt Fig. 2. Modeling bushy cell conductances. Activation curves for three1mem OC I'SYN +INa +IH ) assumes the relatively fast spike- conductances in the biological bushy cell (top) and in our silicon bushy cellI9KLT 19H /Hrel (bottom). Bias settings: Kilp = 2.3 V; Ksat = 2.0 V; Hdi, = 1.4 V;generating and resetting conductances (gNa and 9K) do not Hsat 1775 V; H=re 2.1 V; Nathr = 1.0 V; Vdd 25 Vcontribute appreciably to the membrane time-constant.

Each active conductance contributes to the cell's response.The fast-activating sodium conductance (gNa oC 1I2em) is re- isolating it from the active conductances' integration nodesponsible for generating spikes; the moderately fast-activating (Vint) by turning the voltage-clamp transistor off (VCL). Topotassium conductance that has a low activation-threshold current-clamp the cell, we apply an off-chip voltage that(9KLT oc Imem/IKslp) is responsible for enhancing timing; represents the log of the input current to Vint through a source-and the slow, hyperpolarization-activated conductance (9H oX follower (ICL). To apply synaptic input, we route 0.1 msIHdiv/Imem) is responsible for setting the resting potential pulses to the thirteen excitatory inputs (ANFJ).while decreasing the membrane time-constant. The propor- Over one thousand copies of our silicon bushy cell weretionality constants (gains) are set by the bias voltages NaThr, included in a 4410-neuron CN chip that we designed, sub-Kslp, and Hdiv, respectively. Additionally, the fast-activating mitted, and tested. With 544,692 transistors in 11.33 mm2potassium conductance (9K) is included to reset the neuron it was fabricated in TSMC's 0.25 pm CMOS process. Inafter it spikes. addition to 1080 bushy cells (15% of the cat's population), the

These active conductances' dependence on Imem is re- chip includes four other CN cell types that enhance differentalized using translinear (subthreshold CMOS) circuits, and acoustic features [8]. AN and CN spikes are communicatedtheir dynamics with LPFs, except for gNa, which responds digitally using the Address Event Representation (AER) [14],instantaneously; it simply uses positive feedback circuitry [13]. and routed to and from a desktop computer via an AER-9H and 9KLT use variants of the LPF with a translinear USB2.0 link. Individual currents (Fig. 1, *) are measured usinginput stage that implements the equations above, and limiting a scanner and scaled to compensate for a pad gain of 1000transistors that set the maximum conductance. By driving (estimated).the integration node (Vint) with current sources, both YH(scaled) and 9KLT dynamically control the membrane time-constant. Additionally, YH sets the resting level through a A. Conductance-Voltage Curvessource follower. Using voltage-clamp, we tuned our silicon bushy cell'sWe designed each conductance's subcircuit to be flexible active conductances to qualitatively match those observed bio-

yet compact. For 9KLT's dependence on Imem, Kslp and Ksat logically. Rothman and Manis [15] formulated g-v curves afteradjust the gain and the saturation level, respectively. We well- characterizing many bushy cells in vitro (Fig. 2 top). In steady-connected the Vm-driven transistor to neutralize kappa (z 0.6 state, their model bushy cell rests at -66 mV. At potentialsin this process), which would otherwise cause a square root- hyperpolarized to rest only YH is active. At potentials slightlylike increase in 9KLT at low Imem levels. For 9H's dependence, depolarized from rest, YKLT, the "low-threshold" conductance,Hdiv effectively shifts the value of Vm where limiting occurs. activates and eventually saturates as the spike-generating gNaHrel balances 9H's relative contributions to the neuron's time- becomes active. Our silicon bushy cell's g-I curves match theconstant and resting level. For 9Na's dependence, NaThr shifts biological ones (Fig. 2 bottom); it rests at inen_ 12 nA.the gain and onset (i.e.. spiking threshold). These biases We model YKLT and YH with static time-constants. whereasenabled us to readily match the biological g-v curves. lRothmran anld Manis use a bell-shaped voltage dependelnce. In

Our silicon bushy cell can be driven by voltage-clamp, their model, YKLT 's timne constant ranges fromn 1.5 to 6.4 ins,current-clamp, or silicon AN synapses. To voltage-clamp the peaking near rest. They did not mneasure the timne-constant ofcell, we tie Vm directly to an off-chip voltage source after gu, but in another CN neuron (octopus cell) it ranges fromn 30

4536

to 175 ms [16]. Measuring the time-constants of each silicon 500 0conductance in voltage-clamp, we set YKLT's to 3 ms (KT = :

E o2.29 V) and 9H'S to 80 ms (HT = 2.4 V). DE g eledTogether, YKLT and YH dynamically set the neuron's mem- -__

brane time-constant, yielding a 1 ms time-constant at rest inRothman and Manis's model. Measuring the membrane time- 50 0

constant in current-clamp by injecting a small excitatory pulse :lthat does not significantly activate YKLT, we set our bushy ..cell's membrane time-constant to 1.2 ms at rest by scaling

E

disabledCYKLT and 9H's g-I curves.

To probe the role of YKLT, we compare results when 'the conductance is enabled, disabled, and substituted with a '- 50 Q i i

E(large) passive leak. The disabled setting converts YKLT to a Elfast spike-reset conductance by reducing its input sensitivity 0(KsiP = 2.0 V) and time-constant (KT = 2.0 V) while 0 200 400 600 800 1000increasing its maximum output (Ksat 1.0 V). We simulate time (ms)a passive leak by increasing YH (Urel =2.5 V), in addition to ...apassive*leak by increasing 9H (H,,l 2.5 V), in addition to

Fig. 3 9KLT enhances temporal response to injected current. top With YKLTdisabling 9KLT- enabled, Imem follows the 10 Hz square wave's on (370 nA) and off (0 nA)

transitions; spikes (o) are relegated to the on transitions. middle With 9KLTB. Response to current injection disabled, Imem is slow to react, has an elevated resting level, and integrates

Oursilicon bushy cell follows time-varying current-cl charge over successive current pulses, spikes occur with no apparent phaseOur silicon bushy cell follows time-varying current-clamp relation to the stimulus waveform. In this case, we reduced the square wave's

input with both Imem and its spike timing (Fig. 3 top). Similar amplitude to 28 nA to obtain a similar spike-rate. bottom With a passive leak,to the current-clamp response observed experimentally, the Imem follows the input, has a resting level of zero, and spikes only during

membrae rthe on-phase. At this input level (1.69 /uA), slightly above threshold, spikesmembrane responds quickly to a large step input (membrane occur throughout the on-phase.time-constant of a millisecond), fires a solitary spike, thenquickly settles to an elevated level. At the offset of the currentstep, the membrane rapidly dips below the resting level and spike trains based on a computational model of the guinea-then settles back within 20 ms. pig cochlea-AN complex [17]. Our bushy cell is innervated

The fast response and solitary onset spike are a result of by thirteen AN fibers that exhibit a range of spontaneousshunting inhibition by YKLT. As Imem rises, YKLT activates rates, from 0.5 to 150 Hz. We applied 100 presentations ofand shunts the input current preventing spiking. However, AN spikes, generated in response to a 25 ms, 250 Hz tone, atdue to a delay, a solitary spike can occur when gNa elicits 70 dB SPL (with 25 ms between presentations).positive-feedback before YKLT sufficiently kicks in. We chose Compared to its aggregate AN inputs, the bushy cell firesa current-clamp level just sufficient to elicit spikes, which far fewer spikes overall, has a higher proportion of spikesoccur approximately every other cycle. at sound onset, and fires more precisely in phase with theWhen YKLT is disabled, neither Imem nor spike-times follow acoustic stimulus. These characteristics are in line with what

the input (Fig. 3 middle). As the stimulus turns on and off, we observed when exciting the bushy cell with a square-wave'mem rises and decays slowly. Over multiple cycles, Imem current input: YKLT limits spikes to the onset of a strong stimu-integrates up, eventually reaching spike threshold at a random lus and prevents multiple spikes during stimulation (decreasingstimulus phase. We see little relation between stimulus phase the overall spike rate).and spike-times with YKLT disabled. We quantify phase-locking ability by the magnitude of theWhen a passive leak replaces 9KLT, 'mem again follows normalized vector sum of the spike phases (vector strength,

the input, though spike-times are not relegated to the stimulus VS) [9]. A VS of one corresponds to perfect phase-locking,onset (Fig. 3 bottom). Imem rapidly settles to a level dictated where the neuron fires at exactly the same phase on everyby the difference between leak and input currents, unless the cycle. A VS of zero corresponds to no phase-locking, whereinput surpasses the leak, which causes Imem to integrate to the neuron fires at random phases. The bushy cell exhibits bet-spike threshold. As the leak does not increase with increasing ter phase locking (VS = 0.95) than its AN inputs (aggregate'mem, it cannot limit multiple-spikes during the on-phase of VS = 0.82) at 70 dB SPL. In contrast, the bushy cell's VSthe stimulus. In this case we chose an input level sightly above decreases to 0.90 with a passive leak.threshold, which elicits rapid and repetitive spiking throughout The benefits of YKLT (over a passive leak) are most apparentthe on-phase. at high intensity levels (Fig 4. right). At 110 dB SPL, YKLT

mnakes a dramatic difference: It limits mnultiple spikes andC. Response to synaptic input increases phase-locking (VS =0.97) despite a decrease inWith YKLT enabled, our silicon bushy cell accurately follows AN phase-locking (VS =0.60). The passive-leak neuron, on

the phase of a pure-tone acoustic stimulus (Fig. 4 left). To test the other hand, fired mnulLtiplLe timnes per cycle and decreasedour model's response to sound, we generated stochastic AN its phase-locking (VS =0.75). The superior phase-locking

4537

Pure Tone: 250 Hz, 70 dB SPL Pure Tone: 250 Hz, 1110 dB SPL

AN inputs: VS=0821 AN inputs:

KL enabled: VS= 0led: VS 09

passive leak: VS=0.901W passive leak: VS=075]0 0.005 0.01 0.015 0.02 0.025 0 0.5 1 0 0.005 0.01 0.015 0.02 0.025 0 0.5 1

time (sec) phase (cycles) time (sec) phase (cycles)

Fig. 4. 9KLT enhances temporal response to synaptic inputs. Spike rasters (100 trials) and phase histograms in response to 250 Hz pure tone (red) at 70and 110 dB SPL (left and right panels, respectively): top simulated AN fibers (all thirteen are collapsed into a single spike train in each trial); middle siliconbushy cell with gKLT; bottom silicon bushy cell with a passive leak. Phase histograms are computed from spikes occuring after 10 ms, and are normalizedand shifted to be maximal at 0.5 cycles.

1 _ REFERENCES

*- 7 [I] N. Morgan, H. Bourlard and H. Hermansky, "Automatic Speech Recog-5, // / )nition: An Auditory Perspective," in Speech Processing in the Auditoryco) 0.5 AN i MptSystem, S. Greenberg, et al., Eds. New York: Springer, 2004, pp.309-3380 / g enabled [2] S. Haykin and Z. Chen, "The cocktail party problem," Neural Compu-0 f gKLT tation, vol. 17, pp. 1875-902, 2005

(V-passive leak [3] Z. M. Smith, B. Delgutte and A. J. Oxenham, "Chimaeric sounds reveal0 50 60 70 80 90 100 111 120 dichotomies in auditory perception," Nature, vol. 416, pp. 87-90, 2002

Sound intensity (dB SPL) [4] R. E Lyon and C. Mead, "An analog electronic cochlea," IEEE Trans-iactions on Acoustics, Speech, and Signal Processing, vol. 36, pp. l 119-1134, 1988

Fig. 5. gKLT enhances phase-locking over a wide range of intensities. Vector [5] L. Watts, "Cochlear Mechanics: Analysis and Analog VLSI," Californiastrength values for the aggregate AN, our silicon bushy cell with 9KLT, and Institute of Technology, PhD Thesis, 1993with YKLT replaced by a passive leak. [6] A. McEwan and A. van Schaik, "An alternative analog VLSI imple-

mentation of the Meddis inner hair cell model," IEEE InteinationalSymposiia on Circulits and Systems, vol. 4, pp. IV-928-31, 2004

achieved with gKLT occurs from 60 to 120 dB SPL (Fig. 5) [7] A. van Schaik and S.-C. Liu, "AER EAR: A Matched Silicon CochleaPair with Address Event Representation Interface," IEEE Inteinational

IV. CONCLUSION Symposiui on Circulits and Systems, pp. 4213-4216, 2005[8] E. D. Young and D. Oertel, "Cochlear Nucleus," in The synaptic

We have implemented a silicon model of a bushy cell, organization of the brain, G. M. Sheperd, Eds. Oxford: University Press,including four active conductances, whose dependence on 2004, pp.125-164memhrane voltage qualitatively matches those of the

.g

[9] P. X. Joris, et al., "Enhancement of neural synchronization in thevoathesblologscal anteroventral cochlear nucleus. I. Responses to tones at the characteristic

bushy cell. YKLT enables the bushy cell to enhance timing frequency,"J Netrophysiol,vol. 71,pp. 1022-36, 1994over a range of 60 dB SPL. With a passive leak, timing [10] A. van Schaik, E. Fragniere and E. Vittoz, "An analogue electronic

model of Ventral Cochlear Nucleus neurons," IEEE Intermational Con-elnLhalnLcemelnLt degrades as inltelnsity increases. Our gKLT ctr- frence on Microelectronics fbr Neural Networks, vol. 52-59, 1996cuit is efficient and compact: It operates with sub-threshold [11] P. B. Manis and S. 0. Marx, "Outward currents in isolated ventralcurrents using a log-domain low pass filter comprised of only cochlear nucleus neurons," J Neuirasci, vol. 11, pp. 2865-80, 1991

six transistors. By endowigsilconnuron[12] G. Svirskis, et al., "Enhancement of signal-to-noise ratio and phasesix transistors. By endowing slicon neurons in an auditory locking for small inputs by a low-threshold outward current in auditorypreprocessor with this conductance, they will provide precise neurons," J Neurosci, vol. 22, pp. 11019-25, 2002timing cues for extracting sound source pitch and location, in- [13] E. Culurciello, R. Etienne-Cummings and K. Boahen, "A Biomorphic

Digital Image Sensor," IEEE Journal of Solid-State Circuits, vol. 38, no.creasing the rohustness of ASR systems in realL-world acousthc 2, pp. 281-294, 2003environments. [14] K. A. Boahen, "A Burst-Mode Word-Serial Address-Event Channel,"

IEEE Trian.sactions on Circuits and Systems I, vol. 51, no. 7, pp. 1269-ACKNOWLEDGMENT 1300, 2004

W aatheir [15] J. S. Rothman and P. B. Manis, "The roles potassium currents play inWe thank Donata Qertel anld Paul Manis for sharing thelr regulating the electrical activity of ventral cochlear nucleus neurons," Jexpertise in cochlear lnucleus physiology. This work was sup- Ne trophy iol, vol. 89, pp. 3097-11L3 2003ported hy the MOSIS Educational Program (chip fahrication) [16] B. Ramazana and D. Oertel, Hyperpolarizationa-activated, mixed-cation

.. . ' cu~~cOrrent (Ih) in octopus cells of the mammalian cochlear nucleus," Jthe University of Pennsylvania's NIDCD Training Grant in the terophy iol, vol. 84, pp. 806-817, 2000Neurohiology of OtorhinlolLarynlgology (felUlowship to JW), and [17] C, J. Sumner, et al., "A revised model of the inner-hair cell and auditory-the Packard Foundation (felUlowship to KB). nerve complex I Acoutftoc At vol. 111l pp. 21L78-88, 2002

4538